Python Tf IdfAn extremely simple Python library to perform TF-IDF document comparison.
CadmiumNatural Language Processing (NLP) library for Crystal
VntkVietnamese NLP Toolkit for Node
TextvecText vectorization tool to outperform TFIDF for classification tasks
SnowballImplementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
VtextSimple NLP in Rust with Python bindings
StringlifierStringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials and as a pre-processing step in unsupervised ML-based analysis of application text data.
SoqalArabic Open Domain Question Answering System using Neural Reading Comprehension
GreynirThe greynir.is natural language processing website for Icelandic
DefactonlpDeFactoNLP: An Automated Fact-checking System that uses Named Entity Recognition, TF-IDF vector comparison and Decomposable Attention models.
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
MovieboxMachine learning movie recommending system
NlpSelected Machine Learning algorithms for natural language processing and semantic analysis in Golang
PolyfuzzFuzzy string matching, grouping, and evaluation.
TextminingPython文本挖掘系统 Research of Text Mining System
text2textText2Text: Cross-lingual natural language processing and generation toolkit
iresearchIResearch is a cross-platform, high-performance document oriented search engine library written entirely in C++ with the focus on a pluggability of different ranking/similarity models
lucillaFast, efficient, in-memory Full Text Search for Kotlin
lorcaNatural Language Processing for Spanish in Node.js. Stemmer, sentiment analysis, readability, tf-idf with batteries, concordance and more!
watchmanWatchman: An open-source social-media event-detection system
occupationcoderGiven a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.
fb scraperFBLYZE is a Facebook scraping system and analysis system.
soanSocial Analysis based on Whatsapp data
SentimentAnalysis(BOW, TF-IDF, Word2Vec, BERT) Word Embeddings + (SVM, Naive Bayes, Decision Tree, Random Forest) Base Classifiers + Pre-trained BERT on Tensorflow Hub + 1-D CNN and Bi-Directional LSTM on IMDB Movie Reviews Dataset
tf-idf-pythonTerm frequency–inverse document frequency for Chinese novel/documents implemented in python.
devsearchA web search engine built with Python which uses TF-IDF and PageRank to sort search results.
Keyword-ExtracterProblem Statement: Given a particular PDF/Text document ,How to extract keywords and arrange in order of their weightage using Python?
Nepali-News-ClassifierText Classification of Nepali Language Document. This Mini Project was done for the partial fulfillment of NLP Course : COMP 473.
clusterixVisual exploration of clustered data.
Recommender-SystemsImplementing Content based and Collaborative filtering(with KNN, Matrix Factorization and Neural Networks) in Python
ResumeRiseAn NLP tool which classifies and summarizes resumes
bns-short-text-similarity📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
KeywordExtractionImplementation of algorithm in keyword extraction,including TextRank,TF-IDF and the combination of both
pygramsExtracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
koolslaFood recommendation tool with Machine learning.