bns-short-text-similarity📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
Stars: ✭ 24 (-62.5%)
koolslaFood recommendation tool with Machine learning.
Stars: ✭ 21 (-67.19%)
Recommender-SystemsImplementing Content based and Collaborative filtering(with KNN, Matrix Factorization and Neural Networks) in Python
Stars: ✭ 46 (-28.12%)
lucillaFast, efficient, in-memory Full Text Search for Kotlin
Stars: ✭ 102 (+59.38%)
TextminingPython文本挖掘系统 Research of Text Mining System
Stars: ✭ 268 (+318.75%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (+160.94%)
occupationcoderGiven a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.
Stars: ✭ 30 (-53.12%)
StringlifierStringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials and as a pre-processing step in unsupervised ML-based analysis of application text data.
Stars: ✭ 85 (+32.81%)
SentimentAnalysis(BOW, TF-IDF, Word2Vec, BERT) Word Embeddings + (SVM, Naive Bayes, Decision Tree, Random Forest) Base Classifiers + Pre-trained BERT on Tensorflow Hub + 1-D CNN and Bi-Directional LSTM on IMDB Movie Reviews Dataset
Stars: ✭ 40 (-37.5%)
TextAudit一个短视频app文本审核模块的实现思路及demo
Stars: ✭ 63 (-1.56%)
PolyfuzzFuzzy string matching, grouping, and evaluation.
Stars: ✭ 292 (+356.25%)
CadmiumNatural Language Processing (NLP) library for Crystal
Stars: ✭ 172 (+168.75%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+193.75%)
iresearchIResearch is a cross-platform, high-performance document oriented search engine library written entirely in C++ with the focus on a pluggability of different ranking/similarity models
Stars: ✭ 121 (+89.06%)
watchmanWatchman: An open-source social-media event-detection system
Stars: ✭ 18 (-71.87%)
VtextSimple NLP in Rust with Python bindings
Stars: ✭ 108 (+68.75%)
soanSocial Analysis based on Whatsapp data
Stars: ✭ 106 (+65.63%)
tf-idf-pythonTerm frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (+53.13%)
SSE-PTCodes and Datasets for paper RecSys'20 "SSE-PT: Sequential Recommendation Via Personalized Transformer" and NurIPS'19 "Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers"
Stars: ✭ 103 (+60.94%)
Keyword-ExtracterProblem Statement: Given a particular PDF/Text document ,How to extract keywords and arrange in order of their weightage using Python?
Stars: ✭ 17 (-73.44%)
NlpSelected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Stars: ✭ 304 (+375%)
RecommenderSystemsNotebooksSet of notebooks analysing and discussing the ideas presented at Coursera's Recommender Systems course
Stars: ✭ 28 (-56.25%)
NewsSearch主要使用python+Scrapy框架去抓取新闻网站
Stars: ✭ 23 (-64.06%)
VntkVietnamese NLP Toolkit for Node
Stars: ✭ 170 (+165.63%)
Recommendersystem DatasetThis repository contains some datasets that I have collected in Recommender Systems.
Stars: ✭ 249 (+289.06%)
lorcaNatural Language Processing for Spanish in Node.js. Stemmer, sentiment analysis, readability, tf-idf with batteries, concordance and more!
Stars: ✭ 95 (+48.44%)
SnowballImplementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
Stars: ✭ 131 (+104.69%)
weibo-summary微博自动摘要系统 Chinese Microblog Automatic Summary System
Stars: ✭ 28 (-56.25%)
fb scraperFBLYZE is a Facebook scraping system and analysis system.
Stars: ✭ 61 (-4.69%)
ResumeRiseAn NLP tool which classifies and summarizes resumes
Stars: ✭ 29 (-54.69%)
SoqalArabic Open Domain Question Answering System using Neural Reading Comprehension
Stars: ✭ 72 (+12.5%)
devsearchA web search engine built with Python which uses TF-IDF and PageRank to sort search results.
Stars: ✭ 52 (-18.75%)
slopeonePHP implementation of the Weighted Slope One rating-based collaborative filtering scheme.
Stars: ✭ 85 (+32.81%)
GreynirThe greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (-26.56%)
KeywordExtractionImplementation of algorithm in keyword extraction,including TextRank,TF-IDF and the combination of both
Stars: ✭ 95 (+48.44%)
Nepali-News-ClassifierText Classification of Nepali Language Document. This Mini Project was done for the partial fulfillment of NLP Course : COMP 473.
Stars: ✭ 13 (-79.69%)
DefactonlpDeFactoNLP: An Automated Fact-checking System that uses Named Entity Recognition, TF-IDF vector comparison and Decomposable Attention models.
Stars: ✭ 30 (-53.12%)
clusterixVisual exploration of clustered data.
Stars: ✭ 44 (-31.25%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+1134.38%)
Recommender SystemA developing recommender system in tensorflow2. Algorithm: UserCF, ItemCF, LFM, SLIM, GMF, MLP, NeuMF, FM, DeepFM, MKR, RippleNet, KGCN and so on.
Stars: ✭ 227 (+254.69%)
NGCF-PyTorchPyTorch Implementation for Neural Graph Collaborative Filtering
Stars: ✭ 200 (+212.5%)
pygramsExtracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
Stars: ✭ 52 (-18.75%)
Python Tf IdfAn extremely simple Python library to perform TF-IDF document comparison.
Stars: ✭ 214 (+234.38%)
MovieboxMachine learning movie recommending system
Stars: ✭ 504 (+687.5%)