Textblob ArArabic support for textblob
Stars: ✭ 60 (+140%)
MetaA Modern C++ Data Sciences Toolkit
Stars: ✭ 600 (+2300%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+684%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-8%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (+8%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (+332%)
FastrtextR wrapper for fastText
Stars: ✭ 103 (+312%)
JfasttextJava interface for fastText
Stars: ✭ 193 (+672%)
TorchBlocksA PyTorch-based toolkit for natural language processing
Stars: ✭ 85 (+240%)
NSP-BERTThe code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"
Stars: ✭ 166 (+564%)
yelp comments classification nlpYelp round-10 review comments classification using deep learning (LSTM and CNN) and natural language processing.
Stars: ✭ 72 (+188%)
opentcOpenTC is a text classification engine using several algorithms in machine learning
Stars: ✭ 27 (+8%)
CaverCaver: a toolkit for multilabel text classification.
Stars: ✭ 38 (+52%)
augmentyAugmenty is an augmentation library based on spaCy for augmenting texts.
Stars: ✭ 101 (+304%)
SWDMSIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
Stars: ✭ 35 (+40%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-16%)
text-classification-svmThe missing SVM-based text classification module implementing HanLP's interface
Stars: ✭ 46 (+84%)
conecContext Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings
Stars: ✭ 20 (-20%)
codenamesCodenames AI using Word Vectors
Stars: ✭ 41 (+64%)
policy-data-analyzerBuilding a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
Stars: ✭ 22 (-12%)
fake-news-detectionThis repo is a collection of AWESOME things about fake news detection, including papers, code, etc.
Stars: ✭ 34 (+36%)
HiGitClassHiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)
Stars: ✭ 58 (+132%)
ML2017FALLMachine Learning (EE 5184) in NTU
Stars: ✭ 66 (+164%)
NewsMTSCTarget-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.
Stars: ✭ 54 (+116%)
ebe-datasetEvidence-based Explanation Dataset (AACL-IJCNLP 2020)
Stars: ✭ 16 (-36%)
textgoText preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
Stars: ✭ 33 (+32%)
node-fasttextNodejs binding for fasttext representation and classification.
Stars: ✭ 39 (+56%)
textlyticsText processing library for sentiment analysis and related tasks
Stars: ✭ 25 (+0%)
DaDengAndHisPython【微信公众号:大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱
[email protected] Stars: ✭ 59 (+136%)
extremeTextLibrary for fast text representation and extreme classification.
Stars: ✭ 141 (+464%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (+32%)
BaySMMModel for learning document embeddings along with their uncertainties
Stars: ✭ 25 (+0%)
text2classMulti-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT
Stars: ✭ 15 (-40%)
MetaCatMinimally Supervised Categorization of Text with Metadata (SIGIR'20)
Stars: ✭ 52 (+108%)
MetaLifelongLanguageRepository containing code for the paper "Meta-Learning with Sparse Experience Replay for Lifelong Language Learning".
Stars: ✭ 21 (-16%)
WeSTClass[CIKM 2018] Weakly-Supervised Neural Text Classification
Stars: ✭ 67 (+168%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (+20%)
WSDM-Cup-2019[ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.
Stars: ✭ 62 (+148%)
SPINECode for SPINE - Sparse Interpretable Neural Embeddings. Jhamtani H.*, Pruthi D.*, Subramanian A.*, Berg-Kirkpatrick T., Hovy E. AAAI 2018
Stars: ✭ 44 (+76%)
small-textActive Learning for Text Classification in Python
Stars: ✭ 241 (+864%)
synaptic-simple-trainerA ready to go text classification trainer based on synaptic (https://github.com/cazala/synaptic)
Stars: ✭ 19 (-24%)
HiGRUsImplementation of the paper "Hierarchical GRU for Utterance-level Emotion Recognition" in NAACL-2019.
Stars: ✭ 60 (+140%)
Word-recognition-EmbedNet-CABCode implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
Stars: ✭ 19 (-24%)
NLP-StuffPrograms with word vectors, RNN, NLP stuff, etc
Stars: ✭ 19 (-24%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+468%)
DeepClassifierDeepClassifier is aimed at building general text classification model library.It's easy and user-friendly to build any text classification task.
Stars: ✭ 25 (+0%)
NTUA-slp-nlp💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (-24%)
nlp classificationImplementing nlp papers relevant to classification with PyTorch, gluonnlp
Stars: ✭ 224 (+796%)
monkeylearn-phpOfficial PHP client for the MonkeyLearn API. Build and consume machine learning models for language processing from your PHP apps.
Stars: ✭ 47 (+88%)
Filipino-Text-BenchmarksOpen-source benchmark datasets and pretrained transformer models in the Filipino language.
Stars: ✭ 22 (-12%)