DebiasweRemove problematic gender bias from word embeddings.
Stars: ✭ 175 (+212.5%)
yelp comments classification nlpYelp round-10 review comments classification using deep learning (LSTM and CNN) and natural language processing.
Stars: ✭ 72 (+28.57%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (+71.43%)
textlyticsText processing library for sentiment analysis and related tasks
Stars: ✭ 25 (-55.36%)
Question GenerationGenerating multiple choice questions from text using Machine Learning.
Stars: ✭ 227 (+305.36%)
conecContext Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings
Stars: ✭ 20 (-64.29%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (+62.5%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (-51.79%)
LftmImproving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Stars: ✭ 168 (+200%)
Word-recognition-EmbedNet-CABCode implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
Stars: ✭ 19 (-66.07%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+5630.36%)
wikidata-corpusTrain Wikidata with word2vec for word embedding tasks
Stars: ✭ 109 (+94.64%)
Textblob ArArabic support for textblob
Stars: ✭ 60 (+7.14%)
MimickCode for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
Stars: ✭ 152 (+171.43%)
word2vec-tsneGoogle News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
Stars: ✭ 59 (+5.36%)
Lstm Context EmbeddingsAugmenting word embeddings with their surrounding context using bidirectional RNN
Stars: ✭ 57 (+1.79%)
SIFRankThe code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"
Stars: ✭ 96 (+71.43%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+250%)
Active-Explainable-ClassificationA set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification
Stars: ✭ 28 (-50%)
Elmo TutorialA short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)
Stars: ✭ 145 (+158.93%)
SCL📄 Spatial Contrastive Learning for Few-Shot Classification (ECML/PKDD 2021).
Stars: ✭ 42 (-25%)
contextualLSTMContextual LSTM for NLP tasks like word prediction and word embedding creation for Deep Learning
Stars: ✭ 28 (-50%)
WordnetembeddingsObtaining word embeddings from a WordNet ontology
Stars: ✭ 33 (-41.07%)
word2vec-on-wikipediaA pipeline for training word embeddings using word2vec on wikipedia corpus.
Stars: ✭ 68 (+21.43%)
Fasttext.jsFastText for Node.js
Stars: ✭ 127 (+126.79%)
PersianNERNamed-Entity Recognition in Persian Language
Stars: ✭ 48 (-14.29%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-58.93%)
wefeWEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Stars: ✭ 164 (+192.86%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (+237.5%)
Word2VecfJavaWord2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions
Stars: ✭ 14 (-75%)
InltkNatural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Stars: ✭ 702 (+1153.57%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+2975%)
TransferlearningTransfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
Stars: ✭ 8,481 (+15044.64%)
Nlp NotebooksA collection of notebooks for Natural Language Processing from NLP Town
Stars: ✭ 513 (+816.07%)
few-shot-gan-adaptation[CVPR '21] Official repository for Few-shot Image Generation via Cross-domain Correspondence
Stars: ✭ 198 (+253.57%)
KoanA word2vec negative sampling implementation with correct CBOW update.
Stars: ✭ 232 (+314.29%)
FewCLUEFewCLUE 小样本学习测评基准,中文版
Stars: ✭ 251 (+348.21%)
Deep learning nlpKeras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP
Stars: ✭ 407 (+626.79%)
FlairA very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (+19658.93%)
mmfewshotOpenMMLab FewShot Learning Toolbox and Benchmark
Stars: ✭ 336 (+500%)
ChakinSimple downloader for pre-trained word vectors
Stars: ✭ 323 (+476.79%)
Meta-Fine-Tuning[CVPR 2020 VL3] The repository for meta fine-tuning in cross-domain few-shot learning.
Stars: ✭ 29 (-48.21%)
Datastories Semeval2017 Task4Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Stars: ✭ 184 (+228.57%)
WordgcnACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Stars: ✭ 230 (+310.71%)
TextheroText preprocessing, representation and visualization from zero to hero.
Stars: ✭ 2,407 (+4198.21%)
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (+89.29%)
Lbl2VecLbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.
Stars: ✭ 25 (-55.36%)