KoanA word2vec negative sampling implementation with correct CBOW update.
WordgcnACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Chameleon recsysSource code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Vec4irWord Embeddings for Information Retrieval
Datastories Semeval2017 Task4Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
TextheroText preprocessing, representation and visualization from zero to hero.
DebiasweRemove problematic gender bias from word embeddings.
Sifrank zh基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
LftmImproving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
GensimTopic Modelling for Humans
MimickCode for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
Elmo TutorialA short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)
Hash EmbeddingsPyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
ScattertextBeautiful visualizations of how language differs among document types.
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
FlairA very simple framework for state-of-the-art Natural Language Processing (NLP)
DanlpDaNLP is a repository for Natural Language Processing resources for the Danish Language.
KadotKadot, the unsupervised natural language processing library.
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
MagnitudeA fast, efficient universal vector embedding utility package.
Postgres Word2vecutils to use word embedding like word2vec vectors in a postgres database
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Nlp overviewOverview of Modern Deep Learning Techniques Applied to Natural Language Processing
Average Word2vec🔤 Calculate average word embeddings (word2vec) from documents for transfer learning
EmbeddingsvizVisualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Top2vecTop2Vec learns jointly embedded topic, document and word vectors.
Syntree2vecAn algorithm to augment syntactic hierarchy into word embeddings
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
InltkNatural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
MetaA Modern C++ Data Sciences Toolkit
Nlp NotebooksA collection of notebooks for Natural Language Processing from NLP Town
Bert Embedding🔡 Token level embeddings from BERT model on mxnet and gluonnlp
Deep learning nlpKeras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP
WegoWord Embeddings (e.g. Word2Vec) in Go!
ChakinSimple downloader for pre-trained word vectors
BiosentvecBioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
neuralnets-semanticsWord semantics Deep Learning with Vanilla Python, Keras, Theano, TensorFlow, PyTorch
Lbl2VecLbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.