All Categories → Machine Learning → word-embeddings

Top 105 word-embeddings open source projects

Spanish Word Embeddings
Spanish word embeddings computed with different methods and from different corpora
Koan
A word2vec negative sampling implementation with correct CBOW update.
Wordgcn
ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Question Generation
Generating multiple choice questions from text using Machine Learning.
Shallowlearn
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Germanwordembeddings
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Datastories Semeval2017 Task4
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Debiaswe
Remove problematic gender bias from word embeddings.
Sifrank zh
基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Lftm
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Mimick
Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
Elmo Tutorial
A short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)
Spherical Text Embedding
[NeurIPS 2019] Spherical Text Embedding
Hash Embeddings
PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
Dna2vec
dna2vec: Consistent vector representations of variable-length k-mers
Danlp
DaNLP is a repository for Natural Language Processing resources for the Danish Language.
Kadot
Kadot, the unsupervised natural language processing library.
Easy Bert
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Postgres Word2vec
utils to use word embedding like word2vec vectors in a postgres database
Dict2vec
Dict2vec is a framework to learn word embeddings using lexical dictionaries.
Glove As A Tensorflow Embedding Layer
Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Clustercat
Fast Word Clustering Software
Nlp overview
Overview of Modern Deep Learning Techniques Applied to Natural Language Processing
Lstm Context Embeddings
Augmenting word embeddings with their surrounding context using bidirectional RNN
Average Word2vec
🔤 Calculate average word embeddings (word2vec) from documents for transfer learning
Embeddingsviz
Visualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Top2vec
Top2Vec learns jointly embedded topic, document and word vectors.
Wordnetembeddings
Obtaining word embeddings from a WordNet ontology
Syntree2vec
An algorithm to augment syntactic hierarchy into word embeddings
Concise Ipython Notebooks For Deep Learning
Ipython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
Inltk
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Bert Embedding
🔡 Token level embeddings from BERT model on mxnet and gluonnlp
Wego
Word Embeddings (e.g. Word2Vec) in Go!
Chakin
Simple downloader for pre-trained word vectors
Biosentvec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Hands On Deep Learning Algorithms With Python
Master Deep Learning Algorithms with Extensive Math by Implementing them using TensorFlow
neuralnets-semantics
Word semantics Deep Learning with Vanilla Python, Keras, Theano, TensorFlow, PyTorch
Lbl2Vec
Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.
1-60 of 105 word-embeddings projects