All Categories → Machine Learning → word-embeddings

Top 105 word-embeddings open source projects

SPINE
Code for SPINE - Sparse Interpretable Neural Embeddings. Jhamtani H.*, Pruthi D.*, Subramanian A.*, Berg-Kirkpatrick T., Hovy E. AAAI 2018
sembei
🍘 単語分割を経由しない単語埋め込み 🍘
yelp comments classification nlp
Yelp round-10 review comments classification using deep learning (LSTM and CNN) and natural language processing.
SWDM
SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
word embedding
Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
conec
Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings
Naive-Resume-Matching
Text Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Word-recognition-EmbedNet-CAB
Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
context2vec
PyTorch implementation of context2vec from Melamud et al., CoNLL 2016
wikidata-corpus
Train Wikidata with word2vec for word embedding tasks
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
SiameseCBOW
Implementation of Siamese CBOW using keras whose backend is tensorflow.
SIFRank
The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"
JoSH
[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Active-Explainable-Classification
A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification
compress-fasttext
Tools for shrinking fastText models (in gensim format)
MorphologicalPriorsForWordEmbeddings
Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings
contextualLSTM
Contextual LSTM for NLP tasks like word prediction and word embedding creation for Deep Learning
word2vec-on-wikipedia
A pipeline for training word embeddings using word2vec on wikipedia corpus.
S-WMD
Code for Supervised Word Mover's Distance (SWMD)
fuzzymax
Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
wefe
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Word2VecfJava
Word2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions
two-stream-cnn
A two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data
HiCE
Code for ACL'19 "Few-Shot Representation Learning for Out-Of-Vocabulary Words"
Simple-Sentence-Similarity
Exploring the simple sentence similarity measurements using word embeddings
61-105 of 105 word-embeddings projects