Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+1643.9%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-48.78%)
word-benchmarksBenchmarks for intrinsic word embeddings evaluation.
Stars: ✭ 45 (+9.76%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (+107.32%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+31029.27%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (+17.07%)
NTUA-slp-nlp💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (-53.66%)
Fasttext.jsFastText for Node.js
Stars: ✭ 127 (+209.76%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (+121.95%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+4100%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (+360.98%)
WegoWord Embeddings (e.g. Word2Vec) in Go!
Stars: ✭ 336 (+719.51%)
wikidata-corpusTrain Wikidata with word2vec for word embedding tasks
Stars: ✭ 109 (+165.85%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (+134.15%)
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-34.15%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+3300%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+378.05%)
word2vec-tsneGoogle News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
Stars: ✭ 59 (+43.9%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (+185.37%)
Postgres Word2vecutils to use word embedding like word2vec vectors in a postgres database
Stars: ✭ 96 (+134.15%)
Chameleon recsysSource code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems
Stars: ✭ 202 (+392.68%)
KoanA word2vec negative sampling implementation with correct CBOW update.
Stars: ✭ 232 (+465.85%)
Deep learning nlpKeras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP
Stars: ✭ 407 (+892.68%)
two-stream-cnnA two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data
Stars: ✭ 24 (-41.46%)
SWDMSIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
Stars: ✭ 35 (-14.63%)
DebiasweRemove problematic gender bias from word embeddings.
Stars: ✭ 175 (+326.83%)
word2vec-on-wikipediaA pipeline for training word embeddings using word2vec on wikipedia corpus.
Stars: ✭ 68 (+65.85%)
SIFRankThe code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"
Stars: ✭ 96 (+134.15%)
cyber-matrix-aiCollection of cyber security and "AI" relevant topics
Stars: ✭ 69 (+68.29%)
dnn-lstm-word-segmentChinese Word Segmention Base on the Deep Learning and LSTM Neural Network
Stars: ✭ 24 (-41.46%)
word2vizVisualization of semantic similarities in word embeddings.
Stars: ✭ 86 (+109.76%)
text classifierTensorflow2.3的文本分类项目,支持各种分类模型,支持相关tricks。
Stars: ✭ 135 (+229.27%)
datastories-semeval2017-task6Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (-51.22%)
gonnp📉Deep learning from scratch using Go. Specializes in natural language processing
Stars: ✭ 26 (-36.59%)
navecCompact high quality word embeddings for Russian language
Stars: ✭ 118 (+187.8%)
doc2vec-golangdoc2vec , word2vec, implemented by golang. word embedding representation
Stars: ✭ 33 (-19.51%)
vector space modellingNLP in python Vector Space Modelling and document classification NLP
Stars: ✭ 16 (-60.98%)
Word-recognition-EmbedNet-CABCode implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
Stars: ✭ 19 (-53.66%)
GraphDBLPa Graph-based instance of DBLP
Stars: ✭ 33 (-19.51%)
JoSH[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Stars: ✭ 55 (+34.15%)
Active-Explainable-ClassificationA set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification
Stars: ✭ 28 (-31.71%)
word2vecUse word2vec to improve search result
Stars: ✭ 63 (+53.66%)
Word2VecJavaWord2Vec In Java (2013 google word2vec opensource)
Stars: ✭ 13 (-68.29%)
QuestionClusteringClasificador de preguntas escrito en python 3 que fue implementado en el siguiente vídeo: https://youtu.be/qnlW1m6lPoY
Stars: ✭ 15 (-63.41%)
NLP PEMDCNLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.
Stars: ✭ 58 (+41.46%)
wmd4jwmd4j is a Java library for calculating Word Mover's Distance (WMD)
Stars: ✭ 31 (-24.39%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (-34.15%)
SentimentAnalysisSentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (-21.95%)
word2vec pipelineNLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)
Stars: ✭ 108 (+163.41%)
test word2vec uyghurBu Uyghur yéziqini Pythonning gensim ambiridiki word2vec algorizimida sinap baqqan misal.
Stars: ✭ 15 (-63.41%)
word2vecRust interface to word2vec.
Stars: ✭ 22 (-46.34%)