TextheroText preprocessing, representation and visualization from zero to hero.
Stars: ✭ 2,407 (+937.5%)
JfasttextJava interface for fastText
Stars: ✭ 193 (-16.81%)
Deep Math Machine Learning.aiA blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Stars: ✭ 173 (-25.43%)
Mlsourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
Stars: ✭ 136 (-41.38%)
Lstm Context EmbeddingsAugmenting word embeddings with their surrounding context using bidirectional RNN
Stars: ✭ 57 (-75.43%)
GraphwavemachineA scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
Stars: ✭ 151 (-34.91%)
Word2vecPython interface to Google word2vec
Stars: ✭ 2,370 (+921.55%)
Transorthogonal LinguisticsUses a distributed word representation to finds words along the hyperchord of two input words.
Stars: ✭ 93 (-59.91%)
Textfeatures👷♂️ A simple package for extracting useful features from character objects 👷♀️
Stars: ✭ 148 (-36.21%)
Word2vec訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.
Stars: ✭ 48 (-79.31%)
Cw2vec基于字符训练词向量
Stars: ✭ 80 (-65.52%)
Skip Thoughts.torchPorting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7
Stars: ✭ 146 (-37.07%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-67.24%)
SplitterA Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Stars: ✭ 177 (-23.71%)
Word2vecGo library for performing computations in word2vec binary models
Stars: ✭ 143 (-38.36%)
Role2vecA scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Stars: ✭ 134 (-42.24%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-81.47%)
ClustercatFast Word Clustering Software
Stars: ✭ 65 (-71.98%)
Deeplearning Nlp ModelsA small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.
Stars: ✭ 64 (-72.41%)
Sifrank zh基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Stars: ✭ 175 (-24.57%)
ChiveJapanese word embedding with Sudachi and NWJC 🌿
Stars: ✭ 63 (-72.84%)
Word2vec对 ansj 编写的 Word2VEC_java 的进一步包装,同时实现了常用的词语相似度和句子相似度计算。
Stars: ✭ 136 (-41.38%)
Nlp overviewOverview of Modern Deep Learning Techniques Applied to Natural Language Processing
Stars: ✭ 1,104 (+375.86%)
Question GenerationGenerating multiple choice questions from text using Machine Learning.
Stars: ✭ 227 (-2.16%)
Average Word2vec🔤 Calculate average word embeddings (word2vec) from documents for transfer learning
Stars: ✭ 52 (-77.59%)
PujanggaPujangga - Indonesian Natural Language Processing Tool with REST API, an Interface for InaNLP and Deeplearning4j's Word2Vec
Stars: ✭ 47 (-79.74%)
WordvectorsPre-trained word vectors of 30+ languages
Stars: ✭ 2,043 (+780.6%)
EmbeddingsvizVisualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Stars: ✭ 40 (-82.76%)
Word2vec Russian NovelsInspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris Orekhov
Stars: ✭ 39 (-83.19%)
Log Anomaly DetectorLog Anomaly Detection - Machine learning to detect abnormal events logs
Stars: ✭ 169 (-27.16%)
Scattertext PydataNotebooks for the Seattle PyData 2017 talk on Scattertext
Stars: ✭ 132 (-43.1%)
Pytorch SkipgramImplementing Skip-gram Negative Sampling with pytorch
Stars: ✭ 39 (-83.19%)
Top2vecTop2Vec learns jointly embedded topic, document and word vectors.
Stars: ✭ 972 (+318.97%)
LftmImproving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Stars: ✭ 168 (-27.59%)
Ml ProjectsML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Stars: ✭ 127 (-45.26%)
Philo2vecAn implementation of word2vec applied to [stanford philosophy encyclopedia](http://plato.stanford.edu/)
Stars: ✭ 33 (-85.78%)
Hash EmbeddingsPyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
Stars: ✭ 126 (-45.69%)
WordnetembeddingsObtaining word embeddings from a WordNet ontology
Stars: ✭ 33 (-85.78%)
Few Shot Text ClassificationFew-shot binary text classification with Induction Networks and Word2Vec weights initialization
Stars: ✭ 32 (-86.21%)
GemsecThe TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Stars: ✭ 210 (-9.48%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-18.97%)
DanmfA sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Stars: ✭ 161 (-30.6%)
ServenetService Classification based on Service Description
Stars: ✭ 21 (-90.95%)
Syntree2vecAn algorithm to augment syntactic hierarchy into word embeddings
Stars: ✭ 9 (-96.12%)
Entity2recentity2rec generates item recommendation using property-specific knowledge graph embeddings
Stars: ✭ 159 (-31.47%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-90.09%)
BagofconceptsPython implementation of bag-of-concepts
Stars: ✭ 18 (-92.24%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+240.52%)