Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+783.56%)
Lmdb EmbeddingsFast word vectors with little memory usage in Python
Stars: ✭ 404 (+176.71%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+854.79%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (+3.42%)
NLP-paper🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/
Stars: ✭ 23 (-84.25%)
ServenetService Classification based on Service Description
Stars: ✭ 21 (-85.62%)
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
Stars: ✭ 141 (-3.42%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+8641.78%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+34.25%)
doc2vec-apidocument embedding and machine learning script for beginners
Stars: ✭ 92 (-36.99%)
Ml ProjectsML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Stars: ✭ 127 (-13.01%)
biovecProtVec can be used in protein interaction predictions, structure prediction, and protein data visualization.
Stars: ✭ 23 (-84.25%)
nlpbuddyA text analysis application for performing common NLP tasks through a web dashboard interface and an API
Stars: ✭ 115 (-21.23%)
Role2vecA scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Stars: ✭ 134 (-8.22%)
word2vec-pt-brImplementação e modelo gerado com o treinamento (trigram) da wikipedia em pt-br
Stars: ✭ 34 (-76.71%)
EmbeddingEmbedding模型代码和学习笔记总结
Stars: ✭ 25 (-82.88%)
WegoWord Embeddings (e.g. Word2Vec) in Go!
Stars: ✭ 336 (+130.14%)
walkletsA lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
Stars: ✭ 94 (-35.62%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-85.62%)
PycadlPython package with source code from the course "Creative Applications of Deep Learning w/ TensorFlow"
Stars: ✭ 356 (+143.84%)
Word2VecAndTsneScripts demo-ing how to train a Word2Vec model and reduce its vector space
Stars: ✭ 45 (-69.18%)
word-embeddings-from-scratchCreating word embeddings from scratch and visualize them on TensorBoard. Using trained embeddings in Keras.
Stars: ✭ 22 (-84.93%)
FUTUREA private, free, open-source search engine built on a P2P network
Stars: ✭ 19 (-86.99%)
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+389.73%)
navecCompact high quality word embeddings for Russian language
Stars: ✭ 118 (-19.18%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+441.1%)
Text classificationall kinds of text classification models and more with deep learning
Stars: ✭ 7,179 (+4817.12%)
text classifierTensorflow2.3的文本分类项目,支持各种分类模型,支持相关tricks。
Stars: ✭ 135 (-7.53%)
10 days of deep learning10 days 10 different practical applications of Deep Learning (primarily NLP) using Tensorflow and Keras
Stars: ✭ 28 (-80.82%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-79.45%)
RolXAn alternative implementation of Recursive Feature and Role Extraction (KDD11 & KDD12)
Stars: ✭ 52 (-64.38%)
wordfish-pythonextract relationships from standardized terms from corpus of interest with deep learning 🐟
Stars: ✭ 19 (-86.99%)
Natural Language ProcessingProgramming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
Stars: ✭ 377 (+158.22%)
AravecAraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Stars: ✭ 239 (+63.7%)
Ngram2vecFour word embedding models implemented in Python. Supporting arbitrary context features
Stars: ✭ 703 (+381.51%)
MagpieDeep neural network framework for multi-label text classification
Stars: ✭ 622 (+326.03%)
Nlp兜哥出品 <一本开源的NLP入门书籍>
Stars: ✭ 1,677 (+1048.63%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-70.55%)
EmbeddingsvizVisualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Stars: ✭ 40 (-72.6%)
Word2vec訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.
Stars: ✭ 48 (-67.12%)
TgcontestTelegram Data Clustering contest solution by Mindful Squirrel
Stars: ✭ 74 (-49.32%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+710.96%)
MusaeThe reference implementation of "Multi-scale Attributed Node Embedding".
Stars: ✭ 75 (-48.63%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-51.37%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-41.78%)
Half SizeCode for "Effective Dimensionality Reduction for Word Embeddings".
Stars: ✭ 89 (-39.04%)
GemsecThe TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Stars: ✭ 210 (+43.84%)
Cw2veccw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Stars: ✭ 224 (+53.42%)