word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-85.81%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+32.43%)
Nlp兜哥出品 <一本开源的NLP入门书籍>
Stars: ✭ 1,677 (+1033.11%)
EmbeddingEmbedding模型代码和学习笔记总结
Stars: ✭ 25 (-83.11%)
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
Stars: ✭ 141 (-4.73%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+841.89%)
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+771.62%)
Lmdb EmbeddingsFast word vectors with little memory usage in Python
Stars: ✭ 404 (+172.97%)
WordvectorsPre-trained word vectors of 30+ languages
Stars: ✭ 2,043 (+1280.41%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+8523.65%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (+2.03%)
NLP-paper🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/
Stars: ✭ 23 (-84.46%)
Cw2veccw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Stars: ✭ 224 (+51.35%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+1063.51%)
Half SizeCode for "Effective Dimensionality Reduction for Word Embeddings".
Stars: ✭ 89 (-39.86%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (-20.95%)
Cw2vec基于字符训练词向量
Stars: ✭ 80 (-45.95%)
Transorthogonal LinguisticsUses a distributed word representation to finds words along the hyperchord of two input words.
Stars: ✭ 93 (-37.16%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (-38.51%)
Mlsourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
Stars: ✭ 136 (-8.11%)
CvtkCVTK, a Computer Vision ToolKit.
Stars: ✭ 119 (-19.59%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-48.65%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-42.57%)
Text Classification DemosNeural models for Text Classification in Tensorflow, such as cnn, dpcnn, fasttext, bert ...
Stars: ✭ 144 (-2.7%)
Ja.text8Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-46.62%)
Role2vecA scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Stars: ✭ 134 (-9.46%)
MusaeThe reference implementation of "Multi-scale Attributed Node Embedding".
Stars: ✭ 75 (-49.32%)
TgcontestTelegram Data Clustering contest solution by Mindful Squirrel
Stars: ✭ 74 (-50%)
AsneA sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
Stars: ✭ 73 (-50.68%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+700%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-52.03%)
Skip Thoughts.torchPorting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7
Stars: ✭ 146 (-1.35%)
Word2vecGo library for performing computations in word2vec binary models
Stars: ✭ 143 (-3.38%)
Awesome Embedding ModelsA curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (+904.05%)
Convai Bot 1337NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager
Stars: ✭ 65 (-56.08%)
Kor2vecLibrary for Korean morpheme and word vector representation
Stars: ✭ 64 (-56.76%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+7400%)
Deeplearning Nlp ModelsA small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.
Stars: ✭ 64 (-56.76%)
Repo 2017Python codes in Machine Learning, NLP, Deep Learning and Reinforcement Learning with Keras and Theano
Stars: ✭ 1,123 (+658.78%)
Scattertext PydataNotebooks for the Seattle PyData 2017 talk on Scattertext
Stars: ✭ 132 (-10.81%)
TextclfTextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。
Stars: ✭ 105 (-29.05%)
ChiveJapanese word embedding with Sudachi and NWJC 🌿
Stars: ✭ 63 (-57.43%)
Fasttext NodeNode wrapper around FastText Library
Stars: ✭ 58 (-60.81%)
Fasttext.pyA Python interface for Facebook fastText
Stars: ✭ 1,091 (+637.16%)
Word2vec訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.
Stars: ✭ 48 (-67.57%)
WhatthelangLightning Fast Language Prediction 🚀
Stars: ✭ 130 (-12.16%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-30.41%)