MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+670.17%)
Mlsourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
Stars: ✭ 136 (-24.86%)
Nlp兜哥出品 <一本开源的NLP入门书籍>
Stars: ✭ 1,677 (+826.52%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+554.14%)
ExperimentsSome research experiments
Stars: ✭ 95 (-47.51%)
WebvectorsWeb-ify your word2vec: framework to serve distributional semantic models online
Stars: ✭ 154 (-14.92%)
Ja.text8Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-56.35%)
Scattertext PydataNotebooks for the Seattle PyData 2017 talk on Scattertext
Stars: ✭ 132 (-27.07%)
Repo 2017Python codes in Machine Learning, NLP, Deep Learning and Reinforcement Learning with Keras and Theano
Stars: ✭ 1,123 (+520.44%)
Fasttext4j Implementing Facebook's FastText with java
Stars: ✭ 148 (-18.23%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+6032.6%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+6951.38%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (-46.96%)
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
Stars: ✭ 141 (-22.1%)
WordvectorsPre-trained word vectors of 30+ languages
Stars: ✭ 2,043 (+1028.73%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-53.04%)
Role2vecA scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Stars: ✭ 134 (-25.97%)
MusaeThe reference implementation of "Multi-scale Attributed Node Embedding".
Stars: ✭ 75 (-58.56%)
GraphwavemachineA scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
Stars: ✭ 151 (-16.57%)
Kor2vecLibrary for Korean morpheme and word vector representation
Stars: ✭ 64 (-64.64%)
Ml ProjectsML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Stars: ✭ 127 (-29.83%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (-35.36%)
ChiveJapanese word embedding with Sudachi and NWJC 🌿
Stars: ✭ 63 (-65.19%)
Textfeatures👷♂️ A simple package for extracting useful features from character objects 👷♀️
Stars: ✭ 148 (-18.23%)
Entity2recentity2rec generates item recommendation using property-specific knowledge graph embeddings
Stars: ✭ 159 (-12.15%)
Awesome Embedding ModelsA curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (+720.99%)
Skip Thoughts.torchPorting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7
Stars: ✭ 146 (-19.34%)
TextclfTextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。
Stars: ✭ 105 (-41.99%)
Deep Math Machine Learning.aiA blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Stars: ✭ 173 (-4.42%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-43.09%)
Word2vecGo library for performing computations in word2vec binary models
Stars: ✭ 143 (-20.99%)
Postgres Word2vecutils to use word embedding like word2vec vectors in a postgres database
Stars: ✭ 96 (-46.96%)
Text2vectext2vec, chinese text to vetor.(文本向量化表示工具,包括词向量化、句子向量化、句子相似度计算)
Stars: ✭ 155 (-14.36%)
Transorthogonal LinguisticsUses a distributed word representation to finds words along the hyperchord of two input words.
Stars: ✭ 93 (-48.62%)
Word2vec对 ansj 编写的 Word2VEC_java 的进一步包装,同时实现了常用的词语相似度和句子相似度计算。
Stars: ✭ 136 (-24.86%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (-49.72%)
SplitterA Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Stars: ✭ 177 (-2.21%)
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+612.71%)
Cw2vec基于字符训练词向量
Stars: ✭ 80 (-55.8%)
Skip Gram PytorchA complete pytorch implementation of skip-gram
Stars: ✭ 153 (-15.47%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-58.01%)
AsneA sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
Stars: ✭ 73 (-59.67%)
Log Anomaly DetectorLog Anomaly Detection - Machine learning to detect abnormal events logs
Stars: ✭ 169 (-6.63%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-60.77%)
Deeplearning Nlp ModelsA small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.
Stars: ✭ 64 (-64.64%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (-16.57%)
DebiasweRemove problematic gender bias from word embeddings.
Stars: ✭ 175 (-3.31%)
DanmfA sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Stars: ✭ 161 (-11.05%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+851.38%)