All Projects → Gensim → Similar Projects or Alternatives

2169 Open source projects that are alternatives of or similar to Gensim

Magnitude
A fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (-89.08%)
Tadw
An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-99.66%)
Mutual labels:  data-science, data-mining, word2vec, gensim
Shallowlearn
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (-98.46%)
Mutual labels:  word2vec, word-embeddings, fasttext, gensim
Scattertext
Beautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (-86.51%)
Text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (-94.4%)
Vec4ir
Word Embeddings for Information Retrieval
Stars: ✭ 188 (-98.53%)
Germanwordembeddings
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (-98.52%)
Lda Topic Modeling
A PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-99.29%)
Nlp Journey
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (-89.89%)
Mutual labels:  word2vec, fasttext, gensim
Fasttext.js
FastText for Node.js
Stars: ✭ 127 (-99%)
Mutual labels:  word2vec, word-embeddings, fasttext
word embedding
Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-99.84%)
Mutual labels:  word2vec, word-embeddings, fasttext
Biolitmap
Code for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-99.86%)
Lmdb Embeddings
Fast word vectors with little memory usage in Python
Stars: ✭ 404 (-96.83%)
Mutual labels:  word2vec, fasttext, gensim
Data Science Toolkit
Collection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-98.68%)
Product-Categorization-NLP
Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-99.76%)
Mutual labels:  word2vec, topic-modeling, gensim
SWDM
SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
Stars: ✭ 35 (-99.73%)
ml-nlp-services
机器学习、深度学习、自然语言处理
Stars: ✭ 23 (-99.82%)
Nlp In Practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (-93.81%)
Wordembeddings Elmo Fasttext Word2vec
Using pre trained word embeddings (Fasttext, Word2Vec)
Stars: ✭ 146 (-98.86%)
Mutual labels:  word2vec, fasttext, gensim
Pytorch Sentiment Analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (-74.86%)
Text mining resources
Resources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (-97.2%)
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-99.79%)
Simple-Sentence-Similarity
Exploring the simple sentence similarity measurements using word embeddings
Stars: ✭ 99 (-99.22%)
Mutual labels:  word2vec, word-embeddings, fasttext
Book Socialmediaminingpython
Companion code for the book "Mastering Social Media Mining with Python"
Stars: ✭ 462 (-96.38%)
Biosentvec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Stars: ✭ 308 (-97.59%)
How To Mine Newsfeed Data And Extract Interactive Insights In Python
A practical guide to topic mining and interactive visualizations
Stars: ✭ 61 (-99.52%)
Sense2vec
🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (-90.72%)
Dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (-90.3%)
Mutual labels:  data-science, data-mining
Ja.text8
Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-99.38%)
Glove As A Tensorflow Embedding Layer
Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-99.33%)
Mutual labels:  word2vec, word-embeddings
Ml
A high-level machine learning and deep learning library for the PHP language.
Stars: ✭ 1,270 (-90.05%)
Tsv Utils
eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (-90.48%)
Mutual labels:  data-science, data-mining
Topic Modeling Tool
A point-and-click tool for creating and analyzing topic models produced by MALLET.
Stars: ✭ 85 (-99.33%)
Mutual labels:  data-science, topic-modeling
Vvedenie Mashinnoe Obuchenie
📝 Подборка ресурсов по машинному обучению
Stars: ✭ 1,282 (-89.96%)
Mutual labels:  data-science, data-mining
Forte
Forte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-99.3%)
Applied Ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+39.65%)
Tsrepr
TSrepr: R package for time series representations
Stars: ✭ 75 (-99.41%)
Mutual labels:  data-science, data-mining
Tageditor
🏖TagEditor - Annotation tool for spaCy
Stars: ✭ 92 (-99.28%)
Dict2vec
Dict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (-99.29%)
Mutual labels:  word2vec, word-embeddings
Postgres Word2vec
utils to use word embedding like word2vec vectors in a postgres database
Stars: ✭ 96 (-99.25%)
Mutual labels:  word2vec, word-embeddings
D2l En
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.
Stars: ✭ 11,837 (-7.26%)
Papers Literature Ml Dl Rl Ai
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (-89.49%)
Mutual labels:  data-science, data-mining
Vizuka
Explore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-99.22%)
Mutual labels:  data-science, data-mining
Fastrtext
R wrapper for fastText
Stars: ✭ 103 (-99.19%)
Mutual labels:  word-embeddings, fasttext
Jupyterlab Prodigy
🧬 A JupyterLab extension for annotating data with Prodigy
Stars: ✭ 97 (-99.24%)
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-89.2%)
Repo 2016
R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-99.19%)
Numpy Ml
Machine learning, in numpy
Stars: ✭ 11,100 (-13.03%)
Mutual labels:  word2vec, topic-modeling
Easy Bert
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (-99.17%)
Webvectors
Web-ify your word2vec: framework to serve distributional semantic models online
Stars: ✭ 154 (-98.79%)
Mutual labels:  word2vec, gensim
Musae
The reference implementation of "Multi-scale Attributed Node Embedding".
Stars: ✭ 75 (-99.41%)
Mutual labels:  word2vec, gensim
Text Summarizer
Python Framework for Extractive Text Summarization
Stars: ✭ 96 (-99.25%)
Mutual labels:  word2vec, word-embeddings
Kadot
Kadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-99.15%)
Allennlp
An open-source NLP research library, built on PyTorch.
Stars: ✭ 10,699 (-16.17%)
Awesome Embedding Models
A curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (-88.36%)
Flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (-13.3%)
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (-88.12%)
Dna2vec
dna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (-99.08%)
Mutual labels:  word2vec, word-embeddings
Cogcomp Nlpy
CogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-99.1%)
Embedding As Service
One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (-98.82%)
Mutual labels:  word2vec, fasttext
1-60 of 2169 similar projects