Top 52 gensim open source projects

Aravec
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Gemsec
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Shallowlearn
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Germanwordembeddings
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Splitter
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Webvectors
Web-ify your word2vec: framework to serve distributional semantic models online
Wordembeddings Elmo Fasttext Word2vec
Using pre trained word embeddings (Fasttext, Word2Vec)
Turkish Word2vec
Pre-trained Word2Vec Model for Turkish
Role2vec
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Ml Projects
ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Diff2vec
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
Doc2vec
📓 Long(er) text representation and classification using Doc2Vec embeddings
Nlp Journey
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Musae
The reference implementation of "Multi-scale Attributed Node Embedding".
Gensim Doc Zh
gensim 中文文档
Sine
A PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Twitterldatopicmodeling
Uses topic modeling to identify context between follower relationships of Twitter users
Word2vec
訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.
Tadw
An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Nlp In Practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Gensim Data
Data repository for pretrained NLP models and NLP corpora.
Word2vec Tutorial
中文詞向量訓練教學
Lmdb Embeddings
Fast word vectors with little memory usage in Python
Fast sentence embeddings
Compute Sentence Embeddings Fast!
resume tailor
An unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field
wordfish-python
extract relationships from standardized terms from corpus of interest with deep learning 🐟
pydataberlin-2017
Repo for my talk at the PyData Berlin 2017 conference
Product-Categorization-NLP
Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
word2vec-pt-br
Implementação e modelo gerado com o treinamento (trigram) da wikipedia em pt-br
10 days of deep learning
10 days 10 different practical applications of Deep Learning (primarily NLP) using Tensorflow and Keras
biovec
ProtVec can be used in protein interaction predictions, structure prediction, and protein data visualization.
nlpbuddy
A text analysis application for performing common NLP tasks through a web dashboard interface and an API
nlp workshop odsc europe20
Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
Financial-News-Analysis
招商银行FinTech-复赛-财经新闻分析
Word2VecAndTsne
Scripts demo-ing how to train a Word2Vec model and reduce its vector space
word-embeddings-from-scratch
Creating word embeddings from scratch and visualize them on TensorBoard. Using trained embeddings in Keras.
1-52 of 52 gensim projects