AravecAraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
GemsecThe TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
SplitterA Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
GensimTopic Modelling for Humans
WebvectorsWeb-ify your word2vec: framework to serve distributional semantic models online
Role2vecA scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Ml ProjectsML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Diff2vecReference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
MagnitudeA fast, efficient universal vector embedding utility package.
Doc2vec📓 Long(er) text representation and classification using Doc2Vec embeddings
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
MusaeThe reference implementation of "Multi-scale Attributed Node Embedding".
SineA PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).
Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Word2vec訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Crime AnalysisAssociation Rule Mining from Spatial Data for Crime Analysis
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Gensim DataData repository for pretrained NLP models and NLP corpora.
Adam qasADAM - A Question Answering System. Inspired from IBM Watson
resume tailorAn unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field
wordfish-pythonextract relationships from standardized terms from corpus of interest with deep learning 🐟
hcnHybrid Code Networks https://arxiv.org/abs/1702.03274
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
word2vec-pt-brImplementação e modelo gerado com o treinamento (trigram) da wikipedia em pt-br
10 days of deep learning10 days 10 different practical applications of Deep Learning (primarily NLP) using Tensorflow and Keras
RolXAn alternative implementation of Recursive Feature and Role Extraction (KDD11 & KDD12)
walkletsA lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
biovecProtVec can be used in protein interaction predictions, structure prediction, and protein data visualization.
nlpbuddyA text analysis application for performing common NLP tasks through a web dashboard interface and an API
nlp workshop odsc europe20Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
doc2vec-apidocument embedding and machine learning script for beginners
Word2VecAndTsneScripts demo-ing how to train a Word2Vec model and reduce its vector space
FUTUREA private, free, open-source search engine built on a P2P network