EmbeddingsvizVisualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Stars: ✭ 40 (-67.74%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+1024.19%)
FastrtextR wrapper for fastText
Stars: ✭ 103 (-16.94%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-83.06%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+10192.74%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+58.06%)
BiosentvecBioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Stars: ✭ 308 (+148.39%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+2487.9%)
wefeWEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Stars: ✭ 164 (+32.26%)
WordgcnACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Stars: ✭ 230 (+85.48%)
Chameleon recsysSource code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems
Stars: ✭ 202 (+62.9%)
JfasttextJava interface for fastText
Stars: ✭ 193 (+55.65%)
word2vec-on-wikipediaA pipeline for training word embeddings using word2vec on wikipedia corpus.
Stars: ✭ 68 (-45.16%)
actions-suggest-related-linksA GitHub Action to suggest related or similar issues, documents, and links. Based on the power of NLP and fastText.
Stars: ✭ 23 (-81.45%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (+51.61%)
TextheroText preprocessing, representation and visualization from zero to hero.
Stars: ✭ 2,407 (+1841.13%)
Word2VecfJavaWord2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions
Stars: ✭ 14 (-88.71%)
Sifrank zh基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Stars: ✭ 175 (+41.13%)
LftmImproving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Stars: ✭ 168 (+35.48%)
S-WMDCode for Supervised Word Mover's Distance (SWMD)
Stars: ✭ 90 (-27.42%)
fastchessPredicts the best chess move with 27.5% accuracy by a single matrix multiplication
Stars: ✭ 75 (-39.52%)
KoanA word2vec negative sampling implementation with correct CBOW update.
Stars: ✭ 232 (+87.1%)
Question GenerationGenerating multiple choice questions from text using Machine Learning.
Stars: ✭ 227 (+83.06%)
dasemDanish Semantic analysis
Stars: ✭ 17 (-86.29%)
fasttext-servingServe your fastText models for text classification and word vectors
Stars: ✭ 21 (-83.06%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (+52.42%)
NLP-paper🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/
Stars: ✭ 23 (-81.45%)
Datastories Semeval2017 Task4Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Stars: ✭ 184 (+48.39%)
sisterSImple SenTence EmbeddeR
Stars: ✭ 66 (-46.77%)
DebiasweRemove problematic gender bias from word embeddings.
Stars: ✭ 175 (+41.13%)
german-sentimentA data set and model for german sentiment classification.
Stars: ✭ 37 (-70.16%)
two-stream-cnnA two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data
Stars: ✭ 24 (-80.65%)
Hash EmbeddingsPyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
Stars: ✭ 126 (+1.61%)
MimickCode for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
Stars: ✭ 152 (+22.58%)
Elmo TutorialA short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)
Stars: ✭ 145 (+16.94%)
christmAIsText to abstract art generation for the holidays!
Stars: ✭ 90 (-27.42%)
fasttext-serverlessServerless hashtag recommendations using fastText and Python with AWS Lambda
Stars: ✭ 20 (-83.87%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+1288.71%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (-5.65%)
FlairA very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (+8823.39%)
word-benchmarksBenchmarks for intrinsic word embeddings evaluation.
Stars: ✭ 45 (-63.71%)
PersianNERNamed-Entity Recognition in Persian Language
Stars: ✭ 48 (-61.29%)
HiCECode for ACL'19 "Few-Shot Representation Learning for Out-Of-Vocabulary Words"
Stars: ✭ 56 (-54.84%)
DanlpDaNLP is a repository for Natural Language Processing resources for the Danish Language.
Stars: ✭ 111 (-10.48%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-12.9%)
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (-14.52%)
ungoliant🕷️ The pipeline for the OSCAR corpus
Stars: ✭ 69 (-44.35%)
fasttextjsJavaScript implementation of the FastText prediction algorithm
Stars: ✭ 31 (-75%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (-22.58%)