An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

Stars: ✭ 196 (+58.06%)

Mutual labels: word-embeddings, fasttext

Biosentvec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

Stars: ✭ 308 (+148.39%)

Mutual labels: word-embeddings, fasttext

Pytorch Sentiment Analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Stars: ✭ 3,209 (+2487.9%)

Mutual labels: word-embeddings, fasttext

wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!

Stars: ✭ 164 (+32.26%)

Mutual labels: word-embeddings

Wordgcn

ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks

Stars: ✭ 230 (+85.48%)

Mutual labels: word-embeddings

Chameleon recsys

Source code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems

Stars: ✭ 202 (+62.9%)

Mutual labels: word-embeddings

Jfasttext

Java interface for fastText

Stars: ✭ 193 (+55.65%)

Mutual labels: word-embeddings

word2vec-on-wikipedia

A pipeline for training word embeddings using word2vec on wikipedia corpus.

Stars: ✭ 68 (-45.16%)

Mutual labels: word-embeddings

actions-suggest-related-links

A GitHub Action to suggest related or similar issues, documents, and links. Based on the power of NLP and fastText.

Stars: ✭ 23 (-81.45%)

Mutual labels: fasttext

Vec4ir

Word Embeddings for Information Retrieval

Stars: ✭ 188 (+51.61%)

Mutual labels: word-embeddings

Texthero

Text preprocessing, representation and visualization from zero to hero.

Stars: ✭ 2,407 (+1841.13%)

Mutual labels: word-embeddings

Word2VecfJava

Word2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions

Stars: ✭ 14 (-88.71%)

Mutual labels: word-embeddings

Sifrank zh

基于预训练模型的中文关键词抽取方法（论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码）

Stars: ✭ 175 (+41.13%)

Mutual labels: word-embeddings

Lftm

Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)

Stars: ✭ 168 (+35.48%)

Mutual labels: word-embeddings

Arabic-Word-Embeddings-Word2vec

Arabic Word Embeddings Word2vec

Stars: ✭ 26 (-79.03%)

Mutual labels: word-embeddings

S-WMD

Code for Supervised Word Mover's Distance (SWMD)

Stars: ✭ 90 (-27.42%)

Mutual labels: word-embeddings

fastchess

Predicts the best chess move with 27.5% accuracy by a single matrix multiplication

Stars: ✭ 75 (-39.52%)

Mutual labels: fasttext

Awesome Sentence Embedding

A curated list of pretrained sentence and word embedding models

Stars: ✭ 1,973 (+1491.13%)

Mutual labels: word-embeddings

Koan

A word2vec negative sampling implementation with correct CBOW update.

Stars: ✭ 232 (+87.1%)

Mutual labels: word-embeddings

Word-Embeddings-and-Document-Vectors

An evaluation of word-embeddings for classification

Stars: ✭ 32 (-74.19%)

Mutual labels: fasttext-embeddings

Question Generation

Generating multiple choice questions from text using Machine Learning.

Stars: ✭ 227 (+83.06%)

Mutual labels: word-embeddings

dasem

Danish Semantic analysis

Stars: ✭ 17 (-86.29%)

Mutual labels: word-embeddings

Spherical Text Embedding

[NeurIPS 2019] Spherical Text Embedding

Stars: ✭ 143 (+15.32%)

Mutual labels: word-embeddings

fasttext-serving

Serve your fastText models for text classification and word vectors

Stars: ✭ 21 (-83.06%)

Mutual labels: fasttext

Germanwordembeddings

Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets

Stars: ✭ 189 (+52.42%)

Mutual labels: word-embeddings

NLP-paper

🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/

Stars: ✭ 23 (-81.45%)

Mutual labels: fasttext

Datastories Semeval2017 Task4

Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".

Stars: ✭ 184 (+48.39%)

Mutual labels: word-embeddings

sister

SImple SenTence EmbeddeR

Stars: ✭ 66 (-46.77%)

Mutual labels: word-embeddings

Debiaswe

Remove problematic gender bias from word embeddings.

Stars: ✭ 175 (+41.13%)

Mutual labels: word-embeddings

german-sentiment

A data set and model for german sentiment classification.

Stars: ✭ 37 (-70.16%)

Mutual labels: fasttext

two-stream-cnn

A two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data

Stars: ✭ 24 (-80.65%)

Mutual labels: word-embeddings

Text Classification TF

用tf实现各种文本分类模型，并且封装restful接口，可以直接工程化

Stars: ✭ 32 (-74.19%)

Mutual labels: fasttext

Hash Embeddings

PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.

Stars: ✭ 126 (+1.61%)

Mutual labels: word-embeddings

Mimick

Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)

Stars: ✭ 152 (+22.58%)

Mutual labels: word-embeddings

fasttext-serving

fastText model serving service

Stars: ✭ 54 (-56.45%)

Mutual labels: fasttext

Elmo Tutorial

A short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)

Stars: ✭ 145 (+16.94%)

Mutual labels: word-embeddings

christmAIs

Text to abstract art generation for the holidays!

Stars: ✭ 90 (-27.42%)

Mutual labels: fasttext-embeddings

fasttext-serverless

Serverless hashtag recommendations using fastText and Python with AWS Lambda

Stars: ✭ 20 (-83.87%)

Mutual labels: fasttext

overview-and-benchmark-of-traditional-and-deep-learning-models-in-text-classification

NLP tutorial

Stars: ✭ 41 (-66.94%)

Mutual labels: word-embeddings

Scattertext

Beautiful visualizations of how language differs among document types.

Stars: ✭ 1,722 (+1288.71%)

Mutual labels: word-embeddings

Dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

Stars: ✭ 117 (-5.65%)

Mutual labels: word-embeddings

Flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Stars: ✭ 11,065 (+8823.39%)

Mutual labels: word-embeddings

word-benchmarks

Benchmarks for intrinsic word embeddings evaluation.

Stars: ✭ 45 (-63.71%)

Mutual labels: word-embeddings

PersianNER

Named-Entity Recognition in Persian Language

Stars: ✭ 48 (-61.29%)

Mutual labels: word-embeddings

HiCE

Code for ACL'19 "Few-Shot Representation Learning for Out-Of-Vocabulary Words"

Stars: ✭ 56 (-54.84%)

Mutual labels: word-embeddings

Danlp

DaNLP is a repository for Natural Language Processing resources for the Danish Language.

Stars: ✭ 111 (-10.48%)

Mutual labels: word-embeddings

Kadot

Kadot, the unsupervised natural language processing library.

Stars: ✭ 108 (-12.9%)

Mutual labels: word-embeddings

Easy Bert

A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)

Stars: ✭ 106 (-14.52%)

Mutual labels: word-embeddings

ungoliant

🕷️ The pipeline for the OSCAR corpus

Stars: ✭ 69 (-44.35%)

Mutual labels: fasttext

fasttextjs

JavaScript implementation of the FastText prediction algorithm

Stars: ✭ 31 (-75%)

Mutual labels: fasttext

Text Summarizer

Python Framework for Extractive Text Summarization

Stars: ✭ 96 (-22.58%)

Mutual labels: word-embeddings

MorphologicalPriorsForWordEmbeddings

Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings

Stars: ✭ 53 (-57.26%)

Mutual labels: word-embeddings

1-60 of 162 similar projects

›