An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

Stars: ✭ 196 (+880%)

Mutual labels: word-embeddings

QuestionClustering

Clasificador de preguntas escrito en python 3 que fue implementado en el siguiente vídeo: https://youtu.be/qnlW1m6lPoY

Stars: ✭ 15 (-25%)

Mutual labels: word-embeddings

Lftm

Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)

Stars: ✭ 168 (+740%)

Mutual labels: word-embeddings

S-WMD

Code for Supervised Word Mover's Distance (SWMD)

Stars: ✭ 90 (+350%)

Mutual labels: word-embeddings

two-stream-cnn

A two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data

Stars: ✭ 24 (+20%)

Mutual labels: word-embeddings

Scattertext

Beautiful visualizations of how language differs among document types.

Stars: ✭ 1,722 (+8510%)

Mutual labels: word-embeddings

JoSH

[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding

Stars: ✭ 55 (+175%)

Mutual labels: word-embeddings

Pytorch Sentiment Analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Stars: ✭ 3,209 (+15945%)

Mutual labels: word-embeddings

lda2vec

Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019

Stars: ✭ 27 (+35%)

Mutual labels: word-embeddings

Question Generation

Generating multiple choice questions from text using Machine Learning.

Stars: ✭ 227 (+1035%)

Mutual labels: word-embeddings

pair2vec

pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference

Stars: ✭ 62 (+210%)

Mutual labels: word-embeddings

Germanwordembeddings

Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets

Stars: ✭ 189 (+845%)

Mutual labels: word-embeddings

NTUA-slp-nlp

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Stars: ✭ 19 (-5%)

Mutual labels: word-embeddings

Debiaswe

Remove problematic gender bias from word embeddings.

Stars: ✭ 175 (+775%)

Mutual labels: word-embeddings

dasem

Danish Semantic analysis

Stars: ✭ 17 (-15%)

Mutual labels: word-embeddings

Mimick

Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)

Stars: ✭ 152 (+660%)

Mutual labels: word-embeddings

SiameseCBOW

Implementation of Siamese CBOW using keras whose backend is tensorflow.

Stars: ✭ 14 (-30%)

Mutual labels: word-embeddings

Fasttext.js

FastText for Node.js

Stars: ✭ 127 (+535%)

Mutual labels: word-embeddings

fuzzymax

Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.

Stars: ✭ 43 (+115%)

Mutual labels: word-embeddings

Word2VecfJava

Word2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions

Stars: ✭ 14 (-30%)

Mutual labels: word-embeddings

Dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

Stars: ✭ 117 (+485%)

Mutual labels: word-embeddings

robot-mind-meld

A little game powered by word vectors

Stars: ✭ 31 (+55%)

Mutual labels: word-embeddings

overview-and-benchmark-of-traditional-and-deep-learning-models-in-text-classification

NLP tutorial

Stars: ✭ 41 (+105%)

Mutual labels: word-embeddings

wikidata-corpus

Train Wikidata with word2vec for word embedding tasks

Stars: ✭ 109 (+445%)

Mutual labels: word-embeddings

Simple-Sentence-Similarity

Exploring the simple sentence similarity measurements using word embeddings

Stars: ✭ 99 (+395%)

Mutual labels: word-embeddings

Active-Explainable-Classification

A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification

Stars: ✭ 28 (+40%)

Mutual labels: word-embeddings

Spanish Word Embeddings

Spanish word embeddings computed with different methods and from different corpora

Stars: ✭ 236 (+1080%)

Mutual labels: word-embeddings

Word-recognition-EmbedNet-CAB

Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"

Stars: ✭ 19 (-5%)

Mutual labels: word-embeddings

Wordgcn

ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks

Stars: ✭ 230 (+1050%)

Mutual labels: word-embeddings

MorphologicalPriorsForWordEmbeddings

Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings

Stars: ✭ 53 (+165%)

Mutual labels: word-embeddings

Chameleon recsys

Source code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems

Stars: ✭ 202 (+910%)

Mutual labels: word-embeddings

materials-synthesis-generative-models

Public release of data and code for materials synthesis generation

Stars: ✭ 47 (+135%)

Mutual labels: word-embeddings

Jfasttext

Java interface for fastText

Stars: ✭ 193 (+865%)

Mutual labels: word-embeddings

Arabic-Word-Embeddings-Word2vec

Arabic Word Embeddings Word2vec

Stars: ✭ 26 (+30%)

Mutual labels: word-embeddings

Vec4ir

Word Embeddings for Information Retrieval

Stars: ✭ 188 (+840%)

Mutual labels: word-embeddings

Naive-Resume-Matching

Text Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.

Stars: ✭ 27 (+35%)

Mutual labels: word-embeddings

Texthero

Text preprocessing, representation and visualization from zero to hero.

Stars: ✭ 2,407 (+11935%)

Mutual labels: word-embeddings

contextualLSTM

Contextual LSTM for NLP tasks like word prediction and word embedding creation for Deep Learning

Stars: ✭ 28 (+40%)

Mutual labels: word-embeddings

Sifrank zh

基于预训练模型的中文关键词抽取方法（论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码）

Stars: ✭ 175 (+775%)

Mutual labels: word-embeddings

word2vec-tsne

Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.

Stars: ✭ 59 (+195%)

Mutual labels: word-embeddings

Gensim

Topic Modelling for Humans

Stars: ✭ 12,763 (+63715%)

Mutual labels: word-embeddings

word2vec-on-wikipedia

A pipeline for training word embeddings using word2vec on wikipedia corpus.

Stars: ✭ 68 (+240%)

Mutual labels: word-embeddings

Awesome Sentence Embedding

A curated list of pretrained sentence and word embedding models

Stars: ✭ 1,973 (+9765%)

Mutual labels: word-embeddings

sentiment-analysis-of-tweets-in-russian

Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.

Stars: ✭ 51 (+155%)

Mutual labels: word-embeddings

Spherical Text Embedding

[NeurIPS 2019] Spherical Text Embedding

Stars: ✭ 143 (+615%)

Mutual labels: word-embeddings

PersianNER

Named-Entity Recognition in Persian Language

Stars: ✭ 48 (+140%)

Mutual labels: word-embeddings

Hash Embeddings

PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.

Stars: ✭ 126 (+530%)

Mutual labels: word-embeddings

SIFRank

The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"

Stars: ✭ 96 (+380%)

Mutual labels: word-embeddings

wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!

Stars: ✭ 164 (+720%)

Mutual labels: word-embeddings

codenames

Codenames AI using Word Vectors

Stars: ✭ 41 (+105%)

Mutual labels: word-embeddings

SentimentAnalysis

Sentiment Analysis: Deep Bi-LSTM+attention model

Stars: ✭ 32 (+60%)

Mutual labels: word-embeddings

context2vec

PyTorch implementation of context2vec from Melamud et al., CoNLL 2016