fuzzymaxCode for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
Stars: ✭ 43 (+38.71%)
sanic-wtfSanic meets WTForms
Stars: ✭ 24 (-22.58%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+532.26%)
LftmImproving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Stars: ✭ 168 (+441.94%)
KoanA word2vec negative sampling implementation with correct CBOW update.
Stars: ✭ 232 (+648.39%)
Chameleon recsysSource code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems
Stars: ✭ 202 (+551.61%)
Datastories Semeval2017 Task4Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Stars: ✭ 184 (+493.55%)
contextualLSTMContextual LSTM for NLP tasks like word prediction and word embedding creation for Deep Learning
Stars: ✭ 28 (-9.68%)
json-headJSON microservice for performing HEAD requests
Stars: ✭ 31 (+0%)
Elmo TutorialA short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)
Stars: ✭ 145 (+367.74%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+5454.84%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+10251.61%)
PersianNERNamed-Entity Recognition in Persian Language
Stars: ✭ 48 (+54.84%)
Question GenerationGenerating multiple choice questions from text using Machine Learning.
Stars: ✭ 227 (+632.26%)
sanic-adminsanic-admin is a command line tool for automatically restarting sanic.
Stars: ✭ 15 (-51.61%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (+509.68%)
ethereumd-proxyProxy client-server for Ethereum node using bitcoin JSON-RPC interface.
Stars: ✭ 21 (-32.26%)
DebiasweRemove problematic gender bias from word embeddings.
Stars: ✭ 175 (+464.52%)
MimickCode for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
Stars: ✭ 152 (+390.32%)
two-stream-cnnA two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data
Stars: ✭ 24 (-22.58%)
Fasttext.jsFastText for Node.js
Stars: ✭ 127 (+309.68%)
word2vec-on-wikipediaA pipeline for training word embeddings using word2vec on wikipedia corpus.
Stars: ✭ 68 (+119.35%)
HiCECode for ACL'19 "Few-Shot Representation Learning for Out-Of-Vocabulary Words"
Stars: ✭ 56 (+80.65%)
FlairA very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (+35593.55%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (+248.39%)
paitPython Modern API Tools, fast to code
Stars: ✭ 24 (-22.58%)
sanic-extExtended Sanic functionality
Stars: ✭ 26 (-16.13%)
Spanish Word EmbeddingsSpanish word embeddings computed with different methods and from different corpora
Stars: ✭ 236 (+661.29%)
wink-nlpDeveloper friendly Natural Language Processing ✨
Stars: ✭ 312 (+906.45%)
WordgcnACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Stars: ✭ 230 (+641.94%)
Resume-RaterRates the quality of a candidate based on his/her resume using unsupervised approaches
Stars: ✭ 65 (+109.68%)
compress-fasttextTools for shrinking fastText models (in gensim format)
Stars: ✭ 124 (+300%)
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (+241.94%)
JfasttextJava interface for fastText
Stars: ✭ 193 (+522.58%)
wefeWEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Stars: ✭ 164 (+429.03%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (+506.45%)
word-benchmarksBenchmarks for intrinsic word embeddings evaluation.
Stars: ✭ 45 (+45.16%)
TextheroText preprocessing, representation and visualization from zero to hero.
Stars: ✭ 2,407 (+7664.52%)
sisterSImple SenTence EmbeddeR
Stars: ✭ 66 (+112.9%)
Sifrank zh基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Stars: ✭ 175 (+464.52%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+41070.97%)
Word2VecfJavaWord2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions
Stars: ✭ 14 (-54.84%)
dasemDanish Semantic analysis
Stars: ✭ 17 (-45.16%)
Metis测试题小程序 包含后端api接口 可能会改成gitbook应用了吧
Stars: ✭ 79 (+154.84%)
Hash EmbeddingsPyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
Stars: ✭ 126 (+306.45%)
pair2vecpair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference
Stars: ✭ 62 (+100%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (+277.42%)
DanlpDaNLP is a repository for Natural Language Processing resources for the Danish Language.
Stars: ✭ 111 (+258.06%)
S-WMDCode for Supervised Word Mover's Distance (SWMD)
Stars: ✭ 90 (+190.32%)
JoSH[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Stars: ✭ 55 (+77.42%)
Active-Explainable-ClassificationA set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification
Stars: ✭ 28 (-9.68%)
warbleNative Linux word-guessing game built in Vala and Gtk for elementary OS
Stars: ✭ 82 (+164.52%)
pynaivechainPython implementation of naivechain project
Stars: ✭ 18 (-41.94%)