Top 1910 nlp open source projects

Electra
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
Vmf vae nlp
Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"
Ie Survey
北航大数据高精尖中心张日崇研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。
Segmentit
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment
Parselawdocuments
对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、案件聚类、法律条文推荐等(试验目前基于婚姻类案件,可扩展至其它领域)。
Question Pairs Matching
第三届魔镜杯 智能客服问题相似性算法设计 第12名解决方案
Mnemonicreader
A PyTorch implementation of Mnemonic Reader for the Machine Comprehension task
Hubot Natural
Natural Language Processing Chatbot for RocketChat
Deeplearningfornlpinpytorch
An IPython Notebook tutorial on deep learning for natural language processing, including structure prediction.
Turkish Word2vec
Pre-trained Word2Vec Model for Turkish
Guide To Swift Strings Sample Code
Xcode Playground Sample Code for the Flight School Guide to Swift Strings
Medcat
Medical Concept Annotation Tool
Finbert
BERT for Finance : UC Berkeley MIDS w266 Final Project
Jprocessing
Japanese Natural Langauge Processing Libraries
✭ 135
nlpjapanese
Rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Tmtoolkit
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
Question Answering
TensorFlow implementation of Match-LSTM and Answer pointer for the popular SQuAD dataset.
Scattertext Pydata
Notebooks for the Seattle PyData 2017 talk on Scattertext
Snowball
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
Awesome Bert
bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目
Nlp estimator tutorial
Educational material on using the TensorFlow Estimator framework for text classification
Prenlp
Preprocessing Library for Natural Language Processing
Whatthelang
Lightning Fast Language Prediction 🚀
Konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Kaggle Quora Dup
Solution to Kaggle's Quora Duplicate Question Detection Competition
Id Cnn Cws
Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"
Abstractive Summarization
Implementation of abstractive summarization using LSTM in the encoder-decoder architecture with local attention.
Ml Projects
ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Bnlp
BNLP is a natural language processing toolkit for Bengali Language.
Ajax Movie Recommendation System With Sentiment Analysis
Content-Based Recommender System recommends movies similar to the movie user likes and analyses the sentiments on the reviews given by the user for that movie.
Rdrpostagger
A fast and accurate POS and morphological tagging toolkit (EACL 2014)
Neuro
🔮 Neuro.js is machine learning library for building AI assistants and chat-bots (WIP).
Fusionnet Nli
An example for applying FusionNet to Natural Language Inference
Hash Embeddings
PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
Nlpmetrics
Python code for various NLP metrics
Matilda
LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)
Laserembeddings
LASER multilingual sentence embeddings as a pip package
Fugashi
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Awesome Data Science Viz
💥 📈 A curated list of data science, analysis and visualization tools
Camel tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Cutlet
Japanese to romaji converter in Python
Kogpt2 Finetuning
🔥 Korean GPT-2, KoGPT2 FineTuning cased. 한국어 가사 데이터 학습 🔥
Spammessage
中文垃圾短信识别(手写分类器)
Spacy Dev Resources
💫 Scripts, tools and resources for developing spaCy
Fnc 1 Baseline
A baseline implementation for FNC-1
Spacy Js
🎀 JavaScript API for spaCy with Python REST API
Syntok
Text tokenization and sentence segmentation (segtok v2)
Stog
AMR Parsing as Sequence-to-Graph Transduction
Python autocomplete
Use Transformers and LSTMs to learn Python source code
Files2rouge
Calculating ROUGE score between two files (line-by-line)