NemoNeMo: a toolkit for conversational AI
MelusineMelusine is a high-level library for emails classification and feature extraction "dédiée aux courriels français".
DeepehrChronic Disease Prediction Using Medical Notes
SarahTerminal Assistant For SemiCode OS
Chatbot一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。目前包含seq2seq、seqGAN版本、tf2.0版本、pytorch版本。
Datastories Semeval2017 Task4Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
KtextUtilities for preprocessing text for deep learning with Keras
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Awesome Nlp PolishA curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Hands On Natural Language Processing With PythonThis repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
Onnxt5Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
LazyLazy, AI chatbot service.
Seq2seq tutorialCode For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Dl TextText pre-processing library for deep learning (Keras, tensorflow).
G Reader2018年机器阅读理解技术竞赛模型,国内外1000多支队伍中BLEU-4评分排名第6, ROUGE-L评分排名第14。(未ensemble,未嵌入训练好的词向量,无dropout)
Lingopackage lingo provides the data structures and algorithms required for natural language processing
AtnreAdversarial Training for Neural Relation Extraction
LemminflectA python module for English lemmatization and inflection.
Textaugmentation Gpt2Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Question GenerationGiven a sentence automatically generate reading comprehension style factual questions from that sentence, such that the sentence contains answers to those questions.
Monkeylearn⛔️ ARCHIVED ⛔️ 🐒 R package for text analysis with Monkeylearn 🐒
Wiki SplitOne million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
DatascienceIt consists of examples, assignments discussed in data science course taken at algorithmica.
Doc2vec📓 Long(er) text representation and classification using Doc2Vec embeddings
SummarusModels for automatic abstractive summarization
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Gec PseudodataRepository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
News push projectReal Time News Scraping and Recommendation System - React | Tensorflow | NLP | News Scrapers