Gpt2PyTorch Implementation of OpenAI GPT-2
Stars: ✭ 64 (+93.94%)
Nlg EvalEvaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+2390.91%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+257.58%)
Textaugmentation Gpt2Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Stars: ✭ 104 (+215.15%)
Gluon NlpNLP made easy
Stars: ✭ 2,344 (+7003.03%)
LanguagetoysRandom fun with statistical language models.
Stars: ✭ 63 (+90.91%)
NndialNNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Stars: ✭ 332 (+906.06%)
Practical PytorchGo to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained
Stars: ✭ 4,329 (+13018.18%)
LudwigData-centric declarative deep learning framework
Stars: ✭ 8,018 (+24196.97%)
Nlg RlAccelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction
Stars: ✭ 59 (+78.79%)
Transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+168815.15%)
RnnlgRNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Stars: ✭ 487 (+1375.76%)
PplmPlug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Stars: ✭ 674 (+1942.42%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-45.45%)
Nlp tutorialsOverview of NLP tools and techniques in python
Stars: ✭ 14 (-57.58%)
NeuralparserNeuralParser is a very simple to use dependency parser, based on the Latent Syntactic Structure encoding.
Stars: ✭ 17 (-48.48%)
String To Tree NmtSource code and data for the paper "Towards String-to-Tree Neural Machine Translation"
Stars: ✭ 16 (-51.52%)
Rte Speech GeneratorNatural Language Processing to generate new speeches for the President of Turkey.
Stars: ✭ 22 (-33.33%)
Node Api.ai[DEPRECATED] Ultimate Node.JS SDK for api.ai
Stars: ✭ 12 (-63.64%)
Awesome Ai Ml DlAwesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Stars: ✭ 831 (+2418.18%)
Ciphey⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Stars: ✭ 9,116 (+27524.24%)
Kts linguisticsSpellcheck, phonetics, text processing and more
Stars: ✭ 18 (-45.45%)
EventsRepository for *SEM Paper on Event Coreference Resolution in ECB+
Stars: ✭ 20 (-39.39%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-45.45%)
Entity Recognition DatasetsA collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (+2600%)
Twitter Bot👻 Markov chain-based Japanese twitter bot
Stars: ✭ 12 (-63.64%)
MesimpCodes for "Training Simplification and Model Simplification for Deep Learning: A Minimal Effort Back Propagation Method"
Stars: ✭ 16 (-51.52%)
Acl18 resultsCode to reproduce results in our ACL 2018 paper "Did the Model Understand the Question?"
Stars: ✭ 31 (-6.06%)
Lightning BoltsToolbox of models, callbacks, and datasets for AI/ML researchers.
Stars: ✭ 829 (+2412.12%)
LanguageShared repository for open-sourced projects from the Google AI Language team.
Stars: ✭ 860 (+2506.06%)
RexREx: Relation Extraction. Modernized re-write of the code in the master's thesis: "Relation Extraction using Distant Supervision, SVMs, and Probabalistic First-Order Logic"
Stars: ✭ 21 (-36.36%)
PkePython Keyphrase Extraction module
Stars: ✭ 855 (+2490.91%)
UndertheseaUnderthesea - Vietnamese NLP Toolkit
Stars: ✭ 823 (+2393.94%)
SpagoSelf-contained Machine Learning and Natural Language Processing library in Go
Stars: ✭ 854 (+2487.88%)
PororoPORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Stars: ✭ 812 (+2360.61%)
Spacy Models💫 Models for the spaCy Natural Language Processing (NLP) library
Stars: ✭ 796 (+2312.12%)
Typing AssistantTyping Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Stars: ✭ 32 (-3.03%)
RdrpostaggerR package for Ripple Down Rules-based Part-Of-Speech Tagging (RDRPOS). On more than 45 languages.
Stars: ✭ 31 (-6.06%)
Knowledge GraphsA collection of research on knowledge graphs
Stars: ✭ 845 (+2460.61%)
Torchmoji😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc
Stars: ✭ 795 (+2309.09%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+2293.94%)
VdsVerteego Data Suite
Stars: ✭ 9 (-72.73%)
CourseraQuiz & Assignment of Coursera
Stars: ✭ 774 (+2245.45%)
SpyderOfficial repository for Spyder - The Scientific Python Development Environment
Stars: ✭ 6,712 (+20239.39%)
BpembPre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
Stars: ✭ 909 (+2654.55%)
Syntree2vecAn algorithm to augment syntactic hierarchy into word embeddings
Stars: ✭ 9 (-72.73%)
JcsegJcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for the latest lucene,solr,elasticsearch
Stars: ✭ 754 (+2184.85%)
YoutokentomeUnsupervised text tokenizer focused on computational efficiency
Stars: ✭ 728 (+2106.06%)
ElyraElyra extends JupyterLab Notebooks with an AI centric approach.
Stars: ✭ 839 (+2442.42%)
EccoVisualize and explore NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2).
Stars: ✭ 723 (+2090.91%)
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+2066.67%)
Punny captionsAn implementation of the NAACL 2018 paper "Punny Captions: Witty Wordplay in Image Descriptions".
Stars: ✭ 31 (-6.06%)