gdcCode for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation"
Stars: ✭ 94 (+422.22%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+2166.67%)
Kenlg ReadingReading list for knowledge-enhanced text generation, with a survey
Stars: ✭ 257 (+1327.78%)
miniconsUtility for analyzing Transformer based representations of language.
Stars: ✭ 28 (+55.56%)
BertweetBERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
Stars: ✭ 282 (+1466.67%)
Kogpt2Korean GPT-2 pretrained cased (KoGPT2)
Stars: ✭ 368 (+1944.44%)
MinTLMinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Stars: ✭ 61 (+238.89%)
gpt-jA GPT-J API to use with python3 to generate text, blogs, code, and more
Stars: ✭ 101 (+461.11%)
Xlnet PytorchAn implementation of Google Brain's 2019 XLNet in PyTorch
Stars: ✭ 304 (+1588.89%)
Albert pytorchA Lite Bert For Self-Supervised Learning Language Representations
Stars: ✭ 539 (+2894.44%)
SimplenlgJava API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version.
Stars: ✭ 708 (+3833.33%)
Practical PytorchGo to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained
Stars: ✭ 4,329 (+23950%)
uctfUnsupervised Controllable Text Generation (Applied to text Formalization)
Stars: ✭ 19 (+5.56%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+4100%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+1977.78%)
tying-wv-and-wcImplementation for "Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling"
Stars: ✭ 39 (+116.67%)
KobertKorean BERT pre-trained cased (KoBERT)
Stars: ✭ 591 (+3183.33%)
FNet-pytorchUnofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
Stars: ✭ 204 (+1033.33%)
Gpt NeoxAn implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.
Stars: ✭ 303 (+1583.33%)
DebertaThe implementation of DeBERTa
Stars: ✭ 541 (+2905.56%)
Transfer NlpNLP library designed for reproducible experimentation management
Stars: ✭ 287 (+1494.44%)
Tc BotUser Simulation for Task-Completion Dialogues
Stars: ✭ 733 (+3972.22%)
BluebertBlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).
Stars: ✭ 273 (+1416.67%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+2838.89%)
Accelerated TextAccelerated Text is a no-code natural language generation platform. It will help you construct document plans which define how your data is converted to textual descriptions varying in wording and structure.
Stars: ✭ 256 (+1322.22%)
Lm Lstm CrfEmpower Sequence Labeling with Task-Aware Language Model
Stars: ✭ 778 (+4222.22%)
few-shot-lmThe source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
Stars: ✭ 32 (+77.78%)
Tokenizers💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Stars: ✭ 5,077 (+28105.56%)
python-arpa🐍 Python library for n-gram models in ARPA format
Stars: ✭ 35 (+94.44%)
Chatito🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stars: ✭ 678 (+3666.67%)
JasoAI자소AI - KoGPT2를 이용한 자기소개서 도우미 프로젝트
Stars: ✭ 48 (+166.67%)
Bert PytorchGoogle AI 2018 BERT pytorch implementation
Stars: ✭ 4,642 (+25688.89%)
SDLM-pytorchCode accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts
Stars: ✭ 27 (+50%)
Nlg EvalEvaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+4466.67%)
syntaxmakerThe NLG tool for Finnish
Stars: ✭ 19 (+5.56%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (+2111.11%)
pyVHDLParserStreaming based VHDL parser.
Stars: ✭ 51 (+183.33%)
Dl Nlp ReadingsMy Reading Lists of Deep Learning and Natural Language Processing
Stars: ✭ 656 (+3544.44%)
PlanGenPlan-then-Generate: Controlled Data-to-Text Generation via Planning (EMNLP 2021)
Stars: ✭ 49 (+172.22%)
Tf chatbot seq2seq antilmSeq2seq chatbot with attention and anti-language model to suppress generic response, option for further improve by deep reinforcement learning.
Stars: ✭ 369 (+1950%)
CodeT5Code for CodeT5: a new code-aware pre-trained encoder-decoder model.
Stars: ✭ 390 (+2066.67%)
Lightnlp基于Pytorch和torchtext的自然语言处理深度学习框架。
Stars: ✭ 739 (+4005.56%)
Word-Prediction-NgramNext Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
Stars: ✭ 25 (+38.89%)
language-plannerOfficial Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Stars: ✭ 84 (+366.67%)
Awesome Bert NlpA curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.
Stars: ✭ 567 (+3050%)
Azureml BertEnd-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
Stars: ✭ 342 (+1800%)
Chinese ElectraPre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
Stars: ✭ 830 (+4511.11%)
Chatbot cn基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
Stars: ✭ 791 (+4294.44%)
Nlp chinese corpus大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+36877.78%)