DebertaThe implementation of DeBERTa
Stars: ✭ 541 (+2905.56%)
CoLAKECOLING'2020: CoLAKE: Contextualized Language and Knowledge Embedding
Stars: ✭ 86 (+377.78%)
Transfer NlpNLP library designed for reproducible experimentation management
Stars: ✭ 287 (+1494.44%)
gpt-j-apiAPI for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend
Stars: ✭ 248 (+1277.78%)
Tc BotUser Simulation for Task-Completion Dialogues
Stars: ✭ 733 (+3972.22%)
Black-Box-TuningICML'2022: Black-Box Tuning for Language-Model-as-a-Service
Stars: ✭ 99 (+450%)
BluebertBlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).
Stars: ✭ 273 (+1416.67%)
minGPT-TFA minimal TF2 re-implementation of the OpenAI GPT training
Stars: ✭ 36 (+100%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+2838.89%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (+16.67%)
Accelerated TextAccelerated Text is a no-code natural language generation platform. It will help you construct document plans which define how your data is converted to textual descriptions varying in wording and structure.
Stars: ✭ 256 (+1322.22%)
subword-lstm-lmLSTM Language Model with Subword Units Input Representations
Stars: ✭ 45 (+150%)
Lm Lstm CrfEmpower Sequence Labeling with Task-Aware Language Model
Stars: ✭ 778 (+4222.22%)
backpropBackprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
Stars: ✭ 229 (+1172.22%)
few-shot-lmThe source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
Stars: ✭ 32 (+77.78%)
Tokenizers💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Stars: ✭ 5,077 (+28105.56%)
question generatorAn NLP system for generating reading comprehension questions
Stars: ✭ 188 (+944.44%)
python-arpa🐍 Python library for n-gram models in ARPA format
Stars: ✭ 35 (+94.44%)
Chatito🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stars: ✭ 678 (+3666.67%)
swig-srilmSWIG Wrapper for the SRILM toolkit
Stars: ✭ 33 (+83.33%)
JasoAI자소AI - KoGPT2를 이용한 자기소개서 도우미 프로젝트
Stars: ✭ 48 (+166.67%)
gap-text2sqlGAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training
Stars: ✭ 83 (+361.11%)
Bert PytorchGoogle AI 2018 BERT pytorch implementation
Stars: ✭ 4,642 (+25688.89%)
CharLMCharacter-aware Neural Language Model implemented by PyTorch
Stars: ✭ 32 (+77.78%)
SDLM-pytorchCode accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts
Stars: ✭ 27 (+50%)
recipe-personalizationEMNLP 2019: Generating Personalized Recipes from Historical User Preferences
Stars: ✭ 48 (+166.67%)
Nlg EvalEvaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+4466.67%)
KB-ALBERTKB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델
Stars: ✭ 215 (+1094.44%)
syntaxmakerThe NLG tool for Finnish
Stars: ✭ 19 (+5.56%)
transformer-drg-style-transferThis repository have scripts and Jupyter-notebooks to perform all the different steps involved in Transforming Delete, Retrieve, Generate Approach for Controlled Text Style Transfer
Stars: ✭ 97 (+438.89%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (+2111.11%)
pd3f🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
Stars: ✭ 132 (+633.33%)
pyVHDLParserStreaming based VHDL parser.
Stars: ✭ 51 (+183.33%)
TF-NNLM-TKA toolkit for neural language modeling using Tensorflow including basic models like RNNs and LSTMs as well as more advanced models.
Stars: ✭ 20 (+11.11%)
Dl Nlp ReadingsMy Reading Lists of Deep Learning and Natural Language Processing
Stars: ✭ 656 (+3544.44%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+1277.78%)
PlanGenPlan-then-Generate: Controlled Data-to-Text Generation via Planning (EMNLP 2021)
Stars: ✭ 49 (+172.22%)
Relational Rnn PytorchAn implementation of DeepMind's Relational Recurrent Neural Networks in PyTorch.
Stars: ✭ 236 (+1211.11%)
Tf chatbot seq2seq antilmSeq2seq chatbot with attention and anti-language model to suppress generic response, option for further improve by deep reinforcement learning.
Stars: ✭ 369 (+1950%)
Pytorch NceThe Noise Contrastive Estimation for softmax output written in Pytorch
Stars: ✭ 204 (+1033.33%)
CodeT5Code for CodeT5: a new code-aware pre-trained encoder-decoder model.
Stars: ✭ 390 (+2066.67%)
LingvoLingvo
Stars: ✭ 2,361 (+13016.67%)
Lightnlp基于Pytorch和torchtext的自然语言处理深度学习框架。
Stars: ✭ 739 (+4005.56%)
Gpt ScrollsA collaborative collection of open-source safe GPT-3 prompts that work well
Stars: ✭ 195 (+983.33%)
Word-Prediction-NgramNext Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
Stars: ✭ 25 (+38.89%)
Bert As Language Modelbert as language model, fork from https://github.com/google-research/bert
Stars: ✭ 185 (+927.78%)
language-plannerOfficial Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Stars: ✭ 84 (+366.67%)
Keras BertImplementation of BERT that could load official pre-trained models for feature extraction and prediction
Stars: ✭ 2,264 (+12477.78%)
Awesome Bert NlpA curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.
Stars: ✭ 567 (+3050%)
Chinese ElectraPre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
Stars: ✭ 830 (+4511.11%)
Chatbot cn基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
Stars: ✭ 791 (+4294.44%)
Nlp chinese corpus大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+36877.78%)
TrankitTrankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Stars: ✭ 311 (+1627.78%)