All Categories → Machine Learning → language-model

Top 153 language-model open source projects

Tner
Language model finetuning on NER with an easy interface, and cross-domain evaluation. We released NER models finetuned on various domain via huggingface model hub.
Suggest
Top-k Approximate String Matching.
Lmchallenge
A library & tools to evaluate predictive language models.
Nlp Library
curated collection of papers for the nlp practitioner 📖👩‍🔬
Boilerplate Dynet Rnn Lm
Boilerplate code for quickly getting set up to run language modeling experiments
Bert language understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Chinese Electra
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
Lm Lstm Crf
Empower Sequence Labeling with Task-Aware Language Model
Lightnlp
基于Pytorch和torchtext的自然语言处理深度学习框架。
Keras Language Modeling
📖 Some language modeling tools for Keras
Kobert
Korean BERT pre-trained cased (KoBERT)
Awesome Bert Nlp
A curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.
Sentiment analysis fine grain
Multi-label Classification with BERT; Fine Grained Sentiment Analysis from AI challenger
Albert pytorch
A Lite Bert For Self-Supervised Learning Language Representations
Ctcdecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Bert Pytorch
Google AI 2018 BERT pytorch implementation
Ctcwordbeamsearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Zamia Speech
Open tools and data for cloudless automatic speech recognition
Tf chatbot seq2seq antilm
Seq2seq chatbot with attention and anti-language model to suppress generic response, option for further improve by deep reinforcement learning.
Kogpt2
Korean GPT-2 pretrained cased (KoGPT2)
Azureml Bert
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
Trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Gpt Neox
An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.
Xlnet Pytorch
An implementation of Google Brain's 2019 XLNet in PyTorch
Transfer Nlp
NLP library designed for reproducible experimentation management
Bertweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
Bluebert
BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).
A Pytorch Tutorial To Sequence Labeling
Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling
few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
python-arpa
🐍 Python library for n-gram models in ARPA format
SDLM-pytorch
Code accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts
minicons
Utility for analyzing Transformer based representations of language.
pyVHDLParser
Streaming based VHDL parser.
MinTL
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
tying-wv-and-wc
Implementation for "Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling"
CodeT5
Code for CodeT5: a new code-aware pre-trained encoder-decoder model.
61-120 of 153 language-model projects