All Categories → Machine Learning → language-model

Top 153 language-model open source projects

cscg
Code Generation as a Dual Task of Code Summarization.
CoLAKE
COLING'2020: CoLAKE: Contextualized Language and Knowledge Embedding
gpt-j-api
API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend
minGPT-TF
A minimal TF2 re-implementation of the OpenAI GPT training
wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
subword-lstm-lm
LSTM Language Model with Subword Units Input Representations
dasher-web
Dasher text entry in HTML, CSS, JavaScript, and SVG
LanguageModel-using-Attention
Pytorch implementation of a basic language model using Attention in LSTM network
swig-srilm
SWIG Wrapper for the SRILM toolkit
lm-scorer
📃Language Model based sentences scoring library
gap-text2sql
GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training
personality-prediction
Experiments for automated personality detection using Language Models and psycholinguistic features on various famous personality datasets including the Essays dataset (Big-Five)
CharLM
Character-aware Neural Language Model implemented by PyTorch
calm
Context Aware Language Models
KB-ALBERT
KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델
Vaaku2Vec
Language Modeling and Text Classification in Malayalam Language using ULMFiT
rnn-theano
RNN(LSTM, GRU) in Theano with mini-batch training; character-level language models in Theano
pd3f
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
TF-NNLM-TK
A toolkit for neural language modeling using Tensorflow including basic models like RNNs and LSTMs as well as more advanced models.
PLBART
Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].
121-153 of 153 language-model projects