Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with TensorFlow 2.0 and Keras.

✭ 203

python tensorflow keras natural-language-processing recurrent-neural-networks attention-mechanism language-model natural-language-understanding machine-translation text-generation sentiment-classification

Protein Sequence Embedding Iclr2019

Source code for "Learning protein sequence embeddings using information from structure" - ICLR 2019

✭ 194

python deep-learning pytorch recurrent-neural-networks language-model

Gpt Scrolls

A collaborative collection of open-source safe GPT-3 prompts that work well

✭ 195

python generator transformer language-model

Char Rnn Chinese

Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch. Based on code of https://github.com/karpathy/char-rnn. Support Chinese and other things.

✭ 192

lua lstm chinese rnn language-model

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

✭ 192

python deep-learning machine-learning tensorflow keras neural-networks speech-recognition language-model speech-to-text tensorflow-models

Nlp learning

结合python一起学习自然语言处理 (nlp): 语言模型、HMM、PCFG、Word2vec、完形填空式阅读理解任务、朴素贝叶斯分类器、TFIDF、PCA、SVD

✭ 188

python python3 nlp word2vec language-model

Bert As Language Model

bert as language model, fork from https://github.com/google-research/bert

✭ 185

python tensorflow language-model

Bert Sklearn

a sklearn wrapper for Google's BERT model

✭ 182

jupyter-notebook pytorch nlp natural-language-processing scikit-learn named-entity-recognition transfer-learning ner language-model

Keras Bert

Implementation of BERT that could load official pre-trained models for feature extraction and prediction

✭ 2,264

python shell keras language-model bert

Optimus

Optimus: the first large-scale pre-trained VAE language model

✭ 180

python language-model pretrained-models vae

Macbert

Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP)

✭ 167

tensorflow nlp language-model

Gpt Neo

An implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.

✭ 1,252

python language-model

Indic Bert

BERT-based Multilingual Model for Indian Languages

✭ 160

python nlp language-model

Xlnet Gen

XLNet for generating language.

✭ 164

python text language-model

Lazynlp

Library to scrape and clean web pages to create massive datasets.

✭ 1,985

python nlp data-science natural-language-processing artificial-intelligence language-model text-mining open

Lotclass

[EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach

✭ 160

python text-classification language-model

Keras Xlnet

Implementation of XLNet that can load pretrained checkpoints

✭ 159

python nlp keras language-model

F Lm

Language Modeling

✭ 156

python deep-learning machine-learning gpu language-model

Transformer Lm

Transformer language model (GPT-2) with sentencepiece tokenizer

✭ 154

python tensorflow language-model

Speecht

An opensource speech-to-text software written in tensorflow

✭ 152

python python3 tensorflow language-model speech-to-text asr

Electra pytorch

Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)

✭ 149

python pytorch nlp deeplearning language-model

Awesome Sentence Embedding

A curated list of pretrained sentence and word embedding models

✭ 1,973

python awesome awesome-list nlp unsupervised-learning language-model pretrained-models word-embeddings natural-language embedding-models bert cross-lingual wordembedding sentence-embeddings pretrained-embedding sentence-representations contextualized-representation pretrained-language-model subword-models

Awd Lstm Lm

LSTM and QRNN Language Model Toolkit for PyTorch

✭ 1,834

python shell pytorch lstm language-model sgd qrnn

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

✭ 2,085

cnn rnn attention-mechanism seq2seq language-model papers tts roadmap dnn neural-network speech-synthesis speech-recognition automatic-speech-recognition speaker-verification timit-dataset acoustic-model recognition-synthesis

Ld Net

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling

✭ 148

python pytorch named-entity-recognition ner language-model model-compression sequence-labeling

Tupe

Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve existing models like BERT.

✭ 143

python transformer language-model

Clue

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

✭ 2,425

python Jupyter Notebook shell pytorch tensorflow dataset benchmark chinese language-model pretrained-models nlu corpus glue transformers albert bert roberta chineseglue

Electra

中文预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model

✭ 132

gan language-model pretrained-models adversarial-networks

Chars2vec

Character-based word embeddings model based on RNN for handling real world texts

✭ 130

python natural-language-processing language-model embeddings natural-language-understanding

Kogpt2 Finetuning

🔥 Korean GPT-2, KoGPT2 FineTuning cased. 한국어 가사 데이터 학습 🔥

✭ 124

python nlp language-model korean text-generation

Dynamic Memory Networks Plus Pytorch

Implementation of Dynamic memory networks plus in Pytorch

✭ 123

python pytorch question-answering language-model

Robbert

A Dutch RoBERTa-based language model

✭ 120

jupyter-notebook nlp language-model

Haystack

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.