moskomule / simple_transformers

Licence: other

Simple transformer implementations that I can understand

Programming Languages

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to simple transformers

Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).

Stars: ✭ 30 (+66.67%)

Mutual labels: transformers

text2class

Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT

Stars: ✭ 15 (-16.67%)

Mutual labels: transformers

classy

classy is a simple-to-use library for building high-performance Machine Learning models in NLP.

Stars: ✭ 61 (+238.89%)

Mutual labels: transformers

spark-transformers

Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.

Stars: ✭ 39 (+116.67%)

Mutual labels: transformers

minicons

Utility for analyzing Transformer based representations of language.

Stars: ✭ 28 (+55.56%)

Mutual labels: transformers

smaller-transformers

Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.

Stars: ✭ 66 (+266.67%)

Mutual labels: transformers

golgotha

Contextualised Embeddings and Language Modelling using BERT and Friends using R

Stars: ✭ 39 (+116.67%)

Mutual labels: transformers

TermiNetwork

🌏 A zero-dependency networking solution for building modern and secure iOS, watchOS, macOS and tvOS applications.

Stars: ✭ 80 (+344.44%)

Mutual labels: transformers

Text and Audio classification with Bert

Text Classification in Turkish Texts with Bert

Stars: ✭ 34 (+88.89%)

Mutual labels: transformers

C-Tran

General Multi-label Image Classification with Transformers

Stars: ✭ 106 (+488.89%)

Mutual labels: transformers

robo-vln

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Stars: ✭ 34 (+88.89%)

Mutual labels: transformers

TorchBlocks

A PyTorch-based toolkit for natural language processing

Stars: ✭ 85 (+372.22%)

Mutual labels: transformers

text2text

Text2Text: Cross-lingual natural language processing and generation toolkit

Stars: ✭ 188 (+944.44%)

Mutual labels: transformers

KoELECTRA-Pipeline

Transformers Pipeline with KoELECTRA

Stars: ✭ 37 (+105.56%)

Mutual labels: transformers

OpenDialog

An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统，一键部署微信闲聊机器人)

Stars: ✭ 94 (+422.22%)

Mutual labels: transformers

ttt

A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+

Stars: ✭ 35 (+94.44%)

Mutual labels: transformers

serverless-transformers-on-aws-lambda

Deploy transformers serverless on AWS Lambda

Stars: ✭ 100 (+455.56%)

Mutual labels: transformers

knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

Stars: ✭ 72 (+300%)

Mutual labels: transformers

HVT

[ICCV 2021] Official implementation of "Scalable Vision Transformers with Hierarchical Pooling"

Stars: ✭ 26 (+44.44%)

Mutual labels: transformers

n-grammer-pytorch

Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch

Stars: ✭ 50 (+177.78%)

Mutual labels: transformers

View All Similar Projects ➔

Simple Transformers

Simple transformer implementations that I can understand.

Requirements

conda create -n transformer python=3.9
conda activate transformer
conda install -c pytorch -c conda-forge pytorch torchvision cudatoolkit=11.1
pip install -U homura-core chika rich
# for NLP also install
pip install -U datasets tokenizers
# To use checkpointing
pip install -U fairscale
# To accelerate (probably only slightly), 
pip install -U opt_einsum

Examples

Language Modeling

GPT

Train GPT-like models on wikitext or GigaText. Currently, Transformer blocks of improved pre LN, pre LN, and post LN are available for comparison.

python gpt.py [--model.block {ipre_ln, pre_ln, post_ln}] [--amp]

Bert

Work in progress

Image Recognition

Train ImageNet classification models.

Currently, ViT, and CaiT are implemented.

# single process training
python vit.py [--amp] [--model.ema]
# for multi-process training,
python -m torch.distributed.launch --nproc_per_node=2 vit.py ...

Acknowledgement

For this project, I learned a lot from Andrej's minGPT, Ross's timm, and FAIR's ClassyVision.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

moskomule / simple_transformers

Programming Languages

Labels

Projects that are alternatives of or similar to simple transformers

Simple Transformers

Requirements

Examples

Language Modeling

GPT

Bert

Image Recognition

Acknowledgement