All Categories → No Category → vision-transformer

Top 30 vision-transformer open source projects

Multi-Modal-Transformer
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets. Additionally, it also collects many useful tutorials and tools in these related domains.
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
Splice
Official Pytorch Implementation for "Splicing ViT Features for Semantic Appearance Transfer" presenting "Splice" (CVPR 2022)
MPViT
MPViT:Multi-Path Vision Transformer for Dense Prediction in CVPR 2022
PASSL
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,BEiT,MAE等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
koclip
KoCLIP: Korean port of OpenAI CLIP, in Flax
pytorch-vit
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Evo-ViT
Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
SReT
Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"
ImageNet21K
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
mobilevit-pytorch
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".
transformer-ls
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
keras-vision-transformer
The Tensorflow, Keras implementation of Swin-Transformer and Swin-UNET
1-30 of 30 vision-transformer projects