All Projects → chinhsuanwu → mobilevit-pytorch

chinhsuanwu / mobilevit-pytorch

Licence: MIT license
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to mobilevit-pytorch

SReT
Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"
Stars: ✭ 51 (-85.39%)
Mutual labels:  vit, vision-transformer
PASSL
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,BEiT,MAE等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
Stars: ✭ 134 (-61.6%)
Mutual labels:  vit, vision-transformer
pytorch-vit
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Stars: ✭ 250 (-28.37%)
Mutual labels:  vit, vision-transformer
LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Stars: ✭ 1,566 (+348.71%)
Mutual labels:  vit, vision-transformer
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Stars: ✭ 821 (+135.24%)
Mutual labels:  vit, vision-transformer
PLSC
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, DeiT, FaceViT.
Stars: ✭ 113 (-67.62%)
Mutual labels:  vit
FFCSThingy2.0
A course scheduling tool for FFCS in VIT, Vellore. Easily adaptable to any schedule/timetable. https://discord.com/invite/Un4UanH
Stars: ✭ 15 (-95.7%)
Mutual labels:  vit
Centermulti
基于CenterNet训练的目标检测&人脸对齐&姿态估计模型
Stars: ✭ 240 (-31.23%)
Mutual labels:  mobilenetv2
Lightnetplusplus
LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation
Stars: ✭ 218 (-37.54%)
Mutual labels:  mobilenetv2
transformer-ls
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
Stars: ✭ 201 (-42.41%)
Mutual labels:  vision-transformer
keras-vision-transformer
The Tensorflow, Keras implementation of Swin-Transformer and Swin-UNET
Stars: ✭ 91 (-73.93%)
Mutual labels:  vision-transformer
Visual-Transformer-Paper-Summary
Summary of Transformer applications for computer vision tasks.
Stars: ✭ 51 (-85.39%)
Mutual labels:  vit
pytorch-cifar-model-zoo
Implementation of Conv-based and Vit-based networks designed for CIFAR.
Stars: ✭ 62 (-82.23%)
Mutual labels:  vision-transformer
VT-UNet
[MICCAI2022] This is an official PyTorch implementation for A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation
Stars: ✭ 151 (-56.73%)
Mutual labels:  vision-transformer
Mnasnet Pytorch
A PyTorch implementation of Mnasnet: MnasNet: Platform-Aware Neural Architecture Search for Mobile.
Stars: ✭ 250 (-28.37%)
Mutual labels:  mobilenetv2
libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Stars: ✭ 284 (-18.62%)
Mutual labels:  vision-transformer
Facerecognize For Mobile Phone
适用于移动端的人脸识别模型,计算量与mobilefacenet相同,但megaface上提升了2%+
Stars: ✭ 229 (-34.38%)
Mutual labels:  mobilenetv2
HugsVision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Stars: ✭ 154 (-55.87%)
Mutual labels:  vit
DevSoc21
Official website for DEVSOC 21, our annual flagship hackathon.
Stars: ✭ 15 (-95.7%)
Mutual labels:  vit
cape
Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
Stars: ✭ 29 (-91.69%)
Mutual labels:  vit

MobileViT

Overview

This is a PyTorch implementation of MobileViT specified in "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer", arXiv 2021.

img

👉 Check out CoAtNet if you are interested in other Convolution + Transformer models.

Usage

import torch
from mobilevit import mobilevit_xxs

img = torch.randn(1, 3, 256, 256)
vit = mobilevit_xxs()
out = vit(img)

Citation

@article{mehta2021mobilevit,
  title={MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer},
  author={Mehta, Sachin and Rastegari, Mohammad},
  journal={arXiv preprint arXiv:2110.02178},
  year={2021}
}

Credits

Code adapted from MobileNetV2 and ViT.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].