Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → chinhsuanwu → mobilevit-pytorch

chinhsuanwu / mobilevit-pytorch

Licence: MIT license

A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Programming Languages

139335 projects - #7 most used programming language

Labels

vit mobilenetv2 vision-transformer mobilevit

Projects that are alternatives of or similar to mobilevit-pytorch

Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"

Stars: ✭ 51 (-85.39%)

Mutual labels: vit, vision-transformer

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，BEiT，MAE等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PVTv2 等基础视觉算法

Stars: ✭ 134 (-61.6%)

Mutual labels: vit, vision-transformer

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Stars: ✭ 250 (-28.37%)

Mutual labels: vit, vision-transformer

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Stars: ✭ 1,566 (+348.71%)

Mutual labels: vit, vision-transformer

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Stars: ✭ 821 (+135.24%)

Mutual labels: vit, vision-transformer

Paddle Large Scale Classification Tools，supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, DeiT, FaceViT.

Stars: ✭ 113 (-67.62%)

Mutual labels: vit

A course scheduling tool for FFCS in VIT, Vellore. Easily adaptable to any schedule/timetable. https://discord.com/invite/Un4UanH

Stars: ✭ 15 (-95.7%)

Mutual labels: vit

基于CenterNet训练的目标检测&人脸对齐&姿态估计模型

Stars: ✭ 240 (-31.23%)

Mutual labels: mobilenetv2

Lightnetplusplus

LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation

Stars: ✭ 218 (-37.54%)

Mutual labels: mobilenetv2

Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).

Stars: ✭ 201 (-42.41%)

Mutual labels: vision-transformer

keras-vision-transformer

The Tensorflow, Keras implementation of Swin-Transformer and Swin-UNET

Stars: ✭ 91 (-73.93%)

Mutual labels: vision-transformer

Visual-Transformer-Paper-Summary

Summary of Transformer applications for computer vision tasks.

Stars: ✭ 51 (-85.39%)

Mutual labels: vit

pytorch-cifar-model-zoo

Implementation of Conv-based and Vit-based networks designed for CIFAR.

Stars: ✭ 62 (-82.23%)

Mutual labels: vision-transformer

[MICCAI2022] This is an official PyTorch implementation for A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation

Stars: ✭ 151 (-56.73%)

Mutual labels: vision-transformer

Mnasnet Pytorch

A PyTorch implementation of Mnasnet: MnasNet: Platform-Aware Neural Architecture Search for Mobile.

Stars: ✭ 250 (-28.37%)

Mutual labels: mobilenetv2

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Stars: ✭ 284 (-18.62%)

Mutual labels: vision-transformer

Facerecognize For Mobile Phone

适用于移动端的人脸识别模型，计算量与mobilefacenet相同，但megaface上提升了2%+

Stars: ✭ 229 (-34.38%)

Mutual labels: mobilenetv2

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

Stars: ✭ 154 (-55.87%)

Mutual labels: vit

Official website for DEVSOC 21, our annual flagship hackathon.

Stars: ✭ 15 (-95.7%)

Mutual labels: vit

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Stars: ✭ 29 (-91.69%)

Mutual labels: vit

View All Similar Projects ➔

MobileViT

Overview

This is a PyTorch implementation of MobileViT specified in "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer", arXiv 2021.

👉 Check out CoAtNet if you are interested in other Convolution + Transformer models.

Usage

import torch
from mobilevit import mobilevit_xxs

img = torch.randn(1, 3, 256, 256)
vit = mobilevit_xxs()
out = vit(img)

Citation

@article{mehta2021mobilevit,
  title={MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer},
  author={Mehta, Sachin and Rastegari, Mohammad},
  journal={arXiv preprint arXiv:2110.02178},
  year={2021}
}

Credits

Code adapted from MobileNetV2 and ViT.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 349

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (10) 🔗