Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Alibaba-MIIL → Tresnet

Alibaba-MIIL / Tresnet

Licence: apache-2.0

Official Pytorch Implementation of "TResNet: High-Performance GPU-Dedicated Architecture" (WACV 2021)

Programming Languages

python

139335 projects - #7 most used programming language

Labels

architecture imagenet

Projects that are alternatives of or similar to Tresnet

Caffenet Benchmark

Evaluation of the CNN design choices performance on ImageNet-2012.

Stars: ✭ 700 (+122.93%)

Mutual labels: imagenet, architecture

Micro Company

Rest-full, Hipermedia-based distributed application. Spring boot & cloud. Angular. CQRS. Eventsourcing. Axonframework. Microservices. Docker. CloudFoundry

Stars: ✭ 307 (-2.23%)

Mutual labels: architecture

Tensorflow In Depth

《深入理解TensorFlow》项目代码与样章

Stars: ✭ 281 (-10.51%)

Mutual labels: architecture

Aofe.code

《前端架构：从入门到微前端》源码，code for Architecture of Frontend

Stars: ✭ 292 (-7.01%)

Mutual labels: architecture

Mobilenetv3.pytorch

74.3% MobileNetV3-Large and 67.2% MobileNetV3-Small model on ImageNet

Stars: ✭ 283 (-9.87%)

Mutual labels: imagenet

Computer Vision Leaderboard

Comparison of famous convolutional neural network models

Stars: ✭ 299 (-4.78%)

Mutual labels: imagenet

Cleanaspnetcorewebapi

Starter project for creating APIs built on ASP.NET Core using clean architecture.

Stars: ✭ 279 (-11.15%)

Mutual labels: architecture

React Proto

🎨 React application prototyping tool for developers and designers 🏗️

Stars: ✭ 3,261 (+938.54%)

Mutual labels: architecture

Annon.api

Configurable API gateway that acts as a reverse proxy with a plugin system.

Stars: ✭ 306 (-2.55%)

Mutual labels: architecture

Pytorch Hardnet

35% faster than ResNet: Harmonic DenseNet, A low memory traffic network

Stars: ✭ 293 (-6.69%)

Mutual labels: imagenet

Solution Architecture

Solution Architecture links, articles, books, video lessons, etc.

Stars: ✭ 289 (-7.96%)

Mutual labels: architecture

Harmonyos

A curated list of awesome things related to HarmonyOS. 华为鸿蒙操作系统。

Stars: ✭ 18,385 (+5755.1%)

Mutual labels: architecture

Segmentation models.pytorch

Segmentation models with pretrained backbones. PyTorch.

Stars: ✭ 4,584 (+1359.87%)

Mutual labels: imagenet

Game Programming Patterns

Source repo for the book

Stars: ✭ 3,096 (+885.99%)

Mutual labels: architecture

Android Viewmodelbinding

A lightweight library aiming to speed up Android app development by leveraging the new Android Data Binding together with the Model-View-ViewModel design pattern.

Stars: ✭ 308 (-1.91%)

Mutual labels: architecture

Avenging

MVP pattern example on Android: no Dagger or RxJava example

Stars: ✭ 279 (-11.15%)

Mutual labels: architecture

Cleanarchitecturerxswift

Example of Clean Architecture of iOS app using RxSwift

Stars: ✭ 3,256 (+936.94%)

Mutual labels: architecture

Clean Architecture Zh

《架构整洁之道》中文翻译

Stars: ✭ 299 (-4.78%)

Mutual labels: architecture

Norris

Showcase for Unidirectional Data Flow architecture for Android, powered by Kotlin Coroutines

Stars: ✭ 315 (+0.32%)

Mutual labels: architecture

Pnasnet.pytorch

PyTorch implementation of PNASNet-5 on ImageNet

Stars: ✭ 309 (-1.59%)

Mutual labels: imagenet

View All Similar Projects ➔

TResNet: High Performance GPU-Dedicated Architecture

paperV2 | pretrained models

Official PyTorch Implementation

Tal Ridnik, Hussam Lawen, Asaf Noy, Itamar Friedman, Emanuel Ben Baruch, Gilad Sharir
DAMO Academy, Alibaba Group

Abstract

Many deep learning models, developed in recent years, reach higher ImageNet accuracy than ResNet50, with fewer or comparable FLOPS count. While FLOPs are often seen as a proxy for network efficiency, when measuring actual GPU training and inference throughput, vanilla ResNet50 is usually significantly faster than its recent competitors, offering better throughput-accuracy trade-off. In this work, we introduce a series of architecture modifications that aim to boost neural networks' accuracy, while retaining their GPU training and inference efficiency. We first demonstrate and discuss the bottlenecks induced by FLOPs-optimizations. We then suggest alternative designs that better utilize GPU structure and assets. Finally, we introduce a new family of GPU-dedicated models, called TResNet, which achieve better accuracy and efficiency than previous ConvNets. Using a TResNet model, with similar GPU throughput to ResNet50, we reach 80.7% top-1 accuracy on ImageNet. Our TResNet models also transfer well and achieve state-of-the-art accuracy on competitive datasets such as Stanford cars (96.0%), CIFAR-10 (99.0%), CIFAR-100 (91.5%) and Oxford-Flowers (99.1%). They also perform well on multi-label classification and object detection tasks.

07/01/2021: New SOTA Results

With better pretraining, Our medium-size newtork, TResNet-L-V2, achieved SOTA result on stanford-cars dataset, and 3rd place on CIFAR100, while being significantly faster and smaller than the competitors.

30/9/2020: New Paper Released

We released a new paper, Asymmetric Loss For Multi-Label Classification.
The paper presents a new novel loss function, that is applicable for tasks with inherent data imbalancing - first and foremost multi-label classification, and also object detection and fine-grain single-label classification. The new paper also describe and expand in details preliminary results provided in TResNet paper on MS-COCO multi-label dataset.
Github link

28/8/2020: V2 of TResNet Article Released

Sotabench Comparisons

Comparative results from sotabench benchamrk, demonstartaing that TReNset models give excellent speed-accuracy tradoff:

11/6/2020: V1 of TResNet Article Released

The main change - In addition to single label SOTA results, we also added top results for multi-label classification and object detection tasks, using TResNet. For example, we set a new SOTA record for MS-COCO multi-label dataset, surpassing the previous top results by more than 2.5% mAP !

Bacbkone	mAP
KSSNet (previous SOTA)	83.7
TResNet-L	86.4

2/6/2020: CVPR-Kaggle competitions

We participated and won top places in two major CVPR-Kaggle competitions:

2nd place in Herbarium 2020 competition, out of 153 teams.
7th place in Plant-Pathology 2020 competition, out of 1317 teams.

TResNet was a vital part of our solution for both competitions, allowing us to work on high resolutions and reach top scores while doing fast and efficient experiments.

Main Article Results

TResNet Models

TResNet models accuracy and GPU throughput on ImageNet, compared to ResNet50. All measurements were done on Nvidia V100 GPU, with mixed precision. All models are trained on input resolution of 224.

Models	Top Training Speed (img/sec)	Top Inference Speed (img/sec)	Max Train Batch Size	Top-1 Acc.
ResNet50	805	2830	288	79.0
EfficientNetB1	440	2740	196	79.2
TResNet-M	730	2930	512	80.8
TResNet-L	345	1390	316	81.5
TResNet-XL	250	1060	240	82.0

Comparison To Other Networks

Comparison of ResNet50 to top modern networks, with similar top-1 ImageNet accuracy. All measurements were done on Nvidia V100 GPU with mixed precision. For gaining optimal speeds, training and inference were measured on 90% of maximal possible batch size. Except TResNet-M, all the models' ImageNet scores were taken from the public repository, which specialized in providing top implementations for modern networks. Except EfficientNet-B1, which has input resolution of 240, all other models have input resolution of 224.

Model	Top Training Speed (img/sec)	Top Inference Speed (img/sec)	Top-1 Acc.	Flops[G]
ResNet50	805	2830	79.0	4.1
ResNet50-D	600	2670	79.3	4.4
ResNeXt50	490	1940	79.4	4.3
EfficientNetB1	440	2740	79.2	0.6
SEResNeXt50	400	1770	79.9	4.3
MixNet-L	400	1400	79.0	0.5
TResNet-M	730	2930	80.8	5.5

Transfer Learning SotA Results

Comparison of TResNet to state-of-the-art models on transfer learning datasets (only ImageNet-based transfer learning results). Models inference speed is measured on a mixed precision V100 GPU. Since no official implementation of Gpipe was provided, its inference speed is unknown

Dataset	Model	Top-1 Acc.	Speed img/sec	Input
CIFAR-10	Gpipe	99.0	-	480
CIFAR-10	TResNet-XL	99.0	1060	224
CIFAR-100	EfficientNet-B7	91.7	70	600
CIFAR-100	TResNet-XL	91.5	1060	224
Stanford Cars	EfficientNet-B7	94.7	70	600
Stanford Cars	TResNet-L	96.0	500	368
Oxford-Flowers	EfficientNet-B7	98.8	70	600
Oxford-Flowers	TResNet-L	99.1	500	368

Reproduce Article Scores

We provide code for reproducing the validation top-1 score of TResNet models on ImageNet. First, download pretrained models from here.

Then, run the infer.py script. For example, for tresnet_m (input size 224) run:

python -m infer.py \
--val_dir=/path/to/imagenet_val_folder \
--model_path=/model/path/to/tresnet_m.pth \
--model_name=tresnet_m
--input_size=224

TResNet Training

Due to IP limitations, we do not provide the exact training code that was used to obtain the article results.

However, TResNet is now an integral part of the popular rwightman / pytorch-image-models repo. Using that repo, you can reach very similar results to the one stated in the article.

For example, training tresnet_m on rwightman / pytorch-image-models with the command line:

python -u -m torch.distributed.launch --nproc_per_node=8 \
--nnodes=1 --node_rank=0 ./train.py /data/imagenet/ \
-b=190 --lr=0.6 --model-ema --aa=rand-m9-mstd0.5-inc1 \
--num-gpu=8 -j=16 --amp \
--model=tresnet_m --epochs=300 --mixup=0.2 \
--sched='cosine' --reprob=0.4 --remode=pixel

gave accuracy of 80.5%.

Also, during the merge request, we had interesting discussions and insights regarding TResNet design. I am attaching a pdf version the mentioned discussions. They can shed more light on TResNet design considerations and directions for the future.

TResNet discussion and insights

(taken with permission from here)

Tips For Working With Inplace-ABN

See INPLACE_ABN_TIPS.

Citation

@misc{ridnik2020tresnet,
    title={TResNet: High Performance GPU-Dedicated Architecture},
    author={Tal Ridnik and Hussam Lawen and Asaf Noy and Itamar Friedman},
    year={2020},
    eprint={2003.13630},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Contact

Feel free to contact me if there are any questions or issues (Tal Ridnik, [email protected]).

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 314

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗