Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → d-li14 → Involution

d-li14 / Involution

Licence: mit

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator

Programming Languages

python

139335 projects - #7 most used programming language

Labels

pytorch object-detection semantic-segmentation image-classification operator instance-segmentation

Projects that are alternatives of or similar to Involution

Gluon Cv

Gluon CV Toolkit

Stars: ✭ 5,001 (+1884.52%)

Mutual labels: object-detection, semantic-segmentation, image-classification

Torchdistill

PyTorch-based modular, configuration-driven framework for knowledge distillation. 🏆18 methods including SOTA are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy.

Stars: ✭ 177 (-29.76%)

Mutual labels: object-detection, semantic-segmentation, image-classification

Paz

Hierarchical perception library in Python for pose estimation, object detection, instance segmentation, keypoint estimation, face recognition, etc.

Stars: ✭ 131 (-48.02%)

Mutual labels: object-detection, semantic-segmentation, instance-segmentation

Paper-Notes

Paper notes in deep learning/machine learning and computer vision

Stars: ✭ 37 (-85.32%)

Mutual labels: image-classification, semantic-segmentation, instance-segmentation

Remo Python

🐰 Python lib for remo - the app for annotations and images management in Computer Vision

Stars: ✭ 138 (-45.24%)

Mutual labels: object-detection, image-classification, instance-segmentation

Awesome Computer Vision Models

A list of popular deep learning models related to classification, segmentation and detection problems

Stars: ✭ 278 (+10.32%)

Mutual labels: object-detection, semantic-segmentation, image-classification

Lightnet

🌓 Bringing pjreddie's DarkNet out of the shadows #yolo

Stars: ✭ 322 (+27.78%)

Mutual labels: object-detection, image-classification

InstantDL

InstantDL: An easy and convenient deep learning pipeline for image segmentation and classification

Stars: ✭ 33 (-86.9%)

Mutual labels: semantic-segmentation, instance-segmentation

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Stars: ✭ 8,046 (+3092.86%)

Mutual labels: image-classification, semantic-segmentation

Siamese Mask Rcnn

Siamese Mask R-CNN model for one-shot instance segmentation

Stars: ✭ 257 (+1.98%)

Mutual labels: object-detection, instance-segmentation

celldetection

Cell Detection with PyTorch.

Stars: ✭ 44 (-82.54%)

Mutual labels: semantic-segmentation, instance-segmentation

ObjectNet

PyTorch implementation of "Pyramid Scene Parsing Network".

Stars: ✭ 15 (-94.05%)

Mutual labels: semantic-segmentation, instance-segmentation

Simpledet

A Simple and Versatile Framework for Object Detection and Instance Recognition

Stars: ✭ 2,963 (+1075.79%)

Mutual labels: object-detection, instance-segmentation

super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library

Stars: ✭ 429 (+70.24%)

Mutual labels: image-classification, semantic-segmentation

CAP augmentation

Cut and paste augmentation for object detection and instance segmentation

Stars: ✭ 93 (-63.1%)

Mutual labels: semantic-segmentation, instance-segmentation

Entity

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

Stars: ✭ 313 (+24.21%)

Mutual labels: semantic-segmentation, instance-segmentation

FaPN

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

Stars: ✭ 173 (-31.35%)

Mutual labels: semantic-segmentation, instance-segmentation

HugsVision

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

Stars: ✭ 154 (-38.89%)

Mutual labels: image-classification, semantic-segmentation

Pytorch Hardnet

35% faster than ResNet: Harmonic DenseNet, A low memory traffic network

Stars: ✭ 293 (+16.27%)

Mutual labels: object-detection, semantic-segmentation

Autogluon

AutoGluon: AutoML for Text, Image, and Tabular Data

Stars: ✭ 3,920 (+1455.56%)

Mutual labels: object-detection, image-classification

View All Similar Projects ➔

involution

Official implementation of a neural operator as described in Involution: Inverting the Inherence of Convolution for Visual Recognition (CVPR'21)

By Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, and Qifeng Chen

TL; DR. involution is a general-purpose neural primitive that is versatile for a spectrum of deep learning models on different vision tasks. involution bridges convolution and self-attention in design, while being more efficient and effective than convolution, simpler than self-attention in form.

Getting Started

This repository is fully built upon the OpenMMLab toolkits. For each individual task, the config and model files follow the same directory organization as mmcls, mmdet, and mmseg respectively, so just copy-and-paste them to the corresponding locations to get started.

For example, in terms of evaluating detectors

git clone https://github.com/open-mmlab/mmdetection # and install

# copy model files
cp det/mmdet/models/backbones/* mmdetection/mmdet/models/backbones
cp det/mmdet/models/necks/* mmdetection/mmdet/models/necks
cp det/mmdet/models/utils/* mmdetection/mmdet/models/utils

# copy config files
cp det/configs/_base_/models/* mmdetection/mmdet/configs/_base_/models
cp det/configs/_base_/schedules/* mmdetection/mmdet/configs/_base_/schedules
cp det/configs/involution mmdetection/mmdet/configs -r

# evaluate checkpoints
cd mmdetection
bash tools/dist_test.sh ${CONFIG_FILE} ${CHECKPOINT_FILE} ${GPU_NUM} [--out ${RESULT_FILE}] [--eval ${EVAL_METRICS}]

For more detailed guidance, please refer to the original mmcls, mmdet, and mmseg tutorials.

Currently, we provide an memory-efficient implementation of the involuton operator based on CuPy. Please install this library in advance. A customized CUDA kernel would bring about further acceleration on the hardware. Any contribution from the community regarding this is welcomed!

Model Zoo

The parameters/FLOPs↓ and performance↑ compared to the convolution baselines are marked in the parentheses. Part of these checkpoints are obtained in our reimplementation runs, whose performance may show slight differences with those reported in our paper. Models are trained with 64 GPUs on ImageNet, 8 GPUs on COCO, and 4 GPUs on Cityscapes.

Image Classification on ImageNet

Model	Params(M)	FLOPs(G)	Top-1 (%)	Top-5 (%)	Config	Download
RedNet-26	9.23_(32.8%↓)	1.73_(29.2%↓)	75.96	93.19	config	model \| log
RedNet-38	12.39_(36.7%↓)	2.22_(31.3%↓)	77.48	93.57	config	model \| log
RedNet-50	15.54_(39.5%↓)	2.71_(34.1%↓)	78.35	94.13	config	model \| log
RedNet-101	25.65_(42.6%↓)	4.74_(40.5%↓)	78.92	94.35	config	model \| log
RedNet-152	33.99_(43.5%↓)	6.79_(41.4%↓)	79.12	94.38	config	model \| log

Before finetuning on the following downstream tasks, download the ImageNet pre-trained RedNet-50 weights and set the pretrained argument in det/configs/_base_/models/*.py or seg/configs/_base_/models/*.py to your local path.

Object Detection and Instance Segmentation on COCO

Faster R-CNN

Backbone	Neck	Style	Lr schd	Params(M)	FLOPs(G)	box AP	Config	Download
RedNet-50-FPN	convolution	pytorch	1x	31.6_(23.9%↓)	177.9_(14.1%↓)	39.5_(1.8↑)	config	model \| log
RedNet-50-FPN	involution	pytorch	1x	29.5_(28.9%↓)	135.0_(34.8%↓)	40.2_(2.5↑)	config	model \| log

Mask R-CNN

Backbone	Neck	Style	Lr schd	Params(M)	FLOPs(G)	box AP	mask AP	Config	Download
RedNet-50-FPN	convolution	pytorch	1x	34.2_(22.6%↓)	224.2_(11.5%↓)	39.9_(1.5↑)	35.7_(0.8↑)	config	model \| log
RedNet-50-FPN	involution	pytorch	1x	32.2_(27.1%↓)	181.3_(28.5%↓)	40.8_(2.4↑)	36.4_(1.3↑)	config	model \| log

RetinaNet

Backbone	Neck	Style	Lr schd	Params(M)	FLOPs(G)	box AP	Config	Download
RedNet-50-FPN	convolution	pytorch	1x	27.8_(26.3%↓)	210.1_(12.2%↓)	38.2_(1.6↑)	config	model \| log
RedNet-50-FPN	involution	pytorch	1x	26.3_(30.2%↓)	199.9_(16.5%↓)	38.2_(1.6↑)	config	model \| log

Semantic Segmentation on Cityscapes

Method	Backbone	Neck	Crop Size	Lr schd	Params(M)	FLOPs(G)	mIoU	Config	download
FPN	RedNet-50	convolution	512x1024	80000	18.5_(35.1%↓)	293.9_(19.0%↓)	78.0_(3.6↑)	config	model \| log
FPN	RedNet-50	involution	512x1024	80000	16.4_(42.5%↓)	205.2_(43.4%↓)	79.1_(4.7↑)	config	model \| log
UPerNet	RedNet-50	convolution	512x1024	80000	56.4_(15.1%↓)	1825.6_(3.6%↓)	80.6_(2.4↑)	config	model \| log

Citation

If you find our work useful in your research, please cite:

@InProceedings{Li_2021_CVPR,
author = {Li, Duo and Hu, Jie and Wang, Changhu and Li, Xiangtai and She, Qi and Zhu, Lei and Zhang, Tong and Chen, Qifeng},
title = {Involution: Inverting the Inherence of Convolution for Visual Recognition},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2021}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 252

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (3) 🔗