Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → dddzg → Up Detr

dddzg / Up Detr

Licence: apache-2.0

[CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Programming Languages

python

139335 projects - #7 most used programming language

Labels

detection cvpr coco

Projects that are alternatives of or similar to Up Detr

Fastmaskrcnn

Mask RCNN in TensorFlow

Stars: ✭ 3,069 (+2335.71%)

Mutual labels: coco, detection

keras cv attention models

Keras/Tensorflow attention models including beit,botnet,CMT,CoaT,CoAtNet,convnext,cotnet,davit,efficientdet,efficientnet,fbnet,gmlp,halonet,lcnet,levit,mlp-mixer,mobilevit,nfnets,regnet,resmlp,resnest,resnext,resnetd,swin,tinynet,uniformer,volo,wavemlp,yolor,yolox

Stars: ✭ 159 (+26.19%)

Mutual labels: detection, coco

BCNet

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

Stars: ✭ 434 (+244.44%)

Mutual labels: detection, cvpr

Efficientdet.pytorch

Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch

Stars: ✭ 1,383 (+997.62%)

Mutual labels: coco, detection

Math object detection

An image recognition/object detection model that detects handwritten digits and simple math operators. The output of the predicted objects (numbers & math operators) is then evaluated and solved.

Stars: ✭ 52 (-58.73%)

Mutual labels: coco, detection

Foveabox

FoveaBox: Beyond Anchor-based Object Detector

Stars: ✭ 353 (+180.16%)

Mutual labels: coco, detection

MonoRUn

[CVPR'21] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation

Stars: ✭ 85 (-32.54%)

Mutual labels: detection, cvpr

Prepare detection dataset

convert dataset to coco/voc format

Stars: ✭ 654 (+419.05%)

Mutual labels: coco, detection

Coco Annotator

✏️ Web-based image segmentation tool for object detection, localization, and keypoints

Stars: ✭ 1,138 (+803.17%)

Mutual labels: coco, detection

Pytorch Imagenet Cifar Coco Voc Training

Training examples and results for ImageNet(ILSVRC2012)/CIFAR100/COCO2017/VOC2007+VOC2012 datasets.Image Classification/Object Detection.Include ResNet/EfficientNet/VovNet/DarkNet/RegNet/RetinaNet/FCOS/CenterNet/YOLOv3.

Stars: ✭ 130 (+3.17%)

Mutual labels: coco, detection

Model Quantization

Collections of model quantization algorithms

Stars: ✭ 118 (-6.35%)

Mutual labels: detection

Face landmark factory

These are a set of tools using OpenCV, Tensorflow and Keras, with which you can generate your own model of facial landmark detection and demonstrate the effect of newly-generated model easily.

Stars: ✭ 120 (-4.76%)

Mutual labels: detection

Awesome Gan For Medical Imaging

Awesome GAN for Medical Imaging

Stars: ✭ 1,814 (+1339.68%)

Mutual labels: detection

Make Sense

Free to use online tool for labelling photos. https://makesense.ai

Stars: ✭ 2,087 (+1556.35%)

Mutual labels: detection

Mobilenet

MobileNet build with Tensorflow

Stars: ✭ 1,531 (+1115.08%)

Mutual labels: detection

Yolo label

GUI for marking bounded boxes of objects in images for training neural network Yolo v3 and v2 https://github.com/AlexeyAB/darknet, https://github.com/pjreddie/darknet

Stars: ✭ 128 (+1.59%)

Mutual labels: detection

Dstl unet

Dstl Satellite Imagery Feature Detection

Stars: ✭ 117 (-7.14%)

Mutual labels: detection

Ac Fpn

Implement of paper 《Attention-guided Context Feature Pyramid Network for Object Detection》

Stars: ✭ 117 (-7.14%)

Mutual labels: detection

Sfd.pytorch

S3FD: single shot face detector in pytorch

Stars: ✭ 116 (-7.94%)

Mutual labels: detection

Simpsonrecognition

Detect and recognize The Simpsons characters using Keras and Faster R-CNN

Stars: ✭ 131 (+3.97%)

Mutual labels: detection

View All Similar Projects ➔

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

This is the official PyTorch implementation and models for UP-DETR paper:

@article{dai2020up-detr,
  author  = {Zhigang Dai and Bolun Cai and Yugeng Lin and Junying Chen},
  title   = {UP-DETR: Unsupervised Pre-training for Object Detection with Transformers},
  journal = {arXiv preprint arXiv:2011.09094},
  year    = {2020},
}

In UP-DETR, we introduce a novel pretext named random query patch detection to pre-train transformers for object detection. UP-DETR inherits from DETR with the same ResNet-50 backbone, same Transformer encoder, decoder and same codebase. With unsupervised pre-training CNN, the whole UP-DETR model doesn't require any human annotations. UP-DETR achieves 43.1 AP on COCO with 300 epochs fine-tuning. The AP of open-source version is a little higher than paper report.

Model Zoo

We provide pre-training UP-DETR and fine-tuning UP-DETR models on COCO, and plan to include more in future. The evaluation metric is same to DETR.

Here is the UP-DETR model pre-trained on ImageNet without labels. The CNN weight is initialized from SwAV, which is fixed during the transformer pre-training:

name	backbone	epochs	url	size	md5
UP-DETR	R50 (SwAV)	60	model \| logs	164Mb	`49f01f8b`

Comparision with DETR:

name	backbone (pre-train)	epochs	box AP	url	size
DETR	R50 (Supervised)	500	42.0	-	159Mb
DETR	R50 (SwAV)	300	42.1	-	159Mb
UP-DETR	R50 (SwAV)	300	43.1	model \| logs	159Mb

COCO val5k evaluation results of UP-DETR can be found in this gist.

Usage - Object Detection

There are no extra compiled components in UP-DETR and package dependencies are same to DETR. We provide instructions how to install dependencies via conda:

git clone tbd
conda install -c pytorch pytorch torchvision
conda install cython scipy
pip install -U 'git+https://github.com/cocodataset/cocoapi.git#subdirectory=PythonAPI'

UP-DETR follows two steps: pre-training and fine-tuning. We present the model pre-trained on ImageNet and then fine-tuned on COCO.

Unsupervised Pre-training

Data Preparation

Download and extract ILSVRC2012 train dataset.

We expect the directory structure to be the following:

path/to/imagenet/
  n06785654/  # caterogey directory
    n06785654_16140.JPEG # images
  n04584207/  # caterogey directory
    n04584207_14322.JPEG # images

Images can be organized disorderly because our pre-training is unsupervised.

Pre-training

To pr-train UP-DETR on a single node with 8 gpus for 60 epochs, run:

python -m torch.distributed.launch --nproc_per_node=8 --use_env main.py \
    --lr_drop 40 \
    --epochs 60 \
    --pre_norm \
    --num_patches 10 \
    --batch_size 32 \
    --feature_recon \
    --fre_cnn \
    --imagenet_path path/to/imagenet \
    --output_dir path/to/save_model

As the size of pre-training images is relative small, so we can set a large batch size.

It takes about 2 hours for a epoch, so 60 epochs pre-training takes about 5 days with 8 V100 gpus.

In our further ablation experiment, we found that object query shuffle is not helpful. So, we remove it in the open-source version.

Fine-tuning

Data Preparation

Download and extract COCO 2017 dataset train and val dataset.

The directory structure is expected as follows:

path/to/coco/
  annotations/  # annotation json files
  train2017/    # train images
  val2017/      # val images

Fine-tuning

To fine-tune UP-DETR with 8 gpus for 300 epochs, run:

python -m torch.distributed.launch --nproc_per_node=8 --use_env detr_main.py \
    --lr_drop 200 \
    --epochs 300 \
    --lr_backbone 5e-4 \
    --pre_norm \
    --coco_path path/to/coco \
    --pretrain path/to/save_model/checkpoint.pth

The fine-tuning cost is exactly same to DETR, which takes 28 minutes with 8 V100 gpus. So, 300 epochs training takes about 6 days.

The model can also extended to panoptic segmentation, checking more details on DETR.

Notebook

We provide a notebook in colab to get the visualization result in the paper:

Visualization Notebook: This notebook shows how to perform query patch detection with the pre-training model (without any annotations fine-tuning).

License

UP-DETR is released under the Apache 2.0 license. Please see the LICENSE file for more information.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 126

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗