Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → implus → Gfocalv2

implus / Gfocalv2

Licence: apache-2.0

Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection, CVPR2021

Programming Languages

139335 projects - #7 most used programming language

Labels

computer-vision object-detection detection

Projects that are alternatives of or similar to Gfocalv2

mean Average Precision - This code evaluates the performance of your neural net for object recognition.

Stars: ✭ 2,324 (+760.74%)

Mutual labels: object-detection, detection

Face detection using deep learning.

Stars: ✭ 173 (-35.93%)

Mutual labels: object-detection, detection

A Pytorch Tutorial To Object Detection

SSD: Single Shot MultiBox Detector | a PyTorch Tutorial to Object Detection

Stars: ✭ 2,398 (+788.15%)

Mutual labels: object-detection, detection

Deep Learning For Tracking And Detection

Collection of papers, datasets, code and other resources for object tracking and detection using deep learning

Stars: ✭ 1,920 (+611.11%)

Mutual labels: object-detection, detection

🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.

Stars: ✭ 202 (-25.19%)

Mutual labels: object-detection, detection

SynthDet - An end-to-end object detection pipeline using synthetic data

Stars: ✭ 148 (-45.19%)

Mutual labels: object-detection, detection

Tf deformable net

Deformable convolution net on Tensorflow

Stars: ✭ 173 (-35.93%)

Mutual labels: object-detection, detection

An Embedded Computer Vision & Machine Learning Library (CPU Optimized & IoT Capable)

Stars: ✭ 1,460 (+440.74%)

Mutual labels: object-detection, detection

Video detection library

Stars: ✭ 177 (-34.44%)

Mutual labels: object-detection, detection

Video Platform for Action Recognition and Object Detection in Pytorch

Stars: ✭ 175 (-35.19%)

Mutual labels: object-detection, detection

Free to use online tool for labelling photos. https://makesense.ai

Stars: ✭ 2,087 (+672.96%)

Mutual labels: object-detection, detection

Real time object detection and tracking

YOLOv2 and MobileNet_SSD detection algorithms used along with KCF object tracker

Stars: ✭ 241 (-10.74%)

Mutual labels: object-detection, detection

GUI for marking bounded boxes of objects in images for training neural network Yolo v3 and v2 https://github.com/AlexeyAB/darknet, https://github.com/pjreddie/darknet

Stars: ✭ 128 (-52.59%)

Mutual labels: object-detection, detection

A novel region proposal network for more general object detection ( including scene text detection ).

Stars: ✭ 155 (-42.59%)

Mutual labels: object-detection, detection

Tensorflow Object Detection Tutorial

The purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch

Stars: ✭ 113 (-58.15%)

Mutual labels: object-detection, detection

Object Detection

Object detection with ssd_mobilenet and tiny-yolo (Add: YOLOv3, tflite)

Stars: ✭ 173 (-35.93%)

Mutual labels: object-detection, detection

Efficientdet.pytorch

Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch

Stars: ✭ 1,383 (+412.22%)

Mutual labels: object-detection, detection

SSD: Single Shot MultiBox Detector pytorch implementation focusing on simplicity

Stars: ✭ 107 (-60.37%)

Mutual labels: object-detection, detection

Research platform for 3D object detection in PyTorch.

Stars: ✭ 177 (-34.44%)

Mutual labels: object-detection, detection

Com.unity.perception

Perception toolkit for sim2real training and validation

Stars: ✭ 208 (-22.96%)

Mutual labels: object-detection, detection

View All Similar Projects ➔

Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection

GFocalV2 (GFLV2) is a next generation of GFocalV1 (GFLV1), which utilizes the statistics of learned bounding box distributions to guide the reliable localization quality estimation.

Again, GFLV2 improves over GFLV1 about ~1 AP without (almost) extra computing cost! Analysis of GFocalV2 in ZhiHu: 大白话 Generalized Focal Loss V2. You can see more comments about GFocalV1 in 大白话 Generalized Focal Loss(知乎)

More news:

[2021.3] GFocalV2 has been accepted by CVPR2021 (pre-review score: 113).

[2020.11] GFocalV1 has been adopted in NanoDet, a super efficient object detector on mobile devices, achieving same performance but 2x faster than YoLoV4-Tiny! More details are in YOLO之外的另一选择，手机端97FPS的Anchor-Free目标检测模型NanoDet现已开源~.

[2020.10] Good News! GFocalV1 has been accepted in NeurIPs 2020 and GFocalV2 is on the way.

[2020.9] The winner (1st) of GigaVision (object detection and tracking) in ECCV 2020 workshop from DeepBlueAI team adopt GFocalV1 in their solutions.

[2020.7] GFocalV1 is officially included in MMDetection V2, many thanks to @ZwwWayne and @hellock for helping migrating the code.

Introduction

Localization Quality Estimation (LQE) is crucial and popular in the recent advancement of dense object detectors since it can provide accurate ranking scores that benefit the Non-Maximum Suppression processing and improve detection performance. As a common practice, most existing methods predict LQE scores through vanilla convolutional features shared with object classification or bounding box regression. In this paper, we explore a completely novel and different perspective to perform LQE -- based on the learned distributions of the four parameters of the bounding box. The bounding box distributions are inspired and introduced as ''General Distribution'' in GFLV1, which describes the uncertainty of the predicted bounding boxes well. Such a property makes the distribution statistics of a bounding box highly correlated to its real localization quality. Specifically, a bounding box distribution with a sharp peak usually corresponds to high localization quality, and vice versa. By leveraging the close correlation between distribution statistics and the real localization quality, we develop a considerably lightweight Distribution-Guided Quality Predictor (DGQP) for reliable LQE based on GFLV1, thus producing GFLV2. To our best knowledge, it is the first attempt in object detection to use a highly relevant, statistical representation to facilitate LQE. Extensive experiments demonstrate the effectiveness of our method. Notably, GFLV2 (ResNet-101) achieves 46.2 AP at 14.6 FPS, surpassing the previous state-of-the-art ATSS baseline (43.6 AP at 14.6 FPS) by absolute 2.6 AP on COCO test-dev, without sacrificing the efficiency both in training and inference.

For details see GFocalV2. The speed-accuracy trade-off is as follows:

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Get Started

Please see GETTING_STARTED.md for the basic usage of MMDetection.

Train

# assume that you are under the root directory of this project,
# and you have activated your virtual environment if needed.
# and with COCO dataset in 'data/coco/'

./tools/dist_train.sh configs/gfocal/gfocal_r50_fpn_ms2x.py 8 --validate

Inference

./tools/dist_test.sh configs/gfocal/gfocal_r50_fpn_ms2x.py work_dirs/gfocal_r50_fpn_ms2x/epoch_24.pth 8 --eval bbox

Speed Test (FPS)

CUDA_VISIBLE_DEVICES=0 python3 ./tools/benchmark.py configs/gfocal/gfocal_r50_fpn_ms2x.py work_dirs/gfocal_r50_fpn_ms2x/epoch_24.pth

Models

For your convenience, we provide the following trained models (GFocalV2). All models are trained with 16 images in a mini-batch with 8 GPUs.

Model	Multi-scale training	AP (minival)	AP (test-dev)	FPS	Link
GFocal_R_50_FPN_1x	No	41.0	41.1	19.4	Google
GFocal_R_50_FPN_2x	Yes	43.9	44.4	19.4	Google
GFocal_R_101_FPN_2x	Yes	45.8	46.0	14.6	Google
GFocal_R_101_dcnv2_FPN_2x	Yes	48.0	48.2	12.7	Google
GFocal_X_101_dcnv2_FPN_2x	Yes	48.8	49.0	10.7	Google
GFocal_R2_101_dcnv2_FPN_2x	Yes	49.9	50.5	10.9	Google

[0] The reported numbers here are from new experimental trials (in the cleaned repo), which may be slightly different from the original paper.
[1] Note that the 1x performance may be slightly unstable due to insufficient training. In practice, the 2x results are considerably stable between multiple runs.
[2] All results are obtained with a single model and without any test time data augmentation such as multi-scale, flipping and etc..
[3] dcnv2 denotes deformable convolutional networks v2. Note that for ResNe(X)t based models, we apply deformable convolutions from stage c3 to c5 in backbones.
[4] Refer to more details in config files in config/gfocal/.
[5] FPS is tested with a single GeForce RTX 2080Ti GPU, using a batch size of 1.

Acknowledgement

Thanks MMDetection team for the wonderful open source project!

Citation

If you find GFocal useful in your research, please consider citing:

@article{li2020gfl,
  title={Generalized focal loss: Learning qualified and distributed bounding boxes for dense object detection},
  author={Li, Xiang and Wang, Wenhai and Wu, Lijun and Chen, Shuo and Hu, Xiaolin and Li, Jun and Tang, Jinhui and Yang, Jian},
  journal={arXiv preprint arXiv:2006.04388},
  year={2020}
}

@article{li2020gflv2,
  title={Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection},
  author={Li, Xiang and Wang, Wenhai and Hu, Xiaolin and Li, Jun and Tang, Jinhui and Yang, Jian},
  journal={arXiv preprint arXiv:2011.12885},
  year={2020}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 270

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (9) 🔗