All Projects → tjubiit → Tju Dhd

tjubiit / Tju Dhd

Licence: mit
A newly built high-resolution dataset for object detection and pedestrian detection (IEEE TIP 2020)

Projects that are alternatives of or similar to Tju Dhd

Lacmus
Lacmus is a cross-platform application that helps to find people who are lost in the forest using computer vision and neural networks.
Stars: ✭ 142 (+89.33%)
Mutual labels:  object-detection, dataset
Epic Kitchens 55 Annotations
🍴 Annotations for the EPIC KITCHENS-55 Dataset.
Stars: ✭ 120 (+60%)
Mutual labels:  object-detection, dataset
Vidvrd Helper
To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper
Stars: ✭ 81 (+8%)
Mutual labels:  object-detection, dataset
Maskrcnn Modanet
A Mask R-CNN Keras implementation with Modanet annotations on the Paperdoll dataset
Stars: ✭ 59 (-21.33%)
Mutual labels:  object-detection, dataset
Exclusively Dark Image Dataset
Exclusively Dark (ExDARK) dataset which to the best of our knowledge, is the largest collection of low-light images taken in very low-light environments to twilight (i.e 10 different conditions) to-date with image class and object level annotations.
Stars: ✭ 274 (+265.33%)
Mutual labels:  object-detection, dataset
Shape Detection
🟣 Object detection of abstract shapes with neural networks
Stars: ✭ 170 (+126.67%)
Mutual labels:  object-detection, dataset
Hands Detection
Hands video tracker using the Tensorflow Object Detection API and Faster RCNN model. The data used is the Hand Dataset from University of Oxford.
Stars: ✭ 87 (+16%)
Mutual labels:  object-detection, dataset
Taco
🌮 Trash Annotations in Context Dataset Toolkit
Stars: ✭ 243 (+224%)
Mutual labels:  object-detection, dataset
Tensorflow object tracking video
Object Tracking in Tensorflow ( Localization Detection Classification ) developed to partecipate to ImageNET VID competition
Stars: ✭ 491 (+554.67%)
Mutual labels:  object-detection, dataset
Awesome machine learning solutions
A curated list of repositories for my book Machine Learning Solutions.
Stars: ✭ 65 (-13.33%)
Mutual labels:  object-detection, dataset
Autonomous driving
Ros package for basic autonomous lane tracking and object detection
Stars: ✭ 67 (-10.67%)
Mutual labels:  object-detection
Satellite Image Deep Learning
Resources for deep learning with satellite & aerial imagery
Stars: ✭ 1,141 (+1421.33%)
Mutual labels:  dataset
Csvpack
csvpack library / gem - tools 'n' scripts for working with tabular data packages using comma-separated values (CSV) datafiles in text with meta info (that is, schema, datatypes, ..) in datapackage.json; download, read into and query CSV datafiles with your SQL database (e.g. SQLite, PostgreSQL, ...) of choice and much more
Stars: ✭ 71 (-5.33%)
Mutual labels:  dataset
Data generator object detection 2d
A data generator for 2D object detection
Stars: ✭ 73 (-2.67%)
Mutual labels:  object-detection
Fish detection
Fish detection using Open Images Dataset and Tensorflow Object Detection
Stars: ✭ 67 (-10.67%)
Mutual labels:  object-detection
Panet
PANet for Instance Segmentation and Object Detection
Stars: ✭ 1,170 (+1460%)
Mutual labels:  object-detection
Traffic Rules Violation Detection
The System consists of two main components. Vehicle detection model and A graphical user interface (GUI)
Stars: ✭ 67 (-10.67%)
Mutual labels:  object-detection
Openpowerlifting
Read-Only Mirror of the OpenPowerlifting Project. Main Repo on GitLab.
Stars: ✭ 67 (-10.67%)
Mutual labels:  dataset
Chainer Ssd
Implementation of SSD (Single Shot MultiBox Detector) using Chainer
Stars: ✭ 66 (-12%)
Mutual labels:  object-detection
Kaggle Rsna
Deep Learning for Automatic Pneumonia Detection, RSNA challenge
Stars: ✭ 74 (-1.33%)
Mutual labels:  object-detection

TJU-DHD dataset (object detection and pedestrian detection)

This is the official website for "TJU-DHD: A Diverse High-Resolution Dataset for Object Detection (TIP2020)", which is a newly built high-resolution dataset for object detection and pedestrian detection.

  • 115k+ images and 700k+ instances
  • Scenes: traffic and campus, Tasks: object detection and pedestrian detection
  • High resolution: image resolution of at least 1624x1200 pixels, the object height from 11 pixels to 4152 pixels.
  • Diversity: A large variance in appearance, scale, illumination, season, and weather
  • Cross-scene evaluation and same-scene evaluation on pedestrian detection
  • If you are interested in pedestrian detection, please refer to our survey paper or our github project.

Examples of DHD

Table of Contents

  1. Introduction
  2. Object detection dataset
    2.1 TJU-DHD-traffic
    2.2 TJU-DHD-campus
  3. Pedestrian detection dataset
    3.1 TJU-Ped-traffic
    3.2 TJU-Ped-campus
  4. Benchmark
    4.1 TJU-DHD-traffic
    4.2 TJU-DHD-campus
    4.3 TJU-DHD-pedestrian
  5. Citation
  6. Contact

1. Introduction

Vehicles, pedestrians, and riders are the most important and interesting objects in the perception modules of self-driving vehicles and video surveillance. However, the state-of-the-art performance of detecting such important objects (esp. small objects) is far from satisfying the demand of the practical systems. Large-scale, rich-diversity, and high-resolution vehicle and pedestrian datasets play an important role in developing better object detection methods to satisfy the demand. Existing public large-scale datasets such as MS COCO collected from websites do not focus on these specific scenarios. Moreover, the popular datasets (e.g., KITTI and Citypersons) collected from these specific scenarios are limited in the number of images and instances, the resolution, and the diversity in seasons, weathers, and illuminations. To attempt to solve the problem, in this paper, we build a diverse high-resolution dataset (called TJU-DHD). The dataset contains 115,354 high-resolution images (52% images have a resolution of 1624x1200 pixels and 48% images have a resolution of at least 2,560x1,440 pixels) and 709,330 labeled objects in total with a large variance in scale and appearance. Meanwhile, the dataset has a rich diversity in season variance, illumination variance, and weather variance. Based on this object dataset, a new diverse pedestrian dataset is further built. With the four different detectors (i.e., the one-stage RetinaNet, anchor-free FCOS, two-stage FPN, and Cascade R-CNN), experiments about object detection and pedestrian detection are conducted. We hope that the newly built dataset can help promote the research on object detection and pedestrian detection in these two scenes.

2. Object detection dataset

name DHD-traffic (#images) DHD-traffic (#instances) DHD-campus (#images) DHD-campus (#instances)
training 45,266 239,980 39,727 267,445
validation 5,000 30,679 5,204 41,620
test 10,000 60,963 10,157 68,643
total 60,266 331,622 55,088 377,708

2.1 TJU-DHD-traffic

2.2 TJU-DHD-campus

(The training imageset is too large, thus is ziped as a 4-part archive. One should download all of them and open the .zip.001 using your favorite zip file extractor.)

3. Pedestrian detection dataset

name Ped-traffic (#images) Ped-traffic (#instances) Ped-campus (#images) Ped-campus (#instances)
training 13,858 27,650 39,727 234,455
validation 2,136 5,244 5,204 36,161
test 4,344 10,724 10,157 59,007
total 20,338 43,618 55,088 329,623

3.1 TJU-Ped-traffic

(Note that the images are same as those in the TJU-DHD-traffic)

3.2 TJU-Ped-campus

(Note that the images are same as those in the TJU-DHD-campus)

4. Benchmark

4.1 TJU-DHD-traffic

  • Results on validation

    method backbone input size AP [email protected] [email protected] AP_s AP_m AP_l
    RetinaNet ResNet50 1333x800 53.5 80.9 60.0 24.0 50.5 68.0
    FCOS ResNet50 1333x800 53.8 80.0 60.1 24.6 50.6 68.8
    FPN ResNet50 1333x800 55.4 83.4 63.0 30.4 52.2 68.2
    Cascade RCNN ResNet50 1333x800 57.9 82.7 66.6 32.6 54.4 71.4

4.2 TJU-DHD-campus

  • Results on validation

    method backbone input size AP [email protected] [email protected] AP_t AP_s AP_l AP_l
    RetinaNet ResNet50 1333x800 48.4 79.3 52.4 4.7 27.3 56.2 73.8
    FCOS ResNet50 1333x800 49.3 73.8 53.8 5.6 29.6 55.9 74.3
    FPN ResNet50 1333x800 52.4 77.5 58.4 8.5 37.4 58.6 74.9
    Cascade RCNN ResNet50 1333x800 55.1 77.6 60.9 10.8 40.1 61.2 78.8

4.3 TJU-DHD-pedestrian

  • Miss rate with same-scene evaluation

    method R/RS/HO/R+HO/A (TJU-Ped-campus) R/RS/HO/R+HO/A (TJU-Ped-traffic)
    FPN 27.92/73.14/67.52/35.67/38.08 22.30/35.19/60.30/26.71/37.78
  • Miss rate with cross-scene evaluation

    method R/R+HO (TJU-Ped-campus -> traffic) R/R+HO (TJU-Ped-traffic -> campus)
    FPN 30.62 / 33.89 42.08 / 50.55

5. Citation

If this project help your research, please consider to cite our paper.

@article{Pang_DHD_TIP_2020,
         author = {Yanwei Pang and Jiale Cao and Yazhao Li and Jin Xie and Hanqing Sun and Jinfeng Gong},
         title = {TJU-DHD: A Diverse High-Resolution Dataset for Object Detection},
         journal = {IEEE Transactions on Image Processing},
         year = 2020
        }

@article{Cao_PDR_arXiv_2020,
         author = {Jiale Cao and Yanwei Pang and Jin Xie and Fahad Shahbaz Khan and Ling Shao},
         title = {From Handcrafted to Deep Features for Pedestrian Detection: A Survey},
         journal = {arXiv:2010.00456},
         year = 2020
        }

6. Contact

If you have any questions or want to add your results, please feel free to contact us.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].