Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Classify each facial image into one of the seven facial emotion categories considered using CNN based on https://www.kaggle.com/c/challenges-in-representation-learning-facial-expression-recognition-challenge

Stars: ✭ 82 (-9.89%)

Mutual labels: kaggle

Hands Detection

Hands video tracker using the Tensorflow Object Detection API and Faster RCNN model. The data used is the Hand Dataset from University of Oxford.

Stars: ✭ 87 (-4.4%)

Mutual labels: object-detection

Frostnet

FrostNet: Towards Quantization-Aware Network Architecture Search

Stars: ✭ 85 (-6.59%)

Mutual labels: object-detection

Gossipnet

Non-maximum suppression for object detection in a neural network

Stars: ✭ 83 (-8.79%)

Mutual labels: object-detection

Tensorflowios

A Real Time Object Detection application on iOS using Tensorflow and pre-trained COCO dataset models. Video frames are captured and inference is done locally using the provided mobilenet models. Both Swift and Objective-C projects.

Stars: ✭ 86 (-5.49%)

Mutual labels: object-detection

Vidvrd Helper

To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper

Stars: ✭ 81 (-10.99%)

Mutual labels: object-detection

Text Detection Using Yolo Algorithm In Keras Tensorflow

Implemented the YOLO algorithm for scene text detection in keras-tensorflow (No object detection API used) The code can be tweaked to train for a different object detection task using YOLO.

Stars: ✭ 87 (-4.4%)

Mutual labels: object-detection

Fastai

R interface to fast.ai

Stars: ✭ 85 (-6.59%)

Mutual labels: object-detection

Keras Mobile Detectnet

Fast Object Detector for the Jetson Nano

Stars: ✭ 87 (-4.4%)

Mutual labels: object-detection

Yolo Custom Object Detector

Making custom object detector using Yolo (Java and Python)

Stars: ✭ 84 (-7.69%)

Mutual labels: object-detection

Gtavisionexport

Code to export full segmentations from GTA

Stars: ✭ 83 (-8.79%)

Mutual labels: object-detection

Deep Learning Boot Camp

A community run, 5-day PyTorch Deep Learning Bootcamp

Stars: ✭ 1,270 (+1295.6%)

Mutual labels: kaggle

Labelme

automatic tagging data, the training data prepare for mask-rcnn

Stars: ✭ 83 (-8.79%)

Mutual labels: object-detection

Spark python ml examples

Spark 2.0 Python Machine Learning examples

Stars: ✭ 87 (-4.4%)

Mutual labels: kaggle

Htcn

Implementation of "Harmonizing Transferability and Discriminability for Adapting Object Detectors" (CVPR 2020)

Stars: ✭ 82 (-9.89%)

Mutual labels: object-detection

Pytorch zoo

A collection of useful modules and utilities (especially helpful for kaggling) not available in Pytorch

Stars: ✭ 84 (-7.69%)

Mutual labels: kaggle

Kaggle Competitions

There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.

Stars: ✭ 86 (-5.49%)

Mutual labels: kaggle

Yolo Darknet On Jetson Tx2

How to run YOLO on Jetson TX2

Stars: ✭ 89 (-2.2%)

Mutual labels: object-detection

View All Similar Projects ➔

🌾 9th Place Solution of Global Wheat Detection

Our team: Miras Amir, Or Katz, Shlomo Kashani
Kaggle post
Submission kernel: pseudo ensemble: detectors (3 st)+universenet r10

Solution

Summary

Our solution is based on the excellent MMDetection framework. We trained an ensemble of the following models:

To increase the score a single round of pseudo labelling was applied to each model. Additionally, for a much better generalization of our models, we used heavy augmentations.

Jigsaw puzzles

In the original corpus provided by the organizers, the training images were cropped from an original set of larger images. Therefore, we collected and assembled the original puzzles resulting in a corpus of 1330 puzzle images. The puzzle collection algorithm we adopted was based on this code. But we were unsuccessful in collecting the bounding boxes for puzzles. Mainly because of the existence of bounding boxes that are located on or in the vicinity the border of the image. For this reason, we generated crops for the puzzles offline in addition to training images and generated boxes for them using pseudo labelling.

Validation approach

We used MultilabelStratifiedKFold with 5 folds of iterative stratification stratified by the number of boxes, a median of box areas and source of images. We guaranteed that there isn’t any leak between the sub-folds, so that the images of one puzzle were used only in that one particular fold.

Referring to the paper, one can see wheat heads from different sources. We assumed that the wheat heads of usask_1, ethz_1 sources are very different from the test sources (UTokyo_1, UTokyo_2, UQ_1, NAU_1). Therefore, we did not use these sources for validation.

However, our validation scores did not correlate well with the Kaggle LB. We only noticed global improvements (for example, DetectoRS is better than UniverseNet). Local improvements such as augmentation parameters, WBF parameters etc. did not correlate. We, therefore, shifted our attention to the LB scores mainly.

We trained our models only on the first fold.

Augmentations

Due to the relatively small size of our training set, and another test set distribution, our approach relied heavily on data augmentation. During training, we utilized an extensive data augmentation protocol:

Various augmentations from albumentations:
- HorizontalFlip, ShiftScaleRotate, RandomRotate90
- RandomBrightnessContrast, HueSaturationValue, RGBShift
- RandomGamma
- CLAHE
- Blur, MotionBlur
- GaussNoise
- ImageCompression
- CoarseDropout
RandomBBoxesSafeCrop. Randomly select N boxes in the image and find their union. Then we cropped the image keeping this unified.
Image colorization
Style transfer. A random image from a small test (10 images) was used as a style.
Mosaic augmentation. a, b, c, d -- randomly selected images. Then we just do the following:

top = np.concatenate([a, b], axis=1)
bottom = np.concatenate([c, d], axis=1)
result = np.concatenate([top, bottom], axis=0)

Mixup augmentation. a, b -- randomly selected images. Then: result = (a + b) / 2
Multi-scale Training. In each iteration, the scale of image is randomly sampled from [(768 + 32 * i, 768 + 32 * i) for i in range(25)].
All augmentations except colorization and style transfer were applied online. Examples of augmented images:

External data:

SPIKE dataset

Models

We used DetectoRS with ResNet50 and UniverseNet+GFL with Res2Net101 as main models. DetectoRS was a little bit more accurate and however much slower to train than UniverseNet:

Single DetectoRS Public LB score without pseudo labeling: 0.7592
Single UniverseNet Public LB score without pseudo labeling: 0.7567

For DetectoRS we used:

LabelSmoothCrossEntropyLoss with parameter 0.1
Empirical Attention

Training pipeline

In general, we used a multi-stage training pipeline:

Model inference

We used TTA6 (Test Time Augmentation) for all our models:

Multi-scale Testing with scales [(1408, 1408), (1536, 1536)]
Flips: [original, horizontal, vertical]

For TTA was used a standard MMDet algorithm with NMS that looks like this for two-stage detectors (DetectoRS):

For one-stage detectors (UniverseNet), the algorithm is similar, only without the part with RoiAlign, Head, etc.

Pseudo labelling

Sampling positive examples. We predicted the test image and received its scores and the bounding boxes. Then we calculated confidence = np.mean(scores > 0.75). If the confidence was greater than 0.6 we accepted this image and used for pseudo labelling.
Sources [usask_1, ethz_1] and augmentations like mosaic, mixup, colorization, style transfer weren’t used for pseudo labelling.
1 epoch, 1 round, 1 stage.
Data: original data + pseudo test data ✖️ 3

Ensemble

We used WBF for the ensemble. The distribution of DetectoRS and UniverseNet scores is different. So we applied scaling using rankdata:

scaled_scores = 0.5 * (rankdata(scores) / len(scores)) + 0.5.

WBF parameters:

weights=[0.65, 0.35] respectively for models [DetectoRS, UniverseNet]
iou_thr=0.55
score_thr=0.45

Some observations from our submissions:

Final submission: 0.6741 on Private LB and 0.7725 on Public LB
Pseudo crops from jigsaw puzzles (DetectoRS R50): 0.7513 -> 0.7582
Tuning of pseudo labeling parameters for sampling positive examples (ensemble): 0.7709 -> 0.7729
Pseudo labeling (DetectoRS R50): 0.7582 -> 0.7691
Pseudo labeling (UniverseNet Res2Net50): 0.7494 -> 0.7627
SPIKE dataset (DetectoRS R50): 0.7582 -> 0.7592
Deleting [usask1, ethz1] from pseudo labeling (DetectoRS R50): 0.7678 -> 0.7691

How to run

Data structure

/data/
├── train/
│   ├── d47799d91.jpg
│   ├── b57bb71b6.jpg
│   └── ...
├── train.csv
/dumps/
├── decoder.pth # checkpoint for style transfer (https://yadi.sk/d/begkgtQHxLo6kA)
├── vgg_normalised.pth # checkpoint for style transfer (https://yadi.sk/d/4BkKpSZ-4PUHqQ)
├── pix2pix_gen.pth # checkpoint for image colorization (https://yadi.sk/d/E5vAckDoFbuWYA)
└── PL_detectors_r50.pth # checkpoint of DetectoRS, which was trained without pseudo crops from jigsaw puzzles (https://yadi.sk/d/vpy2oXHFKGuMOg).

Collecting jigsaw puzzles

python gwd/jigsaw/calculate_distance.py
python gwd/jigsaw/collect_images.py
python gwd/jigsaw/collect_bboxes.py

Preparing folds

python gwd/split_folds.py
bash scripts/kaggle2coco.sh

Pseudo crops from jigsaw puzzles

python gwd/jigsaw/crop.py
bash scripts/test_crops.sh
python gwd/prepare_pseudo.py

Preparing external data

python gwd/converters/spike2kaggle.py

Colorization and style transfer

bash scripts/colorization.sh
bash scripts/stylize.sh

Training

bash scripts/train_detectors.sh
bash scripts/train_universenet.sh

Submission preparing

It is based on lopuhin/kaggle-script-template.

You must upload the best checkpoints of trained models to the kaggle dataset. Further change the variables in script_template.py:

MODELS_ROOT (name of your kaggle dataset)
CHECKPOINT_DETECTORS (the best checkpoint of DetectoRS)
CHECKPOINT_UNIVERSE (the best checkpoint of UniverseNet)

bash build.sh  # create a submission file ./build/script.py

Link to the kaggle dataset with:

pretrained weights: gwd_models
external libraries: mmdetection_wheels

References

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 91

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗