All Projects → wanglimin → Mrcnn Scene Recognition

wanglimin / Mrcnn Scene Recognition

MR-CNNs for Large-Scale Scene Recognition

Programming Languages

matlab
3953 projects

Projects that are alternatives of or similar to Mrcnn Scene Recognition

Motionblur Detection By Cnn
Stars: ✭ 126 (-6.67%)
Mutual labels:  convolutional-neural-networks
Pytorch Vfi Cft
Generate slow-motion videos by interpolating more frames
Stars: ✭ 130 (-3.7%)
Mutual labels:  convolutional-neural-networks
Lsoftmax Pytorch
The Pytorch Implementation of L-Softmax
Stars: ✭ 133 (-1.48%)
Mutual labels:  convolutional-neural-networks
Pytorch convlstm
convolutional lstm implementation in pytorch
Stars: ✭ 126 (-6.67%)
Mutual labels:  convolutional-neural-networks
Handwritten Digit Recognition Using Deep Learning
Handwritten Digit Recognition using Machine Learning and Deep Learning
Stars: ✭ 127 (-5.93%)
Mutual labels:  convolutional-neural-networks
Simpsonrecognition
Detect and recognize The Simpsons characters using Keras and Faster R-CNN
Stars: ✭ 131 (-2.96%)
Mutual labels:  convolutional-neural-networks
Aaltd18
Data augmentation using synthetic data for time series classification with deep residual networks
Stars: ✭ 124 (-8.15%)
Mutual labels:  convolutional-neural-networks
Imagenet
Pytorch Imagenet Models Example + Transfer Learning (and fine-tuning)
Stars: ✭ 134 (-0.74%)
Mutual labels:  convolutional-neural-networks
Image Caption Generator
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Stars: ✭ 126 (-6.67%)
Mutual labels:  convolutional-neural-networks
Visualizingcnn
🙈A PyTorch implementation of the paper "Visualizing and Understanding Convolutional Networks." (ECCV 2014)
Stars: ✭ 132 (-2.22%)
Mutual labels:  convolutional-neural-networks
Deepco3
[CVPR19] DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral paper)
Stars: ✭ 127 (-5.93%)
Mutual labels:  convolutional-neural-networks
Sparsenet
[ECCV 2018] Sparsely Aggreagated Convolutional Networks https://arxiv.org/abs/1801.05895
Stars: ✭ 127 (-5.93%)
Mutual labels:  convolutional-neural-networks
Adventures In Deep Learning
A curated list of Deep Learning resources sorted by reputation
Stars: ✭ 131 (-2.96%)
Mutual labels:  convolutional-neural-networks
Jsnet
Javascript/WebAssembly deep learning library for MLPs and convolutional neural networks
Stars: ✭ 126 (-6.67%)
Mutual labels:  convolutional-neural-networks
Deep Learning With Pytorch Tutorials
深度学习与PyTorch入门实战视频教程 配套源代码和PPT
Stars: ✭ 1,986 (+1371.11%)
Mutual labels:  convolutional-neural-networks
Quiver
Interactive convnet features visualization for Keras
Stars: ✭ 1,619 (+1099.26%)
Mutual labels:  convolutional-neural-networks
Cnn Sentence Classification Pytorch
Implementation of Convolutional Neural Networks for Sentence Classification (Y.Kim, EMNLP 2014) on Pytorch.
Stars: ✭ 131 (-2.96%)
Mutual labels:  convolutional-neural-networks
Chainer Cifar10
Various CNN models for CIFAR10 with Chainer
Stars: ✭ 134 (-0.74%)
Mutual labels:  convolutional-neural-networks
Intelegent lock
lock mechanism with face recognition and liveness detection
Stars: ✭ 134 (-0.74%)
Mutual labels:  convolutional-neural-networks
Tensorflow realtime multi Person pose estimation
Multi-Person Pose Estimation project for Tensorflow 2.0 with a small and fast model based on MobilenetV3
Stars: ✭ 129 (-4.44%)
Mutual labels:  convolutional-neural-networks

Multi-Resolution CNNs for Large-Scale Scene Recognition

Here we provide the code and models for the following paper (Arxiv Preprint):

Knowledge Guided Disambiguation for Large-Scale Scene Classification with Multi-Resolution CNNs
Limin Wang, Sheng Guo, Weilin Huang, Yuanjun Xiong, and Yu Qiao 
in IEEE Transactions on Image Processing, 2017

Updates

  • February 21st, 2017
    • Release the code and models
  • January 3rd, 2017
    • Initialize the repo

Overview

We have made two efforts to exploit CNNs for large-scale scene recognition:

  • We design a modular framework to capture multi-level visual information for scene understanding by training CNNs from different resolutions
  • We propose a knowledge disambiguation strategy by using soft labels from extra networks to deal with the label ambiguity issue of scene recognition.

These two efforts are the core part of team "SIAT_MMLAB" for the following large-scale scene recogntion challenges.

Challenge Rank Performance
Places2 challenge 2015 2nd place 0.1736 top5-error
Places2 challenge 2016 4th place 0.1042 top5-error
LSUN challenge 2015 2nd place 0.9030 top1-accuracy
LSUN challenge 2016 1st place 0.9161 top1-accuracy

Places365 Models

We first release the learned models on the Places365 dataset.

  • Models learned at resolution of 256 * 256
Model Top5 Error Rate
(A0) Normal BN-Inception 0.143
(A1) Normal BN-Inception + object networks 0.141
(A2) Normal BN-Inception + scene networks 0.134
  • Models learned at resolution of 384 * 384
Model Top5 Error Rate
(B0) Deeper BN-Inception 0.140
(B1) Deeper BN-Inception + object networks 0.136
(B2) Deeper BN-Inception + scene networks 0.130
  • Download initialization and reference models

We release the scripts at the directory of scripts/.

Try bash scripts/get_init_models.sh to downdload knowldege models.

Try bash scripts/get_reference_models.sh to download reference models.

Testing Code

We release the testing code on the Places365 validation dataset at the directory of matlab/.

We also release a demo code to use our Places365 model as generic feature extraction and perform scene recognition on the MIT Indoor67 dataset at the directory of matlab/.

Training Code

We release the models at the directory of models/ and the training scripts at the directory of scripts/.

Try bash scripts/256_inception2_train.sh to train standard CNNs.

Try bash scripts/256_kd_object_inception2_train.sh to train knowledge disambiguation networks (by object network).

Try bash scripts/256_kd_scene_inception2_train.sh to train knowledge disambiguation netowrks (by scene network).

The training code is based on our modified Caffe toolbox. It is a efficient parallel caffe with MPI implementation. Meanwhile, we implement a new kl-divergence loss layer for our knowledge disambiguation methods;

https://github.com/yjxiong/caffe/tree/kd

Questions

Contact

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].