All Projects → wy1iu → Dcnets

wy1iu / Dcnets

Licence: mit
Implementation for <Decoupled Networks> in CVPR'18.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Dcnets

File Extension List
Organised collection of common file extensions
Stars: ✭ 63 (-45.22%)
Mutual labels:  recognition
Vokaturiandroid
Emotion recognition by speech in android.
Stars: ✭ 79 (-31.3%)
Mutual labels:  recognition
Facial Expression Recognition
💡My Solution to Facial Emotion Recognization in Kaggle competition
Stars: ✭ 88 (-23.48%)
Mutual labels:  recognition
Mtcnn
face detection and alignment with mtcnn
Stars: ✭ 66 (-42.61%)
Mutual labels:  recognition
Face recognition
Face recognition docker image to provide a web service which is able to register and recognize faces
Stars: ✭ 74 (-35.65%)
Mutual labels:  recognition
Ccpd
[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
Stars: ✭ 1,252 (+988.7%)
Mutual labels:  recognition
Openimager
Image processing Toolkit in R
Stars: ✭ 45 (-60.87%)
Mutual labels:  recognition
Idn Caffe
Caffe implementation of "Fast and Accurate Single Image Super-Resolution via Information Distillation Network" (CVPR 2018)
Stars: ✭ 104 (-9.57%)
Mutual labels:  cvpr2018
Phormatics
Using A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)
Stars: ✭ 79 (-31.3%)
Mutual labels:  recognition
Tf Siamesenet
Face recognition based on SiameseNet
Stars: ✭ 87 (-24.35%)
Mutual labels:  recognition
Epic Kitchens 55 Action Models
EPIC-KITCHENS-55 baselines for Action Recognition
Stars: ✭ 68 (-40.87%)
Mutual labels:  recognition
Voicer
AGI-server voice recognizer for #Asterisk
Stars: ✭ 73 (-36.52%)
Mutual labels:  recognition
Crnn chinese characters rec
(CRNN) Chinese Characters Recognition.
Stars: ✭ 1,259 (+994.78%)
Mutual labels:  recognition
Pwc Net
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume, CVPR 2018 (Oral)
Stars: ✭ 1,142 (+893.04%)
Mutual labels:  cvpr2018
Dhf1k
Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)
Stars: ✭ 96 (-16.52%)
Mutual labels:  cvpr2018
Cnn lstm ctc ocr for icpr
Forked from weinman/cnn_lstm_ctc_ocr for the ICPR MTWI 2018 challenge 1
Stars: ✭ 62 (-46.09%)
Mutual labels:  recognition
Surfacenetworks
Source code for CVPR 2018 Oral paper "Surface Networks"
Stars: ✭ 83 (-27.83%)
Mutual labels:  cvpr2018
Residualdensenetwork Pytorch
Pytorch implement: Residual Dense Network for Image Super-Resolution
Stars: ✭ 107 (-6.96%)
Mutual labels:  cvpr2018
Online place recognition
Graph-based image sequences matching for the visual place recognition in changing environments.
Stars: ✭ 100 (-13.04%)
Mutual labels:  recognition
Julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+993.91%)
Mutual labels:  recognition

Decoupled Networks

By Weiyang Liu*, Zhen Liu*, Zhiding Yu, Bo Dai, Rongmei Lin, Yisen Wang, James Rehg, Le Song

(* equal contribution)

License

Decoupled Networks is released under the MIT License (refer to the LICENSE file for details).

Updates

  • [x] Examples for ImageNet-2012
  • [x] Examples for CIFAR-100

Contents

  1. Introduction
  2. Short Video Introduction
  3. Citation
  4. Requirements
  5. Usage

Introduction

Inner product-based convolution has been a central component of convolutional neural networks (CNNs) and the key to learning visual representations. Inspired by the observation that CNN-learned features are naturally decoupled with the norm of features corresponding to the intra-class variation and the angle corresponding to the semantic difference, we propose a generic decoupled learning framework which models the intra-class variation and semantic difference independently.

Specifically, we first reparametrize the inner product to a decoupled form and then generalize it to the decoupled convolution operator which serves as the building block of our decoupled networks. We present several effective instances of the decoupled convolution operator. Each decoupled operator is well motivated and has an intuitive geometric interpretation. Based on these decoupled operators, we further propose to directly learn the operator from data.

The latest version of our paper is available at arXiv and here. Our work is largely inspired and motivated by the observation that the CNN-learned features are naturally decoupled, as shown as follows.

As illustrated as follows, the central idea of decoupled networks is the decoupled convolution, which is used to replace all the original convolution operators.

Short Video Introduction

The following is a short video introduction by Zhen Liu.

DCNet_talk

Citation

If you find our work useful in your research, please consider to cite:

@InProceedings{Liu_2018_CVPR,
    author = {Liu, Weiyang and Liu, Zhen and Yu, Zhiding and Dai, Bo and Lin, Rongmei and Wang, Yisen and Rehg, James M. and Song, Le},
    title = {Decoupled Networks},
    booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
    year = {2018}
}

Requirements

  1. Python 2.7
  2. TensorFlow (Tested on version 1.01)
  3. numpy

Usage

Part 1: Clone the repositary

  • Clone the repositary.

    git clone https://github.com/wy1iu/DCNets.git
    

Part 2: CIFAR-100

  • Training DCNets with TanhConv + Cosine on CIFAR-100:

    cd $DCNET_ROOT/dcnet_cifar100/tanh_cos
    python train_resnet.py
    
  • To train other models, change the model name (tanh_cos) in the script above to your desired one.

Part 3: ImageNet-2012

  • Download ImageNet-2012 dataset and process the dataset with TensorFlow-Slim.

  • We provide one example with the modified Resnet-18 for ImageNet-2012. We use TanhConv magnitude function + Cosine angular activation in this implementation. The user can replace magnitude function and angular function with the other choices mentioned in the paper or any other customized functions.

    cd $DCNET_ROOT/dcnet_imagenet
    python train_DCNet.py
    
  • We provide our result for this implementation, which matches our reported result 88.9% in the paper.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].