Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

jiwoon-ahn / Psa

Licence: mit

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Programming Languages

python

139335 projects - #7 most used programming language

Labels

deep-learning pytorch cvpr2018

Projects that are alternatives of or similar to Psa

Cvpr18 Inaturalist Transfer

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

Stars: ✭ 164 (-37.16%)

Mutual labels: cvpr2018

glimpse clouds

Pytorch implementation of the paper "Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points", F. Baradel, C. Wolf, J. Mille , G.W. Taylor, CVPR 2018

Stars: ✭ 30 (-88.51%)

Mutual labels: cvpr2018

G2LTex

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

Stars: ✭ 104 (-60.15%)

Mutual labels: cvpr2018

Sin

CVPR 2018: Structure Inference Net for Object Detection

Stars: ✭ 178 (-31.8%)

Mutual labels: cvpr2018

Image manipulation detection

Paper: CVPR2018, Learning Rich Features for Image Manipulation Detection

Stars: ✭ 210 (-19.54%)

Mutual labels: cvpr2018

IDN-pytorch

paper implement : Fast and Accurate Single Image Super-Resolution via Information Distillation Network

Stars: ✭ 40 (-84.67%)

Mutual labels: cvpr2018

Frvsr

Frame-Recurrent Video Super-Resolution (official repository)

Stars: ✭ 157 (-39.85%)

Mutual labels: cvpr2018

cvpr18-caption-eval

Learning to Evaluate Image Captioning. CVPR 2018

Stars: ✭ 79 (-69.73%)

Mutual labels: cvpr2018

Dfl Cnn

This is a pytorch re-implementation of Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition

Stars: ✭ 245 (-6.13%)

Mutual labels: cvpr2018

DisguiseNet

Code for DisguiseNet : A Contrastive Approach for Disguised Face Verification in the Wild

Stars: ✭ 20 (-92.34%)

Mutual labels: cvpr2018

Attentive Gan Derainnet

Unofficial tensorflow implemention of "Attentive Generative Adversarial Network for Raindrop Removal from A Single Image (CVPR 2018) " model https://maybeshewill-cv.github.io/attentive-gan-derainnet/

Stars: ✭ 184 (-29.5%)

Mutual labels: cvpr2018

Guided Attention Inference Network

Contains implementation of Guided Attention Inference Network (GAIN) presented in Tell Me Where to Look(CVPR 2018). This repository aims to apply GAIN on fcn8 architecture used for segmentation.

Stars: ✭ 204 (-21.84%)

Mutual labels: cvpr2018

text-detection-fots.pytorch

FOTS text detection branch reimplementation, hmean: 83.3%

Stars: ✭ 80 (-69.35%)

Mutual labels: cvpr2018

Dbnet

DBNet: A Large-Scale Dataset for Driving Behavior Learning, CVPR 2018

Stars: ✭ 172 (-34.1%)

Mutual labels: cvpr2018

DVQA dataset

DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018

Stars: ✭ 20 (-92.34%)

Mutual labels: cvpr2018

L Gm Loss

Implementation of our accepted CVPR 2018 paper "Rethinking Feature Distribution for Loss Functions in Image Classification"

Stars: ✭ 162 (-37.93%)

Mutual labels: cvpr2018

FaceAttr

CVPR2018 Face Super-resolution with supplementary Attributes

Stars: ✭ 18 (-93.1%)

Mutual labels: cvpr2018

Splatnet

SPLATNet: Sparse Lattice Networks for Point Cloud Processing (CVPR2018)

Stars: ✭ 259 (-0.77%)

Mutual labels: cvpr2018

ASNet

Salient Object Detection Driven by Fixation Prediction (CVPR2018)

Stars: ✭ 41 (-84.29%)

Mutual labels: cvpr2018

VoxelMorph-PyTorch

An unofficial PyTorch implementation of VoxelMorph- An unsupervised 3D deformable image registration method

Stars: ✭ 68 (-73.95%)

Mutual labels: cvpr2018

View All Similar Projects ➔

Learning Pixel-level Semantic Affinity with Image-level Supervision

This code is deprecated. Please see https://github.com/jiwoon-ahn/irn instead.

Introduction

The code and trained models of:

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, Jiwoon Ahn and Suha Kwak, CVPR 2018 [Paper]

We have developed a framework based on AffinityNet to generate accurate segmentation labels of training images given their image-level class labels only. A segmentation network learned with our synthesized labels outperforms previous state-of-the-arts by large margins on the PASCAL VOC 2012.

*Our code was first implemented in Tensorflow at the time of CVPR 2018 submssion, and later we migrated to PyTorch. Some trivial details (optimizer, channel size, and etc.) have been changed.

Citation

If you find the code useful, please consider citing our paper using the following BibTeX entry.

@InProceedings{Ahn_2018_CVPR,
author = {Ahn, Jiwoon and Kwak, Suha},
title = {Learning Pixel-Level Semantic Affinity With Image-Level Supervision for Weakly Supervised Semantic Segmentation},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}

Prerequisite

Tested on Ubuntu 16.04, with Python 3.5, PyTorch 0.4, Torchvision 0.2.1, CUDA 9.0, and 1x NVIDIA TITAN X (Pascal).
The PASCAL VOC 2012 development kit: You also need to specify the path ('voc12_root') of your downloaded dev kit.
(Optional) If you want to try with the VGG-16 based network, PyCaffe and VGG-16 ImageNet pretrained weights [vgg16_20M.caffemodel]
(Optional) If you want to try with the ResNet-38 based network, Mxnet and ResNet-38 pretrained weights [ilsvrc-cls_rna-a1_cls1000_ep-0001.params]

Usage

1. Train a classification network to get CAMs.

python3 train_cls.py --lr 0.1 --batch_size 16 --max_epoches 15 --crop_size 448 --network [network.vgg16_cls | network.resnet38_cls] --voc12_root [your_voc12_root_folder] --weights [your_weights_file] --wt_dec 5e-4

2. Generate labels for AffinityNet by applying dCRF on CAMs.

python3 infer_cls.py --infer_list voc12/train_aug.txt --voc12_root [your_voc12_root_folder] --network [network.vgg16_cls | network.resnet38_cls] --weights [your_weights_file] --out_cam [desired_folder] --out_la_crf [desired_folder] --out_ha_crf [desired_folder]

(Optional) Check the accuracy of CAMs.

python3 infer_cls.py --infer_list voc12/val.txt --voc12_root [your_voc12_root_folder] --network network.resnet38_cls --weights res38_cls.pth --out_cam_pred [desired_folder]

3. Train AffinityNet with the labels

python3 train_aff.py --lr 0.1 --batch_size 8 --max_epoches 8 --crop_size 448 --voc12_root [your_voc12_root_folder] --network [network.vgg16_aff | network.resnet38_aff] --weights [your_weights_file] --wt_dec 5e-4 --la_crf_dir [your_output_folder] --ha_crf_dir [your_output_folder]

4. Perform Random Walks on CAMs

python3 infer_aff.py --infer_list [voc12/val.txt | voc12/train.txt] --voc12_root [your_voc12_root_folder] --network [network.vgg16_aff | network.resnet38_aff] --weights [your_weights_file] --cam_dir [your_output_folder] --out_rw [desired_folder]

Results and Trained Models

Class Activation Map

Model	Train (mIoU)	Val (mIoU)
VGG-16	48.9	46.6	[Weights]
ResNet-38	47.7	47.2	[Weights]
ResNet-38	48.0	46.8	CVPR submission

Random Walk with AffinityNet

Model	alpha	Train (mIoU)	Val (mIoU)
VGG-16	4/16/32	59.6	54.0	[Weights]
ResNet-38	4/16/32	61.0	60.2	[Weights]
ResNet-38	4/16/24	58.1	57.0	CVPR submission

*beta=8, gamma=5, t=256 for all settings

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 261

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (18) 🔗