aimerykong / Pixel-Attentional-Gating

Licence: other

Pixel Attentional Gating for Parsimonious Per-Pixel Labeling

Programming Languages

matlab

3953 projects

Cuda

1817 projects

C++

36643 projects - #6 most used programming language

TeX

3793 projects

python

139335 projects - #7 most used programming language

shell

77523 projects

Projects that are alternatives of or similar to Pixel-Attentional-Gating

Simple-does-it-weakly-supervised-instance-and-semantic-segmentation

Weakly Supervised Segmentation by Tensorflow. Implements semantic segmentation in Simple Does It: Weakly Supervised Instance and Semantic Segmentation, by Khoreva et al. (CVPR 2017).

Stars: ✭ 46 (+0%)

Mutual labels: semantic-segmentation

mobilenet segmentation

Binary semantic segmentation with UNet based on MobileNetV2 encoder

Stars: ✭ 18 (-60.87%)

Mutual labels: semantic-segmentation

satellite-Image-Semantic-Segmentation-Unet-Tensorflow-keras

Collection of different Unet Variant suchas VggUnet, ResUnet, DenseUnet, Unet. AttUnet, MobileNetUnet, NestedUNet, R2AttUNet, R2UNet, SEUnet, scSEUnet, Unet_Xception_ResNetBlock

Stars: ✭ 43 (-6.52%)

Mutual labels: semantic-segmentation

recurrent-decoding-cell

[AAAI'20] Segmenting Medical MRI via Recurrent Decoding Cell (Spotlight)

Stars: ✭ 14 (-69.57%)

Mutual labels: semantic-segmentation

FCNN-example

This is a fully convolutional neural net exercise to detect houses from aerial images.

Stars: ✭ 28 (-39.13%)

Mutual labels: semantic-segmentation

improving segmentation with selfsupervised depth

[CVPR21] Implementation of our work "Three Ways to Improve Semantic Segmentation with Self-Supervised Depth Estimation"

Stars: ✭ 189 (+310.87%)

Mutual labels: semantic-segmentation

night image semantic segmentation

[ICIP 2019] : This is the official github repository for the paper "What's There in The Dark" accepted in IEEE International Conference in Image Processing 2019 (ICIP19) , Taipei, Taiwan.

Stars: ✭ 25 (-45.65%)

Mutual labels: semantic-segmentation

unet pytorch

Pytorch implementation of UNet for converting aerial satellite images into google maps kinda images.

Stars: ✭ 27 (-41.3%)

Mutual labels: semantic-segmentation

cool-papers-in-pytorch

Reimplementing cool papers in PyTorch...

Stars: ✭ 21 (-54.35%)

Mutual labels: semantic-segmentation

Segmentation-Series-Chaos

Summary and experiment includes basic segmentation, human segmentation, human or portrait matting for both image and video.

Stars: ✭ 75 (+63.04%)

Mutual labels: semantic-segmentation

LightNet

LightNet: Light-weight Networks for Semantic Image Segmentation (Cityscapes and Mapillary Vistas Dataset)

Stars: ✭ 710 (+1443.48%)

Mutual labels: semantic-segmentation

Lyft-Perception-Challenge

The 4th place and the fastest solution of the Lyft Perception Challenge (Image semantic segmentation with PyTorch)

Stars: ✭ 69 (+50%)

Mutual labels: semantic-segmentation

hsn v1

HistoSegNet: Semantic Segmentation of Histological Tissue Type in Whole Slide Images (ICCV 2019)

Stars: ✭ 65 (+41.3%)

Mutual labels: semantic-segmentation

Entity

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

Stars: ✭ 313 (+580.43%)

Mutual labels: semantic-segmentation

face normals cvpr17

Code for "Face Normals `in-the-wild' using Fully Convolutional Networks" (CVPR 17)

Stars: ✭ 45 (-2.17%)

Mutual labels: surface-normals

InstantDL

InstantDL: An easy and convenient deep learning pipeline for image segmentation and classification

Stars: ✭ 33 (-28.26%)

Mutual labels: semantic-segmentation

FCN-Segmentation-TensorFlow

FCN for Semantic Image Segmentation achieving 68.5 mIoU on PASCAL VOC

Stars: ✭ 34 (-26.09%)

Mutual labels: semantic-segmentation

Robust-Semantic-Segmentation

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

Stars: ✭ 25 (-45.65%)

Mutual labels: semantic-segmentation

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Stars: ✭ 8,046 (+17391.3%)

Mutual labels: semantic-segmentation

segmentation-paper-reading-notes

segmentation paper reading notes

Stars: ✭ 39 (-15.22%)

Mutual labels: semantic-segmentation

View All Similar Projects ➔

Pixel-wise Attentional Gating for Scene Parsing

For paper and slides, please refer to our project page

Our entry to Robust Vision Challenge can be found here depth estimation and semantic segmentation.

To achieve parsimonious inference in per-pixel labeling tasks with a limited computational budget, we propose a Pixel-wise Attentional Gating unit (PAG) that learns to selectively process a subset of spatial locations at each layer of a deep convolutional network. PAG is a generic, architecture-independent, problem-agnostic mechanism that can be readily ``plugged in'' to an existing model with fine-tuning. We utilize PAG in two ways: 1) learning spatially varying pooling fields that improve model performance without the extra computation cost associated with multi-scale pooling, and 2) learning a dynamic computation policy for each pixel to decrease total computation while maintaining accuracy.

We extensively evaluate PAG on a variety of per-pixel labeling tasks, including semantic segmentation, boundary detection, monocular depth and surface normal estimation. We demonstrate that PAG allows competitive or state-of-the-art performance on these tasks. Our experiments show that PAG learns dynamic spatial allocation of computation over the input image which provides better performance trade-offs compared to related approaches (e.g., truncating deep models or dynamically skipping whole layers). Generally, we observe PAG can reduce computation by 10% without noticeable loss in accuracy and performance degrades gracefully when imposing stronger computational constraints.

Keywords Spatial Attention, Dynamic Computation, Per-Pixel Labeling, Semantic Segmentation, Monocular Depth, Surface Normal, Boundary Detection.

Several demos are included as below. As for details on the training, demo and code, please go into each demo folder.

demo1: Panoramic Surface Normal Estimation [Ready]
demo2: Boundary Detection [[!!!TOOD!!!]]
demo3: Semantic Segmentation [[!!!TOOD!!!]]
demo4: Monocular Depth Estimation [[!!!TOOD!!!]]

Please download those models from the google drive.

MatConvNet is used in our project, and some functions are changed/added. Please compile accordingly by adjusting the path --

LD_LIBRARY_PATH=/usr/local/cuda/lib64:local matlab 

path_to_matconvnet = './matconvnet-1.0-beta23_modifiedDagnn/';
run(fullfile(path_to_matconvnet, 'matlab', 'vl_setupnn'));
addpath(fullfile(path_to_matconvnet, 'matlab'));
vl_compilenn('enableGpu', true, ...
               'cudaRoot', '/usr/local/cuda', ...
               'cudaMethod', 'nvcc', ...
               'enableCudnn', true, ...
               'cudnnRoot', '/usr/local/cuda/cudnn/lib64') ;

See also Recurrent Scene Parsing with Perspective Understanding in-the Loop which adapts depth map for pooling field selection.

If you find our model/method/dataset useful, please cite our work (draft at arxiv):

@inproceedings{kong2019pag,
  title={Pixel-wise Attentional Gating for Scene parsing},
  author={Kong, Shu and Fowlkes, Charless},
  booktitle={IEEE Winter Conf. on Applications of Computer Vision (WACV)},
  year={2019}
}

last update: 11/06/2018

Shu Kong

aimerykong At g-m-a-i-l dot com

[email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

aimerykong / Pixel-Attentional-Gating

Programming Languages

Labels

Projects that are alternatives of or similar to Pixel-Attentional-Gating

Pixel-wise Attentional Gating for Scene Parsing