All Projects → tsaishien-chen → SPAN

tsaishien-chen / SPAN

Licence: other
Semantics-guided Part Attention Network (ECCV 2020 Oral)

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to SPAN

tfvaegan
[ECCV 2020] Official Pytorch implementation for "Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification". SOTA results for ZSL and GZSL
Stars: ✭ 107 (+463.16%)
Mutual labels:  pytorch-implementation, eccv2020
LaBERT
A length-controllable and non-autoregressive image captioning model.
Stars: ✭ 50 (+163.16%)
Mutual labels:  eccv2020
DCAN
[AAAI 2020] Code release for "Domain Conditioned Adaptation Network" https://arxiv.org/abs/2005.06717
Stars: ✭ 27 (+42.11%)
Mutual labels:  pytorch-implementation
Magic-VNet
VNet for 3d volume segmentation
Stars: ✭ 45 (+136.84%)
Mutual labels:  pytorch-implementation
loc2vec
Pytorch implementation of the Loc2Vec with some modifications for speed
Stars: ✭ 40 (+110.53%)
Mutual labels:  pytorch-implementation
DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Stars: ✭ 202 (+963.16%)
Mutual labels:  pytorch-implementation
WS3D
Official version of 'Weakly Supervised 3D object detection from Lidar Point Cloud'(ECCV2020)
Stars: ✭ 104 (+447.37%)
Mutual labels:  eccv2020
AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (+468.42%)
Mutual labels:  pytorch-implementation
visdial
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
Stars: ✭ 27 (+42.11%)
Mutual labels:  eccv2020
depth-map-prediction
Pytorch Implementation of Depth Map Prediction from a Single Image using a Multi-Scale Deep Network
Stars: ✭ 78 (+310.53%)
Mutual labels:  pytorch-implementation
nvae
An unofficial toy implementation for NVAE 《A Deep Hierarchical Variational Autoencoder》
Stars: ✭ 83 (+336.84%)
Mutual labels:  pytorch-implementation
Deep-MVLM
A tool for precisely placing 3D landmarks on 3D facial scans based on the paper "Multi-view Consensus CNN for 3D Facial Landmark Placement"
Stars: ✭ 71 (+273.68%)
Mutual labels:  pytorch-implementation
People-Flows
The code for our ECCV 2020 paper: Estimating People Flows to Better Count Them in Crowded Scenes
Stars: ✭ 44 (+131.58%)
Mutual labels:  eccv2020
ResNet-50-CBAM-PyTorch
Implementation of Resnet-50 with and without CBAM in PyTorch v1.8. Implementation tested on Intel Image Classification dataset from https://www.kaggle.com/puneet6060/intel-image-classification.
Stars: ✭ 31 (+63.16%)
Mutual labels:  pytorch-implementation
ViT-V-Net for 3D Image Registration Pytorch
Vision Transformer for 3D medical image registration (Pytorch).
Stars: ✭ 169 (+789.47%)
Mutual labels:  pytorch-implementation
efficientnetv2.pytorch
PyTorch implementation of EfficientNetV2 family
Stars: ✭ 366 (+1826.32%)
Mutual labels:  pytorch-implementation
PyTorch
An open source deep learning platform that provides a seamless path from research prototyping to production deployment
Stars: ✭ 17 (-10.53%)
Mutual labels:  pytorch-implementation
kg one2set
Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"
Stars: ✭ 58 (+205.26%)
Mutual labels:  pytorch-implementation
softpool
SoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification - ECCV 2020 oral
Stars: ✭ 62 (+226.32%)
Mutual labels:  eccv2020
TailCalibX
Pytorch implementation of Feature Generation for Long-Tail Classification by Rahul Vigneswaran, Marc T Law, Vineeth N Balasubramaniam and Makarand Tapaswi
Stars: ✭ 32 (+68.42%)
Mutual labels:  pytorch-implementation

Python 3.6 PyTorch 1.6

Semantics-guided Part Attention Network

This is the pytorch implementatin of Semantics-guided Part Attention Network (SPAN)

Paper

Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network
Tsai-Shien Chen, Chih-Ting Liu, Chih-Wei Wu, and Shao-Yi Chien
European Conference on Computer Vision (ECCV), Oral, 2020
[Paper Website] [arXiv]

Citation

If you use SPAN, please cite this paper:

@inproceedings{SPAN,
    title        = {Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network},
    author       = {Chen, Tsai-Shien and Liu, Chih-Ting and Wu, Chih-Wei and Chien, Shao-Yi},
    booktitle    = {European Conference on Computer Vision},
    pages        = {330--346},
    year         = {2020},
    organization = {Springer}
}

Visualization Example

We visiualize some examples of vehicle images and their

  • foreground masks generated by grabcut,
  • foreground masks generated by deep learning network,
  • front, rear and side attention masks.

Get Started

Prerequisites

  • Download VeRi-776 dataset [link]
    Please note that our code is only supported on VeRi-776 dataset currently.
  • Download this repo by:
$ git clone https://github.com/tsaishien-chen/SPAN.git
$ cd SPAN
  • We run the code under Python 3.6.9
  • The versions of used python packages are listed in requirements.txt. You can install all the dependancies by:
$ sed -i 's/==/>=/g' requirements.txt
$ pip3 install -r requirements.txt

Train

To train the model from scratch, please run

$ python3 main.py --mode train --image_root <Path_to_VeRi>

For example,

$ python3 main.py --mode train --image_root ../Dataset/VeRi

The whole training process includes five steps:

  1. generating the foreground masks by grabcut,
  2. training network to generate more robust foreground mask,
  3. generating the foreground masks by deep learning network,
  4. training network to generate part (front, rear and side) attention mask, and
  5. generating part attention masks.

Implement

We have given the pretrained model of part attention generator;
therefore, you can simply generate the part attention mask without training by

$ python3 main.py --mode implement --image_root <Path_to_VeRi>

For example,

$ python3 main.py --mode implement --image_root ../Dataset/VeRi

Visualize

After training and implementation process, the code will automatically visualize generated masks as above.
Or, you can uncomment the visualize function in main.py and can independently visualize the masks after being generated in each step.

Contact

Tsai-Shien Chen, Media IC and System Lab, National Taiwan University
E-mail : [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].