Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → samyak0210 → saliency

samyak0210 / saliency

Licence: MIT License

Pytorch Implementation of the paper - "Tidying Deep Saliency Prediction Architectures"

Programming Languages

139335 projects - #7 most used programming language

Labels

pytorch-implementation simplenet saliency-prediction saliency-models

Projects that are alternatives of or similar to saliency

nlp classification

Implementing nlp papers relevant to classification with PyTorch, gluonnlp

Stars: ✭ 224 (+729.63%)

Mutual labels: pytorch-implementation

Metrics for model evaluation in pytorch

Stars: ✭ 99 (+266.67%)

Mutual labels: pytorch-implementation

Restoring-Extremely-Dark-Images-In-Real-Time

The project is the official implementation of our CVPR 2021 paper, "Restoring Extremely Dark Images in Real Time"

Stars: ✭ 79 (+192.59%)

Mutual labels: pytorch-implementation

Walk-Transformer

From Random Walks to Transformer for Learning Node Embeddings (ECML-PKDD 2020) (In Pytorch and Tensorflow)

Stars: ✭ 26 (-3.7%)

Mutual labels: pytorch-implementation

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-29.63%)

Mutual labels: pytorch-implementation

"Self-training with Noisy Student improves ImageNet classification" pytorch implementation

Stars: ✭ 31 (+14.81%)

Mutual labels: pytorch-implementation

ConvLSTM-PyTorch

ConvLSTM/ConvGRU (Encoder-Decoder) with PyTorch on Moving-MNIST

Stars: ✭ 202 (+648.15%)

Mutual labels: pytorch-implementation

neural-question-generation

Pytorch implementation of Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks

Stars: ✭ 126 (+366.67%)

Mutual labels: pytorch-implementation

attention-sampling-pytorch

This is a PyTorch implementation of the paper: "Processing Megapixel Images with Deep Attention-Sampling Models".

Stars: ✭ 25 (-7.41%)

Mutual labels: pytorch-implementation

PyTorch implementation of 'An Unsupervised Neural Attention Model for Aspect Extraction' by He et al. ACL2017'

Stars: ✭ 52 (+92.59%)

Mutual labels: pytorch-implementation

Relation-Network-PyTorch

Implementation of Relation Network and Recurrent Relational Network using PyTorch v1.3. Original papers: (RN) https://arxiv.org/abs/1706.01427 (RRN): https://arxiv.org/abs/1711.08028

Stars: ✭ 17 (-37.04%)

Mutual labels: pytorch-implementation

differentiable-morphogenesis

experimenting with differentiable models of morphogenesis 🔬 🦠

Stars: ✭ 38 (+40.74%)

Mutual labels: pytorch-implementation

FedLab-benchmarks

Standard federated learning implementations in FedLab and FL benchmarks.

Stars: ✭ 49 (+81.48%)

Mutual labels: pytorch-implementation

SelfOrganizingMap-SOM

Pytorch implementation of Self-Organizing Map(SOM). Use MNIST dataset as a demo.

Stars: ✭ 33 (+22.22%)

Mutual labels: pytorch-implementation

subjectiveqe-esrgan

PyTorch implementation of ESRGAN (ECCVW 2018) for compressed image subjective quality enhancement.

Stars: ✭ 12 (-55.56%)

Mutual labels: pytorch-implementation

Official repository for ElasticFace: Elastic Margin Loss for Deep Face Recognition

Stars: ✭ 86 (+218.52%)

Mutual labels: pytorch-implementation

PyTorch implementation of SimCLR: supports multi-GPU training and closely reproduces results

Stars: ✭ 89 (+229.63%)

Mutual labels: pytorch-implementation

Multi-Agent-Diverse-Generative-Adversarial-Networks

Easy-to-follow Pytorch tutorial Notebook for Multi-Agent-Diverse-Generative-Adversarial-Networks

Stars: ✭ 23 (-14.81%)

Mutual labels: pytorch-implementation

pytorch-serving

[UNMAINTAINED] A starter pack for creating a lightweight responsive web app for Fast.AI PyTorch models.

Stars: ✭ 16 (-40.74%)

Mutual labels: pytorch-implementation

lowshot-shapebias

Learning low-shot object classification with explicit shape bias learned from point clouds

Stars: ✭ 37 (+37.04%)

Mutual labels: pytorch-implementation

View All Similar Projects ➔

Tidying Deep Saliency Prediction Architectures

This repository contains Pytorch Implementation of SimpleNet and MDNSal. Appearing in the proceedings of the 21st International Conference on Intelligent Robots and Systems (IROS).

Cite

Please cite with the following Bibtex code:

@inproceedings{Navya-IROS-2020, 
               AUTHOR = {Navyasri Reddy, Samyak Jain, Pradeep Yarlagadda, Vineet Gandhi}, 
               TITLE = {Tidying Deep Saliency Prediction Architectures}, 
               BOOKTITLE = {IROS}, 
               YEAR = {2020}
}

Abstract

Learning computational models for visual attention (saliency estimation) is an effort to inch machines/robots closer to human visual cognitive abilities. Data-driven efforts have dominated the landscape since the introduction of deep neural network architectures. In deep learning research, the choices in architecture design are often empirical and frequently lead to more complex models than necessary. The complexity, in turn, hinders the application requirements. In this paper, we identify four key components of saliency models, i.e., input features, multi-level integration, readout architecture, and loss functions. We review the existing state of the art models on these four components and propose novel and simpler alternatives. As a result, we propose two novel end-to-end architectures called SimpleNet and MDNSal, which are neater, minimal, more interpretable and achieve state of the art performance on public saliency benchmarks. SimpleNet is an optimized encoder-decoder architecture and brings notable performance gains on the SALICON dataset (the largest saliency benchmark). MDNSal is a parametric model that directly predicts parameters of a GMM distribution and is aimed to bring more interpretability to the prediction maps. The proposed saliency models run at 25fps, making them ideal for real-time applications.

Architecture

SimpleNet Architecture

MDNSal Architecture

Testing

Clone this repository and download the pretrained weights of SimpleNet, for multiple encoders, trained on SALICON dataset from this link. The trained weights for MobileNetV2 can be found here.

Then just run the code using

$ python3 test.py --val_img_dir path/to/test/images --results_dir path/to/results --model_val_path path/to/saved/models

This will generate saliency maps for all images in the images directory and dump these maps into results directory

Training

For training the model from scratch, download the pretrained weights of PNASNet from here and place these weights in the PNAS/ folder. Run the following command to train

$ python3 train.py --dataset_dir path/to/dataset

The dataset directory structure should be

└── Dataset  
    ├── fixations  
    │   ├── train  
    │   └── val  
    ├── images  
    │   ├── train  
    │   └── val  
    ├── maps  
        ├── train  
        └── val

For training the model with MIT1003 or CAT2000 dataset, first train the model with SALICON dataset and finetune the model weights on MIT1003 or CAT2000 dataset.

Experiments

Multiple Encoders

For training the model, we provide encoders based out of PNASNet, DenseNet-161, VGG-16 and ResNet-50. Run the command -

$ python3 train.py --enc_model <model> --train_enc <boolean value> 
<model> : {"pnas", "densenet", "resnet", "vgg", "mobilenet"}

train_enc is 1 if we want to finetune the encoder and 0 otherwise.

Similarly for testing the model,

$ python3 test.py --enc_model <model> --model_val_path path/to/pretrained/model --save_results <binary> --validate <binary>

If you want to save the results of the generated map make save_results flag to 1 and if you want to evaluate the model quantitatively make the validate flag to 1.

Multiple Loss functions

For the training the model with a combination of loss functions, run the following command -

$ python3 train.py --<loss_function> True --<loss_function>_coeff <coefficient of the loss>
<loss_function> : {"kldiv", "cc", "nss", "sim"}

By default the loss function is KLDiv with coefficient 1.0

Quantitative Results

SALICON Test

The results of our models on SALICON test dataset can be viewed here under the name SimpleNet and MDNSal. Comparison with other state-of-the-art saliency detection models

MIT Test

Comparison with other state-of-the-art saliency detection models on MIT300 test set

Qualitative Results

Contact

If any question, please contact [email protected], [email protected] or [email protected] , or use public issues section of this repository

License

This code is distributed under MIT LICENSE.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 27

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗