Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → imatge-upc → Salgan

imatge-upc / Salgan

Licence: mit

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Programming Languages

139335 projects - #7 most used programming language

Labels

deep-learning convolutional-neural-networks deeplearning visual

Projects that are alternatives of or similar to Salgan

Deeplearning.ai

deeplearning.ai , By Andrew Ng, All video link

Stars: ✭ 625 (+99.04%)

Mutual labels: convolutional-neural-networks, deeplearning

Sru Deeplearning Workshop

دوره 12 ساعته یادگیری عمیق با چارچوب Keras

Stars: ✭ 66 (-78.98%)

Mutual labels: convolutional-neural-networks, deeplearning

T81 558 deep learning

Washington University (in St. Louis) Course T81-558: Applications of Deep Neural Networks

Stars: ✭ 4,152 (+1222.29%)

Mutual labels: convolutional-neural-networks, deeplearning

Tf Mobilenet V2

Mobilenet V2(Inverted Residual) Implementation & Trained Weights Using Tensorflow

Stars: ✭ 85 (-72.93%)

Mutual labels: convolutional-neural-networks, deeplearning

🙈A PyTorch implementation of the paper "Visualizing and Understanding Convolutional Networks." （ECCV 2014)

Stars: ✭ 132 (-57.96%)

Mutual labels: convolutional-neural-networks, deeplearning

Bilinear Cnn Tensorflow

This is an implementation of Bilinear CNN for fine grained visual recognition using TensorFlow.

Stars: ✭ 187 (-40.45%)

Mutual labels: convolutional-neural-networks, deeplearning

Switchable Normalization

Code for Switchable Normalization from "Differentiable Learning-to-Normalize via Switchable Normalization", https://arxiv.org/abs/1806.10779

Stars: ✭ 804 (+156.05%)

Mutual labels: convolutional-neural-networks, deeplearning

Human Activity Recognition (HAR) with 1D Convolutional Neural Network in Python and Keras

Stars: ✭ 97 (-69.11%)

Mutual labels: convolutional-neural-networks, deeplearning

All Convolutional Network: (https://arxiv.org/abs/1412.6806#) implementation in Keras

Stars: ✭ 115 (-63.38%)

Mutual labels: convolutional-neural-networks, deeplearning

MotionSense Dataset for Human Activity and Attribute Recognition ( time-series data generated by smartphone's sensors: accelerometer and gyroscope)

Stars: ✭ 159 (-49.36%)

Mutual labels: convolutional-neural-networks, deeplearning

Deeplearning.ai Assignments

Stars: ✭ 268 (-14.65%)

Mutual labels: convolutional-neural-networks, deeplearning

Deep Learning In Cloud

List of Deep Learning Cloud Providers

Stars: ✭ 298 (-5.1%)

Mutual labels: deeplearning

Geospatial Machine Learning

A curated list of resources focused on Machine Learning in Geospatial Data Science.

Stars: ✭ 289 (-7.96%)

Mutual labels: convolutional-neural-networks

Cs229 Ml Implementation

Implementation of cs229(Machine Learning taught by Andrew Ng) in python.

Stars: ✭ 288 (-8.28%)

Mutual labels: deeplearning

Pytorch Explain Black Box

PyTorch implementation of Interpretable Explanations of Black Boxes by Meaningful Perturbation

Stars: ✭ 287 (-8.6%)

Mutual labels: deeplearning

Accelerating Cnn With Fpga

This project accelerates CNN computation with the help of FPGA, for more than 50x speed-up compared with CPU.

Stars: ✭ 301 (-4.14%)

Mutual labels: convolutional-neural-networks

Segmentation keras

DilatedNet in Keras for image segmentation

Stars: ✭ 300 (-4.46%)

Mutual labels: deeplearning

Multi-view CNN (MVCNN) for shape recognition

Stars: ✭ 285 (-9.24%)

Mutual labels: convolutional-neural-networks

Outdated, see new https://github.com/braindecode/braindecode

Stars: ✭ 284 (-9.55%)

Mutual labels: convolutional-neural-networks

Holistically-Nested Edge Detection

Stars: ✭ 277 (-11.78%)

Mutual labels: deeplearning

View All Similar Projects ➔

SalGAN: Visual Saliency Prediction with Adversarial Networks


Junting Pan	Cristian Canton Ferrer	Kevin McGuinness	Noel O'Connor	Jordi Torres	Elisa Sayrol	Xavier Giro-i-Nieto

A joint collaboration between:


Insight Centre for Data Analytics	Dublin City University (DCU)	Microsoft	Facebook	Barcelona Supercomputing Center	Universitat Politecnica de Catalunya (UPC)

Abstract

We introduce SalGAN, a deep convolutional neural network for visual saliency prediction trained with adversarial examples. The first stage of the network consists of a generator model whose weights are learned by back-propagation computed from a binary cross entropy (BCE) loss over downsampled versions of the saliency maps. The resulting prediction is processed by a discriminator network trained to solve a binary classification task between the saliency maps generated by the generative stage and the ground truth ones. Our experiments show how adversarial training allows reaching state-of-the-art performance across different metrics when combined with a widely-used loss function like BCE.

Slides

<iframe src="//www.slideshare.net/slideshow/embed_code/key/5cXl80Fm2c3ksg" width="595" height="485" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" style="border:1px solid #CCC; border-width:1px; margin-bottom:5px; max-width: 100%;" allowfullscreen> </iframe>

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks from Xavier Giro

Publication

Find the extended pre-print version of our work on arXiv. The shorter extended abstract presented as spotlight in the CVPR 2017 Scene Understanding Workshop (SUNw) is available here.

Please cite with the following Bibtex code:

@InProceedings{Pan_2017_SalGAN,
author = {Pan, Junting and Canton, Cristian and McGuinness, Kevin and O'Connor, Noel E. and Torres, Jordi and Sayrol, Elisa and Giro-i-Nieto, Xavier and},
title = {SalGAN: Visual Saliency Prediction with Generative Adversarial Networks},
booktitle = {arXiv},
month = {January},
year = {2017}
}

You may also want to refer to our publication with the more human-friendly Chicago style:

Junting Pan, Cristian Canton, Kevin McGuinness, Noel E. O'Connor, Jordi Torres, Elisa Sayrol and Xavier Giro-i-Nieto. "SalGAN: Visual Saliency Prediction with Generative Adversarial Networks." arXiv. 2017.

Architecture

Model parameters

The parameters to run SalGAN can be downloaded here:

If you wanted to train the model, you will also need this additional file

[vgg16.pkl (528 MB)]

Visual Results

Datasets

Training

As explained in our paper, our networks were trained on the training and validation data provided by SALICON.

Test

Two different dataset were used for test:

Test partition of SALICON dataset.
MIT300.

Software frameworks

Our paper presents two convolutional neural networks, one correspends to the Generator (Saliency Prediction Network) and the another is the Discriminator for the adversarial training. To compute saliency maps only the Generator is needed.

SalGAN on Lasagne

SalGAN is implemented in Lasagne, which at its time is developed over Theano.

pip install -r https://raw.githubusercontent.com/imatge-upc/saliency-salgan-2017/master/requirements.txt

SalGAN on a docker

We have prepared this Docker container with all necessary dependencies for computing saliency maps with SalGAN. You will need to use nvidia-docker.

Using the container is like connecting via ssh to a machine. To start an interactive session run:

    >> sudo nvidia-docker run -it --entrypoint='bash' -w /home/ evamohe/salgan

This will open a terminal within the container located in the '/home' folder.

Yo will find Salgan code in "/home/salgan". So if you want to test the installation, within the container, run:

   >> cd /home/salgan/scripts
   >> THEANO_FLAGS=mode=FAST_RUN,device=gpu0,floatX=float32,lib.cnmem=0.5,optimizer_including=cudnn python 03-predict.py

That will process the sample images located in "/home/salgan/images" and store them in "/home/salgan/saliency". To exit the container, run:

   >> exit

You migh want to process your own data with your own custom scripts. For that, you can mount different local folders in the container. For example:

>> sudo nvidia-docker run -v $PATH_TO_MY_CODE:/home/code -v $PATH_TO_MY_DATA:/home/data -it --entrypoint='bash' -w /home/

will open a new session in the container, with '/home/code' and '/home/data' folders that will be share with your computer. If you edit your code locally, the changes will be updated automatically in the container. Similarly, all the files generated in '/home/data' will be available in your original data folder.

Usage

To train our model from scrath you need to run the following command:

THEANO_FLAGS=mode=FAST_RUN,device=gpu,floatX=float32,lib.cnmem=1,optimizer_including=cudnn python 02-train.py

In order to run the test script to predict saliency maps, you can run the following command after specifying the path to you images and the path to the output saliency maps:

THEANO_FLAGS=mode=FAST_RUN,device=gpu,floatX=float32,lib.cnmem=1,optimizer_including=cudnn python 03-predict.py

With the provided model weights you should obtain the follwing result:

Download the pretrained VGG-16 weights from: vgg16.pkl

External implementation in PyTorch

Bat-Orgil Batsaikhan and Catherine Qi Zhao from the University of Minnesota released a PyTorch implementation in 2018 as part of their poster "Generative Adversarial Network for Videos and Saliency Map".

Acknowledgements

We would like to especially thank Albert Gil Moreno and Josep Pujal from our technical support team at the Image Processing Group at the UPC.


Albert Gil	Josep Pujal


We gratefully acknowledge the support of NVIDIA Corporation with the donation of the GeoForce GTX Titan Z and Titan X used in this work.
The Image ProcessingGroup at the UPC is a SGR14 Consolidated Research Group recognized and sponsored by the Catalan Government (Generalitat de Catalunya) through its AGAUR office.
This work has been developed in the framework of the projects BigGraph TEC2013-43935-R and Malegra TEC2016-75976-R, funded by the Spanish Ministerio de Economía y Competitividad and the European Regional Development Fund (ERDF).
This publication has emanated from research conducted with the financial support of Science Foundation Ireland (SFI) under grant number SFI/12/RC/2289.

Contact

If you have any general doubt about our work or code which may be of interest for other researchers, please use the public issues section on this github repo. Alternatively, drop us an e-mail at mailto:[email protected].

<script> (function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){ (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o), m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m) })(window,document,'script','https://www.google-analytics.com/analytics.js','ga'); ga('create', 'UA-7678045-13', 'auto'); ga('send', 'pageview'); </script>

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 314

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (23) 🔗