Ghiaseddin - قیاس الدین

This repo contains the code for the paper "Deep Relative Attributes" by Yaser Souri, Erfan Noury, Ehsan Adeli Mosabbeb.

The paper

Deep Relative Attributes by Yaser Souri (@yassersouri), Erfan Noury (@erfannoury), Ehsan Adeli Mosabbeb (@eadeli). ACCV 2016.

The paper on arXiv: arxiv:1512.04103

The name

The name is "Ghiaseddin" which is written as "قیاس الدین" in Persian/Arabic. It is pronouned as "Ghiyāth ad-Dīn". Ghias or "قیاس" is the Persia/Arabic word that refers to the act of comparing two things (which is what we actually do in relative attributes). Furthermore Ghiaseddin has a relation to Ghiyāth al-Dīn Jamshīd al-Kāshī "غیاث الدین جمشید کاشانی", where "Ghiaseddin" is pronounced similar to the first name of "Jamshīd al-Kāshī" but written with different letters in Persian/Arabic ("قیاس الدین" vs "غیاث الدین").

Dependencies

The code is written in Python 2.7 and uses the Lasagne deep learning framework which is based on the amazing Theano. These two are the main dependencies of the project. Besides these you will be needing CUDA 7 and cuDNN 4. It might work without CUDA or with lower versions but I have not tested it.

To visualize the training procedure I have used pastalog. If you want to see the loss decrease in realtime you will have to install it (optional).

For a complete list of dependencies and their versions see requirements.txt.

Downloading files

If you want to perform training yourself, you need to download some files (initial weights files and dataset images).

Downloading datasets

Zappos50K

python /path/to/project/ghiaseddin/scripts/download-dataset-zappos.py

LFW10

python /path/to/project/ghiaseddin/scripts/download-dataset-lfw10.py

OSR and PubFig

python /path/to/project/ghiaseddin/scripts/download-dataset-osr_pubfig.py

Downloading initial weights (models pretrained on ILSVRC)

GoogLeNet

python /path/to/project/ghiaseddin/scripts/download-weights-googlenet.py

VGG16

python /path/to/project/ghiaseddin/scripts/download-weights-vgg16.py

Running our experiments (reproducing our results)

We have used Titan Black, Titan X, and Titan 980 Ti GPUs to produce our results.

The random seed can be set at ghiaseddin/settings.py. We have used 0, 1 and 2 as our random seeds for Zappos50k2, LFW10, OSR and PubFig experiments. (Zappos50k1 already has 10 different splits of training data so we have only run the full experiment once with 0 as random seed)

To reproduce our results you can run the following scripts which will output the accuracies.

./run-zappos1.sh # for Zappos50k1 experiment
./run-zappos2.sh # for Zappos50k2 experiment
./run-lfw.sh # for LFW10 experiment
./run-osr.sh # for OSR experiment
./run-pubfig.sh # for PubFig experiment

Our results

We report mean and std of ranking prediction accuracy over 3 different runs for OSR, PubFig, LFW10 and Zappos50k2 (fine-grained) and over the 10 splits (provided with the dataset) for Zappos50k1.

Currently (7th Sep 2016) our results on OSR, PubFig, Zappos50k1 and Zappos50k2 are state-of-the-art to our knowledge.

OSR

Method	Natural	Open	Perspective	Large Size	Diagonal Plane	Depth Close	Mean
Ours (VGG16)	99.40 (±0.10)	97.44 (±0.16)	96.88 (±0.13)	96.79 (±0.32)	98.43 (±0.23)	97.65 (±0.16)	97.77 (±0.10)

PubFig

Method	Male	White	Young	Smiling	Chubby	Visible Forehead	Bushy Eyebrows	Narrow Eyes	Pointy Nose	Big Lips	Round Face	Mean
Ours (VGG16)	95.50 (±0.36)	94.60 (±0.55)	94.33 (±0.36)	95.36 (±0.56)	92.32 (±0.36)	97.28 (±0.49)	94.53 (±0.64)	93.19 (±0.51)	94.24 (±0.24)	93.62 (±0.20)	94.76 (±0.24)	94.52 (±0.08)

LFW10

Method	Bald Head	Dark Hair	Eyes Open	Good Looking	Masculine Looking	Mouth Open	Smile	Visible Teeth	Visible Forehead	Young	Mean
Ours (VGG16)	81.14 (±3.39)	88.92 (±0.75)	74.44 (±5.97)	70.28 (±0.54)	98.08 (±0.33)	85.46 (±0.70)	82.49 (±1.41)	82.77 (±2.15)	81.90 (±2.00)	76.33 (±0.43)	82.18 (±1.08)

Zappos50k1

Method	Open	Pointy	Sporty	Comfort	Mean
Ours (VGG16)	95.37 (±0.82)	94.43 (±0.75)	97.30 (±0.81)	95.57 (±0.97)	95.67 (±0.49)

Zappos50k2 (fine-grained)

Method	Open	Pointy	Sporty	Comfort	Mean
Ours (VGG16)	73.45 (±1.23)	68.20 (±0.18)	73.07 (±0.75)	70.31 (±1.50)	71.26 (±0.50)

Doing your own experiments

Training a new model

First start the pastalog server (Optional).

/path/to/project/ghiaseddin/scripts/start_pastalog.sh

Then you can use ghiaseddin to train:

import sys
sys.path.append('/path/to/ghiaseddin/')
import ghiassedin

zappos = ghiaseddin.Zappos50K1(ghiaseddin.settings.zappos_root, attribute_index=0, split_index=0)
googlenet = ghiaseddin.GoogeLeNet(ghiaseddin.settings.googlenet_ilsvrc_weights)
model = ghiaseddin.Ghiaseddin(extractor=googlenet, dataset=zappos) # possibility to add other options

# train the model for 10 epochs
losses = []
for i in range(10):
    loss = model.train_one_epoch()
    losses.append(loss)

# or like this
losses = model.train_n_epoch(10) # here losses is a list of size 10

# save the trained model
model.save('/path/to/model.pkl')

Calculating accuracy of a model

# calculates the relative attribute prediction accuracy
print model.eval_accuracy()

Visualizing saliency

# randomly generates saliency maps for 10 samples of the testing set
fig = model.generate_saliency(size=10)
# or you can specify which pairs
fig = model.generate_saliency([10, 20, 30, 40])
# and you can easily save the figure
fig.savefig('/path/to/file/saliency.png')

Here are some example saliencies (Not all saliencies are easily interpretable as these):

OSR - Natural

Zappos50k1 - Open

Zappos50k1 - Pointy

LFW10 - Bald Head

LFW10 - Good Looking

Reference

If you use this code in your research please consider citing our paper:

@inproceedings{souri2016dra,
  title={Deep Relative Attributes},
  author={Souri, Yaser and Noury, Erfan and Adeli, Ehsan},
  booktitle={ACCV},
  year={2016}
}

Feedback

We are not experts in Theano and/or Lasagne or in Deep Learning. So please provide us with your feedback. If you find any issues inside the paper please contact Yasser Souri ([email protected]). If you have issues or feedback related to the code, please use the Github issues section and file a new issue.

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

yassersouri / ghiaseddin

Programming Languages

Labels

Projects that are alternatives of or similar to ghiaseddin