All Projects → imatge-upc → salbow

imatge-upc / salbow

Licence: other
Saliency Weighted Convolutional Features for Instance Search

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to salbow

Lucenenet
Apache Lucene.NET
Stars: ✭ 1,704 (+2998.18%)
Mutual labels:  retrieval
CVPR2020 PADS
(CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet mining with Reinforcement Learning.
Stars: ✭ 57 (+3.64%)
Mutual labels:  retrieval
FieldedSDM
Fielded Sequential Dependence Model (code and runs)
Stars: ✭ 32 (-41.82%)
Mutual labels:  retrieval
beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Stars: ✭ 738 (+1241.82%)
Mutual labels:  retrieval
shrec17
Supplementary code for SHREC 2017 RGB-D Object-to-CAD Retrieval track
Stars: ✭ 27 (-50.91%)
Mutual labels:  retrieval
RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Stars: ✭ 473 (+760%)
Mutual labels:  retrieval
OpenDialog
An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
Stars: ✭ 94 (+70.91%)
Mutual labels:  retrieval
CondensedMovies
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
Stars: ✭ 78 (+41.82%)
Mutual labels:  retrieval
cineast
Cineast is a multi-feature content-based mulitmedia retrieval engine. It is capable of retrieving images, audio- and video sequences as well as 3d models based on edge or color sketches, textual descriptions and example objects.
Stars: ✭ 51 (-7.27%)
Mutual labels:  retrieval
MoTIS
Mobile(iOS) Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.
Stars: ✭ 60 (+9.09%)
Mutual labels:  retrieval
image embeddings
Using efficientnet to provide embeddings for retrieval
Stars: ✭ 107 (+94.55%)
Mutual labels:  retrieval
awesome-visual-localization-papers
The relocalization task aims to estimate the 6-DoF pose of a novel (unseen) frame in the coordinate system given by the prior model of the world.
Stars: ✭ 60 (+9.09%)
Mutual labels:  retrieval
UDLF
An Unsupervised Distance Learning Framework for Multimedia Retrieval
Stars: ✭ 40 (-27.27%)
Mutual labels:  retrieval
palladian
Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
Stars: ✭ 32 (-41.82%)
Mutual labels:  retrieval
cottontaildb
Cottontail DB is a column store aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.
Stars: ✭ 16 (-70.91%)
Mutual labels:  retrieval
deep recommenders
Deep Recommenders
Stars: ✭ 214 (+289.09%)
Mutual labels:  retrieval
tf retrieval baseline
A Tensorflow retrieval (space embedding) baseline. Metric learning baseline on CUB and Stanford Online Products.
Stars: ✭ 39 (-29.09%)
Mutual labels:  retrieval
libfmp
libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)
Stars: ✭ 71 (+29.09%)
Mutual labels:  retrieval
AudioAlign
Audio Synchronization and Analysis Tool
Stars: ✭ 80 (+45.45%)
Mutual labels:  retrieval
MHCLN
Deep Metric and Hash Code Learning Network for Content Based Retrieval of Remote Sensing Images
Stars: ✭ 30 (-45.45%)
Mutual labels:  retrieval

Saliency Weighted Convolutional features for Instance Search

Eva Mohedano Kevin McGuinness Xavier Giro-i-Nieto Noel O'Connor
Eva Mohedano Kevin McGuinness Xavier Giro-i-Nieto Noel O'Connor

A joint collaboration between:

logo-insight logo-dcu logo-upc logo-etsetb logo-gpi
Insight Centre for Data Analytics Dublin City University (DCU) Universitat Politecnica de Catalunya (UPC) UPC ETSETB TelecomBCN UPC Image Processing Group

Publication

Find our paper at arXiv

@article{mohedano2017saliency,
  title={Saliency Weighted Convolutional Features for Instance Search},
  author={Mohedano, Eva and McGuinness, Kevin and Giro-i-Nieto, Xavier and O'Connor, Noel E},
  journal={arXiv preprint arXiv:1711.10795},
  year={2017}
}

Abstract

This work explores attention models to weight the contribution of local convolutional representations for the instance search task. We present a retrieval framework based on bags of local convolutional features (BLCF) that benefits from saliency weighting to build an efficient image representation. The use of human visual attention models (saliency) allows significant improvements in retrieval performance without the need to conduct region analysis or spatial verification, and without requiring any feature fine tuning. We investigate the impact of different saliency models, finding that higher performance on saliency benchmarks does not necessarily equate to improved performance when used in instance search tasks. The proposed approach outperforms the state-of-the-art on the challenging INSTRE benchmark by a large margin, and provides similar performance on the Oxford and Paris benchmarks compared to more complex methods that use off-the-shelf representations.

Code Instructions

This repo contains python scripts to build Bag of Visual Words based on local CNN features to perform instance search in three different datasets:

Prerequisits

 virtualenv ~/salbow
 source ~/salbow/bin/activate
  • The code runs with CUDA Version 7.5.18. For python dependencies run:
 pip install --upgrade pip
 pip install -r requirements.txt
  • Then, install the custom python-vlfeat library by running:
 python install_vlfeat.py
  • Lastly, modify config.py file to set custom paths: Dataset images are automatically downloaded in:
PATH_DATASET='custom/dataset/path'

Precomputed data contained saliency predictions for the three datasets, and BLCF models, assignment maps and raw features.

saliency masks for each of the datasets are stored in:

PATH_SALIENCY='custom/saliency/path'

BLCF models, features and assignment maps are stored in:

PATH_OUTPUT='custom/output/path'

How to run it

Usage: python evaluation.py [OPTIONS]

Options:
  --dataset TEXT     Selected dataset for extraction  (availables 'instre' (default), 'oxford', 'paris')
  --layer TEXT       layer from vgg16                 (default 'conv5_1')
  --max_dim INTEGER  Max dimension of images          (default '340')
  --weighting TEXT   Spatial weighting scheme         (availables None (default), 'gaussian', 'l2norm', 'SALGAN')
  --global_search    Flag to apply global search for queries        
  --query_expansion  Flag to apply Average Query Expansion    

Example:

python evaluation.py --dataset 'instre' --query_expansion --weighting 'SALGAN'

ret:
mAP = 0.697773325515
+QE mAP = 0.757181174096

The command above applies saliency weighting from SalGAN to the assignment maps of Instre, reporting mAP when performing query expansion (top 10 retrieved images). Results using precomputed data.

Acknowledgements

This publication has emanated from research conducted with the financial support of Science Foundation Ireland (SFI) under grant number SFI/12/RC/2289 and SFI/15/SIRG/3283. logo-ireland
The Image ProcessingGroup at the UPC is a SGR14 Consolidated Research Group recognized and sponsored by the Catalan Government (Generalitat de Catalunya) through its AGAUR office. logo-catalonia
This work has been developed in the framework of the project BigGraph TEC2013-43935-R, funded by the Spanish Ministerio de Economía y Competitividad and the European Regional Development Fund (ERDF). logo-spain

Contact

If you have any general doubt about our work or code which may be of interest for other researchers, please use the public issues section on this github repo. Alternatively, drop us an e-mail at mailto:[email protected].

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].