All Projects → ilija139 → vqa-soft

ilija139 / vqa-soft

Licence: other
Accompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 VQA workshop paper.

Programming Languages

c
50402 projects - #5 most used programming language
lua
6591 projects
python
139335 projects - #7 most used programming language
Cuda
1817 projects
C++
36643 projects - #6 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to vqa-soft

Php Opencv Examples
Tutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)
Stars: ✭ 333 (+2278.57%)
Mutual labels:  caffe, torch
Deepo
Setup and customize deep learning environment in seconds.
Stars: ✭ 6,145 (+43792.86%)
Mutual labels:  caffe, torch
Densenet Caffe
DenseNet Caffe Models, converted from https://github.com/liuzhuang13/DenseNet
Stars: ✭ 350 (+2400%)
Mutual labels:  caffe, torch
Gocv
Go package for computer vision using OpenCV 4 and beyond.
Stars: ✭ 4,511 (+32121.43%)
Mutual labels:  caffe, torch
Php Opencv
php wrapper for opencv
Stars: ✭ 194 (+1285.71%)
Mutual labels:  caffe, torch
Deep Learning Model Convertor
The convertor/conversion of deep learning models for different deep learning frameworks/softwares.
Stars: ✭ 3,044 (+21642.86%)
Mutual labels:  caffe, torch
Digits
Deep Learning GPU Training System
Stars: ✭ 4,056 (+28871.43%)
Mutual labels:  caffe, torch
Vqa.pytorch
Visual Question Answering in Pytorch
Stars: ✭ 602 (+4200%)
Mutual labels:  torch, vqa
Paperspace Python
Paperspace API for python
Stars: ✭ 38 (+171.43%)
Mutual labels:  caffe, torch
Bottom Up Attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Stars: ✭ 989 (+6964.29%)
Mutual labels:  caffe, vqa
Machine Learning Curriculum
💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (+5335.71%)
Mutual labels:  caffe, torch
Orn
Oriented Response Networks, in CVPR 2017
Stars: ✭ 207 (+1378.57%)
Mutual labels:  caffe, torch
Netron
Visualizer for neural network, deep learning, and machine learning models
Stars: ✭ 17,193 (+122707.14%)
Mutual labels:  caffe, torch
deep-learning-platforms
deep-learning platforms,framework,data(深度学习平台、框架、资料)
Stars: ✭ 17 (+21.43%)
Mutual labels:  caffe, torch
torch-lrcn
An implementation of the LRCN in Torch
Stars: ✭ 85 (+507.14%)
Mutual labels:  torch
caffe-mobilenet-v3
Caffe Implementation of MobileNets V3
Stars: ✭ 127 (+807.14%)
Mutual labels:  caffe
How to write cuda extensions in pytorch
How to write cuda kernels or c functions in pytorch, especially for former caffe users.
Stars: ✭ 51 (+264.29%)
Mutual labels:  caffe
neuralBlack
A Multi-Class Brain Tumor Classifier using Convolutional Neural Network with 99% Accuracy achieved by applying the method of Transfer Learning using Python and Pytorch Deep Learning Framework
Stars: ✭ 36 (+157.14%)
Mutual labels:  torch
DVQA dataset
DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018
Stars: ✭ 20 (+42.86%)
Mutual labels:  vqa
caffe
This fork of BVLC/Caffe is dedicated to supporting Cambricon deep learning processor and improving performance of this deep learning framework when running on Machine Learning Unit(MLU).
Stars: ✭ 40 (+185.71%)
Mutual labels:  caffe

Accompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 VQA workshop paper.

The repo contains code for reproducing the paper's experiments and efficient GPU implementation of the proposed loss function for torch, pytorch, and caffe.

Requirements

To run the experiments you would first need to install torch from https://github.com/torch/distro/. We used torch version from commit 5c1d3cfda8101123628a45e70435d545ae1bc771 but later versions probably would work too.

After installing torch you will need to install the following useful lua libraries:

C data structures for torch https://github.com/torch/tds, so we can allocate data in C memory space instead of lua's and thus avoid lua's memory limit and garbage collection.

luarocks install tds

RNN lib for torch https://github.com/Element-Research/rnn for mask zero lookuptable and other useful modules.

luarocks install rnn

threads for lua https://github.com/torch/threads for multi-threaded code.

luarocks install threads

The following libraries are required but you can modify the code and still run the experiments. However we recommend installing them anyway.

fb-debugger a source-level debugger for lua

Follow the install instructions at https://github.com/facebook/fblualib/blob/master/fblualib/debugger/README.md.

OptNet - Reducing memory usage in torch neural nets https://github.com/fmassa/optimize-net.

luarocks install optnet

Visdom for visualization https://github.com/facebookresearch/visdom.

pip install visdom
luarocks install visdom

Installation

We provide GPU implementation of the loss function for torch, pytorch, and caffe. cd to loss_implementations to read further instructions on how to add the loss function to your framework's installation.

Experiments

First, head over the image_preprocess folder and follow the instructions there to extract feature tensors for MS COCO images.

After obtaining image feature tensors, head over experiments/pool or experiments/avg to run the experiments reported in the paper. To run the models with the proposed loss function run:

gpu=0 ./run_soft_cross_entropy.sh

For standard cross entropy run:

gpu=0 ./run_cross_entropy.sh

gpu=0 specifies the ID of the GPU to be used i.e. it's an alias for CUDA_VISIBLE_DEVICES=0.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].