All Projects → TheShadow29 → visual-commonsense-pytorch

TheShadow29 / visual-commonsense-pytorch

Licence: other
For visual commonsense model

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

visual-commonsense-pytorch

For visual commonsense model. Paper: https://arxiv.org/pdf/1811.10830.pdf

Note that this is unofficial implementation.

Steps for getting Bert-results:

  1. I am using pytorch pretrained bert from huggingface here: https://github.com/huggingface/pytorch-pretrained-BERT. So that needs to be installed first.
  2. Download data from http://visualcommonsense.com/download.html. You should have two folders - images and annotations.
  3. cd code
  4. Change the cfg.json file to set the vcr_tdir
  5. python bert_main.py "some_unique_id" --task_type $T. The task type can be QA or QA_R. Q_AR is under progress.

Results:

QA gives 59% on validation set, QA_R gives 66%, both of which are higher than what is reported in the paper (53% and 64%). I am not sure why this the case though. Any inputs are welcome.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].