A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

✭ 295

awesome-list vqa

Nscl Pytorch Release

PyTorch implementation for the Neuro-Symbolic Concept Learner (NS-CL).

✭ 276

python vqa

MICCAI21 MMQ

Multiple Meta-model Quantifying for Medical Visual Question Answering

✭ 16

python shell deep-learning medical vqa question-answering medical-image-processing

bottom-up-features

Bottom-up features extractor implemented in PyTorch.

✭ 62

python c Cuda pytorch feature-extraction vqa faster-rcnn visual-question-answering

rosita

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

✭ 36

python vqa vision-and-language pre-training referring-expression-comprehension image-text-retrieval

vqa-soft

Accompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 VQA workshop paper.

✭ 14

c lua python Cuda C++shell caffe deep-learning torch pytorch vqa cross-entropy

FigureQA-baseline

TensorFlow implementation of the CNN-LSTM, Relation Network and text-only baselines for the paper "FigureQA: An Annotated Figure Dataset for Visual Reasoning"

✭ 28

python microsoft deep-learning tensorflow neural-networks vqa relation-network relational-reasoning visual-question-answering figure-analysis

DVQA dataset

DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018

✭ 20

deep-learning dataset vqa question-answering bar-chart cvpr2018

just-ask

[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

✭ 57

Jupyter Notebook python HTML vqa video-understanding weakly-supervised-learning multimodal-learning visual-question-answering question-generation vision-and-language videoqa pre-training video-question-answering

AoA-pytorch

A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering

✭ 33

python vqa attention attention-mechanism captioning visual-question-answering

probnmn-clevr

Code for ICML 2019 paper "Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering" [long-oral]

✭ 63

python vqa probabilistic-models icml clevr neural-module-networks icml-2019

iMIX

A framework for Multimodal Intelligence research from Inspur HSSLAB.

✭ 21

python framework vqa multimodal vision-and-language multimodal-deep-learning

Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

✭ 484

Jupyter Notebook python javascript shell c CSS visualization transformers transformer vqa clip interpretability explainable-ai explainability detr lxmert visualbert

mmgnn textvqa

A Pytorch implementation of CVPR 2020 paper: Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

✭ 41

python pytorch vqa gnn

hcrn-videoqa

Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)

✭ 111

python vqa question-answering tgif-qa videoqa

cfvqa

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

✭ 96

python shell pytorch vqa causality causal-inference cvpr counterfactual cvpr2021 language-bias

ZS-F-VQA

Code and Data for paper: Zero-shot Visual Question Answering using Knowledge Graph [ ISWC 2021 ]

✭ 51

python shell knowledge-graph vqa zero-shot

VideoNavQA

An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)

✭ 22

python shell benchmark machine-learning deep-neural-networks video navigation vqa question-answering visual-reasoning multimodal embodied cross-modality conditioning videonavqa

neuro-symbolic-ai-soc

Neuro-Symbolic Visual Question Answering on Sort-of-CLEVR using PyTorch

✭ 41

Jupyter Notebook python program-synthesis pytorch artificial-intelligence vqa reasoning clevr neuro-symbolic-learning pytorch-implementation neuro-symbolic

self critical vqa

Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''

✭ 39

python shell vqa visual-question-answering interpretable-deep-learning interpretable-ai explainable-ai

1-40 of 40 vqa projects