All Projects → Conditional Batch Norm → Similar Projects or Alternatives

39 Open source projects that are alternatives of or similar to Conditional Batch Norm

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Stars: ✭ 989 (+1839.22%)

Mutual labels: vqa

PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind People

Stars: ✭ 33 (-35.29%)

Mutual labels: vqa

Visual Question Answering

📷 ❓ Visual Question Answering Demo and Algorithmia API

Stars: ✭ 18 (-64.71%)

Mutual labels: vqa

Bottom Up Attention Vqa

An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.

Stars: ✭ 667 (+1207.84%)

Mutual labels: vqa

Vqa.pytorch

Visual Question Answering in Pytorch

Stars: ✭ 602 (+1080.39%)

Mutual labels: vqa

Mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Stars: ✭ 4,713 (+9141.18%)

Mutual labels: vqa

Mac Network

Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)

Stars: ✭ 444 (+770.59%)

Mutual labels: vqa

Awesome Vqa

Visual Q&A reading list

Stars: ✭ 403 (+690.2%)

Mutual labels: vqa

Oscar

Oscar and VinVL

Stars: ✭ 396 (+676.47%)

Mutual labels: vqa

Tbd Nets

PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"

Stars: ✭ 345 (+576.47%)

Mutual labels: vqa

Awesome Visual Question Answering

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

Stars: ✭ 295 (+478.43%)

Mutual labels: vqa

Nscl Pytorch Release

PyTorch implementation for the Neuro-Symbolic Concept Learner (NS-CL).

Stars: ✭ 276 (+441.18%)

Mutual labels: vqa

MICCAI21 MMQ

Multiple Meta-model Quantifying for Medical Visual Question Answering

Stars: ✭ 16 (-68.63%)

Mutual labels: vqa

bottom-up-features

Bottom-up features extractor implemented in PyTorch.

Stars: ✭ 62 (+21.57%)

Mutual labels: vqa

rosita

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

Stars: ✭ 36 (-29.41%)

Mutual labels: vqa

vqa-soft

Accompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 VQA workshop paper.

Stars: ✭ 14 (-72.55%)

Mutual labels: vqa

FigureQA-baseline

TensorFlow implementation of the CNN-LSTM, Relation Network and text-only baselines for the paper "FigureQA: An Annotated Figure Dataset for Visual Reasoning"

Stars: ✭ 28 (-45.1%)

Mutual labels: vqa

DVQA dataset

DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018

Stars: ✭ 20 (-60.78%)

Mutual labels: vqa

just-ask

[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Stars: ✭ 57 (+11.76%)

Mutual labels: vqa

AoA-pytorch

A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering

Stars: ✭ 33 (-35.29%)

Mutual labels: vqa

probnmn-clevr

Code for ICML 2019 paper "Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering" [long-oral]

Stars: ✭ 63 (+23.53%)

Mutual labels: vqa

iMIX

A framework for Multimodal Intelligence research from Inspur HSSLAB.

Stars: ✭ 21 (-58.82%)

Mutual labels: vqa

Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Stars: ✭ 484 (+849.02%)

Mutual labels: vqa

mmgnn textvqa

A Pytorch implementation of CVPR 2020 paper: Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

Stars: ✭ 41 (-19.61%)

Mutual labels: vqa

hcrn-videoqa

Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)

Stars: ✭ 111 (+117.65%)

Mutual labels: vqa

cfvqa

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias