TensorFlow implementation of the CNN-LSTM, Relation Network and text-only baselines for the paper "FigureQA: An Annotated Figure Dataset for Visual Reasoning"

Stars: ✭ 28 (-50.88%)

Mutual labels: vqa, visual-question-answering

TRAR-VQA

[ICCV 2021] TRAR: Routing the Attention Spans in Transformers for Visual Question Answering -- Official Implementation

Stars: ✭ 49 (-14.04%)

Mutual labels: visual-question-answering, vision-and-language

self critical vqa

Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''

Stars: ✭ 39 (-31.58%)

Mutual labels: vqa, visual-question-answering

iMIX

A framework for Multimodal Intelligence research from Inspur HSSLAB.

Stars: ✭ 21 (-63.16%)

Mutual labels: vqa, vision-and-language

WeSHClass

[AAAI 2019] Weakly-Supervised Hierarchical Text Classification

Stars: ✭ 83 (+45.61%)

Mutual labels: weakly-supervised-learning

weasel

Weakly Supervised End-to-End Learning (NeurIPS 2021)

Stars: ✭ 117 (+105.26%)

Mutual labels: weakly-supervised-learning

MIA

Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）

Stars: ✭ 57 (+0%)

Mutual labels: vision-and-language

wikiHow paper list

A paper list of research conducted based on wikiHow

Stars: ✭ 25 (-56.14%)

Mutual labels: vision-and-language

VCML

PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019

Stars: ✭ 45 (-21.05%)

Mutual labels: visual-question-answering

SPML

Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning

Stars: ✭ 81 (+42.11%)

Mutual labels: weakly-supervised-learning

awesome-graph-self-supervised-learning

Awesome Graph Self-Supervised Learning

Stars: ✭ 805 (+1312.28%)

Mutual labels: pre-training

MTL-AQA

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]

Stars: ✭ 38 (-33.33%)

Mutual labels: video-understanding

VarCLR

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

Stars: ✭ 30 (-47.37%)

Mutual labels: pre-training

WS3D

Official version of 'Weakly Supervised 3D object detection from Lidar Point Cloud'(ECCV2020)

Stars: ✭ 104 (+82.46%)

Mutual labels: weakly-supervised-learning

Transformer-QG-on-SQuAD

Implement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)

Stars: ✭ 28 (-50.88%)

Mutual labels: question-generation

TS-CAM

Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

Stars: ✭ 96 (+68.42%)

Mutual labels: weakly-supervised-learning

iPerceive

Stars: ✭ 52 (-8.77%)

Mutual labels: videoqa

knodle

A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently develop and compare their own methods.

Stars: ✭ 76 (+33.33%)

Mutual labels: weakly-supervised-learning

KorQuAD-Question-Generation

question generation model with KorQuAD dataset

Stars: ✭ 27 (-52.63%)

Mutual labels: question-generation

X-VLM

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Stars: ✭ 283 (+396.49%)

Mutual labels: vision-and-language

VidSitu

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

Stars: ✭ 41 (-28.07%)

Mutual labels: vision-and-language

Zero-shot-Fact-Verification

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

Stars: ✭ 39 (-31.58%)

Mutual labels: question-generation

CBP

Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"

Stars: ✭ 52 (-8.77%)

Mutual labels: vision-and-language

Simple-does-it-weakly-supervised-instance-and-semantic-segmentation

Weakly Supervised Segmentation by Tensorflow. Implements semantic segmentation in Simple Does It: Weakly Supervised Instance and Semantic Segmentation, by Khoreva et al. (CVPR 2017).

Stars: ✭ 46 (-19.3%)

Mutual labels: weakly-supervised-learning

WSDEC

Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised manner.

Stars: ✭ 95 (+66.67%)

Mutual labels: weakly-supervised-learning

Tianchi2020ChineseMedicineQuestionGeneration

2020 阿里云天池大数据竞赛-中医药文献问题生成挑战赛

Stars: ✭ 20 (-64.91%)

Mutual labels: question-generation

Learning-Action-Completeness-from-Points

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

Stars: ✭ 53 (-7.02%)

Mutual labels: weakly-supervised-learning

explicit memory tracker

[ACL 2020] Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading

Stars: ✭ 35 (-38.6%)

Mutual labels: question-generation

Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Stars: ✭ 484 (+749.12%)

Mutual labels: vqa

STCNet

STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection

Stars: ✭ 29 (-49.12%)

Mutual labels: video-understanding

synse-zsl

Official PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'

Stars: ✭ 14 (-75.44%)

Mutual labels: vision-and-language

Awesome-Weak-Shot-Learning

A curated list of papers, code and resources pertaining to weak-shot classification, detection, and segmentation.

Stars: ✭ 142 (+149.12%)

Mutual labels: weakly-supervised-learning

question generator

An NLP system for generating reading comprehension questions

Stars: ✭ 188 (+229.82%)

Mutual labels: question-generation

TopicNet

Interface for easier topic modelling.

Stars: ✭ 127 (+122.81%)

Mutual labels: multimodal-learning

calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Stars: ✭ 105 (+84.21%)

Mutual labels: vision-and-language

GAL-fWSD

Generative Adversarial Learning Towards Fast Weakly Supervised Detection

Stars: ✭ 18 (-68.42%)

Mutual labels: weakly-supervised-learning

clip playground

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities

Stars: ✭ 80 (+40.35%)

Mutual labels: vision-and-language

Learning-From-Rules

Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net/forum?id=SkeuexBtDr)

Stars: ✭ 46 (-19.3%)

Mutual labels: weakly-supervised-learning

dcsp segmentation

No description or website provided.

Stars: ✭ 34 (-40.35%)

Mutual labels: weakly-supervised-learning

deviation-network

Source code of the KDD19 paper "Deep anomaly detection with deviation networks", weakly/partially supervised anomaly detection, few-shot anomaly detection

Stars: ✭ 94 (+64.91%)

Mutual labels: weakly-supervised-learning

trove

Weakly supervised medical named entity classification

Stars: ✭ 55 (-3.51%)

Mutual labels: weakly-supervised-learning

hexia

Mid-level PyTorch Based Framework for Visual Question Answering.

Stars: ✭ 24 (-57.89%)

Mutual labels: visual-question-answering

mmgnn textvqa

A Pytorch implementation of CVPR 2020 paper: Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

Stars: ✭ 41 (-28.07%)

Mutual labels: vqa

C2C

Implementation of Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification approach.

Stars: ✭ 30 (-47.37%)

Mutual labels: weakly-supervised-learning

concept-based-xai

Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI

Stars: ✭ 41 (-28.07%)

Mutual labels: weakly-supervised-learning

multimodal-vae-public

A PyTorch implementation of "Multimodal Generative Models for Scalable Weakly-Supervised Learning" (https://arxiv.org/abs/1802.05335)

Stars: ✭ 98 (+71.93%)

Mutual labels: multimodal-learning

MSAF

Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"

Stars: ✭ 47 (-17.54%)

Mutual labels: multimodal-learning

detect-shortcuts

Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering

Stars: ✭ 17 (-70.18%)

Mutual labels: visual-question-answering

WSL4MIS

Scribbles or Points-based weakly-supervised learning for medical image segmentation, a strong baseline, and tutorial for research and application.

Stars: ✭ 100 (+75.44%)

Mutual labels: weakly-supervised-learning

RelationNetworks-CLEVR

A pytorch implementation for "A simple neural network module for relational reasoning", working on the CLEVR dataset

Stars: ✭ 83 (+45.61%)

Mutual labels: visual-question-answering

MLH-Quizzet

This is a smart Quiz Generator that generates a dynamic quiz from any uploaded text/PDF document using NLP. This can be used for self-analysis, question paper generation, and evaluation, thus reducing human effort.

Stars: ✭ 23 (-59.65%)

Mutual labels: question-generation

stanford-cs231n-assignments-2020

This repository contains my solutions to the assignments for Stanford's CS231n "Convolutional Neural Networks for Visual Recognition" (Spring 2020).

Stars: ✭ 84 (+47.37%)

Mutual labels: vision-and-language

beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Stars: ✭ 738 (+1194.74%)

Mutual labels: question-generation

robo-vln

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Stars: ✭ 34 (-40.35%)

Mutual labels: vision-and-language

1-60 of 155 similar projects

›