All Projects → NExT-QA → Similar Projects or Alternatives

40 Open source projects that are alternatives of or similar to NExT-QA

[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Stars: ✭ 57 (+14%)

Mutual labels: video-understanding, videoqa, video-question-answering

pytorch violet

A PyTorch implementation of VIOLET

Stars: ✭ 119 (+138%)

Mutual labels: video-question-answering

glimpse clouds

Pytorch implementation of the paper "Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points", F. Baradel, C. Wolf, J. Mille , G.W. Taylor, CVPR 2018

Stars: ✭ 30 (-40%)

Mutual labels: video-understanding

hcrn-videoqa

Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)

Stars: ✭ 111 (+122%)

Mutual labels: videoqa

Kaleido-BERT

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.

Stars: ✭ 252 (+404%)

Mutual labels: vision-language

SSTDA

[CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)

Stars: ✭ 150 (+200%)

Mutual labels: video-understanding

Awesome Grounding

awesome grounding: A curated list of research papers in visual grounding

Stars: ✭ 247 (+394%)

Mutual labels: video-understanding

Paddlevideo

Comprehensive, latest, and deployable video deep learning algorithm, including video recognition, action localization, and temporal action detection tasks. It's a high-performance, light-weight codebase provides practical models for video understanding research and application

Stars: ✭ 218 (+336%)

Mutual labels: video-understanding

Actionvlad

ActionVLAD for video action classification (CVPR 2017)

Stars: ✭ 217 (+334%)

Mutual labels: video-understanding

Step

STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)

Stars: ✭ 196 (+292%)

Mutual labels: video-understanding

Youtube 8m

The 2nd place Solution to the Youtube-8M Video Understanding Challenge by Team Monkeytyping (based on tensorflow)

Stars: ✭ 171 (+242%)

Mutual labels: video-understanding

Object level visual reasoning

Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018

Stars: ✭ 163 (+226%)

Mutual labels: video-understanding

Awesome Activity Prediction

Paper list of activity prediction and related area

Stars: ✭ 147 (+194%)

Mutual labels: video-understanding

Video2tfrecord

Easily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.

Stars: ✭ 137 (+174%)

Mutual labels: video-understanding

Multiverse

Dataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.

Stars: ✭ 131 (+162%)

Mutual labels: video-understanding

Mmaction

An open-source toolbox for action understanding based on PyTorch

Stars: ✭ 1,711 (+3322%)

Mutual labels: video-understanding

I3d finetune

TensorFlow code for finetuning I3D model on UCF101.

Stars: ✭ 128 (+156%)

Mutual labels: video-understanding

Movienet Tools

Tools for movie and video research

Stars: ✭ 113 (+126%)

Mutual labels: video-understanding

Temporal Segment Networks

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Stars: ✭ 1,287 (+2474%)

Mutual labels: video-understanding

Temporal Shift Module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Stars: ✭ 1,282 (+2464%)

Mutual labels: video-understanding

Temporally Language Grounding

A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"

Stars: ✭ 73 (+46%)

Mutual labels: video-understanding

Tdn

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

Stars: ✭ 72 (+44%)

Mutual labels: video-understanding

Tsn Pytorch

Temporal Segment Networks (TSN) in PyTorch

Stars: ✭ 895 (+1690%)

Mutual labels: video-understanding

Mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Stars: ✭ 684 (+1268%)

Mutual labels: video-understanding

Action Detection

temporal action detection with SSN

Stars: ✭ 597 (+1094%)

Mutual labels: video-understanding

Activity Recognition With Cnn And Rnn

Temporal Segments LSTM and Temporal-Inception for Activity Recognition

Stars: ✭ 415 (+730%)

Mutual labels: video-understanding

Video Understanding Dataset

A collection of recent video understanding datasets, under construction!

Stars: ✭ 387 (+674%)

Mutual labels: video-understanding

Awesome Action Recognition

A curated list of action recognition and related area resources

Stars: ✭ 3,202 (+6304%)

Mutual labels: video-understanding

DEAR

[ICCV 2021 Oral] Deep Evidential Action Recognition

Stars: ✭ 36 (-28%)

Mutual labels: video-understanding

PyAnomaly

Useful Toolbox for Anomaly Detection

Stars: ✭ 95 (+90%)

Mutual labels: video-understanding

DIN-Group-Activity-Recognition-Benchmark

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

Stars: ✭ 26 (-48%)

Mutual labels: video-understanding

CP-360-Weakly-Supervised-Saliency

Stars: ✭ 20 (-60%)

Mutual labels: video-understanding

Awesome-Temporally-Language-Grounding

A curated list of “Temporally Language Grounding” and related area

Stars: ✭ 97 (+94%)

Mutual labels: video-understanding

STCNet

STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection

Stars: ✭ 29 (-42%)