All Categories → Machine Learning → video-understanding

Top 33 video-understanding open source projects

awesome grounding: A curated list of research papers in visual grounding

✭ 247

awesome-list computer-vision natural-language-processing paper papers arxiv video-understanding

Comprehensive, latest, and deployable video deep learning algorithm, including video recognition, action localization, and temporal action detection tasks. It's a high-performance, light-weight codebase provides practical models for video understanding research and application

✭ 218

python action-recognition ava video-understanding

Actionvlad

ActionVLAD for video action classification (CVPR 2017)

✭ 217

python deep-learning tensorflow video-processing action-recognition video-understanding

Step

STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)

✭ 196

python apex action-recognition ava amp activity-recognition video-understanding

Youtube 8m

The 2nd place Solution to the Youtube-8M Video Understanding Challenge by Team Monkeytyping (based on tensorflow)

✭ 171

python deep-learning tensorflow computer-vision ensemble-learning video-understanding

Object level visual reasoning

Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018

✭ 163

python computer-vision video-understanding

Awesome Activity Prediction

Paper list of activity prediction and related area

✭ 147

awesome-list action-recognition activity-recognition video-understanding

Video2tfrecord

Easily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.

✭ 137

python deep-learning tensorflow neural-network opencv optical-flow preprocessor video-understanding

Multiverse

Dataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.

✭ 131

python computer-vision video-understanding

Mmaction

An open-source toolbox for action understanding based on PyTorch

✭ 1,711

python Cuda C++shell Dockerfile pytorch action-recognition video-understanding action-detection temporal-action-detection temporal-action-localization spatial-temporal-action-detection

I3d finetune

TensorFlow code for finetuning I3D model on UCF101.

✭ 128

python deep-learning cnn action-recognition video-understanding

Movienet Tools

Tools for movie and video research

✭ 113

deep-learning computer-vision movie action-recognition video-understanding

Temporal Segment Networks

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

✭ 1,287

python action-recognition video-understanding

Temporal Shift Module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

✭ 1,282

python low-latency video-understanding

Temporally Language Grounding

A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"

✭ 73

python video-understanding

Tdn

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

✭ 72

python pytorch action-recognition video-understanding

Tsn Pytorch

Temporal Segment Networks (TSN) in PyTorch

✭ 895

python deep-learning pytorch action-recognition video-understanding

Mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

✭ 684

python pytorch benchmark action-recognition ava video-understanding

Action Detection

temporal action detection with SSN

✭ 597

python action-recognition video-understanding

Activity Recognition With Cnn And Rnn

Temporal Segments LSTM and Temporal-Inception for Activity Recognition

✭ 415

lua convolutional-neural-networks torch lstm-neural-networks activity-recognition video-understanding

Video Understanding Dataset

A collection of recent video understanding datasets, under construction!

✭ 387

computer-vision datasets action-recognition video-understanding

Awesome Action Recognition

A curated list of action recognition and related area resources

✭ 3,202

awesome awesome-list pose-estimation video-processing action-recognition activity-recognition video-understanding object-recognition activity-understanding action-detection video-recognition action-classification

DEAR

[ICCV 2021 Oral] Deep Evidential Action Recognition

✭ 36

python shell c C++cython Dockerfile model-calibration uncertainty-quantification action-recognition video-understanding debiasing evidential-deep-learning ood-detection openset-recognition

PyAnomaly

Useful Toolbox for Anomaly Detection

✭ 95

python Cuda C++shell Dockerfile Batchfile Makefile machine-learning computer-vision multimedia artificial-intelligence artificial-neural-networks video-understanding anomaly-detection video-analysis video-anomaly-detection pytroch

DIN-Group-Activity-Recognition-Benchmark

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

✭ 26

python Dockerfile action-recognition video-understanding dynamic-networks graph-neural-networks group-activity-recognition

CP-360-Weakly-Supervised-Saliency

✭ 20

python shell deep-neural-networks computer-vision 360-video video-understanding saliency-map

Awesome-Temporally-Language-Grounding

A curated list of “Temporally Language Grounding” and related area

✭ 97

video-understanding temporal-activity-localization language-grounding charades activitynet-captions temporal-language-grounding

just-ask

[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

✭ 57

Jupyter Notebook python HTML vqa video-understanding weakly-supervised-learning multimodal-learning visual-question-answering question-generation vision-and-language videoqa pre-training video-question-answering

STCNet

STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection

✭ 29

python video-understanding smoke-detector rise-dataset

MTL-AQA

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]

✭ 38

python shell pytorch video-processing lstm representation-learning action-recognition video-understanding c3d video-captioning captioning fine-grained-classification multitask-learning dilated-convolution action-quality-assessment mtl-aqa fine-grained-action-recognition dilated-c3d

NExT-QA

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

✭ 50

python shell video-understanding videoqa vision-language video-question-answering multi-object-interaction causal-temporal-action-reasoning

glimpse clouds

Pytorch implementation of the paper "Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points", F. Baradel, C. Wolf, J. Mille , G.W. Taylor, CVPR 2018

✭ 30

python shell computer-vision activity-recognition video-understanding cvpr2018

SSTDA

[CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)

✭ 150

python shell video pytorch video-understanding domain-adaptation self-supervised-learning action-segmentation domain-discrepancy temporal-dynamics cvpr2020

1-33 of 33 video-understanding projects