Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → open-mmlab → Mmaction2

open-mmlab / Mmaction2

Licence: apache-2.0

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Programming Languages

python

139335 projects - #7 most used programming language

Labels

pytorch benchmark action-recognition ava video-understanding

Projects that are alternatives of or similar to Mmaction2

Paddlevideo

Comprehensive, latest, and deployable video deep learning algorithm, including video recognition, action localization, and temporal action detection tasks. It's a high-performance, light-weight codebase provides practical models for video understanding research and application

Stars: ✭ 218 (-68.13%)

Mutual labels: action-recognition, video-understanding, ava

Step

STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)

Stars: ✭ 196 (-71.35%)

Mutual labels: action-recognition, video-understanding, ava

Awesome Activity Prediction

Paper list of activity prediction and related area

Stars: ✭ 147 (-78.51%)

Mutual labels: action-recognition, video-understanding

Hand pose action

Dataset and code for the paper "First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations", CVPR 2018.

Stars: ✭ 173 (-74.71%)

Mutual labels: action-recognition, benchmark

Actionvlad

ActionVLAD for video action classification (CVPR 2017)

Stars: ✭ 217 (-68.27%)

Mutual labels: action-recognition, video-understanding

Movienet Tools

Tools for movie and video research

Stars: ✭ 113 (-83.48%)

Mutual labels: action-recognition, video-understanding

I3d finetune

TensorFlow code for finetuning I3D model on UCF101.

Stars: ✭ 128 (-81.29%)

Mutual labels: action-recognition, video-understanding

Tsn Pytorch

Temporal Segment Networks (TSN) in PyTorch

Stars: ✭ 895 (+30.85%)

Mutual labels: action-recognition, video-understanding

MTL-AQA

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]

Stars: ✭ 38 (-94.44%)

Mutual labels: action-recognition, video-understanding

DIN-Group-Activity-Recognition-Benchmark

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

Stars: ✭ 26 (-96.2%)

Mutual labels: action-recognition, video-understanding

DEAR

[ICCV 2021 Oral] Deep Evidential Action Recognition

Stars: ✭ 36 (-94.74%)

Mutual labels: action-recognition, video-understanding

Temporal Segment Networks

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Stars: ✭ 1,287 (+88.16%)

Mutual labels: action-recognition, video-understanding

Tdn

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

Stars: ✭ 72 (-89.47%)

Mutual labels: action-recognition, video-understanding

Mmaction

An open-source toolbox for action understanding based on PyTorch

Stars: ✭ 1,711 (+150.15%)

Mutual labels: action-recognition, video-understanding

Okutama Action

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

Stars: ✭ 36 (-94.74%)

Mutual labels: action-recognition, benchmark

Action Detection

temporal action detection with SSN

Stars: ✭ 597 (-12.72%)

Mutual labels: action-recognition, video-understanding

Alphaction

Spatio-Temporal Action Localization System

Stars: ✭ 221 (-67.69%)

Mutual labels: action-recognition, ava

Awesome Action Recognition

A curated list of action recognition and related area resources

Stars: ✭ 3,202 (+368.13%)

Mutual labels: action-recognition, video-understanding

Video Understanding Dataset

A collection of recent video understanding datasets, under construction!

Stars: ✭ 387 (-43.42%)

Mutual labels: action-recognition, video-understanding

Celero

C++ Benchmark Authoring Library/Framework

Stars: ✭ 593 (-13.3%)

Mutual labels: benchmark

View All Similar Projects ➔

Introduction

English | 简体中文

MMAction2 is an open-source toolbox for video understanding based on PyTorch. It is a part of the OpenMMLab project.

The master branch works with PyTorch 1.3+.

Action Recognition Results on Kinetics-400

Spatio-Temporal Action Detection Results on AVA-2.1

Major Features

Modular design

We decompose the video understanding framework into different components and one can easily construct a customized video understanding framework by combining different modules.
Support for various datasets

The toolbox directly supports multiple datasets, UCF101, Kinetics-[400/600/700], Something-Something V1&V2, Moments in Time, Multi-Moments in Time, THUMOS14, etc.
Support for multiple video understanding frameworks

MMAction2 implements popular frameworks for video understanding:
- For action recognition, various algorithms are implemented, including TSN, TSM, TIN, R(2+1)D, I3D, SlowOnly, SlowFast, CSN, Non-local, etc.
- For temporal action localization, we implement BSN, BMN, SSN.
- For spatial temporal detection, we implement SlowOnly, SlowFast.
Well tested and documented

We provide detailed documentation and API reference, as well as unittests.

Changelog

v0.12.0 was released in 28/02/2021. Please refer to changelog.md for details and release history.

Benchmark

Model	input	io backend	batch size x gpus	MMAction2 (s/iter)	MMAction (s/iter)	Temporal-Shift-Module (s/iter)	PySlowFast (s/iter)
TSN	256p rawframes	Memcached	32x8	0.32	0.38	0.42	x
TSN	256p dense-encoded video	Disk	32x8	0.61	x	x	TODO
I3D heavy	256p videos	Disk	8x8	0.34	x	x	0.44
I3D	256p rawframes	Memcached	8x8	0.43	0.56	x	x
TSM	256p rawframes	Memcached	8x8	0.31	x	0.41	x
Slowonly	256p videos	Disk	8x8	0.32	TODO	x	0.34
Slowfast	256p videos	Disk	8x8	0.69	x	x	1.04
R(2+1)D	256p videos	Disk	8x8	0.45	x	x	x

Details can be found in benchmark.

ModelZoo

Supported methods for Action Recognition:

(click to collapse)

[x] TSN (ECCV'2016)
[x] TSM (ICCV'2019)
[x] TSM Non-Local (ICCV'2019)
[x] R(2+1)D (CVPR'2018)
[x] I3D (CVPR'2017)
[x] I3D Non-Local (CVPR'2018)
[x] SlowOnly (ICCV'2019)
[x] SlowFast (ICCV'2019)
[x] CSN (ICCV'2019)
[x] TIN (AAAI'2020)
[x] TPN (CVPR'2020)
[x] C3D (CVPR'2014)
[x] X3D (CVPR'2020)
[x] OmniSource (ECCV'2020)
[x] MultiModality: Audio (ArXiv'2020)
[x] TANet (ArXiv'2020)

Supported methods for Temporal Action Detection:

(click to collapse)

[x] BSN (ECCV'2018)
[x] BMN (ICCV'2019)
[x] SSN (ICCV'2017)

Supported methods for Spatial Temporal Action Detection:

(click to collapse)

[x] SlowOnly+Fast R-CNN (ICCV'2019)
[x] SlowFast+Fast R-CNN (ICCV'2019)
[x] Long-Term Feature Bank (CVPR'2019)

Results and models are available in the README.md of each method's config directory. A summary can be found in the model zoo page.

We will keep up with the latest progress of the community, and support more popular algorithms and frameworks. If you have any feature requests, please feel free to leave a comment in Issues.

Dataset

Supported datasets:

Supported datasets for Action Recognition:

(click to collapse)

[x] UCF101 [ Homepage ] (CRCV-IR-12-01)
[x] HMDB51 [ Homepage ] (ICCV'2011)
[x] Kinetics-[400/600/700] [ Homepage ] (CVPR'2017)
[x] Something-Something V1 [ Homepage ] (ICCV'2017)
[x] Something-Something V2 [ Homepage ] (ICCV'2017)
[x] Moments in Time [ Homepage ] (TPAMI'2019)
[x] Multi-Moments in Time [ Homepage ] (ArXiv'2019)
[x] HVU [ Homepage ] (ECCV'2020)
[x] Jester [ Homepage ] (ICCV'2019)
[x] GYM [ Homepage ] (CVPR'2020)
[x] ActivityNet [ Homepage ] (CVPR'2015)

Supported datasets for Temporal Action Detection

(click to collapse)

[x] ActivityNet [ Homepage ] (CVPR'2015)
[x] THUMOS14 [ Homepage ] (THUMOS Challenge 2014)

Supported datasets for Spatial Temporal Action Detection

(click to collapse)

[x] AVA [ Homepage ] (CVPR'2018)
[x] UCF101-24 [ Homepage ] (CRCV-IR-12-01)
[x] JHMDB [ Homepage ] (ICCV'2013)

Installation

Please refer to install.md for installation.

Data Preparation

Please refer to data_preparation.md for a general knowledge of data preparation. The supported datasets are listed in supported_datasets.md

Get Started

Please see getting_started.md for the basic usage of MMAction2. There are also tutorials:

A Colab tutorial is also provided. You may preview the notebook here or directly run on Colab.

FAQ

Please refer to FAQ for frequently asked questions.

License

This project is released under the Apache 2.0 license.

Citation

If you find this project useful in your research, please consider cite:

@misc{2020mmaction2,
    title={OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark},
    author={MMAction2 Contributors},
    howpublished = {\url{https://github.com/open-mmlab/mmaction2}},
    year={2020}
}

Contributing

We appreciate all contributions to improve MMAction2. Please refer to CONTRIBUTING.md in MMCV for more details about the contributing guideline.

Acknowledgement

MMAction2 is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new models.

Projects in OpenMMLab

MMCV: OpenMMLab foundational library for computer vision.
MMClassification: OpenMMLab image classification toolbox and benchmark.
MMDetection: OpenMMLab detection toolbox and benchmark.
MMDetection3D: OpenMMLab's next-generation platform for general 3D object detection.
MMSegmentation: OpenMMLab semantic segmentation toolbox and benchmark.
MMAction2: OpenMMLab's next-generation video understanding toolbox and benchmark.
MMTracking: OpenMMLab video perception toolbox and benchmark.
MMPose: OpenMMLab pose estimation toolbox and benchmark.
MMEditing: OpenMMLab image and video editing toolbox.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 684

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (43) 🔗