All Projects → MVIG-SJTU → Alphaction

MVIG-SJTU / Alphaction

Spatio-Temporal Action Localization System

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Alphaction

Mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Stars: ✭ 684 (+209.5%)
Mutual labels:  action-recognition, ava
torch-lrcn
An implementation of the LRCN in Torch
Stars: ✭ 85 (-61.54%)
Mutual labels:  torch, action-recognition
Pytorch gbw lm
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
Stars: ✭ 101 (-54.3%)
Mutual labels:  gpu, torch
Digits
Deep Learning GPU Training System
Stars: ✭ 4,056 (+1735.29%)
Mutual labels:  gpu, torch
Paddlevideo
Comprehensive, latest, and deployable video deep learning algorithm, including video recognition, action localization, and temporal action detection tasks. It's a high-performance, light-weight codebase provides practical models for video understanding research and application
Stars: ✭ 218 (-1.36%)
Mutual labels:  action-recognition, ava
Paperspace Python
Paperspace API for python
Stars: ✭ 38 (-82.81%)
Mutual labels:  gpu, torch
Step
STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)
Stars: ✭ 196 (-11.31%)
Mutual labels:  action-recognition, ava
Ig65m Pytorch
PyTorch 3D video classification models pre-trained on 65 million Instagram videos
Stars: ✭ 217 (-1.81%)
Mutual labels:  action-recognition
Terrain Builder
🏔 Procedural terrain using Three.js and perlin noise, Now Accelerated by your GPU!
Stars: ✭ 228 (+3.17%)
Mutual labels:  gpu
Genomeworks
SDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (-2.71%)
Mutual labels:  gpu
Nvidia Clerk
A cross-platform go bot that tracks for availability of stock from Nvidia's store and adds a cart to your checkout.
Stars: ✭ 214 (-3.17%)
Mutual labels:  gpu
Actionvlad
ActionVLAD for video action classification (CVPR 2017)
Stars: ✭ 217 (-1.81%)
Mutual labels:  action-recognition
Cupoch
Robotics with GPU computing
Stars: ✭ 225 (+1.81%)
Mutual labels:  gpu
Visdial
[CVPR 2017] Torch code for Visual Dialog
Stars: ✭ 215 (-2.71%)
Mutual labels:  torch
Optix Pathtracer
Simple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (+4.52%)
Mutual labels:  gpu
Torch
R Interface to Torch
Stars: ✭ 214 (-3.17%)
Mutual labels:  torch
Tf Quant Finance
High-performance TensorFlow library for quantitative finance.
Stars: ✭ 2,925 (+1223.53%)
Mutual labels:  gpu
Ms G3d
[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"
Stars: ✭ 225 (+1.81%)
Mutual labels:  action-recognition
Torchdata
PyTorch dataset extended with map, cache etc. (tensorflow.data like)
Stars: ✭ 226 (+2.26%)
Mutual labels:  torch
Datoviz
⚡ High-performance GPU interactive scientific data visualization with Vulkan
Stars: ✭ 222 (+0.45%)
Mutual labels:  gpu

AlphAction

AlphAction aims to detect the actions of multiple persons in videos. It is the first open-source project that achieves 30+ mAP (32.4 mAP) with single model on AVA dataset.

This project is the official implementation of paper Asynchronous Interaction Aggregation for Action Detection (ECCV 2020), authored by Jiajun Tang*, Jin Xia* (equal contribution), Xinzhi Mu, Bo Pang, Cewu Lu (corresponding author).


demo1 demo2
demo3

Demo Video

AlphAction demo video [YouTube] [BiliBili]

Installation

You need first to install this project, please check INSTALL.md

Data Preparation

To do training or inference on AVA dataset, please check DATA.md for data preparation instructions.

Model Zoo

Please see MODEL_ZOO.md for downloading models.

Training and Inference

To do training or inference with AlphAction, please refer to GETTING_STARTED.md.

Demo Program

To run the demo program on video or webcam, please check the folder demo. We select 15 common categories from the 80 action categories of AVA, and provide a practical model which achieves high accuracy (about 70 mAP) on these categories.

Acknowledgement

We thankfully acknowledge the computing resource support of Huawei Corporation for this project.

Citation

If this project helps you in your research or project, please cite this paper:

@inproceedings{tang2020asynchronous,
  title={Asynchronous Interaction Aggregation for Action Detection},
  author={Tang, Jiajun and Xia, Jin and Mu, Xinzhi and Pang, Bo and Lu, Cewu},
  booktitle={Proceedings of the European conference on computer vision (ECCV)},
  year={2020}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].