PaddlevideoComprehensive, latest, and deployable video deep learning algorithm, including video recognition, action localization, and temporal action detection tasks. It's a high-performance, light-weight codebase provides practical models for video understanding research and application
Stars: ✭ 218 (+445%)
Hidden Two StreamCaffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"
Stars: ✭ 179 (+347.5%)
bLVNet-TAMThe official Codes for NeurIPS 2019 paper. Quanfu Fan, Ricarhd Chen, Hilde Kuehne, Marco Pistoia, David Cox, "More Is Less: Learning Efficient Video Representations by Temporal Aggregation Modules"
Stars: ✭ 54 (+35%)
AlphactionSpatio-Temporal Action Localization System
Stars: ✭ 221 (+452.5%)
UntrimmednetWeakly Supervised Action Recognition and Detection
Stars: ✭ 152 (+280%)
conv3d-video-action-recognitionMy experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves result…
Stars: ✭ 50 (+25%)
StepSTEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)
Stars: ✭ 196 (+390%)
tfvaegan[ECCV 2020] Official Pytorch implementation for "Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification". SOTA results for ZSL and GZSL
Stars: ✭ 107 (+167.5%)
C3d KerasC3D for Keras + TensorFlow
Stars: ✭ 171 (+327.5%)
AttentionalpoolingactionCode/Model release for NIPS 2017 paper "Attentional Pooling for Action Recognition"
Stars: ✭ 248 (+520%)
HakeHAKE: Human Activity Knowledge Engine (CVPR'18/19/20, NeurIPS'20)
Stars: ✭ 132 (+230%)
Action recognition zooCodes for popular action recognition models, verified on the something-something data set.
Stars: ✭ 227 (+467.5%)
gzsl-odOut-of-Distribution Detection for Generalized Zero-Shot Action Recognition
Stars: ✭ 47 (+17.5%)
ActionvladActionVLAD for video action classification (CVPR 2017)
Stars: ✭ 217 (+442.5%)
MUSES[CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark
Stars: ✭ 51 (+27.5%)
Pose2vecA Repository for maintaining various human skeleton preprocessing steps in numpy and tensorflow along with tensorflow model to learn pose embeddings.
Stars: ✭ 25 (-37.5%)
Hand pose actionDataset and code for the paper "First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations", CVPR 2018.
Stars: ✭ 173 (+332.5%)
MiCT-Net-PyTorchVideo Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone
Stars: ✭ 48 (+20%)
Dd NetA lightweight network for body/hand action recognition
Stars: ✭ 161 (+302.5%)
Squeeze-and-Recursion-Temporal-GatesCode for : [Pattern Recognit. Lett. 2021] "Learn to cycle: Time-consistent feature discovery for action recognition" and [IJCNN 2021] "Multi-Temporal Convolutions for Human Action Recognition in Videos".
Stars: ✭ 62 (+55%)
Hoi Learning ListA list of the Human-Object Interaction Learning studies.
Stars: ✭ 145 (+262.5%)
temporal-binding-networkImplementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
Stars: ✭ 95 (+137.5%)
TA3N[ICCV 2019 Oral] TA3N: https://github.com/cmhungsteve/TA3N (Most updated repo)
Stars: ✭ 45 (+12.5%)
Action RecognitionExploration of different solutions to action recognition in video, using neural networks implemented in PyTorch.
Stars: ✭ 129 (+222.5%)
Openpose-based-GUI-for-Realtime-Pose-Estimate-and-Action-RecognitionGUI based on the python api of openpose in windows using cuda10 and cudnn7. Support body , hand, face keypoints estimation and data saving. Realtime gesture recognition is realized through two-layer neural network based on the skeleton collected from the gui.
Stars: ✭ 69 (+72.5%)
LintelA Python module to decode video frames directly, using the FFmpeg C API.
Stars: ✭ 240 (+500%)
UAV-Human[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
Stars: ✭ 122 (+205%)
Ms G3d[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"
Stars: ✭ 225 (+462.5%)
ViCC[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.
Stars: ✭ 33 (-17.5%)
Ican[BMVC 2018] iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection
Stars: ✭ 225 (+462.5%)
Two-Stream-CNNTwo Stream CNN implemented in Keras using in skeleton-based action recognition with dataset NTU RGB+D
Stars: ✭ 75 (+87.5%)
Ta3n[ICCV 2019 (Oral)] Temporal Attentive Alignment for Large-Scale Video Domain Adaptation (PyTorch)
Stars: ✭ 217 (+442.5%)
MSAFOffical implementation of paper "MSAF: Multimodal Split Attention Fusion"
Stars: ✭ 47 (+17.5%)
Ig65m PytorchPyTorch 3D video classification models pre-trained on 65 million Instagram videos
Stars: ✭ 217 (+442.5%)
TCEThis repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).
Stars: ✭ 51 (+27.5%)
MmskeletonA OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Stars: ✭ 2,378 (+5845%)
VideoTransformer-pytorchPyTorch implementation of a collections of scalable Video Transformer Benchmarks.
Stars: ✭ 159 (+297.5%)
AmassData preparation and loader for AMASS
Stars: ✭ 180 (+350%)
adascan-publicCode for AdaScan: Adaptive Scan Pooling (CVPR 2017)
Stars: ✭ 43 (+7.5%)
VipVideo Platform for Action Recognition and Object Detection in Pytorch
Stars: ✭ 175 (+337.5%)
C3D-tensorflowAction recognition with C3D network implemented in tensorflow
Stars: ✭ 34 (-15%)
Video CaffeVideo-friendly caffe -- comes with the most recent version of Caffe (as of Jan 2019), a video reader, 3D(ND) pooling layer, and an example training script for C3D network and UCF-101 data
Stars: ✭ 172 (+330%)
dynamic-images-for-action-recognitionA public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et al.
Stars: ✭ 27 (-32.5%)
TimeceptionTimeception for Complex Action Recognition, CVPR 2019 (Oral Presentation)
Stars: ✭ 153 (+282.5%)
pushup-counter-appCount pushups from video/webcam. Tech stack: Keypoint detection, BlazePose, action recognition.
Stars: ✭ 48 (+20%)
MTL-AQAWhat and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
Stars: ✭ 38 (-5%)
temporal-sslVideo Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.
Stars: ✭ 46 (+15%)
synse-zslOfficial PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'
Stars: ✭ 14 (-65%)
pose2actionexperiments on classifying actions using poses
Stars: ✭ 24 (-40%)
ailia-modelsThe collection of pre-trained, state-of-the-art AI models for ailia SDK
Stars: ✭ 1,102 (+2655%)
sparsepropTemporal action proposals
Stars: ✭ 46 (+15%)