Video ClassificationTutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Two Stream PytorchPyTorch implementation of two-stream networks for video action recognition
Realtime Action RecognitionApply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
DEAR[ICCV 2021 Oral] Deep Evidential Action Recognition
DIN-Group-Activity-Recognition-BenchmarkA new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.
auditory-slow-fastImplementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch
GST-videoICCV 19 Grouped Spatial-Temporal Aggretation for Efficient Action Recognition
ntu-xNTU-X, which is an extended version of popular NTU dataset
cpnetLearning Video Representations from Correspondence Proposals (CVPR 2019 Oral)
TadTREnd-to-end Temporal Action Detection with Transformer. [Under review for a journal publication]
Robust-Deep-Learning-PipelineDeep Convolutional Bidirectional LSTM for Complex Activity Recognition with Missing Data. Human Activity Recognition Challenge. Springer SIST (2020)
video repres mascode for CVPR-2019 paper: Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
sanThe official PyTorch implementation of "Context Matters: Self-Attention for sign Language Recognition"
TCFPN-ISBATemporal Convolutional Feature Pyramid Network (TCFPN) & Iterative Soft Boundary Assignment (ISBA), CVPR '18
synse-zslOfficial PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'
Two-Stream-CNNTwo Stream CNN implemented in Keras using in skeleton-based action recognition with dataset NTU RGB+D
Pose2vecA Repository for maintaining various human skeleton preprocessing steps in numpy and tensorflow along with tensorflow model to learn pose embeddings.
pushup-counter-appCount pushups from video/webcam. Tech stack: Keypoint detection, BlazePose, action recognition.
pose2actionexperiments on classifying actions using poses
UAV-Human[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
gzsl-odOut-of-Distribution Detection for Generalized Zero-Shot Action Recognition
TCEThis repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).
tfvaegan[ECCV 2020] Official Pytorch implementation for "Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification". SOTA results for ZSL and GZSL
Squeeze-and-Recursion-Temporal-GatesCode for : [Pattern Recognit. Lett. 2021] "Learn to cycle: Time-consistent feature discovery for action recognition" and [IJCNN 2021] "Multi-Temporal Convolutions for Human Action Recognition in Videos".
MTL-AQAWhat and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
ailia-modelsThe collection of pre-trained, state-of-the-art AI models for ailia SDK
ViCC[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.
conv3d-video-action-recognitionMy experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves result…
MSAFOffical implementation of paper "MSAF: Multimodal Split Attention Fusion"
MUSES[CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark
bLVNet-TAMThe official Codes for NeurIPS 2019 paper. Quanfu Fan, Ricarhd Chen, Hilde Kuehne, Marco Pistoia, David Cox, "More Is Less: Learning Efficient Video Representations by Temporal Aggregation Modules"
C3D-tensorflowAction recognition with C3D network implemented in tensorflow
MiCT-Net-PyTorchVideo Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone
temporal-binding-networkImplementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
temporal-sslVideo Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.