Tdn[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
Stars: ✭ 72 (-87.94%)
ActionvladActionVLAD for video action classification (CVPR 2017)
Stars: ✭ 217 (-63.65%)
MTL-AQAWhat and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
Stars: ✭ 38 (-93.63%)
MmactionAn open-source toolbox for action understanding based on PyTorch
Stars: ✭ 1,711 (+186.6%)
DEAR[ICCV 2021 Oral] Deep Evidential Action Recognition
Stars: ✭ 36 (-93.97%)
Mmaction2OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Stars: ✭ 684 (+14.57%)
Movienet ToolsTools for movie and video research
Stars: ✭ 113 (-81.07%)
DIN-Group-Activity-Recognition-BenchmarkA new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.
Stars: ✭ 26 (-95.64%)
I3d finetuneTensorFlow code for finetuning I3D model on UCF101.
Stars: ✭ 128 (-78.56%)
Tsn PytorchTemporal Segment Networks (TSN) in PyTorch
Stars: ✭ 895 (+49.92%)
StepSTEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)
Stars: ✭ 196 (-67.17%)
PaddlevideoComprehensive, latest, and deployable video deep learning algorithm, including video recognition, action localization, and temporal action detection tasks. It's a high-performance, light-weight codebase provides practical models for video understanding research and application
Stars: ✭ 218 (-63.48%)
synse-zslOfficial PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'
Stars: ✭ 14 (-97.65%)
Two-Stream-CNNTwo Stream CNN implemented in Keras using in skeleton-based action recognition with dataset NTU RGB+D
Stars: ✭ 75 (-87.44%)
Pose2vecA Repository for maintaining various human skeleton preprocessing steps in numpy and tensorflow along with tensorflow model to learn pose embeddings.
Stars: ✭ 25 (-95.81%)
pose2actionexperiments on classifying actions using poses
Stars: ✭ 24 (-95.98%)
DLCV2018SPRINGDeep Learning for Computer Vision (CommE 5052) in NTU
Stars: ✭ 38 (-93.63%)
gzsl-odOut-of-Distribution Detection for Generalized Zero-Shot Action Recognition
Stars: ✭ 47 (-92.13%)
sanThe official PyTorch implementation of "Context Matters: Self-Attention for sign Language Recognition"
Stars: ✭ 17 (-97.15%)
auditory-slow-fastImplementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch
Stars: ✭ 46 (-92.29%)
TCFPN-ISBATemporal Convolutional Feature Pyramid Network (TCFPN) & Iterative Soft Boundary Assignment (ISBA), CVPR '18
Stars: ✭ 40 (-93.3%)
adascan-publicCode for AdaScan: Adaptive Scan Pooling (CVPR 2017)
Stars: ✭ 43 (-92.8%)
Realtime Action RecognitionApply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
Stars: ✭ 417 (-30.15%)
tfvaegan[ECCV 2020] Official Pytorch implementation for "Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification". SOTA results for ZSL and GZSL
Stars: ✭ 107 (-82.08%)
pushup-counter-appCount pushups from video/webcam. Tech stack: Keypoint detection, BlazePose, action recognition.
Stars: ✭ 48 (-91.96%)
GST-videoICCV 19 Grouped Spatial-Temporal Aggretation for Efficient Action Recognition
Stars: ✭ 40 (-93.3%)
UAV-Human[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
Stars: ✭ 122 (-79.56%)
TCEThis repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).
Stars: ✭ 51 (-91.46%)
ntu-xNTU-X, which is an extended version of popular NTU dataset
Stars: ✭ 55 (-90.79%)
Gluon CvGluon CV Toolkit
Stars: ✭ 5,001 (+737.69%)
STCNetSTCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection
Stars: ✭ 29 (-95.14%)
torch-lrcnAn implementation of the LRCN in Torch
Stars: ✭ 85 (-85.76%)
dynamic-images-for-action-recognitionA public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et al.
Stars: ✭ 27 (-95.48%)
cpnetLearning Video Representations from Correspondence Proposals (CVPR 2019 Oral)
Stars: ✭ 93 (-84.42%)
Squeeze-and-Recursion-Temporal-GatesCode for : [Pattern Recognit. Lett. 2021] "Learn to cycle: Time-consistent feature discovery for action recognition" and [IJCNN 2021] "Multi-Temporal Convolutions for Human Action Recognition in Videos".
Stars: ✭ 62 (-89.61%)
ailia-modelsThe collection of pre-trained, state-of-the-art AI models for ailia SDK
Stars: ✭ 1,102 (+84.59%)
Openpose-based-GUI-for-Realtime-Pose-Estimate-and-Action-RecognitionGUI based on the python api of openpose in windows using cuda10 and cudnn7. Support body , hand, face keypoints estimation and data saving. Realtime gesture recognition is realized through two-layer neural network based on the skeleton collected from the gui.
Stars: ✭ 69 (-88.44%)
Action-LocalizationAction-Localization, Atomic Visual Actions (AVA) Dataset
Stars: ✭ 22 (-96.31%)
TadTREnd-to-end Temporal Action Detection with Transformer. [Under review for a journal publication]
Stars: ✭ 55 (-90.79%)
ViCC[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.
Stars: ✭ 33 (-94.47%)
NExT-QANExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
Stars: ✭ 50 (-91.62%)
PyAnomalyUseful Toolbox for Anomaly Detection
Stars: ✭ 95 (-84.09%)
just-ask[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Stars: ✭ 57 (-90.45%)
conv3d-video-action-recognitionMy experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves result…
Stars: ✭ 50 (-91.62%)
MSAFOffical implementation of paper "MSAF: Multimodal Split Attention Fusion"
Stars: ✭ 47 (-92.13%)
Robust-Deep-Learning-PipelineDeep Convolutional Bidirectional LSTM for Complex Activity Recognition with Missing Data. Human Activity Recognition Challenge. Springer SIST (2020)
Stars: ✭ 20 (-96.65%)