synse-zslOfficial PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'
Stars: ✭ 14 (-74.55%)
MmskeletonA OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Stars: ✭ 2,378 (+4223.64%)
temporal-binding-networkImplementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
Stars: ✭ 95 (+72.73%)
MiCT-Net-PyTorchVideo Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone
Stars: ✭ 48 (-12.73%)
gzsl-odOut-of-Distribution Detection for Generalized Zero-Shot Action Recognition
Stars: ✭ 47 (-14.55%)
Ms G3d[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"
Stars: ✭ 225 (+309.09%)
AttentionalpoolingactionCode/Model release for NIPS 2017 paper "Attentional Pooling for Action Recognition"
Stars: ✭ 248 (+350.91%)
Squeeze-and-Recursion-Temporal-GatesCode for : [Pattern Recognit. Lett. 2021] "Learn to cycle: Time-consistent feature discovery for action recognition" and [IJCNN 2021] "Multi-Temporal Convolutions for Human Action Recognition in Videos".
Stars: ✭ 62 (+12.73%)
Action recognition zooCodes for popular action recognition models, verified on the something-something data set.
Stars: ✭ 227 (+312.73%)
ActionvladActionVLAD for video action classification (CVPR 2017)
Stars: ✭ 217 (+294.55%)
bLVNet-TAMThe official Codes for NeurIPS 2019 paper. Quanfu Fan, Ricarhd Chen, Hilde Kuehne, Marco Pistoia, David Cox, "More Is Less: Learning Efficient Video Representations by Temporal Aggregation Modules"
Stars: ✭ 54 (-1.82%)
pose2actionexperiments on classifying actions using poses
Stars: ✭ 24 (-56.36%)
sparsepropTemporal action proposals
Stars: ✭ 46 (-16.36%)
tfvaegan[ECCV 2020] Official Pytorch implementation for "Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification". SOTA results for ZSL and GZSL
Stars: ✭ 107 (+94.55%)
AlphactionSpatio-Temporal Action Localization System
Stars: ✭ 221 (+301.82%)
PaddlevideoComprehensive, latest, and deployable video deep learning algorithm, including video recognition, action localization, and temporal action detection tasks. It's a high-performance, light-weight codebase provides practical models for video understanding research and application
Stars: ✭ 218 (+296.36%)
ailia-modelsThe collection of pre-trained, state-of-the-art AI models for ailia SDK
Stars: ✭ 1,102 (+1903.64%)
StepSTEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)
Stars: ✭ 196 (+256.36%)
Hidden Two StreamCaffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"
Stars: ✭ 179 (+225.45%)
VideoTransformer-pytorchPyTorch implementation of a collections of scalable Video Transformer Benchmarks.
Stars: ✭ 159 (+189.09%)
pushup-counter-appCount pushups from video/webcam. Tech stack: Keypoint detection, BlazePose, action recognition.
Stars: ✭ 48 (-12.73%)
C3D-tensorflowAction recognition with C3D network implemented in tensorflow
Stars: ✭ 34 (-38.18%)
sanThe official PyTorch implementation of "Context Matters: Self-Attention for sign Language Recognition"
Stars: ✭ 17 (-69.09%)
UAV-Human[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
Stars: ✭ 122 (+121.82%)
conv3d-video-action-recognitionMy experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves result…
Stars: ✭ 50 (-9.09%)
Hand pose actionDataset and code for the paper "First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations", CVPR 2018.
Stars: ✭ 173 (+214.55%)
Robust-Deep-Learning-PipelineDeep Convolutional Bidirectional LSTM for Complex Activity Recognition with Missing Data. Human Activity Recognition Challenge. Springer SIST (2020)
Stars: ✭ 20 (-63.64%)
temporal-sslVideo Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.
Stars: ✭ 46 (-16.36%)
TCEThis repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).
Stars: ✭ 51 (-7.27%)
TA3N[ICCV 2019 Oral] TA3N: https://github.com/cmhungsteve/TA3N (Most updated repo)
Stars: ✭ 45 (-18.18%)
TCFPN-ISBATemporal Convolutional Feature Pyramid Network (TCFPN) & Iterative Soft Boundary Assignment (ISBA), CVPR '18
Stars: ✭ 40 (-27.27%)
LintelA Python module to decode video frames directly, using the FFmpeg C API.
Stars: ✭ 240 (+336.36%)
dynamic-images-for-action-recognitionA public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et al.
Stars: ✭ 27 (-50.91%)
cpnetLearning Video Representations from Correspondence Proposals (CVPR 2019 Oral)
Stars: ✭ 93 (+69.09%)
Ican[BMVC 2018] iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection
Stars: ✭ 225 (+309.09%)
MTL-AQAWhat and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
Stars: ✭ 38 (-30.91%)
Ta3n[ICCV 2019 (Oral)] Temporal Attentive Alignment for Large-Scale Video Domain Adaptation (PyTorch)
Stars: ✭ 217 (+294.55%)
adascan-publicCode for AdaScan: Adaptive Scan Pooling (CVPR 2017)
Stars: ✭ 43 (-21.82%)
Ig65m PytorchPyTorch 3D video classification models pre-trained on 65 million Instagram videos
Stars: ✭ 217 (+294.55%)
Openpose-based-GUI-for-Realtime-Pose-Estimate-and-Action-RecognitionGUI based on the python api of openpose in windows using cuda10 and cudnn7. Support body , hand, face keypoints estimation and data saving. Realtime gesture recognition is realized through two-layer neural network based on the skeleton collected from the gui.
Stars: ✭ 69 (+25.45%)
Video CaffeVideo-friendly caffe -- comes with the most recent version of Caffe (as of Jan 2019), a video reader, 3D(ND) pooling layer, and an example training script for C3D network and UCF-101 data
Stars: ✭ 172 (+212.73%)
Dataset-REPAIRREPresentAtion bIas Removal (REPAIR) of datasets
Stars: ✭ 49 (-10.91%)
AmassData preparation and loader for AMASS
Stars: ✭ 180 (+227.27%)
ViCC[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.
Stars: ✭ 33 (-40%)
VipVideo Platform for Action Recognition and Object Detection in Pytorch
Stars: ✭ 175 (+218.18%)
Two-Stream-CNNTwo Stream CNN implemented in Keras using in skeleton-based action recognition with dataset NTU RGB+D
Stars: ✭ 75 (+36.36%)
MSAFOffical implementation of paper "MSAF: Multimodal Split Attention Fusion"
Stars: ✭ 47 (-14.55%)
torch-lrcnAn implementation of the LRCN in Torch
Stars: ✭ 85 (+54.55%)
TadTREnd-to-end Temporal Action Detection with Transformer. [Under review for a journal publication]
Stars: ✭ 55 (+0%)
video repres mascode for CVPR-2019 paper: Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
Stars: ✭ 63 (+14.55%)
Pose2vecA Repository for maintaining various human skeleton preprocessing steps in numpy and tensorflow along with tensorflow model to learn pose embeddings.
Stars: ✭ 25 (-54.55%)
MUSES[CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark
Stars: ✭ 51 (-7.27%)