Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → kenshohara → Video Classification 3d Cnn Pytorch

kenshohara / Video Classification 3d Cnn Pytorch

Licence: mit

Video classification tools using 3D ResNet

Programming Languages

139335 projects - #7 most used programming language

Labels

deep-learning pytorch computer-vision action-recognition

Projects that are alternatives of or similar to Video Classification 3d Cnn Pytorch

tennis action recognition

Using deep learning to perform action recognition in the sport of tennis.

Stars: ✭ 17 (-98.05%)

Mutual labels: action-recognition

Video Understanding Dataset

A collection of recent video understanding datasets, under construction!

Stars: ✭ 387 (-55.72%)

Mutual labels: action-recognition

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Stars: ✭ 684 (-21.74%)

Mutual labels: action-recognition

[ICCV 2021 Oral] Deep Evidential Action Recognition

Stars: ✭ 36 (-95.88%)

Mutual labels: action-recognition

Action Recognition Visual Attention

Action recognition using soft attention based deep recurrent neural networks

Stars: ✭ 350 (-59.95%)

Mutual labels: action-recognition

Two Stream Pytorch

PyTorch implementation of two-stream networks for video action recognition

Stars: ✭ 428 (-51.03%)

Mutual labels: action-recognition

vlog action recognition

Identifying Visible Actions in Lifestyle Vlogs

Stars: ✭ 13 (-98.51%)

Mutual labels: action-recognition

Hcn Prototypeloss Pytorch

Hierarchical Co-occurrence Network with Prototype Loss for Few-shot Learning (PyTorch)

Stars: ✭ 17 (-98.05%)

Mutual labels: action-recognition

Awesome Skeleton Based Action Recognition

Skeleton-based Action Recognition

Stars: ✭ 360 (-58.81%)

Mutual labels: action-recognition

Action Detection

temporal action detection with SSN

Stars: ✭ 597 (-31.69%)

Mutual labels: action-recognition

3d Resnets Pytorch

3D ResNets for Action Recognition (CVPR 2018)

Stars: ✭ 3,169 (+262.59%)

Mutual labels: action-recognition

Awesome Action Recognition

A curated list of action recognition and related area resources

Stars: ✭ 3,202 (+266.36%)

Mutual labels: action-recognition

Gluon CV Toolkit

Stars: ✭ 5,001 (+472.2%)

Mutual labels: action-recognition

DIN-Group-Activity-Recognition-Benchmark

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

Stars: ✭ 26 (-97.03%)

Mutual labels: action-recognition

Two Stream Action Recognition

Using two stream architecture to implement a classic action recognition method on UCF101 dataset

Stars: ✭ 705 (-19.34%)

Mutual labels: action-recognition

auditory-slow-fast

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

Stars: ✭ 46 (-94.74%)

Mutual labels: action-recognition

Realtime Action Recognition

Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)

Stars: ✭ 417 (-52.29%)

Mutual labels: action-recognition

Temporal Segment Networks (TSN) in PyTorch

Stars: ✭ 895 (+2.4%)

Mutual labels: action-recognition

Action recognition tf

Action recognition.基于C3D的视频动作识别

Stars: ✭ 16 (-98.17%)

Mutual labels: action-recognition

Video Classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

Stars: ✭ 543 (-37.87%)

Mutual labels: action-recognition

View All Similar Projects ➔

Video Classification Using 3D ResNet

This is a pytorch code for video (action) classification using 3D ResNet trained by this code.
The 3D ResNet is trained on the Kinetics dataset, which includes 400 action classes.
This code uses videos as inputs and outputs class names and predicted class scores for each 16 frames in the score mode.
In the feature mode, this code outputs features of 512 dims (after global average pooling) for each 16 frames.

Torch (Lua) version of this code is available here.

Requirements

PyTorch

conda install pytorch torchvision cuda80 -c soumith

FFmpeg, FFprobe

wget http://johnvansickle.com/ffmpeg/releases/ffmpeg-release-64bit-static.tar.xz
tar xvf ffmpeg-release-64bit-static.tar.xz
cd ./ffmpeg-3.3.3-64bit-static/; sudo cp ffmpeg ffprobe /usr/local/bin;

Python 3

Preparation

Download this code.
Download the pretrained model.
- ResNeXt-101 achieved the best performance in our experiments. (See paper in details.)

Usage

Assume input video files are located in ./videos.

To calculate class scores for each 16 frames, use --mode score.

python main.py --input ./input --video_root ./videos --output ./output.json --model ./resnet-34-kinetics.pth --mode score

To visualize the classification results, use generate_result_video/generate_result_video.py.

To calculate video features for each 16 frames, use --mode feature.

python main.py --input ./input --video_root ./videos --output ./output.json --model ./resnet-34-kinetics.pth --mode feature

Citation

If you use this code, please cite the following:

@article{hara3dcnns,
  author={Kensho Hara and Hirokatsu Kataoka and Yutaka Satoh},
  title={Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?},
  journal={arXiv preprint},
  volume={arXiv:1711.09577},
  year={2017},
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 874

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (35) 🔗