All Projects → JaywongWang → SST-Tensorflow

JaywongWang / SST-Tensorflow

Licence: other
Tensorflow Implementation of the Paper "SST: Single-Stream Temporal Action Proposals" in CVPR 2017.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to SST-Tensorflow

MTL-AQA
What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
Stars: ✭ 38 (-24%)
Mutual labels:  video-processing
deepvisualminer
Deep visual mining for your photos and videos using YOLOv2 deep convolutional neural network based object detector and traditional face recognition algorithms
Stars: ✭ 21 (-58%)
Mutual labels:  video-processing
Nager.VideoStream
Get images from a network camera stream or webcam
Stars: ✭ 27 (-46%)
Mutual labels:  video-processing
VideoStitching
solve real time video stitching problem: 4 camera example by opencv surf
Stars: ✭ 42 (-16%)
Mutual labels:  video-processing
FunVideo
A series of embeddable theme into video, include animation and voice.
Stars: ✭ 19 (-62%)
Mutual labels:  video-processing
SSffmpegVideoOperation
This is a library of FFmpeg for android... 📸 🎞 🚑
Stars: ✭ 261 (+422%)
Mutual labels:  video-processing
eta
ETA: Extensible Toolkit for Analytics
Stars: ✭ 22 (-56%)
Mutual labels:  video-processing
Face-Recognition-Attendance
Face recognition using OpenCV and developed Advance high speed Attendance System on old CCTV footage
Stars: ✭ 19 (-62%)
Mutual labels:  video-processing
RFDA-PyTorch
Official Code for 'Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction' - ACM Multimedia2021 (ACMMM2021) Accepted Paper Task: Video Quality Enhancement / Video Compression Artifact Reduction
Stars: ✭ 44 (-12%)
Mutual labels:  video-processing
ffcvt
ffmpeg convert wrapper tool
Stars: ✭ 32 (-36%)
Mutual labels:  video-processing
DrawingBotV3
DrawingBotV3 is a software for creating line drawings from Images
Stars: ✭ 161 (+222%)
Mutual labels:  video-processing
slitcamera
Bash script to turn video file into slit photo
Stars: ✭ 41 (-18%)
Mutual labels:  video-processing
youtube-video-maker
📹 A tool for automatic video creation and uploading on YouTube
Stars: ✭ 134 (+168%)
Mutual labels:  video-processing
3D-Tracking-MVS
3D position tracking for soccer players with multi-camera videos
Stars: ✭ 68 (+36%)
Mutual labels:  video-processing
Implicit-Internal-Video-Inpainting
[ICCV 2021]: IIVI: Internal Video Inpainting by Implicit Long-range Propagation
Stars: ✭ 190 (+280%)
Mutual labels:  video-processing
ExpertVideoToolbox
A lightweight, versatile GUI of x264, x265. Nearly full input formats support, .mkv and .mp4 output support. Avs support will be added soon. Language: Chinese
Stars: ✭ 12 (-76%)
Mutual labels:  video-processing
aws-content-analysis
This project is a fully automated video search engine which uses AWS AI services for computer vision and speech recognition to catalog video archives.
Stars: ✭ 67 (+34%)
Mutual labels:  video-processing
pepic
Image and video proxy for my pet-projects
Stars: ✭ 35 (-30%)
Mutual labels:  video-processing
antz
ANTz immersive 3D data visualization engine
Stars: ✭ 25 (-50%)
Mutual labels:  video-processing
ShotDetection
Open source software that detects shot boundaries in video.
Stars: ✭ 42 (-16%)
Mutual labels:  video-processing

SST-Tensorflow

Tensorflow Implementation of the Paper SST: Single-Stream Temporal Action Proposals by Shyamal Buch et al. in CVPR 2017.

Data Preparation

Please download video data and annotation data from the website THUMOS14. Extract C3D features for non-overlap 16-frame snippets from the 412 videos (200 val videos + 212 test videos, I found one test video missing) for the task of temporal action proposals. Alternatively, you may download my provided C3D features (Google Drive, Baidu Drive), and put them in dataset/thumos14/features/. If you are interested in the feature extraction, I refer you to this code.

fc6 features are used in my experiment.

Please follow the script dataset/thumos14/prepare_gt_proposal_data.py to generate ground-truth proposal data for train/val/test split. I already put the generated files in dataset/thumos14/gt_proposals/.

After that, please generate anchor weights (for handling imbalance class problem) by uniformly sampling video streams (follow dataset/thumos14/anchors/get_anchor_weight.py) or just use my pre-calculated weights (weights.json).

Hyper Parameters

The best configuration (from my experiments) is given in opt.py, including model setup, training options, and testing options.

Training

Train your model using the script train.py. Run around 50 epochs and pick the best checkpoint (with the smallest val loss) for prediction.

Prediction

Follow the script test.py to make proposal predictions.

Evaluation

Follow the script eval.py to evaluate your proposal predictions.

Results

You may download my trained model in Google Drive or Baidu Drive. Please put them in checkpoints/. Change the file init_from in opt.py and run test.py !

Update: The predicted action proposals for the test set can be found in Google Drive or Baidu Drive. The result figures are put in results/1. They are slightly better than the reported ones.

Method Recall@1000 at tIoU=0.8
SST (paper) 0.672
SST (my impl) 0.696

alt text

alt text

Dependencies

tensorflow==1.0.1

python==2.7.5

Other versions may also work.

Acknowledgements

Great thanks to Shyamal Buch for really helpful discussion.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].