Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → fabienbaradel → Object_level_visual_reasoning

fabienbaradel / Object_level_visual_reasoning

Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018

Programming Languages

139335 projects - #7 most used programming language

Labels

computer-vision video-understanding

Projects that are alternatives of or similar to Object level visual reasoning

DIN-Group-Activity-Recognition-Benchmark

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

Stars: ✭ 26 (-84.05%)

Mutual labels: video-understanding

Temporal Segment Networks (TSN) in PyTorch

Stars: ✭ 895 (+449.08%)

Mutual labels: video-understanding

TensorFlow code for finetuning I3D model on UCF101.

Stars: ✭ 128 (-21.47%)

Mutual labels: video-understanding

[ICCV 2021 Oral] Deep Evidential Action Recognition

Stars: ✭ 36 (-77.91%)

Mutual labels: video-understanding

Action Detection

temporal action detection with SSN

Stars: ✭ 597 (+266.26%)

Mutual labels: video-understanding

Temporally Language Grounding

A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"

Stars: ✭ 73 (-55.21%)

Mutual labels: video-understanding

Awesome-Temporally-Language-Grounding

A curated list of “Temporally Language Grounding” and related area

Stars: ✭ 97 (-40.49%)

Mutual labels: video-understanding

Easily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.

Stars: ✭ 137 (-15.95%)

Mutual labels: video-understanding

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Stars: ✭ 684 (+319.63%)

Mutual labels: video-understanding

Tools for movie and video research

Stars: ✭ 113 (-30.67%)

Mutual labels: video-understanding

Awesome Action Recognition

A curated list of action recognition and related area resources

Stars: ✭ 3,202 (+1864.42%)

Mutual labels: video-understanding

Activity Recognition With Cnn And Rnn

Temporal Segments LSTM and Temporal-Inception for Activity Recognition

Stars: ✭ 415 (+154.6%)

Mutual labels: video-understanding

Temporal Shift Module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Stars: ✭ 1,282 (+686.5%)

Mutual labels: video-understanding

Useful Toolbox for Anomaly Detection

Stars: ✭ 95 (-41.72%)

Mutual labels: video-understanding

An open-source toolbox for action understanding based on PyTorch

Stars: ✭ 1,711 (+949.69%)

Mutual labels: video-understanding

CP-360-Weakly-Supervised-Saliency

CP-360-Weakly-Supervised-Saliency

Stars: ✭ 20 (-87.73%)

Mutual labels: video-understanding

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

Stars: ✭ 72 (-55.83%)

Mutual labels: video-understanding

Awesome Activity Prediction

Paper list of activity prediction and related area

Stars: ✭ 147 (-9.82%)

Mutual labels: video-understanding

Dataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.

Stars: ✭ 131 (-19.63%)

Mutual labels: video-understanding

Temporal Segment Networks

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Stars: ✭ 1,287 (+689.57%)

Mutual labels: video-understanding

View All Similar Projects ➔

Object level Visual Reasoning in Videos

This repository contains a Pytorch implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori, In ECCV 2018.

Links: Project page | Camera-ready | Complementary Mask Data

Code

We release code for training and testing our implementation. We encourage you to follow the steps below:

preprocessing the video dataset
- rescaling an entire dataset (WxH=256x256 and fps=30)
testing the dataloader
- efficient video decoding on the fly
training/testing the model
- training procedure using precomputed masks

Masks

Please visit the following website for downloading the mask predictions.

Requirements

pytorch 0.4.0
numpy
lintel - make sure that you have already installed this library (important for decoding videos on the fly)

Citation

If you find this paper or our implementation useful for your research or if you use the precomputed masks, please cite our paper.

@InProceedings{Baradel_2018_ECCV,
author = {Baradel, Fabien and Neverova, Natalia and Wolf, Christian and Mille, Julien and Mori, Greg},
title = {Object Level Visual Reasoning in Videos},
booktitle = {ECCV},
year = {2018}
}

Acknowledgements

This work was funded by grant Deepvision (ANR-15- CE23-0029, STPGP-479356-15), a joint French/Canadian call by ANR & NSERC.

Licence

MIT License

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 163

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗