WhiteBox-Part1In this part, I've introduced and experimented with ways to interpret and evaluate models in the field of image. (Pytorch)
Stars: ✭ 34 (+70%)
CS231nMy solutions for Assignments of CS231n: Convolutional Neural Networks for Visual Recognition
Stars: ✭ 30 (+50%)
just-ask[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Stars: ✭ 57 (+185%)
360WebPlayerThe easiest way to stream 360 videos and pictures on your website or blog.
Stars: ✭ 31 (+55%)
DINetA dilated inception network for visual saliency prediction (TMM 2019)
Stars: ✭ 25 (+25%)
ls-psvr-encoderA simple command line tool to encode your 180 and 360 videos for sideloading with Littlstar's VR Cinema app for PSVR.
Stars: ✭ 61 (+205%)
vrview-react⭐ Virtual Reality React Component for 360º photos, videos and virtual tour visualization
Stars: ✭ 29 (+45%)
STCNetSTCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection
Stars: ✭ 29 (+45%)
MTL-AQAWhat and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
Stars: ✭ 38 (+90%)
NExT-QANExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
Stars: ✭ 50 (+150%)
glimpse cloudsPytorch implementation of the paper "Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points", F. Baradel, C. Wolf, J. Mille , G.W. Taylor, CVPR 2018
Stars: ✭ 30 (+50%)
SSTDA[CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)
Stars: ✭ 150 (+650%)
SgplayerA powerful media play framework for iOS, macOS, and tvOS.
Stars: ✭ 1,974 (+9770%)
Awesome Groundingawesome grounding: A curated list of research papers in visual grounding
Stars: ✭ 247 (+1135%)
PaddlevideoComprehensive, latest, and deployable video deep learning algorithm, including video recognition, action localization, and temporal action detection tasks. It's a high-performance, light-weight codebase provides practical models for video understanding research and application
Stars: ✭ 218 (+990%)
ActionvladActionVLAD for video action classification (CVPR 2017)
Stars: ✭ 217 (+985%)
StepSTEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)
Stars: ✭ 196 (+880%)
Youtube 8mThe 2nd place Solution to the Youtube-8M Video Understanding Challenge by Team Monkeytyping (based on tensorflow)
Stars: ✭ 171 (+755%)
Object level visual reasoningPytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018
Stars: ✭ 163 (+715%)
Video2tfrecordEasily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.
Stars: ✭ 137 (+585%)
MultiverseDataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.
Stars: ✭ 131 (+555%)
MmactionAn open-source toolbox for action understanding based on PyTorch
Stars: ✭ 1,711 (+8455%)
I3d finetuneTensorFlow code for finetuning I3D model on UCF101.
Stars: ✭ 128 (+540%)
Temporal Shift Module[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Stars: ✭ 1,282 (+6310%)
Temporally Language GroundingA Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"
Stars: ✭ 73 (+265%)
Tdn[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
Stars: ✭ 72 (+260%)
Tsn PytorchTemporal Segment Networks (TSN) in PyTorch
Stars: ✭ 895 (+4375%)
Mmaction2OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Stars: ✭ 684 (+3320%)
DEAR[ICCV 2021 Oral] Deep Evidential Action Recognition
Stars: ✭ 36 (+80%)
PyAnomalyUseful Toolbox for Anomaly Detection
Stars: ✭ 95 (+375%)
DIN-Group-Activity-Recognition-BenchmarkA new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.
Stars: ✭ 26 (+30%)
StellargraphStellarGraph - Machine Learning on Graphs
Stars: ✭ 2,235 (+11075%)
DeepgazeComputer Vision library for human-computer interaction. It implements Head Pose and Gaze Direction Estimation Using Convolutional Neural Networks, Skin Detection through Backprojection, Motion Detection and Tracking, Saliency Map.
Stars: ✭ 1,552 (+7660%)
U-2-Net-DemoDemonstration using Google Colab to show how U-2-NET can be used for Background Removal, Changing Backgrounds, Bounding Box Creation, Salient Feature Highlighting and Salient Object Cropping.
Stars: ✭ 132 (+560%)