All Projects → wadhwasahil → Video Classification 2 Stream Cnn

wadhwasahil / Video Classification 2 Stream Cnn

Video Classification using 2 stream CNN

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Video Classification 2 Stream Cnn

Keras Vgg16 Places365
Keras code and weights files for the VGG16-places365 and VGG16-hybrid1365 CNNs for scene classification
Stars: ✭ 138 (-54.75%)
Mutual labels:  cnn, vgg16
Food Recipe Cnn
food image to recipe with deep convolutional neural networks.
Stars: ✭ 448 (+46.89%)
Mutual labels:  cnn, vgg16
Deep Learning With Python
Deep learning codes and projects using Python
Stars: ✭ 195 (-36.07%)
Mutual labels:  cnn, vgg16
Neural Image Captioning
Implementation of Neural Image Captioning model using Keras with Theano backend
Stars: ✭ 12 (-96.07%)
Mutual labels:  cnn, vgg16
Srgan
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
Stars: ✭ 2,641 (+765.9%)
Mutual labels:  cnn, vgg16
Ffdnet
FFDNet: Toward a Fast and Flexible Solution for CNN based Image Denoising (TIP, 2018)
Stars: ✭ 274 (-10.16%)
Mutual labels:  cnn
Pytorch Cifar Models
3.41% and 17.11% error on CIFAR-10 and CIFAR-100
Stars: ✭ 286 (-6.23%)
Mutual labels:  cnn
Caffe Hrt
Heterogeneous Run Time version of Caffe. Added heterogeneous capabilities to the Caffe, uses heterogeneous computing infrastructure framework to speed up Deep Learning on Arm-based heterogeneous embedded platform. It also retains all the features of the original Caffe architecture which users deploy their applications seamlessly.
Stars: ✭ 271 (-11.15%)
Mutual labels:  cnn
Pytorch Saltnet
Kaggle | 9th place single model solution for TGS Salt Identification Challenge
Stars: ✭ 270 (-11.48%)
Mutual labels:  cnn
Yolo V2 Pytorch
YOLO for object detection tasks
Stars: ✭ 302 (-0.98%)
Mutual labels:  cnn
Text Cnn
嵌入Word2vec词向量的CNN中文文本分类
Stars: ✭ 298 (-2.3%)
Mutual labels:  cnn
Chinese Text Classification
Chinese-Text-Classification,Tensorflow CNN(卷积神经网络)实现的中文文本分类。QQ群:522785813,微信群二维码:http://www.tensorflownews.com/
Stars: ✭ 284 (-6.89%)
Mutual labels:  cnn
Mobilenetv2
A Keras implementation of MobileNetV2.
Stars: ✭ 277 (-9.18%)
Mutual labels:  cnn
Cs291k
🎭 Sentiment Analysis of Twitter data using combined CNN and LSTM Neural Network models
Stars: ✭ 287 (-5.9%)
Mutual labels:  cnn
Pytorch Image Classification
Tutorials on how to implement a few key architectures for image classification using PyTorch and TorchVision.
Stars: ✭ 272 (-10.82%)
Mutual labels:  cnn
Computer Vision Leaderboard
Comparison of famous convolutional neural network models
Stars: ✭ 299 (-1.97%)
Mutual labels:  cnn
Resnetcam Keras
Keras implementation of a ResNet-CAM model
Stars: ✭ 269 (-11.8%)
Mutual labels:  cnn
Named Entity Recognition With Bidirectional Lstm Cnns
Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs
Stars: ✭ 283 (-7.21%)
Mutual labels:  cnn
Pytorch Srgan
A modern PyTorch implementation of SRGAN
Stars: ✭ 289 (-5.25%)
Mutual labels:  cnn
Holy Edge
Holistically-Nested Edge Detection
Stars: ✭ 277 (-9.18%)
Mutual labels:  vgg16

Video Classification using Two Stream CNNs

We use a spatial and a temporal stream with VGG-16 and CNN-M respectively for modeling video information. LSTMs are stacked on top of the CNNs for modeling long term dependencies between video frames. For more information, see these papers:

Two-Stream Convolutional Networks for Action Recognition in Videos

Fusing Multi-Stream Deep Networks for Video Classification

Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification

Towards Good Practices for Very Deep Two-Stream ConvNets


Here are the steps to run the project on CCV dataset:

Creating a virtual environment

First create a directory named env and then run the following inside the directory. This will create a virtual environment. Assuming we create a requirements.txt file to help install modules that are needed in the project. $ mkdir env $ cd env $ virtualenv venv-video-classification $ source env-video-classification\bin\activate $ cd .. $ pip install requirements.txt

Setting up the DataSet

  1. Get the YouTube data, remove broken videos and negative instances and finally create a pickle file of the dataset by running scripts from the utility_scripts folder

  2. Temporal Stream (in the temporal folder):

  3. Run temporal_vid2img to create optical flow frames and the related files

  4. Run temporal_stream_cnn to start with the temporal stream training

  5. Spatial Stream (in the spatial folder):

  6. Run the spatial_vid2img to create static frames and related files

  7. Download the vgg16_weights.h5 file from here and put it in the spatial folder

  8. Run spatial_stream_cnn to start with the spatial stream training

  9. Temporal Stream LSTM: Will soon update the code

  10. Spatial Stream LSTM: Will soon update the code


Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].