Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → anuragranj → Spynet

anuragranj / Spynet

Licence: other

Spatial Pyramid Network for Optical Flow

Programming Languages

lua

6591 projects

Labels

deep-learning convolutional-networks optical-flow

Projects that are alternatives of or similar to Spynet

Back2future.pytorch

Unsupervised Learning of Multi-Frame Optical Flow with Occlusions

Stars: ✭ 104 (-34.18%)

Mutual labels: optical-flow

Arflow

The official PyTorch implementation of the paper "Learning by Analogy: Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation".

Stars: ✭ 134 (-15.19%)

Mutual labels: optical-flow

Eye Tracker

Implemented and improved the iTracker model proposed in the paper "Eye Tracking for Everyone"

Stars: ✭ 140 (-11.39%)

Mutual labels: convolutional-networks

Pwc Net pytorch

pytorch implementation of "PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume"

Stars: ✭ 111 (-29.75%)

Mutual labels: optical-flow

Cn24

Convolutional (Patch) Networks for Semantic Segmentation

Stars: ✭ 125 (-20.89%)

Mutual labels: convolutional-networks

Video2tfrecord

Easily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.

Stars: ✭ 137 (-13.29%)

Mutual labels: optical-flow

Ddflow

DDFlow: Learning Optical Flow with Unlabeled Data Distillation

Stars: ✭ 101 (-36.08%)

Mutual labels: optical-flow

Tfvos

Semi-Supervised Video Object Segmentation (VOS) with Tensorflow. Includes implementation of *MaskRNN: Instance Level Video Object Segmentation (NIPS 2017)* as part of the NIPS Paper Implementation Challenge.

Stars: ✭ 151 (-4.43%)

Mutual labels: optical-flow

Netdef models

Repository for different network models related to flow/disparity (ECCV 18)

Stars: ✭ 130 (-17.72%)

Mutual labels: optical-flow

Deep Learning For Tracking And Detection

Collection of papers, datasets, code and other resources for object tracking and detection using deep learning

Stars: ✭ 1,920 (+1115.19%)

Mutual labels: optical-flow

Vcn

Volumetric Correspondence Networks for Optical Flow, NeurIPS 2019.

Stars: ✭ 118 (-25.32%)

Mutual labels: optical-flow

Densematchingbenchmark

Dense Matching Benchmark

Stars: ✭ 120 (-24.05%)

Mutual labels: optical-flow

Flownet2 Docker

Dockerfile and runscripts for FlowNet 2.0 (estimation of optical flow)

Stars: ✭ 137 (-13.29%)

Mutual labels: optical-flow

Convolutional Network

A convolutional neural network from scratch

Stars: ✭ 105 (-33.54%)

Mutual labels: convolutional-networks

Livianet

This repository contains the code of LiviaNET, a 3D fully convolutional neural network that was employed in our work: "3D fully convolutional networks for subcortical segmentation in MRI: A large-scale study"

Stars: ✭ 143 (-9.49%)

Mutual labels: convolutional-networks

Semanticsegpapercollection

Stars: ✭ 102 (-35.44%)

Mutual labels: convolutional-networks

Keras Yolo2

Easy training on custom dataset. Various backends (MobileNet and SqueezeNet) supported. A YOLO demo to detect raccoon run entirely in brower is accessible at https://git.io/vF7vI (not on Windows).

Stars: ✭ 1,693 (+971.52%)

Mutual labels: convolutional-networks

Frvsr

Frame-Recurrent Video Super-Resolution (official repository)

Stars: ✭ 157 (-0.63%)

Mutual labels: optical-flow

Deep Learning Specialization Coursera

Deep Learning Specialization courses by Andrew Ng, deeplearning.ai

Stars: ✭ 146 (-7.59%)

Mutual labels: convolutional-networks

Glasses

High-quality Neural Networks for Computer Vision 😎

Stars: ✭ 138 (-12.66%)

Mutual labels: convolutional-networks

View All Similar Projects ➔

SPyNet: Spatial Pyramid Network for Optical Flow

This code is based on the paper Optical Flow Estimation using a Spatial Pyramid Network.

[Unofficial Pytorch version] [Unofficial tensorflow version]

First things first: Setting up this code
Easy Usage: Compute Optical Flow in 5 lines
Fast Performance Usage: Compute Optical Flow at a rocket speed
Training: Train your own models using Spatial Pyramid approach on mulitiple GPUs
End2End SPyNet: An easy trainable end-to-end version of SPyNet
Optical Flow Utilities: A set of functions in lua for working around optical flow
References: For further reading

First things first

You need to have Torch.

Install other required packages

cd extras/spybhwd
luarocks make
cd ../stnbhwd
luarocks make

For Easy Usage, follow this

Set up SPyNet

spynet = require('spynet')
easyComputeFlow = spynet.easy_setup()

Load images and compute flow

im1 = image.load('samples/00001_img1.ppm' )
im2 = image.load('samples/00001_img2.ppm' )
flow = easyComputeFlow(im1, im2)

To save your flow fields to a .flo file use flowExtensions.writeFLO.

For Fast Performace, follow this (recommended)

Set up SPyNet

Set up SPyNet according to the image size and model. For optimal performance, resize your image such that width and height are a multiple of 32. You can also specify your favorite model. The present supported modes are fine tuned models sintelFinal(default), sintelClean, kittiFinal, and base models chairsFinal and chairsClean.

spynet = require('spynet')
computeFlow = spynet.setup(512, 384, 'sintelFinal')    -- for 384x512 images

Now you can call computeFlow anytime to estimate optical flow between image pairs.

Computing flow

Load an image pair and stack and normalize it.

im1 = image.load('samples/00001_img1.ppm' )
im2 = image.load('samples/00001_img2.ppm' )
im = torch.cat(im1, im2, 1)
im = spynet.normalize(im)

SPyNet works with batches of data on CUDA. So, compute flow using

im = im:resize(1, im:size(1), im:size(2), im:size(3)):cuda()
flow = computeFlow(im)

You can also use batch-mode, if your images im are a tensor of size Bx6xHxW, of batch size B with 6 RGB pair channels. You can directly use:

flow = computeFlow(im)

Training

Training sequentially is faster than training end-to-end since you need to learn small number of parameters at each level. To train a level N, we need the trained models at levels 1 to N-1. You also initialize the model with a pretrained model at N-1.

E.g. To train level 3, we need trained models at L1 and L2, and we initialize it modelL2_3.t7.

th main.lua -fineWidth 128 -fineHeight 96 -level 3 -netType volcon \
-cache checkpoint -data FLYING_CHAIRS_DIR \
-L1 models/modelL1_3.t7 -L2 models/modelL2_3.t7 \
-retrain models/modelL2_3.t7

End2End SPyNet

The end-to-end version of SPyNet is easily trainable and is available at anuragranj/end2end-spynet.

Optical Flow Utilities

We provide flowExtensions.lua containing various functions to make your life easier with optical flow while using Torch/Lua. You can just copy this file into your project directory and use if off the shelf.

flowX = require 'flowExtensions'

[flow_magnitude] flowX.computeNorm(flow_x, flow_y)

Given flow_x and flow_y of size MxN each, evaluate flow_magnitude of size MxN.

[flow_angle] flowX.computeAngle(flow_x, flow_y)

Given flow_x and flow_y of size MxN each, evaluate flow_angle of size MxN in degrees.

[rgb] flowX.field2rgb(flow_magnitude, flow_angle, [max], [legend])

Given flow_magnitude and flow_angle of size MxN each, return an image of size 3xMxN for visualizing optical flow. max(optional) specifies maximum flow magnitude and legend(optional) is boolean that prints a legend on the image.

[rgb] flowX.xy2rgb(flow_x, flow_y, [max])

Given flow_x and flow_y of size MxN each, return an image of size 3xMxN for visualizing optical flow. max(optional) specifies maximum flow magnitude.

[flow] flowX.loadFLO(filename)

Reads a .flo file. Loads x and y components of optical flow in a 2 channel 2xMxN optical flow field. First channel stores x component and second channel stores y component.

flowX.writeFLO(filename,F)

Write a 2xMxN flow field F containing x and y components of its flow fields in its first and second channel respectively to filename, a .flo file.

[flow] flowX.loadPFM(filename)

Reads a .pfm file. Loads x and y components of optical flow in a 2 channel 2xMxN optical flow field. First channel stores x component and second channel stores y component.

[flow_rotated] flowX.rotate(flow, angle)

Rotates flow of size 2xMxN by angle in radians. Uses nearest-neighbor interpolation to avoid blurring at boundaries.

[flow_scaled] flowX.scale(flow, sc, [opt])

Scales flow of size 2xMxN by sc times. opt(optional) specifies interpolation method, simple (default), bilinear, and bicubic.

[flowBatch_scaled] flowX.scaleBatch(flowBatch, sc)

Scales flowBatch of size Bx2xMxN, a batch of B flow fields by sc times. Uses nearest-neighbor interpolation.

Timing Benchmarks

Our timing benchmark is set up on Flying chair dataset. To test it, you need to download

wget http://lmb.informatik.uni-freiburg.de/resources/datasets/FlyingChairs/FlyingChairs.zip

Run the timing benchmark

th timing_benchmark.lua -data YOUR_FLYING_CHAIRS_DATA_DIRECTORY

References

Our warping code is based on qassemoquab/stnbhwd.
The images in samples are from Flying Chairs dataset: Dosovitskiy, Alexey, et al. "Flownet: Learning optical flow with convolutional networks." 2015 IEEE International Conference on Computer Vision (ICCV). IEEE, 2015.
Some parts of flowExtensions.lua are adapted from marcoscoffier/optical-flow with help from fguney.
The unofficial PyTorch implementation is from sniklaus.

License

Free for non-commercial and scientific research purposes. For commercial use, please contact [email protected]. Check LICENSE file for details.

When using this code, please cite

Ranjan, Anurag, and Michael J. Black. "Optical Flow Estimation using a Spatial Pyramid Network." arXiv preprint arXiv:1611.00850 (2016).

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 158

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗