All Projects → LouisFoucard → Stereoconvnet

LouisFoucard / Stereoconvnet

Stereo convolutional neural network for depth map prediction from stereo images

Projects that are alternatives of or similar to Stereoconvnet

Pyomogallery
A collection of Pyomo examples
Stars: ✭ 149 (-0.67%)
Mutual labels:  jupyter-notebook
Nlp adversarial examples
Implementation code for the paper "Generating Natural Language Adversarial Examples"
Stars: ✭ 149 (-0.67%)
Mutual labels:  jupyter-notebook
Feature Selector
Feature selector is a tool for dimensionality reduction of machine learning datasets
Stars: ✭ 1,913 (+1175.33%)
Mutual labels:  jupyter-notebook
Testovoe
Home assignments for data science positions
Stars: ✭ 149 (-0.67%)
Mutual labels:  jupyter-notebook
Slayerpytorch
PyTorch implementation of SLAYER for training Spiking Neural Networks
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Python For Financial Analysis And Algorithmic Trading
https://www.udemy.com/python-for-finance-and-trading-algorithms/
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Phonetic Similarity Vectors
Source code to accompany my paper "Poetic sound similarity vectors using phonetic features"
Stars: ✭ 148 (-1.33%)
Mutual labels:  jupyter-notebook
Learning by association
This repository contains code for the paper Learning by Association - A versatile semi-supervised training method for neural networks (CVPR 2017) and the follow-up work Associative Domain Adaptation (ICCV 2017).
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Machinelearning Watermelonbook
周志华-机器学习
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Face Depixelizer
Face Depixelizer based on "PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models" repository.
Stars: ✭ 1,912 (+1174.67%)
Mutual labels:  jupyter-notebook
Transformers Ru
A list of pretrained Transformer models for the Russian language.
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Person Reid Tiny Baseline
Open source person re-identification in Pytorch
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Computer vision
C/C++/Python based computer vision models using OpenPose, OpenCV, DLIB, Keras and Tensorflow libraries. Object Detection, Tracking, Face Recognition, Gesture, Emotion and Posture Recognition
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Www old.julialang.org
Julia Project web site (Old)
Stars: ✭ 149 (-0.67%)
Mutual labels:  jupyter-notebook
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+1458%)
Mutual labels:  jupyter-notebook
Carnd Mercedes Sf Utilities
Tools for Sensor Fusion processing.
Stars: ✭ 149 (-0.67%)
Mutual labels:  jupyter-notebook
Data Science Portfolio
A Portfolio of my Data Science Projects
Stars: ✭ 149 (-0.67%)
Mutual labels:  jupyter-notebook
Ml Mooc Nptel
This repository contains the Tutorials for the NPTEL MOOC on Machine Learning.
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Deeplearning keras2
Modification of fast.ai deep learning course notebooks for usage with Keras 2 and Python 3.
Stars: ✭ 150 (+0%)
Mutual labels:  jupyter-notebook
Autonomousdrivingcookbook
Scenarios, tutorials and demos for Autonomous Driving
Stars: ✭ 1,939 (+1192.67%)
Mutual labels:  jupyter-notebook

StereoConvNet

Stereo fully convolutional neural network for depth map prediction from stereo images. The network is implemented using the nn library Lasagne, and training-time/over-fitting are reduced significantly with the help of the recently developed Batch normalization technique (http://arxiv.org/abs/1502.03167).

The network is fully convolutional, and takes a couple of grayscale stereoscopic images concatenated along the channel axis, and ouputs a single image representing the depth map. A series of convolutional and maxpooling layers followed by a series of upscalling and deconvolutional layers allow the network to extract image disparity features at the smaller scale (object edges), and generate a smooth estimate of the depth map at the larger scale (full object). The main advantage of this technique over other methods from computer vision research (based on an explicit computation of image disparity) is its robustness, in particular the fact that it is able to produce smooth estimates of the depth map even on textureless region.

The traing/validation sets are created using the random virtual 3d scene generator (see https://github.com/LouisFoucard/DepthMap_dataset). The objective function used here is Eulerian distance. Thanks to batch normalization, training only takes 10 minutes (2000 images) on a K520 (AWS GPU), or 2 hours on a mac book pro.

Below are examples of a random 3d scene, its ground truth depth map, and the predictions computed with the fully convolutional stereo neural network:

alt tag

To check whether the network is truly learning stereo features, or merely learning to associate a closer distance to whatever object it sees on top of the background, let's run the same network (trained on stereo images) on 2 identical left images, instead of the stereo couple left/right images (similar to depthmap estimation from single images, which has been explored recently). In this case, there are no disparities between the 2 images, so this should throw the depthmap estimation off:

alt tag

We see indeed that the depth map calculated from 2 left images is off (last column), and objects are simply not registered and disappear altogether. This seems to support the idea that the network is learning true stereo features based on image disparity.

The architecture is roughly as follows:

Input layer (batch_size x 2 x width x height)

conv1 (batch_size x 16 x width x height)

maxpool1 (batch_size x 16 x width/2 x height/2)

conv2 (batch_sizex 32 x width/2 x height/2)

maxpool2 (batch_size x 32 x width/4 x height/4)

conv3 (batch_size x 64 x width/4 x height/4)

maxool3 (batch_size x 64 x width/8 x height/8)

conv4 (batch_size x 128, x width/8 x height/8)

deconv1 (batch_size x 32, x width/8 x height/8)

upscale1 (batch_size x 32 x width/4 x height/4)

deconv2 (batch_size x 16, x width/4 x height/4)

upscale2 (batch_size x 16 x width/2 x height/2)

deconv3(batch_size x 8 x width/2 x height/2)

upscale3 (batch_size x 8 x width x height)

deconv4 (batch_size x 1 x width x height)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].