All Projects → zhixuanli → StereoNet

zhixuanli / StereoNet

Licence: other
A customized implementation of the paper "StereoNet: guided hierarchical refinement for real-time edge-aware depth prediction"

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to StereoNet

SUIM
Semantic Segmentation of Underwater Imagery: Dataset and Benchmark. #IROS2020
Stars: ✭ 53 (-50.47%)
Mutual labels:  semantic-segmentation
kitti-A-LOAM
Easy description to run and evaluate A-LOAM with KITTI-data
Stars: ✭ 28 (-73.83%)
Mutual labels:  kitti
adversarial-attacks
Code for our CVPR 2018 paper, "On the Robustness of Semantic Segmentation Models to Adversarial Attacks"
Stars: ✭ 90 (-15.89%)
Mutual labels:  semantic-segmentation
Tools Merge Image PointCloud
Project the PointCloud to the image & Generate the LiDAR PointCloud with color.
Stars: ✭ 39 (-63.55%)
Mutual labels:  kitti
kitti deeplab
Inference script and frozen inference graph with fine tuned weights for semantic segmentation on images from the KITTI dataset.
Stars: ✭ 26 (-75.7%)
Mutual labels:  semantic-segmentation
SPML
Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning
Stars: ✭ 81 (-24.3%)
Mutual labels:  semantic-segmentation
DeepLab-V3
Google DeepLab V3 for Image Semantic Segmentation
Stars: ✭ 103 (-3.74%)
Mutual labels:  semantic-segmentation
panoptic-forecasting
[CVPR 2021] Forecasting the panoptic segmentation of future video frames
Stars: ✭ 44 (-58.88%)
Mutual labels:  semantic-segmentation
UAV-Stereo-Vision
A program for controlling a micro-UAV for obstacle detection and collision avoidance using disparity mapping
Stars: ✭ 30 (-71.96%)
Mutual labels:  stereo
BMW-IntelOpenVINO-Segmentation-Inference-API
This is a repository for a semantic segmentation inference API using the OpenVINO toolkit
Stars: ✭ 31 (-71.03%)
Mutual labels:  semantic-segmentation
UAVI Seg Pytorch
基于语义分割的矿区无人机影像地物提取实验
Stars: ✭ 106 (-0.93%)
Mutual labels:  semantic-segmentation
Fast-SCNN pytorch
A PyTorch Implementation of Fast-SCNN: Fast Semantic Segmentation Network(PyTorch >= 1.4)
Stars: ✭ 30 (-71.96%)
Mutual labels:  semantic-segmentation
3D-Detection-Tracking-Viewer
3D detection and tracking viewer (visualization) for kitti & waymo dataset
Stars: ✭ 150 (+40.19%)
Mutual labels:  kitti
wasr network
WaSR Segmentation Network for Unmanned Surface Vehicles v0.5
Stars: ✭ 32 (-70.09%)
Mutual labels:  semantic-segmentation
squeeze-unet
Squeeze-unet Semantic Segmentation for embedded devices
Stars: ✭ 21 (-80.37%)
Mutual labels:  semantic-segmentation
CVPR2021 PLOP
Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation
Stars: ✭ 102 (-4.67%)
Mutual labels:  semantic-segmentation
DSIN
Deep Image Compression using Decoder Side Information (ECCV 2020)
Stars: ✭ 39 (-63.55%)
Mutual labels:  kitti
ML IDCard Segmentation-TF-Keras
Machine Learning Project to identify an ID Card on an image
Stars: ✭ 38 (-64.49%)
Mutual labels:  semantic-segmentation
atomai
Deep and Machine Learning for Microscopy
Stars: ✭ 77 (-28.04%)
Mutual labels:  semantic-segmentation
SpinNet
[CVPR 2021] SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration
Stars: ✭ 181 (+69.16%)
Mutual labels:  kitti

Pytorch StereoNet

Customized implementation of the Stereonet guided hierarchical refinement for real-time edge-aware depth prediction


I have just noticed that the share link above is not available now. Please download from this link: https://drive.google.com/file/d/1CO2hGMphCZsNcJPFAk2q4-QG_0zM0LpU/view?usp=sharing.

This project is the first one after I have learned Pytorch. And it's the homework for my CV course. I'm focusing on the segmentation field at present. Thanks for your attention.


The network archtecture of StereoNet

Attention: Not accomplished yet

  1. Stereo Matching is not my main research field, and this repo is created for a homework. So maybe it's not very completed, but I have tried to make it perfect. If you need a better version, please refer to https://github.com/meteorshowers/StereoNet.
  2. The approach of computing the cost volume in the StereoNet paper is subtracting the padding image and the other image. Here I changed it to concatenate the two images. If you want to change it to the paper's way, just set it when you initialize the net.
  3. Only training and testing on the KITTI 2015 train dataset is not enough, the best performance has achieved 74.5% (pixels with error smaller than 1). After pretraining on SceneFlow and finetune on KITTI15, the acc achieves 90.054%, not as good as the acc in paper. I have try hard to achieve the accuracy in paper, but still can't. Maybe some details are wrong.

Experiment Results till now

  1. train and test on SceneFlow datasets:
    • epoch 22 total training loss = 3.956
    • average test EPE = 3.496
  2. different finetuning on kitti 15 and result
    • 300 epochs, max 3 pixel error rate = 80.893 on kitti val
      optimizer = RMSprop(model.parameters(), lr=1e-3, weight_decay=0.0001)
      scheduler = torch.optim.lr_scheduler.ExponentialLR(optimizer, gamma=0.9)
      
    • 300 epochs, max 3 pixel error rate = 83.527 on kitti val
      optimizer = optim.Adam(model.parameters(), lr=0.001, betas=(0.9, 0.999))
      scheduler = torch.optim.lr_scheduler.ExponentialLR(optimizer, gamma=0.9)
      
    • 300 epochs, max 3 pixel error rate = 90.054 on kitti val
      optimizer = optim.Adam(model.parameters(), lr=0.001, betas=(0.9, 0.999))
      if epoch <= 200:
          lr = 0.001
      else:
          lr = 0.0001    
      
    • 2000 epochs, max 3 pixel error rate = 93.680 on kitti val, after 4.98 hours finetune
      optimizer = optim.Adam(model.parameters(), lr=0.001, betas=(0.9, 0.999))
      if epoch <= 200:
          lr = 0.001
      else:
          lr = 0.0001    
      

Pre-requirement

  • Pytorch 1.0.0
  • CUDA Toolkit 10
  • numpy

Datasets:

  1. Pretrain: SceneFlow
  2. KITTI 2015

You can use the anaconda virtual environment to quick start

Install Anaconda

1. wget https://repo.anaconda.com/archive/Anaconda3-5.3.1-Linux-x86_64.sh
2. bash Anaconda3-5.3.1-Linux-x86_64.sh

Please reference to Ubuntu系统下Anaconda使用方法总结 for more information about conda installation.

Create Virtual Environment according to my environment index

conda env create -n your_env_name -f environment.yaml

Training and Test

Switch to the correct python environment

conda activate your_env_name

Start training and test

Pretrain on SceneFlow dataset

cd pretrain-sceneflow
python sceneflow-pretrain.py

Finetune on KITTI 2015

cd finetune-kitti15
python finetune-kitti15.py

Coding Reference

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].