All Projects → tentone → monodepth

tentone / monodepth

Licence: GPL-3.0 license
Python ROS depth estimation from RGB image based on code from the paper "High Quality Monocular Depth Estimation via Transfer Learning"

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects
CMake
9771 projects

Projects that are alternatives of or similar to monodepth

Arc Robot Vision
MIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.
Stars: ✭ 224 (+446.34%)
Mutual labels:  vision, rgbd
Tsdf Fusion
Fuse multiple depth frames into a TSDF voxel volume.
Stars: ✭ 426 (+939.02%)
Mutual labels:  vision, rgbd
diode-devkit
DIODE Development Toolkit
Stars: ✭ 58 (+41.46%)
Mutual labels:  rgbd, depth-estimation
Tsdf Fusion Python
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Stars: ✭ 464 (+1031.71%)
Mutual labels:  vision, rgbd
Apc Vision Toolbox
MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object segmentation and 6D object pose estimation.
Stars: ✭ 277 (+575.61%)
Mutual labels:  vision, rgbd
3dmatch Toolbox
3DMatch - a 3D ConvNet-based local geometric descriptor for aligning 3D meshes and point clouds.
Stars: ✭ 571 (+1292.68%)
Mutual labels:  vision, rgbd
Cs231a Notes
The course notes for Stanford's CS231A course on computer vision
Stars: ✭ 230 (+460.98%)
Mutual labels:  vision
autonomous-delivery-robot
Repository for Autonomous Delivery Robot project of IvLabs, VNIT
Stars: ✭ 65 (+58.54%)
Mutual labels:  vision
Simplecv
Stars: ✭ 2,522 (+6051.22%)
Mutual labels:  vision
Opticalflow visualization
Python optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge
Stars: ✭ 183 (+346.34%)
Mutual labels:  vision
rgbd ptam
Python implementation of RGBD-PTAM algorithm
Stars: ✭ 65 (+58.54%)
Mutual labels:  rgbd
stereo.vision
planar fitting computation using stereo vision techniques
Stars: ✭ 19 (-53.66%)
Mutual labels:  vision
frc-score-detection
A program to detect FRC match scores from their livestream.
Stars: ✭ 15 (-63.41%)
Mutual labels:  vision
Amazing Arkit
ARKit相关资源汇总 群:326705018
Stars: ✭ 239 (+482.93%)
Mutual labels:  vision
RGBD-semantic-segmentation
A paper list of RGBD semantic segmentation (processing)
Stars: ✭ 264 (+543.9%)
Mutual labels:  rgbd
EfficientMORL
EfficientMORL (ICML'21)
Stars: ✭ 22 (-46.34%)
Mutual labels:  vision
React Native Text Detector
Text Detector from image for react native using firebase MLKit on android and Tesseract on iOS
Stars: ✭ 194 (+373.17%)
Mutual labels:  vision
Grocery-Product-Detection
This repository builds a product detection model to recognize products from grocery shelf images.
Stars: ✭ 73 (+78.05%)
Mutual labels:  vision
sam-textvqa
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
Stars: ✭ 51 (+24.39%)
Mutual labels:  vision
nested-transformer
Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf
Stars: ✭ 174 (+324.39%)
Mutual labels:  vision

Mono Depth ROS

Configuration

  • Topics subscribed by the ROS node
    • /image/camera_raw - Input image from camera (can be changed on the parameter topic_color)
  • Topics published by the ROS node, containing depth and point cloud data generated.
    • /image/depth - Image message containing the depth image estimated (can be changed on the parameter topic_depth).
    • /pointcloud - Pointcloud2 message containing a estimated point cloud (can be changed on the parameter topic_pointcloud).
  • Parameters that can be configurated
    • frame_id - TF Frame id to be published in the output messages.
    • debug - If set true a window with the output result if displayed.
    • min_depth, max_depth - Min and max depth values considered for scaling.
    • batch_size - Batch size used when predicting the depth image using the model provided.
    • model_file - Keras model file used, relative to the monodepth package.

Setup

  • Install Python 2 and ROS dependencies
apt-get install python python-pip curl
pip install rosdep rospkg rosinstall_generator rosinstall wstool vcstools catkin_tools catkin_pkg
  • Install project dependencies
pip install tensorflow keras pillow matplotlib scikit-learn scikit-image opencv-python pydot GraphViz tk
  • Clone the project into your ROS workspace and download pretrained models
git clone https://github.com/tentone/monodepth.git
cd monodepth/models
curl –o nyu.h5 https://s3-eu-west-1.amazonaws.com/densedepth/nyu.h5

Launch

  • Example ROS launch entry provided bellow, for easier integration into your already existing ROS launch pipeline.
<node pkg="monodepth" type="monodepth.py" name="monodepth" output="screen" respawn="true">
    <param name="topic_color" value="/camera/image_raw"/>
    <param name="topic_depth" value="/camera/depth"/>
</node>

Pretrained models

Datasets for training

  • NYU Depth V2 (50K)
    • The NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect.
    • Download dataset (4.1 GB)
  • KITTI Dataset (80K)
    • Datasets captured by driving around the mid-size city of Karlsruhe, in rural areas and on highways. Up to 15 cars and 30 pedestrians are visible per image.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].