All Projects → leeyevi → Mv3d_tf

leeyevi / Mv3d_tf

Licence: mit
Tensorflow implementation of Multi-View 3D Object Detection Network (in progress)

Projects that are alternatives of or similar to Mv3d tf

Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+1267.68%)
Mutual labels:  jupyter-notebook
Datascience book
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Ucl Deep Learning Ans Reinforcement Learning
Deep learning and Reinforcement learning lecture and course work
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Linear algebra with python
Lecture Notes for Linear Algebra Featuring Python
Stars: ✭ 1,355 (+1268.69%)
Mutual labels:  jupyter-notebook
Hands On Exploratory Data Analysis With Python
Hands-on Exploratory Data Analysis with Python, published by Packt
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Bds
Code and examples from Business Data Science
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Cbe20255
Introduction to Chemical Engineering Analysis
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Delf enhanced
Wrapper of DELF Tensorflow Model
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Pytorch Bert Document Classification
Enriching BERT with Knowledge Graph Embedding for Document Classification (PyTorch)
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Ml Sound Classifier
Machine Learning Sound Classifier
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Physlight
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Gp regression
A Primer on Gaussian Processes for Regression Analysis (PyData NYC 2019)
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Facedetector
A re-implementation of mtcnn. Joint training, tutorial and deployment together.
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Nab
The Numenta Anomaly Benchmark
Stars: ✭ 1,352 (+1265.66%)
Mutual labels:  jupyter-notebook
100days Ml Code
100天机器学习 (翻译+ 实操)
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Objectron
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Stars: ✭ 1,352 (+1265.66%)
Mutual labels:  jupyter-notebook
Afnet
Code for paper in CVPR2019, 'Attentive Feedback Network for Boundary-aware Salient Object Detection'
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Organic
Code repo for optimizing distributions of molecules.
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Math of machine learning
This is the code for "Mathematcs of Machine Learning" by Siraj Raval on Youtube
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Adversarially Learned Anomaly Detection
ALAD (Proceedings of IEEE ICDM 2018) official code
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook

MV3D_TF(In progress)

This is an experimental Tensorflow implementation of MV3D - a ConvNet for object detection with Lidar and Mono-camera.

For details about MV3D please refer to the paper Multi-View 3D Object Detection Network for Autonomous Driving by Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia.

Requirements: software

  1. Requirements for Tensorflow 1.0 (see: Tensorflow)

  2. Python packages you might not have: cython, python-opencv, easydict

Requirements: hardware

  1. For training the end-to-end version of Faster R-CNN with VGG16, 3G of GPU memory is sufficient (using CUDNN)

Installation

  1. Clone the Faster R-CNN repository
  # Make sure to clone with --recursive
  git clone --recursive https://github.com/RyannnG/MV3D_TF.git
  1. Build the Cython modules

     cd $MV3D/lib
     make
    
  2. Downloads KITTI object datasets.

 % Specify KITTI data path so that the structure is like

 % {kitti_dir}/object/training/image_2
 %                            /image_3
 %                            /calib
 %                            /lidar_bv
 %							 /velodyne
       

 % {kitti_dir}/object/testing/image_2
 %                           /image_3
 %                           /calib
 %                           /lidar_bv
 %							/velodyne
  1. Make Lidar Bird View data

    # edit the kitti_path in tools/read_lidar.py
    # then start make data
    python tools/read_lidar.py
    
  2. Create symlinks for the KITTI dataset

   cd $MV3D/data/KITTI
   ln -s {kitti_dir}/object object
  1. Download pre-trained ImageNet models

    Download the pre-trained ImageNet models [Google Drive] [Dropbox]

    mv VGG_imagenet.npy $MV3D/data/pretrain_model/VGG_imagenet.npy
  1. Run script to train model
 cd $MV3D
 ./experiments/scripts/mv3d.sh $DEVICE $DEVICE_ID ${.npy/ckpt.meta} kitti_train

DEVICE is either cpu/gpu

Network Structure

Key idea: Use Lidar bird view to generate anchor boxes, then project those boxes on image to do classification.

structure

Examples

Image and corresponding Lidar map

Note:

In image:

  • Boxes without regression

In Lidar:

  • white box: without regression (correspond with image)
  • purple box: with regression

figure_20

figure_20

figure_20

figure_20

figure_20

figure_20

figure_20

figure_20

Existing Errors

Mostly due to regression error

figure_20

(error in box 5,6,9)

figure_20

figure_20

(error in 8, 9, 10)

figure_20

References

Lidar Birds Eye Views

part.2: Didi Udacity Challenge 2017 — Car and pedestrian Detection using Lidar and RGB

Faster_RCNN_TF

Faster R-CNN caffe version

TFFRCNN

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].