Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → art-programmer → Planenet

art-programmer / Planenet

Licence: mit

PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image

Programming Languages

python

139335 projects - #7 most used programming language

Labels

pytorch tensorflow cvpr2018

Projects that are alternatives of or similar to Planenet

glimpse clouds

Pytorch implementation of the paper "Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points", F. Baradel, C. Wolf, J. Mille , G.W. Taylor, CVPR 2018

Stars: ✭ 30 (-91.02%)

Mutual labels: cvpr2018

ASNet

Salient Object Detection Driven by Fixation Prediction (CVPR2018)

Stars: ✭ 41 (-87.72%)

Mutual labels: cvpr2018

V2v Posenet release

Official Torch7 implementation of "V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map", CVPR 2018

Stars: ✭ 286 (-14.37%)

Mutual labels: cvpr2018

IDN-pytorch

paper implement : Fast and Accurate Single Image Super-Resolution via Information Distillation Network

Stars: ✭ 40 (-88.02%)

Mutual labels: cvpr2018

G2LTex

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

Stars: ✭ 104 (-68.86%)

Mutual labels: cvpr2018

Splatnet

SPLATNet: Sparse Lattice Networks for Point Cloud Processing (CVPR2018)

Stars: ✭ 259 (-22.46%)

Mutual labels: cvpr2018

Image manipulation detection

Paper: CVPR2018, Learning Rich Features for Image Manipulation Detection

Stars: ✭ 210 (-37.13%)

Mutual labels: cvpr2018

Prm

Weakly Supervised Instance Segmentation using Class Peak Response, in CVPR 2018 (Spotlight)

Stars: ✭ 322 (-3.59%)

Mutual labels: cvpr2018

DVQA dataset

DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018

Stars: ✭ 20 (-94.01%)

Mutual labels: cvpr2018

Geomapnet

Geometry-Aware Learning of Maps for Camera Localization (CVPR2018)

Stars: ✭ 281 (-15.87%)

Mutual labels: cvpr2018

text-detection-fots.pytorch

FOTS text detection branch reimplementation, hmean: 83.3%

Stars: ✭ 80 (-76.05%)

Mutual labels: cvpr2018

DisguiseNet

Code for DisguiseNet : A Contrastive Approach for Disguised Face Verification in the Wild

Stars: ✭ 20 (-94.01%)

Mutual labels: cvpr2018

Psa

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Stars: ✭ 261 (-21.86%)

Mutual labels: cvpr2018

FaceAttr

CVPR2018 Face Super-resolution with supplementary Attributes

Stars: ✭ 18 (-94.61%)

Mutual labels: cvpr2018

Tfusion

CVPR2018: Unsupervised Cross-dataset Person Re-identification by Transfer Learning of Spatio-temporal Patterns

Stars: ✭ 301 (-9.88%)

Mutual labels: cvpr2018

Dfl Cnn

This is a pytorch re-implementation of Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition

Stars: ✭ 245 (-26.65%)

Mutual labels: cvpr2018

cvpr18-caption-eval

Learning to Evaluate Image Captioning. CVPR 2018

Stars: ✭ 79 (-76.35%)

Mutual labels: cvpr2018

Textspotter

Stars: ✭ 323 (-3.29%)

Mutual labels: cvpr2018

Dcpdn

Densely Connected Pyramid Dehazing Network (CVPR'2018)

Stars: ✭ 321 (-3.89%)

Mutual labels: cvpr2018

Wshp

Code for CVPR'18 spotlight "Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer"

Stars: ✭ 273 (-18.26%)

Mutual labels: cvpr2018

View All Similar Projects ➔

PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image

By Chen Liu, Jimei Yang, Duygu Ceylan, Ersin Yumer, and Yasutaka Furukawa

Introduction

This paper presents the first end-to-end neural architecture for piece-wise planar reconstruction from a single RGB image. The proposed network, PlaneNet, learns to directly infer a set of plane parameters and corresponding plane segmentation masks. For more details, please refer to our CVPR 2018 paper or visit our project website.

Updates

We developed a better technique, PlaneRCNN, for piece-wise planar detection as described in our recent arXiv paper. Unfortunately, we cannot release the code and data yet.

We add script for extracting plane information from the original ScanNet dataset and rendering 3D planar segmentation results to 2D views. Please see the README in folder data_preparation/ for details. Note that we made some modifications to the heuristic-heavy plane fitting algorithms when cleaning up the messy codes developed over time. So the plane fitting results will be slightly different with the training data we used (provided in the .tfrecords files).

PyTorch training and testing codes are available now (still experimental and without the CRF module).

Dependencies

Python 2.7, TensorFlow (>= 1.3), numpy, opencv 3.

Getting started

Compilation

Please run the following commands to compile the library for the crfasrnn module.

cd cpp
sh compile.sh
cd ..

To train the network, you also need to run the following commands to compile the library for computing the set matching loss. You need Eigen (I am using Eigen 3.2.92) for the compilation. (Please see here for details.)

cd nndistance
make
cd ..

Data preparation

We convert ScanNet data to .tfrecords files for training and testing. The training data can be downloaded from here (or here if you cannont access the previous one), and the validation data can be downloaded from here (or here).

If you download the training data from the BOX link, please run the following command to merge downloaded files into one .tfrecords file.

cat training_data_segments/* > planes_scannet_train.tfrecords

Training

To train the network from the pretrained DeepLab network, please first download the DeepLab model here (under the Caffe to TensorFlow conversion), and then run the following command.

python train_planenet.py --restore=0 --modelPathDeepLab="path to the deep lab model" --dataFolder="folder which contains tfrecords files"

Evaluation

Please first download our trained network from here (or here) and put the uncompressed folder under ./checkpoint folder.

To evaluate the performance against existing methods, please run:

python evaluate.py --dataFolder="folder which contains tfrecords files"

Plane representation

A plane is represented by three parameters and a segmentation mask. If the plane equation is nx=d where n is the surface normal and d is the plane offset, then plane parameters are nd. The plane equation is in the camera frame, where x points to the right, y points to the front, and z points to the up.

Applications

Please first download our trained network (see [Evaluation](### Evaluation) section for details). Script predict.py predicts and visualizes custom images (if "customImageFolder" is specified) or ScanNet testing images (if "dataFolder" is specified).

python predict.py --customImageFolder="folder which contains custom images"
python predict.py --dataFolder="folder which contains tfrecords files" [--startIndex=0] [--numImages=30]

This will generate visualization images, a webpage containing all the visualization, as well as cache files under folder "predict/".

Same commands can be used for various applications by providing optional arguments, applicationType, imageIndex, textureImageFilename, and some application-specific arguments. The following commands are used to generate visualizations in the submission. (The TV application needs more manual specification for better visualization.)

python predict.py --dataFolder=/mnt/vision/Data/PlaneNet/ --textureImageFilename=texture_images/CVPR.jpg --imageIndex=118 --applicationType=logo_texture --startIndex=118 --numImages=1
python predict.py --dataFolder=/mnt/vision/Data/PlaneNet/ --textureImageFilename=texture_images/CVPR.jpg --imageIndex=118 --applicationType=logo_video --startIndex=118 --numImages=1
python predict.py --dataFolder=/mnt/vision/Data/PlaneNet/ --textureImageFilename=texture_images/checkerboard.jpg --imageIndex=72 --applicationType=wall_texture --wallIndices=7,9 --startIndex=72 --numImages=1
python predict.py --dataFolder=/mnt/vision/Data/PlaneNet/ --textureImageFilename=texture_images/checkerboard.jpg --imageIndex=72 --applicationType=wall_video --wallIndices=7,9 --startIndex=72 --numImages=1
python predict.py --customImageFolder=my_images/TV/ --textureImageFilename=texture_images/TV.mp4 --imageIndex=0 --applicationType=TV --wallIndices=2,9
python predict.py --customImageFolder=my_images/ruler --textureImageFilename=texture_images/ruler_36.png --imageIndex=0 --applicationType=ruler --startPixel=950,444 --endPixel=1120,2220

Note that, the above script generate image sequences for video applications. Please run the following command under the image sequence folder to generate a video:

ffmpeg -r 60 -f image2 -s 640x480 -i %04d.png -vcodec libx264 -crf 25 -pix_fmt yuv420p video.mp4

To check out the pool ball application, please run the following commands.

python predict.py --customImageFolder=my_images/pool --imageIndex=0 --applicationType=pool --estimateFocalLength=False
cd pool
python pool.py

Use mouse to play:)

Contact

If you have any questions, please contact me at [email protected].

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 334

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (19) 🔗