Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → nsavinov → Semantic3dnet

nsavinov / Semantic3dnet

Licence: bsd-3-clause

Point cloud semantic segmentation via Deep 3D Convolutional Neural Network

Labels

deep-learning point-cloud torch

Projects that are alternatives of or similar to Semantic3dnet

Deep Learning For Lidar Point Clouds

Deep Learning for LiDAR Point Clouds in Autonomous Driving: A Review

Stars: ✭ 131 (-22.94%)

Mutual labels: point-cloud

Samplernn torch

Torch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

Stars: ✭ 146 (-14.12%)

Mutual labels: torch

Mvstudio

An integrated SfM (Structure from Motion) and MVS (Multi-View Stereo) solution.

Stars: ✭ 154 (-9.41%)

Mutual labels: point-cloud

Torchsample

High-Level Training, Data Augmentation, and Utilities for Pytorch

Stars: ✭ 1,731 (+918.24%)

Mutual labels: torch

Grid Gcn

Grid-GCN for Fast and Scalable Point Cloud Learning

Stars: ✭ 143 (-15.88%)

Mutual labels: point-cloud

Extrinsic lidar camera calibration

This is a package for extrinsic calibration between a 3D LiDAR and a camera, described in paper: Improvements to Target-Based 3D LiDAR to Camera Calibration. This package is used for Cassie Blue's 3D LiDAR semantic mapping and automation.

Stars: ✭ 149 (-12.35%)

Mutual labels: point-cloud

Neural Style Audio Torch

Torch implementation for audio neural style.

Stars: ✭ 130 (-23.53%)

Mutual labels: torch

Pointnet2

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Stars: ✭ 2,197 (+1192.35%)

Mutual labels: point-cloud

Jsis3d

[CVPR'19] JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds

Stars: ✭ 144 (-15.29%)

Mutual labels: point-cloud

Dgcnn.pytorch

A PyTorch implementation of Dynamic Graph CNN for Learning on Point Clouds (DGCNN)

Stars: ✭ 153 (-10%)

Mutual labels: point-cloud

Prediction Flow

Deep-Learning based CTR models implemented by PyTorch

Stars: ✭ 138 (-18.82%)

Mutual labels: torch

Synthesize3dviadepthorsil

[CVPR 2017] Generation and reconstruction of 3D shapes via modeling multi-view depth maps or silhouettes

Stars: ✭ 141 (-17.06%)

Mutual labels: torch

Dockerfiles

Deep Learning Dockerfiles

Stars: ✭ 150 (-11.76%)

Mutual labels: torch

Lidar camera calibration

Light-weight camera LiDAR calibration package for ROS using OpenCV and PCL (PnP + LM optimization)

Stars: ✭ 133 (-21.76%)

Mutual labels: point-cloud

Pointasnl

PointASNL: Robust Point Clouds Processing using Nonlocal Neural Networks with Adaptive Sampling （CVPR 2020）

Stars: ✭ 159 (-6.47%)

Mutual labels: point-cloud

3pu

Patch-base progressive 3D Point Set Upsampling

Stars: ✭ 131 (-22.94%)

Mutual labels: point-cloud

Skip Thoughts.torch

Porting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7

Stars: ✭ 146 (-14.12%)

Mutual labels: torch

Awesome 3d Detectors

Paperlist of awesome 3D detection methods

Stars: ✭ 163 (-4.12%)

Mutual labels: point-cloud

Pangolin

Python binding of 3D visualization library Pangolin

Stars: ✭ 157 (-7.65%)

Mutual labels: point-cloud

Npbg

Neural Point-Based Graphics

Stars: ✭ 152 (-10.59%)

Mutual labels: point-cloud

View All Similar Projects ➔

Point cloud semantic segmentation via Deep 3D Convolutional Neural Network

This code implements a deep neural network for 3D point cloud semantic segmentation. It comes as a baseline for the benchmark http://www.semantic3d.net/ (reproduces DeepNet entry in reduced-8 track). It is written in C++/lua and is supposed to simplify starting to work with the benchmark.

The code requires at least 8 Gb RAM and an Nvidia GPU (at least 6 Gb of memory, tested for Nvidia Titan X GPU).

If you use this code or the benchmark in your research, please cite it as

@article{hackel2017semantic3d,
title={Semantic3D.net: A new Large-scale Point Cloud Classification Benchmark},
author={Hackel, Timo and Savinov, Nikolay and Ladicky, Lubor and Wegner, Jan and Schindler, Konrad and Pollefeys, Marc},
journal={arXiv preprint arXiv:1704.03847},
year={2017}
}

How does it work

Each point in the point cloud is to be classified into one of the semantic classes like building/car/vegetation/etc. It is done by considering a range of neighbourhoods for the point, computing occupancy grids for them and applying 3D convolutional neural network on those grids. A more detailed description can be found at https://goo.gl/TUPqXo.

Instructions for Linux (tested for Ubuntu 16.04.2 LTS):

install torch (tested for commit 5c1d3cfda8101123628a45e70435d545ae1bc771 from June 7, 2017), cuda (tested for v8.0) and cudnn (tested for v5.1). For the latter, follow the instructions https://github.com/soumith/cudnn.torch
clone this repository
run "cd build; ./setup.sh" to download data and transform it into necessary format.
run "./build_run.sh" to prepare small train/validation sets to track optimization progress
run "cd ../src; ./launch_training.sh". You can track the progress in nohup.out file. Wait until the train error becomes close to 0 and test error start to oscillate around some value. Then kill the process (took 304 epochs for the baseline).
run "./launch_prediction.sh". You might want to change gpu indexes in this script depending on number of gpus you have available. You can also tweak number of openmp threads. Wait until it finishes (might take a day or so).
after prediction finishes, run "./prepare_to_submit.sh" to put the submission into necessary format.
submit data/benchmark/submit.zip to the server http://www.semantic3d.net/submit_public.php.

Parameters

The changeable constants are in lib/point_cloud_util/data_loader_constants.h and in src/point_cloud_constants.lua

Explanation of them:

lib/point_cloud_util/data_loader_constants.h:

const int kWindowSize = 16; // neighbourhood of the point is voxelized into kWindowSize ^ 3 voxels

const int kBatchSize = 100; // since batch is constructed on cpp side, we need specify its size here

const int kNumberOfClasses = 8; // integer labels 1, ..., 8 are considered

const int kDefaultNumberOfScales = 5; // each sample in the batch is constructed as stacked multiples scales. in the deep net architecture fully connected layer outputs are concatenated

const int kDefaultNumberOfRotations = 1; // optional implemenation of TI-pooling on top of multi-scale architecture. check the repository or read the paper for more details

const float kSpatialResolution = 0.025; // voxel side in meters

const int kBatchResamplingLimit = 100; // after this number of batches, the coordinate system is rotated by a random angle and the voxelized representations are recalculated from scratch (augmentation).

src/point_cloud_constants.lua:

opt.number_of_filters = 16 -- number of filters in the first convolutional layer, other layers size is also proportional to this constant.

opt.kLargePrintingInterval = 100 -- how often to evaluate model and dump solution to disk

opt.kWarmStart = false -- restart with the saved model

opt.kModelDumpName = '../dump/model_dump' -- where the model is saved

opt.kOptimStateDumpName = '../dump/optim_state_dump' -- where the optimization progress is saved

opt.kStreamingPath = '../data/benchmark/sg28_station4_intensity_rgb_train.txt' -- from which file training data is sampled

Caveats

Data is randomly sampled from the training set, all the classes are made equally probable via this sampling. Thus it is required that the training file contains at least one sample of each class.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 170

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗