All Projects → chrischoy → Fcgf

chrischoy / Fcgf

Licence: mit
Fully Convolutional Geometric Features: Fast and accurate 3D features for registration and correspondence.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Fcgf

Open3d
Open3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+1686.59%)
Mutual labels:  3d, registration
Deepglobalregistration
[CVPR 2020 Oral] A differentiable framework for 3D registration
Stars: ✭ 222 (-32.32%)
Mutual labels:  3d, registration
Cilantro
A lean C++ library for working with point cloud data
Stars: ✭ 577 (+75.91%)
Mutual labels:  3d, registration
Probreg
Python package for point cloud registration using probabilistic model (Coherent Point Drift, GMMReg, SVR, GMMTree, FilterReg, Bayesian CPD)
Stars: ✭ 306 (-6.71%)
Mutual labels:  3d, registration
D3feat
[TensorFlow] Implementation of CVPR'20 oral paper - D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features https://arxiv.org/abs/2003.03164
Stars: ✭ 143 (-56.4%)
Mutual labels:  3d, registration
Medpy
Medical image processing in Python
Stars: ✭ 321 (-2.13%)
Mutual labels:  3d, feature-extraction
Deca
DECA: Detailed Expression Capture and Animation
Stars: ✭ 292 (-10.98%)
Mutual labels:  3d
Surfboard
Novoic's audio feature extraction library
Stars: ✭ 318 (-3.05%)
Mutual labels:  feature-extraction
Intrinsic3d
Intrinsic3D - High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting (ICCV 2017)
Stars: ✭ 297 (-9.45%)
Mutual labels:  3d
Cvpods
All-in-one Toolbox for Computer Vision Research.
Stars: ✭ 277 (-15.55%)
Mutual labels:  3d
Isometricsass
Sass library to make isometric 2D without javascript
Stars: ✭ 331 (+0.91%)
Mutual labels:  3d
Marlin Config
Marlin firmware instant configurator
Stars: ✭ 327 (-0.3%)
Mutual labels:  3d
Stuntrally
The main repository containing Stunt Rally sources and game data. A 3D racing game based on VDrift and OGRE with track editor.
Stars: ✭ 314 (-4.27%)
Mutual labels:  3d
Cardslideview
一行代码实现ViewPager卡片效果,比ViewPager2更强大,底层同样是RecyclerView
Stars: ✭ 301 (-8.23%)
Mutual labels:  3d
Unitygraphicsprogramming
書籍「UnityGraphicsProgramming vol.1」のサンプルコードリポジトリ
Stars: ✭ 321 (-2.13%)
Mutual labels:  3d
So Net
SO-Net: Self-Organizing Network for Point Cloud Analysis, CVPR2018
Stars: ✭ 297 (-9.45%)
Mutual labels:  3d
React Particles Webgl
🔆 A 2D/3D particle library built on React, Three.js and WebGL
Stars: ✭ 330 (+0.61%)
Mutual labels:  3d
Glportal
🎮 Open Source teleportation based first person puzzle-platformer
Stars: ✭ 297 (-9.45%)
Mutual labels:  3d
Online3dviewer
Online 3D Model Viewer
Stars: ✭ 312 (-4.88%)
Mutual labels:  3d
Audiofabric
a 3d music visualization
Stars: ✭ 327 (-0.3%)
Mutual labels:  3d

Fully Convolutional Geometric Features, ICCV, 2019

Extracting geometric features from 3D scans or point clouds is the first step in applications such as registration, reconstruction, and tracking. State-of-the-art methods require computing low-level features as input or extracting patch-based features with limited receptive field. In this work, we present fully-convolutional geometric features, computed in a single pass by a 3D fully-convolutional network. We also present new metric learning losses that dramatically improve performance. Fully-convolutional geometric features are compact, capture broad spatial context, and scale to large scenes. We experimentally validate our approach on both indoor and outdoor datasets. Fully-convolutional geometric features achieve state-of-the-art accuracy without requiring prepossessing, are compact (32 dimensions), and are 600 times faster than the most accurate prior method.

ICCV'19 Paper

News

  • 2020-10-02 Measure the FCGF speedup on v0.5 on MinkowskiEngineBenchmark. The speedup ranges from 2.7x to 7.7x depending on the batch size.
  • 2020-09-04 Updates on ME v0.5 further speed up the inference time from 13.2ms to 11.8ms. As a reference, ME v0.4 takes 37ms.
  • 2020-08-18 Merged the v0.5 to the master with v0.5 installation. You can now use the full GPU support for sparse tensor hi-COO representation for faster training and inference.
  • 2020-08-07 MinkowskiEngine v0.5 improves the FCGF inference speed by x2.8 (280% speed-up, feed forward time for ResUNetBN2C on the 3DMatch kitchen point cloud ID-20: 37ms (ME v0.4.3) down to 13.2ms (ME v0.5.0). Measured on TitanXP, Ryzen-3700X).
  • 2020-06-15 Source code for Deep Global Registration, CVPR'20 Oral has been released. Please refer to the repository and the paper for using FCGF for registration.

3D Feature Accuracy vs. Speed

Comparison Table Speed vs. Accuracy
Table Accuracy vs. Speed

Feature-match recall and speed in log scale on the 3DMatch benchmark. Our approach is the most accurate and the fastest. The gray region shows the Pareto frontier of the prior methods.

Related Works

3DMatch by Zeng et al. uses a Siamese convolutional network to learn 3D patch descriptors. CGF by Khoury et al. maps 3D oriented histograms to a low-dimensional feature space using multi-layer perceptrons. PPFNet and PPF FoldNet by Deng et al. adapts the PointNet architecture for geometric feature description. 3DFeat by Yew and Lee uses a PointNet to extract features in outdoor scenes.

Our work addressed a number of limitations in the prior work. First, all prior approaches extract a small 3D patch or a set of points and map it to a low-dimensional space. This not only limits the receptive field of the network but is also computationally inefficient since all intermediate representations are computed separately even for overlapping 3D regions. Second, using expensive low-level geometric signatures as input can slow down feature computation. Lastly, limiting feature extraction to a subset of interest points results in lower spatial resolution for subsequent matching stages and can thus reduce registration accuracy.

Fully Convolutional Metric Learning, Hardest Contrastive, and Hardest Triplet Loss

Traditional metric learning assumes that the features are independent and identically distributed (i.i.d.) since a batch is constructed by random sampling. However, in fully-convolutional metric learning first proposed in Universal Correspondence Network, Choy 2016, adjacent features are locally correlated and hard-negative mining could find features adjacent to anchors, which are false negatives. Thus, filtering out these false negatives is a crucial step similar to how Universal Correspondence Network used a distance threshold to filter out the false negatives.

Also, the number of features used in the fully-convolutional setting is orders of magnitude larger than that in standard metric learning algorithms. For instance, FCGF generates ~40k features for a pair of scans (this increases proportionally with the batch size) while a minibatch in traditional metric learning has around 1k features. Thus, it is not feasible to use all pairwise distances within a batch in the standard metric learning.

Instead, we propose the hardest-contrastive loss and the hardest-triplet loss. Visually, these are simple variants that use the hardest negatives for both features within a positive pair. One of the key advantages of the hardest-contrastive loss is that you do not need to save the temporary variables used to find the hardest negatives. This small change allows us to reconstruct the loss from the hardest negatives indices and throw away the intermediate results among a large number of feature. Here, we used almost 40k features to mine the hardest negative and destroy all intermediate variables once the indices of the hardest negatives are found for each positive feature.

Contrastive Loss Triplet Loss Hardest Contrastive Hardest Triplet
1 2 3 4

Sampling and negative-mining strategy for each method. Blue: positives, Red: Negatives. Traditional contrastive and triplet losses use random sampling. Our hardest-contrastive and hardest-triplet losses use the hardest negatives.

Please refer to our ICCV'19 paper for more details.

Visualization of FCGF

We color-coded FCGF features for pairs of 3D scans that are 10m apart for KITTI and a 3DMatch benchmark pair for indoor scans. FCGF features are mapped to a scalar space using t-SNE and colorized with the Spectral color map.

KITTI LIDAR Scan 1 KITTI LIDAR Scan 2
0 1
Indoor Scan 1 Indoor Scan 2
0 1

FCGF Correspondence Visualizations

Please follow the link Youtube Video or click the image to view the YouTube video of FCGF visualizations.

Requirements

  • Ubuntu 14.04 or higher
  • CUDA 11.1 or higher
  • Python v3.7 or higher
  • Pytorch v1.6 or higher
  • MinkowskiEngine v0.5 or higher

Installation & Dataset Download

We recommend conda for installation. First, create a conda environment with pytorch 1.5 or higher with

conda create -n py3-fcgf python=3.7
conda activate py3-fcgf
conda install pytorch -c pytorch
pip install git+https://github.com/NVIDIA/MinkowskiEngine.git

Next, download FCGF git repository and install the requirement from the FCGF root directory..

git clone https://github.com/chrischoy/FCGF.git
cd FCGF
# Do the following inside the conda environment
pip install -r requirements.txt

For training, download the preprocessed 3DMatch benchmark dataset.

./scripts/download_datasets.sh /path/to/dataset/download/dir

For KITTI training, follow the instruction on KITTI Odometry website to download the KITTI odometry training set.

Demo: Extracting and color coding FCGF

After installation, you can run the demo script by

python demo.py

The demo script will first extract FCGF features from a mesh file generated from a kitchen scene. Next, it will color code the features independent of their spatial location. After the color mapping using TSNE, the demo script will visualize the color coded features by coloring the input point cloud.

demo

You may have to rotate the scene to get the above visualization.

Training and running 3DMatch benchmark

python train.py --threed_match_dir /path/to/threedmatch/

For benchmarking the trained weights on 3DMatch, download the 3DMatch Geometric Registration Benchmark dataset from here or run

bash ./scripts/download_3dmatch_test.sh /path/to/threedmatch_test/

and follow:

python -m scripts.benchmark_3dmatch.py \
    --source /path/to/threedmatch \
    --target ./features_tmp/ \
    --voxel_size 0.025 \
    --model ~/outputs/checkpoint.pth \
    --extract_features --evaluate_feature_match_recall --with_cuda

Training and testing on KITTI Odometry custom split

For KITTI training, follow the instruction on KITTI Odometry website to download the KITTI odometry training set.

export KITTI_PATH=/path/to/kitti/; ./scripts/train_fcgf_kitti.sh

Registration Test on 3DMatch

Model Zoo

Model Normalized Feature Dataset Voxel Size Feature Dimension Performance Link
ResUNetBN2C True 3DMatch 2.5cm (0.025) 32 FMR: 0.9578 +- 0.0272 download
ResUNetBN2C True 3DMatch 2.5cm (0.025) 16 FMR: 0.9442 +- 0.0345 download
ResUNetBN2C True 3DMatch 5cm (0.05) 32 FMR: 0.9372 +- 0.0332 download
ResUNetBN2C False KITTI 20cm (0.2) 32 RTE: 0.0534m, RRE: 0.1704° download
ResUNetBN2C False KITTI 30cm (0.3) 32 RTE: 0.0607m, RRE: 0.2280° download
ResUNetBN2C True KITTI 30cm (0.3) 16 RTE: 0.0670m, RRE: 0.2295° download
ResUNetBN2C True KITTI 30cm (0.3) 32 RTE: 0.0639m, RRE: 0.2253° download

Raw Data for FCGF Figure 4

Citing FCGF

FCGF will be presented at ICCV'19: Friday, November 1, 2019, 1030–1300 Poster 4.1 (Hall B)

@inproceedings{FCGF2019,
    author = {Christopher Choy and Jaesik Park and Vladlen Koltun},
    title = {Fully Convolutional Geometric Features},
    booktitle = {ICCV},
    year = {2019},
}

Related Projects

Projects using FCGF

Acknowledgements

We want to thank all the ICCV reviewers, especially R2, for suggestions and valuable pointers.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].