Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

This work is based on our paper "DualConvMesh-Net: Joint Geodesic and Euclidean Convolutions on 3D Meshes", which appeared at the IEEE Conference On Computer Vision And Pattern Recognition (CVPR) 2020.

Stars: ✭ 75 (-46.81%)

Mutual labels: semantic-segmentation, cvpr

Unsup3d

(CVPR'20 Oral) Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

Stars: ✭ 905 (+541.84%)

Mutual labels: 3d, cvpr

Suncgtoolbox

C++ based toolbox for the SUNCG dataset

Stars: ✭ 136 (-3.55%)

Mutual labels: 3d, semantic-segmentation

Contrastiveseg

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Stars: ✭ 135 (-4.26%)

Mutual labels: semantic-segmentation

Multi Task Refinenet

Multi-Task (Joint Segmentation / Depth / Surface Normas) Real-Time Light-Weight RefineNet

Stars: ✭ 139 (-1.42%)

Mutual labels: semantic-segmentation

Armorpaint

3D PBR Texture Painting Software

Stars: ✭ 2,065 (+1364.54%)

Mutual labels: 3d

Canvas Test

🎮 happy canvas

Stars: ✭ 1,722 (+1121.28%)

Mutual labels: 3d

Synthesize3dviadepthorsil

[CVPR 2017] Generation and reconstruction of 3D shapes via modeling multi-view depth maps or silhouettes

Stars: ✭ 141 (+0%)

Mutual labels: 3d

Bisenetv2 Tensorflow

Unofficial tensorflow implementation of real-time scene image segmentation model "BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation"

Stars: ✭ 139 (-1.42%)

Mutual labels: semantic-segmentation

Rg3d

3D and 2D game engine written in Rust

Stars: ✭ 2,998 (+2026.24%)

Mutual labels: 3d

Kiu Net Pytorch

Official Pytorch Code of KiU-Net for Image Segmentation - MICCAI 2020 (Oral)

Stars: ✭ 134 (-4.96%)

Mutual labels: semantic-segmentation

Pytorch Fcn Easiest Demo

PyTorch Implementation of Fully Convolutional Networks (a very simple and easy demo).

Stars: ✭ 138 (-2.13%)

Mutual labels: semantic-segmentation

3d Deep Learning

3D Deep Learning works

Stars: ✭ 135 (-4.26%)

Mutual labels: 3d

Human Video Generation

Human Video Generation Paper List

Stars: ✭ 139 (-1.42%)

Mutual labels: 3d

Dilation Tensorflow

A native Tensorflow implementation of semantic segmentation according to Multi-Scale Context Aggregation by Dilated Convolutions (2016). Optionally uses the pretrained weights by the authors.

Stars: ✭ 134 (-4.96%)

Mutual labels: semantic-segmentation

Flownet2 Docker

Dockerfile and runscripts for FlowNet 2.0 (estimation of optical flow)

Stars: ✭ 137 (-2.84%)

Mutual labels: cvpr

Building Blocks

A voxel library for real-time applications.

Stars: ✭ 140 (-0.71%)

Mutual labels: 3d

View All Similar Projects ➔

Spatio-Temporal Segmentation

This repository contains the accompanying code for 4D-SpatioTemporal ConvNets: Minkowski Convolutional Neural Networks, CVPR'19.

Change Log

2020-05-19 The latest Minkowski Engine since the commit be5c3, does not require explicit cache clear and can use the memory more efficiently.
2020-05-04: As pointed out by Thomas Chaton on Issue#30, I also found out that the training script contains bugs that models cannot reach the target performance described in the Model Zoo with the latest MinkowskiEngine. I am in the process of debugging the bugs, but I am having some difficulty finding the bugs. So, I created another git repo SpatioTemporalSegmentation-ScanNet from my other private repo that reaches the target performance. Please refer to the SpatioTemporalSegmentation-ScanNet for the ScanNet training. I'll update this repo once I find the bugs and merge SpatioTemporalSegmentation-ScanNet with this repo. Sorry for the trouble.

Requirements

Ubuntu 14.04 or higher
CUDA 10.1 or higher
pytorch 1.3 or higher
python 3.6 or higher
GCC 6 or higher

Installation

You need to install pytorch and Minkowski Engine either with pip or with anaconda.

Pip

The MinkowskiEngine is distributed via PyPI MinkowskiEngine which can be installed simply with pip. First, install pytorch following the instruction. Next, install openblas.

sudo apt install libopenblas-dev

pip install torch torchvision

pip install -U git+https://github.com/StanfordVL/MinkowskiEngine

Next, clone the repository and install the rest of the requirements

git clone https://github.com/chrischoy/SpatioTemporalSegmentation/

cd SpatioTemporalSegmentation

pip install -r requirements.txt

Troubleshooting

Please visit the MinkowskiEngine issue pages if you have difficulties installing Minkowski Engine.

ScanNet Training

Download the ScanNet dataset from the official website. You need to sign the terms of use.
Next, preprocess all scannet raw point cloud with the following command after you set the path correctly.

python -m lib.datasets.preprocessing.scannet

Train the network with

export BATCH_SIZE=N;
./scripts/train_scannet.sh 0 \
	-default \
	"--scannet_path /path/to/preprocessed/scannet"

Modify the BATCH_SIZE accordingly.

The first argument is the GPU id and the second argument is the path postfix and the last argument is the miscellaneous arguments.

mIoU vs. Overall Accuracy

The official evaluation metric for ScanNet is mIoU. OA, Overal Accuracy is not the official metric since it is not discriminative. This is the convention from the 2D semantic segmentation as the pixelwise overall accuracy does not capture the fidelity of the semantic segmentation. On 3D ScanNet semantic segmentation, OA: 89.087 -> mIOU 71.496 mAP 76.127 mAcc 79.660 on the ScanNet validation set v2.

Then why is the overall accuracy least discriminative metric? This is due to the fact that most of the scenes consist of large structures such as walls, floors, or background and scores on these will dominate the statistics if you use Overall Accuracy.

Synthia 4D Experiment

Download the dataset from download
Extract

cd /path/to/extract/synthia4d
wget http://cvgl.stanford.edu/data2/Synthia4D.tar
tar -xf Synthia4D.tar
tar -xvjf *.tar.bz2

Training

export BATCH_SIZE=N; \
./scripts/train_synthia4d.sh 0 \
	"-default" \
	"--synthia_path /path/to/extract/synthia4d"

The above script trains a network. You have to change the arguments accordingly. The first argument to the script is the GPU id. Second argument is the log directory postfix; change to mark your experimental setup. The final argument is a series of the miscellaneous aruments. You have to specify the synthia directory here. Also, you have to wrap all arguments with " ".

Stanford 3D Dataset

Download the stanford 3d dataset from the website
Preprocess

Modify the input and output directory accordingly in

lib/datasets/preprocessing/stanford.py

And run

python -m lib.datasets.preprocessing.stanford

Train

./scripts/train_stanford.sh 0 \
	"-default" \
	"--stanford3d_path /PATH/TO/PREPROCESSED/STANFORD"

Model Zoo

Model	Dataset	Voxel Size	Conv1 Kernel Size	Performance	Link
Mink16UNet34C	ScanNet train + val	2cm	3	Test set 73.6% mIoU, no sliding window	download
Mink16UNet34C	ScanNet train	2cm	5	Val 72.219% mIoU, no rotation average, no sliding window per class performance	download
Mink16UNet18	Stanford Area5 train	5cm	5	Area 5 test 65.828% mIoU, no rotation average, no sliding window per class performance	download
Mink16UNet34	Stanford Area5 train	5cm	5	Area 5 test 66.348% mIoU, no rotation average, no sliding window per class performance	download
3D Mink16UNet14A	Synthia CVPR19 train	15cm	3	CVPR19 test 81.903% mIoU, no rotation average, no sliding window per class performance	download
3D Mink16UNet18	Synthia CVPR19 train	15cm	3	CVPR19 test 82.762% mIoU, no rotation average, no sliding window per class performance	download

Note that sliding window style evaluation (cropping and stitching results) used in many related works effectively works as an ensemble (rotation averaging) which boosts the performance.

Demo

The demo code will download weights and an example scene first and then visualize prediction results.

Dataset	Scannet	Stanford
Command	`python -m demo.scannet`	`python -m demo.stanford`
Result

Citing this work

If you use the Minkowski Engine, please cite:

4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks, CVPR'19, [pdf]

@inproceedings{choy20194d,
  title={4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks},
  author={Choy, Christopher and Gwak, JunYoung and Savarese, Silvio},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={3075--3084},
  year={2019}
}

Related projects

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 141

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (21) 🔗