All Projects → yinyunie → 3d Shape Analysis Paper List

yinyunie / 3d Shape Analysis Paper List

A list of recent papers, libraries and datasets about 3D shape/geometry analysis (by topics, updating).

Projects that are alternatives of or similar to 3d Shape Analysis Paper List

Awesome 3dreconstruction list
A curated list of papers & resources linked to 3D reconstruction from images.
Stars: ✭ 3,151 (+668.54%)
Mutual labels:  3d-reconstruction
Monoport
Volumetric Human Teleportation (SIGGRAPH 2020 Real-Time Live) Monocular Real-Time Volumetric Performance Capture(ECCV 2020)
Stars: ✭ 296 (-27.8%)
Mutual labels:  3d-reconstruction
Kimera Semantics
Real-Time 3D Semantic Reconstruction from 2D data
Stars: ✭ 368 (-10.24%)
Mutual labels:  3d-reconstruction
Dynamicfusion
Implementation of Newcombe et al. CVPR 2015 DynamicFusion paper
Stars: ✭ 267 (-34.88%)
Mutual labels:  3d-reconstruction
Surfelmeshing
Real-time surfel-based mesh reconstruction from RGB-D video.
Stars: ✭ 278 (-32.2%)
Mutual labels:  3d-reconstruction
Layoutnet
Torch implementation of our CVPR 18 paper: "LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image"
Stars: ✭ 334 (-18.54%)
Mutual labels:  3d-reconstruction
3PU pytorch
pytorch implementation of >>Patch-base progressive 3D Point Set Upsampling<<
Stars: ✭ 61 (-85.12%)
Mutual labels:  3d-reconstruction
Differentiable volumetric rendering
This repository contains the code for the CVPR 2020 paper "Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision"
Stars: ✭ 380 (-7.32%)
Mutual labels:  3d-reconstruction
Intrinsic3d
Intrinsic3D - High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting (ICCV 2017)
Stars: ✭ 297 (-27.56%)
Mutual labels:  3d-reconstruction
Semiglobalmatching
SGM,立体匹配最经典应用最广泛算法,4000+引用,兼顾效率和效果。完整实现,代码规范,注释清晰,博客教学,欢迎star!
Stars: ✭ 363 (-11.46%)
Mutual labels:  3d-reconstruction
Dagsfm
Distributed and Graph-based Structure from Motion
Stars: ✭ 269 (-34.39%)
Mutual labels:  3d-reconstruction
Convolutional occupancy networks
[ECCV'20] Convolutional Occupancy Networks
Stars: ✭ 276 (-32.68%)
Mutual labels:  3d-reconstruction
Openmvg
open Multiple View Geometry library. Basis for 3D computer vision and Structure from Motion.
Stars: ✭ 3,902 (+851.71%)
Mutual labels:  3d-reconstruction
Ransac Flow
(ECCV 2020) RANSAC-Flow: generic two-stage image alignment
Stars: ✭ 265 (-35.37%)
Mutual labels:  3d-reconstruction
Curated List Of Awesome 3d Morphable Model Software And Data
The idea of this list is to collect shared data and algorithms around 3D Morphable Models. You are invited to contribute to this list by adding a pull request. The original list arised from the Dagstuhl seminar on 3D Morphable Models https://www.dagstuhl.de/19102 in March 2019.
Stars: ✭ 375 (-8.54%)
Mutual labels:  3d-reconstruction
instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
Stars: ✭ 1,863 (+354.39%)
Mutual labels:  3d-reconstruction
Semantic slam
Real time semantic slam in ROS with a hand held RGB-D camera
Stars: ✭ 317 (-22.68%)
Mutual labels:  3d-reconstruction
Awesome Holistic 3d
A list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision
Stars: ✭ 387 (-5.61%)
Mutual labels:  3d-reconstruction
Densebody pytorch
PyTorch implementation of CloudWalk's recent work DenseBody
Stars: ✭ 378 (-7.8%)
Mutual labels:  3d-reconstruction
Nerf pl
NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning
Stars: ✭ 362 (-11.71%)
Mutual labels:  3d-reconstruction

3D-Shape-Analysis-Paper-List

A list of papers, libraries and datasets I recently read is collected for anyone who shows interest at



Statistics: 🔥 code is available & stars >= 100  |  ⭐️ citation >= 50

3D Detection & Segmentation

  • [Arxiv] Pseudo-labeling for Scalable 3D Object Detection
  • [Arxiv] DPointNet: A Density-Oriented PointNet for 3D Object Detection in Point Clouds
  • [Arxiv] PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection [pytorch]
  • [Arxiv] Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss
  • [Arxiv] CubifAE-3D: Monocular Camera Space Cubification for Auto-Encoder based 3D Object Detection
  • [Arxiv] Self-Attention Based Context-Aware 3D Object Detection [pytorch]
  • [Arxiv] Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection

Before 2021

  • [Arxiv] It’s All Around You: Range-Guided Cylindrical Network for 3D Object Detection
  • [Arxiv] 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection [Project]
  • [Arxiv] Demystifying Pseudo-LiDAR for Monocular 3D Object Detection
  • [3DV2020] PanoNet3D: Combining Semantic and Geometric Understanding for LiDAR Point Cloud Detection
  • [AAAI2021] PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection
  • [Arxiv] SegGroup: Seg-Level Supervision for 3D Instance and Semantic Segmentation
  • [Arxiv] 3D Object Detection with Pointformer
  • [WACV2021] CenterFusion: Center-based Radar and Camera Fusion for 3D Object Detection [pytorch]
  • [Arxiv] Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation [pytorch]
  • [Arxiv] Learning to Predict the 3D Layout of a Scene
  • [Arxiv] Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes [Project]
  • [Arxiv] DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution
  • [Arxiv] Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection in Autonomous Driving
  • [NeurIPS2020] Every View Counts: Cross-View Consistency in 3D Object Detection with Hybrid-Cylindrical-Spherical Voxelization
  • [NeurIPS2020] Group Contextual Encoding for 3D Point Clouds [pytorch]
  • [Arxiv] 3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations [Project]
  • [Arxiv] A Density-Aware PointRCNN for 3D Objection Detection in Point Clouds
  • [Arxiv] Monocular 3D Detection with Geometric Constraints Embedding and Semi-supervised Training
  • [ECCV2020] Reinforced Axial Refinement Network for Monocular 3D Object Detection
  • [Arxiv] RUHSNet: 3D Object Detection Using Lidar Data in Real Time [pytorch]
  • [IROS2020] 3D Multi-Object Tracking: A Baseline and New Evaluation Metrics [Project][Code]
  • [ECCV2020] Virtual Multi-view Fusion for 3D Semantic Segmentation
  • [ACMMM2020] Weakly Supervised 3D Object Detection from Point Clouds
  • [ECCV2020] Weakly Supervised 3D Object Detection from Lidar Point Cloud [pytorch]
  • [ECCV2020] Kinematic 3D Object Detection in Monocular Video
  • [IROS2020] Object-Aware Centroid Voting for Monocular 3D Object Detection
  • [ECCV2020] Pillar-based Object Detection for Autonomous Driving
  • [Arxiv] Local Grid Rendering Networks for 3D Object Detection in Point Clouds
  • [Arxiv] Learning to Detect 3D Objects from Point Clouds in Real Time
  • [Arxiv] SVGA-Net: Sparse Voxel-Graph Attention Network for 3D Object Detection from Point Clouds
  • [CVPR2020] PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
  • [CVPR2020] FroDO: From Detections to 3D Objects
  • [CVPR2020] Physically Realizable Adversarial Examples for LiDAR Object Detection
  • [CVPR2020] Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection
  • [CVPR2020] End-to-end 3D Point Cloud Instance Segmentation without Detection
  • [CVPR2020] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships
  • [CVPR2020] Structure Aware Single-stage 3D Object Detection from Point Cloud
  • [CVPR2020] Learning Depth-Guided Convolutions for Monocular 3D Object Detection [pytorch] 🔥
  • [CVPR2020] What You See is What You Get: Exploiting Visibility for 3D Object Detection
  • [CVPR2020] Density Based Clustering for 3D Object Detection in Point Clouds
  • [CVPR2020] Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation
  • [CVPR2020] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
  • [CVPR2020] PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
  • [CVPR2020] MLCVNet: Multi-Level Context VoteNet for 3D Object Detection
  • [CVPR2020] PointPainting: Sequential Fusion for 3D Object Detection
  • [CVPR2020] Joint 3D Instance Segmentation and Object Detection for Autonomous Driving
  • [CVPR2020] Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud [tensorflow]
  • [CVPR2020] Joint 3D Instance Segmentation and Object Detection for Autonomous Driving
  • [CVPR2020] HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection
  • [CVPR2020] A Hierarchical Graph Network for 3D Object Detection on Point Clouds
  • [Arxiv] H3DNet: 3D Object Detection Using Hybrid Geometric Primitives
  • [CVPR2020] P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
  • [Arxiv] 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection
  • [CVPR2020] Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking
  • [CVPR2020] Learning to Evaluate Perception Models Using Planner-Centric Metrics
  • [CVPR2020] Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation [pytorch]
  • [Arxiv] SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds [github]
  • [CVPR2020] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection [github]
  • [Arxiv] Finding Your (3D) Center: 3D Object Detection Using a Learned Loss
  • [CVPR2020] PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
  • [CVPR2020] 3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segm
  • [CVPR2020] Fusion-Aware Point Convolution for Online Semantic 3D Scene Segmentation
  • [CVPR2020] OccuSeg: Occupancy-aware 3D Instance Segmentation
  • [CVPR2020] Learning to Segment 3D Point Clouds in 2D Image Space
  • [CVPR2020] Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud [tensorflow]
  • [AAAI2020] ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection
  • [Arxiv] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships
  • [Arxiv] HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection
  • [Arxiv] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation
  • [Arxiv] 3DSSD: Point-based 3D Single Stage Object Detector
  • [Arxiv] Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation
  • [CVPR2020] ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes
  • [Arxiv] A Review on Object Pose Recovery: from 3D Bounding Box Detectors to Full 6D Pose Estimators
  • [Arxiv] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
  • [Arxiv] Objects as Points [github] ⭐️🔥
  • [Arxiv] RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving [github]
  • [CVPR2020] DSGN: Deep Stereo Geometry Network for 3D Object Detection [github]
  • [Arxiv] Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation
  • [Arxiv] PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
  • [Arxiv] Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots
  • [CVPR2020] SESS: Self-Ensembling Semi-Supervised 3D Object Detection
  • [NeurIPS2019] PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
  • [NeurIPS2019] Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds
  • [ICCV2019] Deep Hough Voting for 3D Object Detection in Point Clouds
  • [AAAI2020] JSNet: Joint Instance and Semantic Segmentation of 3D Point Clouds
  • [ICCV2019] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection [pytorch]
  • [ICCV2019] 3D Instance Segmentation via Multi-Task Metric Learning
  • [Arxiv] Single-Stage Monocular 3D Object Detection with Virtual Cameras
  • [Arxiv] Depth Completion via Deep Basis Fitting
  • [Arxiv] Relation Graph Network for 3D Object Detection in Point Clouds
  • [CVPR2019] 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans [pytorch] 🔥
  • [ICCV2019] Rescan: Inductive Instance Segmentation for Indoor RGBD Scans [C++]
  • [ICCV2019] Transferable Semi-Supervised 3D Object Detection From RGB-D Data
  • [ICCV2019] STD: Sparse-to-Dense 3D Object Detector for Point Cloud
  • [CVPR2019] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud [pytorch]
  • [Arxiv] Fast Point R-CNN
  • [Arxiv] Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection [pytorch] 🔥
  • [ECCV2018] 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation [pytorch] 🔥

Shape Representation

  • [Arxiv] DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes with Biharmonic Coordinates [Project]

Before 2021

  • [Arxiv] Point2Skeleton: Learning Skeletal Representations from Point Clouds [pytorch]
  • [Arxiv] ParaNet: Deep Regular Representation for 3D Point Clouds
  • [Arxiv] Geometric Adversarial Attacks and Defenses on 3D Point Clouds [tensorflow]
  • [Arxiv] Learning Category-level Shape Saliency via Deep Implicit Surface Networks
  • [Arxiv] pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis
  • [Arxiv] Deep Implicit Templates for 3D Shape Representation
  • [NeurIPS2020] MetaSDF: Meta-learning Signed Distance Functions [Project]
  • [Arxiv] RISA-Net: Rotation-Invariant Structure-Aware Network for Fine-Grained 3D Shape Retrieval [tensorflow]
  • [Arxiv] Overfit Neural Networks as a Compact Shape Representation
  • [Arxiv] DSM-Net: Disentangled Structured Mesh Net for Controllable Generation of Fine Geometry [Project]
  • [Arxiv] PatchNets: Patch-Based Generalizable Deep Implicit 3D Shape Representations
  • [Arxiv] CaSPR: Learning Canonical Spatiotemporal Point Cloud Representations
  • [Arxiv] ROCNET: RECURSIVE OCTREE NETWORK FOR EFFICIENT 3D DEEP REPRESENTATION
  • [ECCV2020] GeLaTO: Generative Latent Textured Objects [Project]
  • [ECCV2020] Ladybird: Quasi-Monte Carlo Sampling for Deep Implicit Field Based 3D Reconstruction with Symmetry
  • [Arxiv] Neural Sparse Voxel Fields
  • [CVPR2020] StructEdit: Learning Structural Shape Variations [github]
  • [Arxiv] PAI-GCN: Permutable Anisotropic Graph Convolutional Networks for 3D Shape Representation Learning [github]
  • [CVPR2020] Learning Generative Models of Shape Handles [Project page]
  • [CVPR2020] DualSDF: Semantic Shape Manipulation using a Two-Level Representation [github]
  • [CVPR2020] Learning Unsupervised Hierarchical Part Decomposition of 3D Objects from a Single RGB Image [pytorch]
  • [NeurIPS2019] Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations [pytorch]
  • [Arxiv] Label-Efficient Learning on Point Clouds using Approximate Convex Decompositions
  • [Arxiv] Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds
  • [Arxiv] Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction
  • [Arxiv] SeqXY2SeqZ: Structure Learning for 3D Shapes by Sequentially Predicting 1D Occupancy Segments From 2D Coordinates
  • [CVPR2020] D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features
  • [Arxiv] Implicit Geometric Regularization for Learning Shapes
  • [Arxiv] Analytic Marching: An Analytic Meshing Solution from Deep Implicit Surface Networks
  • [Arxiv] Adversarial Generation of Continuous Implicit Shape Representations [pytorch]
  • [Arxiv] A Novel Tree-structured Point Cloud Dataset For Skeletonization Algorithm Evaluation [dataset]
  • [CVPRW2019] SkelNetOn 2019: Dataset and Challenge on Deep Learning for Geometric Shape Understanding [project]
  • [Arxiv] Skeleton Extraction from 3D Point Clouds by Decomposing the Object into Parts
  • [Arxiv] InSphereNet: a Concise Representation and Classification Method for 3D Object
  • [Arxiv] Deep Structured Implicit Functions
  • [CVIU] 3D articulated skeleton extraction using a single consumer-grade depth camera
  • [ICLR2019] Point Cloud GAN [tensorflow]
  • [ICCV2019] Learning Shape Templates with Structured Implicit Functions
  • [ICCV2019] 3D Point Cloud Generative Adversarial Network Based on Tree Structured Graph Convolutions [pytorch]
  • [ICCV2019] Implicit Surface Representations as Layers in Neural Networks
  • [CVPR2019] DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation [pytorch] 🔥 ⭐️
  • [SIGGRAPH2019] StructureNet: Hierarchical Graph Networks for 3D Shape Generation [pytorch]
  • [SIGGRAPH Asia2019] LOGAN: Unpaired Shape Transform in Latent Overcomplete Space [tensorflow]
  • [TOG] Voxel Cores: Efficient, robust, and provably good approximation of 3D medial axes
  • [SIGGRAPH2018] P2P-NET: Bidirectional Point Displacement Net for Shape Transform [tensorflow]
  • [ICML2018] Learning Representations and Generative Models for 3D Point Clouds [tensorflow] 🔥⭐️
  • [NeurIPS2018] Discovery of Latent 3D Keypoints via End-to-end Geometric Reasoning [tensorflow][project page]⭐️🔥
  • [AAAI2018] Unsupervised Articulated Skeleton Extraction from Point Set Sequences Captured by a Single Depth Camera
  • [3DV2018] Parsing Geometry Using Structure-Aware Shape Templates
  • [SIGGRAPH2017] GRASS: Generative Recursive Autoencoders for Shape Structures [pytorch] 🔥
  • [TOG] Erosion Thickness on Medial Axes of 3D Shapes
  • [Vis Comput] Distance field guided L1-median skeleton extraction
  • [CGF] Contracting Medial Surfaces Isotropically for Fast Extraction of Centred Curve Skeletons
  • [CGF] Improved Use of LOP for Curve Skeleton Extraction
  • [SIGGRAPH Asia2015] Deep Points Consolidation [C++ & Qt]
  • [SIGGRAPH2015] Burning The Medial Axis
  • [SIGGRAPH2009] Curve Skeleton Extraction from Incomplete Point Cloud [matlab] ⭐️
  • [TOG] SDM-NET: deep generative network for structured deformable mesh
  • [TOG] Robust and Accurate Skeletal Rigging from Mesh Sequences 🔥
  • [TOG] L1-medial skeleton of point cloud [C++] 🔥
  • [EUROGRAPHICS2016] 3D Skeletons: A State-of-the-Art Report 🔥
  • [SGP2012] Mean Curvature Skeletons [C++] 🔥
  • [SMIC2010] Point Cloud Skeletons via Laplacian-Based Contraction [Matlab] 🔥

Shape & Scene Completion

  • [CVPR2021] Style-based Point Generator with Adversarial Rendering for Point Cloud Completion

Before 2021

  • [Arxiv] PMP-Net: Point Cloud Completion by Learning Multi-step Point Moving Paths
  • [Arxiv] S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds
  • [Arxiv] Semantic Scene Completion using Local Deep Implicit Functions on LiDAR Data
  • [Arxiv] Learning-based 3D Occupancy Prediction for Autonomous Navigation in Occluded Environments
  • [Arxiv] PMP-Net: Point Cloud Completion by Learning Multi-step Point Moving Paths
  • [3DV2020] SCFusion: Real-time Incremental Scene Reconstruction with Semantic Completion
  • [Arxiv] Refinement of Predicted Missing Parts Enhance Point Cloud Completion [pytorch]
  • [Arxiv] Unsupervised Partial Point Set Registration via Joint Shape Completion and Registration
  • [Arxiv] LMSCNet: Lightweight Multiscale 3D Semantic Completion [Demo]
  • [ECCV2020] SoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification
  • [ECCV2020] Weakly-supervised 3D Shape Completion in the Wild
  • [Arxiv] Point Cloud Completion by Learning Shape Priors
  • [Arxiv] KAPLAN: A 3D Point Descriptor for Shape Completion
  • [Arxiv] VPC-Net: Completion of 3D Vehicles from MLS Point Clouds
  • [Arxiv] SPSG: Self-Supervised Photometric Scene Generation from RGB-D Scans
  • [Arxiv] GRNet: Gridding Residual Network for Dense Point Cloud Completion
  • [Arxiv] Deep Octree-based CNNs with Output-Guided Skip Connections for 3D Shape and Scene Completion
  • [CVPR2020] Point Cloud Completion by Skip-attention Network with Hierarchical Folding
  • [CVPR2020] Cascaded Refinement Network for Point Cloud Completion [github]
  • [CVPR2020] Anisotropic Convolutional Networks for 3D Semantic Scene Completion [github]
  • [AAAI2020] Attention-based Multi-modal Fusion Network for Semantic Scene Completion
  • [CVPR2020] 3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior [github]
  • [ECCV2020] Multimodal Shape Completion via Conditional Generative Adversarial Networks [pytorch]
  • [CVPR2020] RevealNet: Seeing Behind Objects in RGB-D Scans
  • [CVPR2020] Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion
  • [CVPR2020] PF-Net: Point Fractal Network for 3D Point Cloud Completion
  • [Arxiv] 3D Gated Recurrent Fusion for Semantic Scene Completion
  • [ICCVW2019] EdgeConnect: Structure Guided Image Inpainting using Edge Prediction [pytorch] 🔥⭐️
  • [ICRA2020] Depth Based Semantic Scene Completion with Position Importance Aware Loss
  • [CVPR2020] SG-NN: Sparse Generative Neural Networks for Self-Supervised Scene Completion of RGB-D Scans
  • [Arxiv] PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes
  • [ICLR2020] Unpaired Point Cloud Completion on Real Scans using Adversarial Training [tensorflow]
  • [AAAI2020] Morphing and Sampling Network for Dense Point Cloud Completion [pytorch]
  • [ICCVW2019] Render4Completion: Synthesizing Multi-View Depth Maps for 3D Shape Completion
  • [ICCV2019] ForkNet: Multi-branch Volumetric Semantic Completion from a Single Depth Image [tensorflow]
  • [ICCV2019] Cascaded Context Pyramid for Full-Resolution 3D Semantic Scene Completion [Caffe3D]
  • [ICCV2019] Multi-Angle Point Cloud-VAE: Unsupervised Feature Learning for 3D Point Clouds from Multiple Angles by Joint Self-Reconstruction and Half-to-Half Prediction
  • [Arxiv] EdgeNet: Semantic Scene Completion from RGB-D images
  • [CVPR2019] TopNet: Structural Point Cloud Decoder [pytorch & tensorflow]
  • [CVPR2019] Deep Reinforcement Learning of Volume-guided Progressive View Inpainting for 3D Point Scene Completion from a Single Depth Image
  • [CVPR2019] Leveraging Shape Completion for 3D Siamese Tracking [pytorch]
  • [CVPR2019] RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion [pytorch]
  • [3DV2018] PCN: Point Completion Network [tensorflow] 🔥
  • [ECCV2018] Efficient Semantic Scene Completion Network with Spatial Group Convolution [pytorch]
  • [CVPR2018] ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans [tensorflow] 🔥⭐️
  • [CVPR2018] Learning 3D Shape Completion from Laser Scan Data with Weak Supervision [torch][torch]
  • [IJCV2018] Learning 3D Shape Completion under Weak Supervision [torch][torch]
  • [ICCV2017] High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference ⭐️
  • [ICCV2017] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [torch] 🔥⭐️
  • [CVPR2017] Semantic Scene Completion from a Single Depth Image [caffe] 🔥⭐️
  • [CVPR2016] Structured Prediction of Unobserved Voxels From a Single Depth Image [resource] ⭐️

Shape Reconstruction

  • [Arxiv] PC2WF: 3D WIREFRAME RECONSTRUCTION FROM RAW POINT CLOUDS
  • [CVPR2021] Diffusion Probabilistic Models for 3D Point Cloud Generation [Project]
  • [Arxiv] ShaRF: Shape-conditioned Radiance Fields from a Single View [Project]
  • [Arxiv] Shelf-Supervised Mesh Prediction in the Wild
  • [Arxiv] HyperPocket: Generative Point Cloud Completion
  • [Arxiv] Im2Vec: Synthesizing Vector Graphics without Vector Supervision [resource]
  • [Arxiv] Secrets of 3D Implicit Object Shape Reconstruction in the Wild
  • [Arxiv] Joint Learning of 3D Shape Retrieval and Deformation
  • [Arxiv] Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes

Before 2021

  • [Arxiv] Learning Delaunay Surface Elements for Mesh Reconstruction
  • [Arxiv] Compositionally Generalizable 3D Structure Prediction
  • [Arxiv] Online Adaptation for Consistent Mesh Reconstruction in the Wild
  • [Arxiv] Sign-Agnostic Implicit Learning of Surface Self-Similarities for Shape Modeling and Reconstruction from Raw Point Clouds
  • [Arxiv] Deep Optimized Priors for 3D Shape Modeling and Reconstruction
  • [Arxiv] DO 2D GANS KNOW 3D SHAPE? UNSUPERVISED 3D SHAPE RECONSTRUCTION FROM 2D IMAGE GANS [Project]
  • [Arxiv] DUDE: Deep Unsigned Distance Embeddings for Hi-Fidelity Representation of Complex 3D Surfaces
  • [3DV2020] Learning to Infer Semantic Parameters for 3D Shape Editing [Project]
  • [3DV2020] Cycle-Consistent Generative Rendering for 2D-3D Modality Translation [Project]
  • [3DV2020] A Divide et Impera Approach for 3D Shape Reconstruction from Multiple Views
  • [Arxiv] A Closed-Form Solution to Local Non-Rigid Structure-from-Motion
  • [Arxiv] Deformed Implicit Field: Modeling 3D Shapes with Learned Dense Correspondence
  • [Arxiv] D-NeRF: Neural Radiance Fields for Dynamic Scenes
  • [Arxiv] Modular Primitives for High-Performance Differentiable Rendering
  • [Arxiv] NeuralFusion: Online Depth Fusion in Latent Space
  • [Arxiv] Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video [Project]
  • [NeurIPS2020] Continuous Object Representation Networks: Novel View Synthesis without Target View Supervision [Project]
  • [NeurIPS2020] SDF-SRN: Learning Signed Distance 3D Object Reconstruction from Static Images [Project]
  • [NeurIPS2020] Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance [Project]
  • [NeurIPS2020] Convolutional Generation of Textured 3D Meshes [Project]
  • [Arxiv] Vid2CAD: CAD Model Alignment using Multi-View Constraints from Videos
  • [NeurIPS2020] UCLID-Net: Single View Reconstruction in Objec Space [Project]
  • [NeurIPS2020] CaSPR: Learning Canonical Spatiotemporal Point Cloud Representations [Project]
  • [NeurIPS2020] Generative 3D Part Assembly via Dynamic Graph Learning [pytorch]
  • [NeurIPS2020] Learning Deformable Tetrahedral Meshes for 3D Reconstruction [Project]
  • [NeurIPS2020] SoftFlow: Probabilistic Framework for Normalizing Flow on Manifolds [pytorch]
  • [Arxiv] Training Data Generating Networks: Linking 3D Shapes and Few-Shot Classification
  • [Arxiv] MESHMVS: MULTI-VIEW STEREO GUIDED MESH RECONSTRUCTION
  • [Arxiv] Learning Occupancy Function from Point Clouds for Surface Reconstruction
  • [NeurIPS2020] SDF-SRN: Learning Signed Distance 3D Object Reconstruction from Static Images [Project]
  • [Arxiv] GRF: Learning a General Radiance Field for 3D Scene Representation and Rendering [github]
  • [3DV2020] A Progressive Conditional Generative Adversarial Network for Generating Dense and Colored 3D Point Clouds
  • [3DV2020] Better Patch Stitching for Parametric Surface Reconstruction
  • [NeurIPS2020] Skeleton-bridged Point Completion: From Global Inference to Local Adjustment [Project Page]
  • [Arxiv] NeRF++: Analyzing and Improving Neural Radiance Fields [pytorch]
  • [Arxiv] Improved Modeling of 3D Shapes with Multi-view Depth Maps
  • [SIGGRAPH2020] One Shot 3D Photography [Project]
  • [BMVC2020] Large Scale Photometric Bundle Adjustment
  • [ECCV2020] Interactive Annotation of 3D Object Geometry using 2D Scribbles [Project]
  • [BMVC2020] Visibility-aware Multi-view Stereo Network
  • [ECCV2020] Pix2Surf: Learning Parametric 3D Surface Models of Objects from Images
  • [ECCV2020] 3D Bird Reconstruction: a Dataset, Model, and Shape Recovery from a Single View [Project][Pytorch]
  • [BMVC2020] 3D-GMNet: Single-View 3D Shape Recovery as A Gaussian Mixture
  • [SIGGRAPH2020] Self-Sampling for Neural Point Cloud Consolidation
  • [ECCV2020] Stochastic Bundle Adjustment for Efficient and Scalable 3D Reconstruction [github]
  • [Arxiv] NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections [Project]
  • [Arxiv] MeshODE: A Robust and Scalable Framework for Mesh Deformation
  • [Arxiv] MRGAN: Multi-Rooted 3D Shape Generation with Unsupervised Part Disentanglement
  • [ECCV2020] Meshing Point Clouds with Predicted Intrinsic-Extrinsic Ratio Guidance [pytorch]
  • [ECCV2020] Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop
  • [ECCV2020] Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking
  • [ECCV2020] Shape and Viewpoint without Keypoints
  • [Arxiv] Object-Centric Multi-View Aggregation
  • [ECCV2020] Points2Surf Learning Implicit Surfaces from Point Clouds
  • [NeurIPS2020] Neural Mesh Flow: 3D Manifold Mesh Generation via Diffeomorphic Flows [Project]
  • [Arxiv] Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images
  • [Arxiv] Neural Non-Rigid Tracking
  • [NeurIPS2020] MeshSDF: Differentiable Iso-Surface Extraction
  • [Arxiv] 3D Reconstruction of Novel Object Shapes from Single Images
  • [NeurIPS2020] ShapeFlow: Learnable Deformations Among 3D Shapes [pytorch]
  • [Arxiv] 3D Shape Reconstruction from Free-Hand Sketches
  • [Arxiv] NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
  • [Arxiv] Convolutional Occupancy Networks
  • [Siggraph2020] Point2Mesh: A Self-Prior for Deformable Meshes
  • [Arxiv] PointTriNet: Learned Triangulation of 3D Point
  • [Arxiv] A Simple and Scalable Shape Representation for 3D Reconstruction
  • [Siggraph2020] Vid2Curve: Simultaneously Camera Motion Estimation and Thin Structure Reconstruction from an RGB Video
  • [CVPR2020] From Image Collections to Point Clouds with Self-supervised Shape and Pose Networks [tensorflow]
  • [CVPR2020] Through the Looking Glass: Neural 3D Reconstruction of Transparent Shapes [github]
  • [Arxiv] PolyGen: An Autoregressive Generative Model of 3D Meshes
  • [Arxiv] Combinatorial 3D Shape Generation via Sequential Assembly
  • [Arxiv] Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors
  • [Arxiv] Neural Object Descriptors for Multi-View Shape Reconstruction
  • [CVPR2020] SPARE3D: A Dataset for SPAtial REasoning on Three-View Line Drawings [pytorch]
  • [Arxiv] Modeling 3D Shapes by Reinforcement Learning
  • [ECCV2020] ParSeNet: A Parametric Surface Fitting Network for 3D Point Clouds [pytorch]
  • [Arxiv] Self-Supervised 2D Image to 3D Shape Translation with Disentangled Representations
  • [Arxiv] Universal Differentiable Renderer for Implicit Neural Representations
  • [Arxiv] Learning 3D Part Assembly from a Single Image
  • [Arxiv] Curriculum DeepSDF
  • [Arxiv] PT2PC: Learning to Generate 3D Point Cloud Shapes from Part Tree Conditions
  • [Arxiv] Self-supervised Single-view 3D Reconstruction via Semantic Consistency
  • [Arxiv] Meta3D: Single-View 3D Object Reconstruction from Shape Priors in Memory
  • [Arxiv] STD-Net: Structure-preserving and Topology-adaptive Deformation Network for 3D Reconstruction from a Single Image
  • [Arxiv] Curvature Regularized Surface Reconstruction from Point Cloud
  • [Arxiv] Hypernetwork approach to generating point clouds
  • [Arxiv] Inverse Graphics GAN: Learning to Generate 3D Shapes from Unstructured 2D Data
  • [Arxiv] Meshlet Priors for 3D Mesh Reconstruction
  • [Arxiv] Front2Back: Single View 3D Shape Reconstruction via Front to Back Prediction
  • [Arxiv] SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization
  • [CVPR2019] Occupancy Networks: Learning 3D Reconstruction in Function Space [pytorch] 🔥⭐️
  • [NeurIPS2019] DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction [tensorflow]
  • [NeurIPS2019] Learning to Infer Implicit Surfaces without 3D Supervision
  • [CVPR2019] A Skeleton-bridged Deep Learning Approach for Generating Meshes of Complex Topologies from Single RGB Images [pytorch & tensorflow]
  • [Arxiv] Deep Level Sets: Implicit Surface Representations for 3D Shape Inference
  • [CVPR2019] Learning Implicit Fields for Generative Shape Modeling [tensorflow] 🔥
  • [ICCV2019] Point-based Multi-view Stereo Network [pytorch] ⭐️
  • [Arxiv] TSRNet: Scalable 3D Surface Reconstruction Network for Point Clouds using Tangent Convolution
  • [Arxiv] DR-KFD: A Differentiable Visual Metric for 3D Shape Reconstruction
  • [ICCV2019] GraphX-Convolution for Point Cloud Deformation in 2D-to-3D Conversion
  • [ICCV2019] Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation [pytorch]
  • [ICCV2019] Few-Shot Generalization for Single-Image 3D Reconstruction via Priors
  • [ICCV2019] Deep Mesh Reconstruction from Single RGB Images via Topology Modification Networks
  • [AAAI2018] Learning Efficient Point Cloud Generation for Dense 3D Object Reconstruction [tensorflow] ⭐️🔥
  • [NeurIPS2017] MarrNet: 3D Shape Reconstruction via 2.5D Sketches [torch]⭐️🔥

3D Scene Understanding

  • [Arxiv] House-GAN++: Generative Adversarial Layout Refinement Networks
  • [Arxiv] Weakly Supervised Learning of Rigid 3D Scene Flow
  • [ICLR2021] End-to-End Egospheric Spatial Memory
  • [Arxiv] Single-Shot Cuboids: Geodesics-based End-to-end Manhattan Aligned Layout Estimation from Spherical Panoramas [Project]
  • [Arxiv] A modular vision language navigation and manipulation framework for long horizon compositional tasks in indoor environment
  • [Arxiv] Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes
  • [Arxiv] Where2Act: From Pixels to Actions for Articulated 3D Objects [Project]

Before 2021

  • [Arxiv] AI2-THOR: An Interactive 3D Environment for Visual AI [Project]
  • [Arxiv] Audio-Visual Floorplan Reconstruction
  • [Arxiv] PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds
  • [Arxiv] RAFT-3D: Scene Flow using Rigid-Motion Embeddings
  • [Arxiv] GenScan: A Generative Method for Populating Parametric 3D Scan Datasets
  • [Arxiv] LayoutGMN: Neural Graph Matching for Structural Layout Similarity
  • [Arxiv] Seeing Behind Objects for 3D Multi-Object Tracking in RGB-D Sequences
  • [Arxiv] P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding
  • [Arxiv] Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net
  • [Arxiv] Localising In Complex Scenes Using Balanced Adversarial Adaptation
  • [Arxiv] Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
  • [NeurIPS2020] Multi-Plane Program Induction with 3D Box Priors [Project]
  • [Arxiv] HoHoNet: 360 Indoor Holistic Understanding with Latent Horizontal Features
  • [Arxiv] Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts
  • [Arxiv] Generative Layout Modeling using Constraint Graphs
  • [NeurIPS2020] Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D [pytorch]
  • [NeurIPS2020] Learning Affordance Landscapes for Interaction Exploration in 3D Environments [Project]
  • [NeurIPS2020W] Unsupervised Domain Adaptation for Visual Navigation
  • [Arxiv] Embodied Visual Navigation with Automatic Curriculum Learningin Real Environments
  • [Arxiv] 3D Room Layout Estimation Beyond the Manhattan World Assumption
  • [Arxiv] OpenBot: Turning Smartphones into Robots [Project]
  • [Arxiv] Audio-Visual Waypoints for Navigation
  • [Arxiv] Learning Affordance Landscapes for Interaction Exploration in 3D Environments [Project]
  • [ECCV2020] Occupancy Anticipation for Efficient Exploration and Navigation [Project]
  • [Arxiv] Retargetable AR: Context-aware Augmented Reality in Indoor Scenes based on 3D Scene Graph
  • [Arxiv] Generating Person-Scene Interactions in 3D Scenes
  • [Arxiv] GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes
  • [ECCV2020] ReferIt3D: Neural Listeners for Fine-Grained 3D Object Identification in Real-World Scenes
  • [Arxiv] Structural Plan of Indoor Scenes with Personalized Preferences
  • [Arxiv] HoliCity: A City-Scale Data Platform for Learning Holistic 3D Structures [Project]
  • [CVPR2020] End-to-End Optimization of Scene Layout [Project]
  • [Arxiv] Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships
  • [CVPR2020] Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
  • [Arxiv] LayoutMP3D: Layout Annotation of Matterport3D
  • [CVPR2020] Local Implicit Grid Representations for 3D Scenes
  • [Arxiv] Scan2Plan: Efficient Floorplan Generation from 3D Scans of Indoor Scenes
  • [CVPR2020] RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds [tensorflow] 🔥
  • [CVPR2020] Intelligent Home 3D: Automatic 3D-House Design from Linguistic Descriptions Only
  • [ICRA2020] 3DCFS: Fast and Robust Joint 3D Semantic-Instance Segmentation via Coupled Feature Selection
  • [Arxiv] Indoor Scene Recognition in 3D
  • [Journal] Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense
  • [Arxiv] BlockGAN Learning 3D Object-aware Scene Representations from Unlabelled Images
  • [Arxiv] 3D Dynamic Scene Graphs: Actionable Spatial Perception with Places, Objects, and Humans [Project] Related: [Arxiv] [Arxiv]
  • [ICCV2019] U4D: Unsupervised 4D Dynamic Scene Understanding
  • [ICCV2019] UprightNet: Geometry-Aware Camera Orientation Estimation from Single Images
  • [ICCV2019] Habitat: A Platform for Embodied AI Research [habitat-api] [habitat-sim] ⭐️
  • [ICCV2019] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences [project page] ⭐️
  • [ICCV2019] Neural Inverse Rendering of an Indoor Scene From a Single Image
  • [ICCV2019] SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation [pytorch]
  • [ICCV2019] RIO: 3D Object Instance Re-Localization in Changing Indoor Environments [dataset]
  • [ICCV2019] CamNet: Coarse-to-Fine Retrieval for Camera Re-Localization
  • [ICCV2019] U4D: Unsupervised 4D Dynamic Scene Understanding
  • [NeurIPS2018] Learning to Exploit Stability for 3D Scene Parsing

3D Scene Reconstruction

  • [CVPR2021] RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction
  • [Arxiv] DONeRF: Towards Real-Time Rendering of Neural Radiance Fields using Depth Oracle Networks [Project]
  • [Arxiv] IBRNet: Learning Multi-View Image-Based Rendering [Project]
  • [Arxiv] NeRF--: Neural Radiance Fields Without Known Camera Parameters [Project]
  • [Arxiv] STaR: Self-supervised Tracking and Reconstruction of Rigid Objects in Motion with Neural Rendering [Project]

Before 2021

  • [Arxiv] pixelNeRF: Neural Radiance Fields from One or Few Images [Project]
  • [Arxiv] NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis [Project]
  • [Arxiv] MO-LTR: Multiple Object Localization, Tracking and Reconstruction from Monocular RGB Videos
  • [Arxiv] DI-Fusion: Online Implicit 3D Reconstruction with Deep Priors
  • [Arxiv] Neural Radiance Flow for 4D View Synthesis and Video Processing [Project]
  • [3DV2020] Scene Flow from Point Clouds with or without Learning
  • [Arxiv] Stable View Synthesis
  • [Arxiv] Neural Scene Graphs for Dynamic Scenes
  • [3DV2020] RidgeSfM: Structure from Motion via Robust Pairwise Matching Under Depth Uncertainty [pytorch]
  • [Arxiv] FlowStep3D: Model Unrolling for Self-Supervised Scene Flow Estimation
  • [Arxiv] MoNet: Motion-based Point Cloud Prediction Network
  • [Arxiv] MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera
  • [Arxiv] Efficient Initial Pose-graph Generation for Global SfM
  • [Arxiv] Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes [Project]
  • [Arxiv] Deformable Neural Radiance Fields [Project]
  • [Arxiv] DeRF: Decomposed Radiance Fields
  • [Arxiv] RGBD-Net: Predicting color and depth images for novel views synthesis
  • [Arxiv] SSCNav: Confidence-Aware Semantic Scene Completion for Visual Semantic Navigation [Project]
  • [Arxiv] From Points to Multi-Object 3D Reconstruction
  • [Arxiv] Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image [Project]
  • [Arxiv] SceneFormer: Indoor Scene Generation with Transformers
  • [NeurIPS2020] Neural Sparse Voxel Fields [Project]
  • [Arxiv] Towards Part-Based Understanding of RGB-D Scans
  • [Arxiv] Dynamic Plane Convolutional Occupancy Networks
  • [NeurIPS2020] Neural Unsigned Distance Fields for Implicit Function Learning [Project]
  • [Arxiv] Holistic static and animated 3D scene generation from diverse text descriptions [pytorch]
  • [Arxiv] Semi-Supervised Learning of Multi-Object 3D Scene Representations
  • [ECCV2020] CAD-Deform: Deformable Fitting of CAD Models to 3D Scans
  • [ECCV2020] Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve
  • [ECCV2020] Learnable Cost Volume Using the Cayley Representation
  • [ECCV2020] Topology-Change-Aware Volumetric Fusion for Dynamic Scene Reconstruction
  • [ECCV2020] Convolutional Occupancy Networks
  • [CVPR2020] MARMVS: Matching Ambiguity Reduced Multiple View Stereo for Efficient Large Scale Scene Reconstruction
  • [ECCV2020] CoReNet: Coherent 3D scene reconstruction from a single RGB image
  • [CVPR2020] DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes
  • [ECCV2020] SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans
  • [Arxiv] Removing Dynamic Objects for Static Scene Reconstruction using Light Fields
  • [Arxiv] Atlas: End-to-End 3D Scene Reconstruction from Posed Images
  • [Arxiv] Scan2Plan: Efficient Floorplan Generation from 3D Scans of Indoor Scenes
  • [Arxiv] Plane Pair Matching for Efficient 3D View Registration
  • [CVPR2020] Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image [pytorch]
  • [Arxiv] Indoor Layout Estimation by 2D LiDAR and Camera Fusion
  • [Arxiv] General 3D Room Layout from a Single View by Render-and-Compare
  • [ICCV2019] Learning to Reconstruct 3D Manhattan Wireframes from a Single Image
  • [CVPR2019] PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image [pytorch]🔥
  • [ICCV2019] 3D Scene Reconstruction with Multi-layer Depth and Epipolar Transformers
  • [ICCV Workshop2019] Silhouette-Assisted 3D Object Instance Reconstruction from a Cluttered Scene
  • [ICCV2019] 3D-RelNet: Joint Object and Relation Network for 3D prediction [pytorch]
  • [3DV2019] Pano Popups: Indoor 3D Reconstruction with a Plane-Aware Network
  • [CVPR2018] Factoring Shape, Pose, and Layout from the 2D Image of a 3D Scene [pytorch]
  • [IROS2017] Indoor Scan2BIM: Building Information Models of House Interiors
  • [CVPR2017] 3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions [github]

About Human Body

  • [Arxiv] Single-Shot Motion Completion with Transformer [Project]
  • [EG2021] Walk2Map: Extracting Floor Plans from Indoor Walk Trajectories
  • [Arxiv] Forecasting Characteristic 3D Poses of Human Actions
  • [Arxiv] Capturing Detailed Deformations of Moving Human Bodies
  • [Arxiv] A-NeRF: Surface-free Human 3D Pose Refinement via Neural Rendering [Project]
  • [Arxiv] Learn to Dance with AIST++: Music Conditioned 3D Dance Generation [Project]
  • [Arxiv] S3: Neural Shape, Skeleton, and Skinning Fields for 3D Human Modeling
  • [Arxiv] PandaNet : Anchor-Based Single-Shot Multi-Person 3D Pose Estimation
  • [Arxiv] Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans [Project]
  • [Arxiv] Chasing the Tail in Monocular 3D Human Reconstruction with Prototype Memory
  • [3DV2020] PLACE: Proximity Learning of Articulation and Contact in 3D Environments [Project]
  • [ICCV2019] Resolving 3D Human Pose Ambiguities with 3D Scene Constraints [Project]

Before 2021

  • [Arxiv] Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes [Project]
  • [Arxiv] End-to-End Human Pose and Mesh Reconstruction with Transformers
  • [Arxiv] Human Mesh Recovery from Multiple Shots [Project]
  • [NeurIPS2020] 3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data [Project]
  • [Arxiv] Holistic 3D Human and Scene Mesh Estimation from Single View Images
  • [Arxiv] Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video
  • [Arxiv] Pose2Pose: 3D Positional Pose-Guided 3D Rotational Pose Prediction for Expressive 3D Human Pose and Mesh Estimation
  • [Arxiv] NeuralAnnot: Neural Annotator for in-the-wild Expressive 3D Human Pose and Mesh Training Sets
  • [Arxiv] 4D Human Body Capture from Egocentric Video via 3D Scene Grounding [Project]
  • [Arxiv] Populating 3D Scenes by Learning Human-Scene Interaction [Project]
  • [ECCV2020] Long-term Human Motion Prediction with Scene Context [Project]
  • [Arxiv] Vid2Actor: Free-viewpoint Animatable Person Synthesis from Video in the Wild [Project]
  • [Arxiv] ANR: Articulated Neural Rendering for Virtual Avatars
  • [Arxiv] Generating 3D People in Scenes without People [Project]
  • [ICCV2019] Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
  • [CVPR2019] Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments [Project]
  • [TOG2016] Pigraphs: learning interaction snapshots from observations [Project]

General Methods

  • [Arxiv] High-Performance Large-Scale Image Recognition Without Normalization
  • [Arxiv] Generative Models as Distributions of Functions
  • [Arxiv] Point-set Distances for Learning Representations of 3D Point Clouds
  • [Arxiv] Compressed Object Detection
  • [Arxiv] A linearized framework and a new benchmark for model selection for fine-tuning
  • [Arxiv] The Devils in the Point Clouds: Studying the Robustness of Point Cloud Convolutions
  • [Arxiv] Self-Supervised Pretraining of 3D Features on any Point-Cloud [pytorch]
  • [3DV2020] Learning Rotation-Invariant Representations of Point Clouds Using Aligned Edge Convolutional Neural Networks

Before 2021

  • [Arxiv] Diffusion is All You Need for Learning on Surfaces
  • [Arxiv] SPU-Net: Self-Supervised Point Cloud Upsampling by Coarse-to-Fine Reconstruction with Self-Projection Optimization
  • [3DV2020] Rotation-Invariant Point Convolution With Multiple Equivariant Alignments
  • [Arxiv] One Point is All You Need: Directional Attention Point for Feature Learning
  • [Arxiv] PCT: Point Cloud Transformer
  • [Arxiv] Hausdorff Point Convolution with Geometric Priors
  • [Arxiv] MARNet: Multi-Abstraction Refinement Network for 3D Point Cloud Analysis [Github]
  • [Arxiv] Point Transformer
  • [Arxiv] Learning geometry-image representation for 3D point cloud generation
  • [Arxiv] Deeper or Wider Networks of Point Clouds with Self-attention?
  • [NeurIPS2020] Primal-Dual Mesh Convolutional Neural Networks [pytorch]
  • [NeurIPS2020] Rational neural networks [tensorflow]
  • [NeurIPS2020] Exchangeable Neural ODE for Set Modeling [Project]
  • [NeurIPS2020] SE(3)-Transformers: 3D Roto-Translation Equivariant Attention Networks [Project]
  • [NeurIPS2020] NVAE: A Deep Hierarchical Variational Autoencoder [pytorch]
  • [NeurIPS2020] Implicit Graph Neural Networks [pytorch]
  • [NeurIPS2020] The Autoencoding Variational Autoencoder [pytorch]
  • [Arxiv] PointManifold: Using Manifold Learning for Point Cloud Classification
  • [Arxiv] RelationNet++: Bridging Visual Representations for Object Detection via Transformer Decoder
  • [Arxiv] Pre-Training by Completing Point Clouds [pytorch]
  • [NeurIPS2020] Rotation-Invariant Local-to-Global Representation Learning for 3D Point Cloud
  • [Arxiv] IF-Defense: 3D Adversarial Point Cloud Defense via Implicit Function based Restoration [pytorch]
  • [Arxiv] DV-ConvNet: Fully Convolutional Deep Learning on Point Clouds with Dynamic Voxelization and 3D Group Convolution
  • [Arxiv] Spatial Transformer Point Convolution
  • [Arxiv] Minimal Adversarial Examples for Deep Learning on 3D Point Clouds
  • [BMVC2020] Black Magic in Deep Learning: How Human Skill Impacts Network Training
  • [ECCV2020] PointMixup: Augmentation for Point Clouds [Code]
  • [ECCV2020] DR-KFS: A Differentiable Visual Similarity Metric for 3D Shape Reconstruction
  • [Arxiv] Unsupervised 3D Learning for Shape Analysis via Multiresolution Instance Discrimination
  • [Arxiv] Global Context Aware Convolutions for 3D Point Cloud Understanding
  • [ECCV2020] Shape Adaptor: A Learnable Resizing Module [pytorch]
  • [ACMMM2020] Differentiable Manifold Reconstruction for Point Cloud Denoising [pytorch]
  • [ECCV2020] Discrete Point Flow Networks for Efficient Point Cloud Generation
  • [Siggraph2020] Neural Subdivision
  • [Arxiv] PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding
  • [Arxiv] Accelerating 3D Deep Learning with PyTorch3D
  • [Arxiv] Natural Graph Networks
  • [ECCV2020] Progressive Point Cloud Deconvolution Generation Network [github]
  • [Arxiv] Point Set Voting for Partial Point Cloud Analysis
  • [Arxiv] PointMask: Towards Interpretable and Bias-Resilient Point Cloud Processing
  • [Arxiv] Fully Convolutional Mesh Autoencoder using Efficient Spatially Varying Kernels
  • [Arxiv] A Closer Look at Local Aggregation Operators in Point Cloud Analysis [github]
  • [NeurIPS2020] Implicit Neural Representations with Periodic Activation Functions [pytorch] 🔥
  • [Arxiv] Rethinking Sampling in 3D Point Cloud Generative Adversarial Networks
  • [Arxiv] Local-Area-Learning Network: Meaningful Local Areas for Efficient Point Cloud Analysis
  • [Arxiv] TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations
  • [Arxiv] Fully Convolutional Mesh Autoencoder using Efficient Spatially Varying Kernels
  • [Arxiv] Rethinking Sampling in 3D Point Cloud Generative Adversarial Networks
  • [Arxiv] MeshWalker: Deep Mesh Understanding by Random Walks
  • [Arxiv] MOPS-Net: A Matrix Optimization-driven Network for Task-Oriented 3D Point Cloud Downsampling
  • [Arxiv] DPDist : Comparing Point Clouds Using Deep Point Cloud Distance
  • [CVPR2020] PointASNL: Robust Point Clouds Processing using Nonlocal Neural Networks with Adaptive Sampling
  • [AAAI2020] Shape-Oriented Convolution Neural Network for Point Cloud Analysis
  • [Arxiv] Joint Supervised and Self-Supervised Learning for 3D Real-World Challenges
  • [Arxiv] LIGHTCONVPOINT: CONVOLUTION FOR POINTS [pytorch]
  • [Arxiv] Variational Auto-Decoder [pytorch]
  • [Arxiv] Generative PointNet: Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification
  • [CVPR2020] DualConvMesh-Net: Joint Geodesic and Euclidean Convolutions on 3D Meshes [pytorch]
  • [CVPR2020] RPM-Net: Robust Point Matching using Learned Features [github]
  • [CVPR2020] Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds
  • [CVPR2020] PointGMM: a Neural GMM Network for Point Clouds
  • [Arxiv] Dynamic ReLU
  • [CVPR2020] SampleNet: Differentiable Point Cloud Sampling [pytorch]
  • [Arxiv] Defense-PointNet: Protecting PointNet Against Adversarial Attacks
  • [CVPR2020] FPConv: Learning Local Flattening for Point Convolution [pytorch]
  • [SIGGRAPH2019] MeshCNN: A Network with an Edge [pytorch] 🔥⭐️
  • [ICCV2019] Total Denoising: Unsupervised Learning of 3D Point Cloud Cleaning [tensorflow]
  • [ICCV2019] PU-GAN: a Point Cloud Upsampling Adversarial Network🔥
  • [CVPR2019] Relation-Shape Convolutional Neural Network for Point Cloud Analysis [pytorch] 🔥
  • [CVPR2019] Patch-based Progressive 3D Point Set Upsampling [tensorflow] [pytorch] 🔥
  • [TOG2019] Dynamic Graph CNN for Learning on Point Clouds [Project] 🔥 ⭐️
  • [ECCV2018] EC-Net: an Edge-aware Point set Consolidation Network [project page]
  • [CVPR2018] PU-Net: Point Cloud Upsampling Network ⭐️🔥
  • [Arxiv] PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
  • [ICLR2017] DEEP LEARNING WITH SETS AND POINT CLOUDS
  • [NeurIPS2017] Deep Sets
  • [Siggraph2006] Designing with Distance Fields

Others (inc. Networks in Classification, Matching, Registration, Alignment, Depth, Normal, Pose, Keypoints, etc.)

  • [CVPR2021] Self-supervised Geometric Perception
  • [Arxiv] StablePose: Learning 6D Object Poses from Geometrically Stable Patches
  • [Arxiv] A Parameterised Quantum Circuit Approach to Point Set Matching
  • [Arxiv] Adjoint Rigid Transform Network: Self-supervised Alignment of 3D Shapes
  • [Arxiv] Video Transformer Network
  • [ICLR2021] NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation [pytorch]
  • [Arxiv] NBDT: NEURAL-BACKED DECISION TREE [pytorch]
  • [Arxiv] AdaBins: Depth Estimation using Adaptive Bins [pytorch]
  • [Arxiv] Unsupervised Monocular Depth Reconstruction of Non-Rigid Scenes
  • [Arxiv] CorrNet3D: Unsupervised End-to-end Learning of Dense Correspondence for 3D Point Clouds

Before 2021

  • [Arxiv] iNeRF: Inverting Neural Radiance Fields for Pose Estimation [Project]
  • [Arxiv] Boosting Monocular Depth Estimation with Lightweight 3D Point Fusion
  • [Arxiv] 3D Registration for Self-Occluded Objects in Context
  • [Arxiv] Continuous Surface Embeddings
  • [Arxiv] SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration
  • [Arxiv] MVTN: Multi-View Transformation Network for 3D Shape Recognition
  • [Arxiv] PREDATOR: Registration of 3D Point Clouds with Low Overlap
  • [Arxiv] Deep Magnification-Arbitrary Upsampling over 3D Point Clouds
  • [Arxiv] Occlusion Guided Scene Flow Estimation on 3D Point Clouds
  • [NeurIPS2020] An Analysis of SVD for Deep Rotation Estimation
  • [EG2020W] SHREC 2020 track: 6D object pose estimation
  • [ACCV2020] Best Buddies Registration for Point Clouds
  • [3DV] A New Distributional Ranking Loss With Uncertainty: Illustrated in Relative Depth Estimation
  • [BMVC2020] View-consistent 4D Light Field Depth Estimation
  • [BMVC2020] Neighbourhood-Insensitive Point Cloud Normal Estimation Network [Project]
  • [ECCV2020] DeepGMR: Learning Latent Gaussian Mixture Models for Registration [Project]
  • [ECCV2020] Motion Capture from Internet Videos [Project]
  • [ECCV2020] Depth Completion with RGB Prior
  • [ECCV2020] 6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal Inference
  • [Arxiv] Self-Supervised Learning of Point Clouds via Orientation Estimation
  • [SIGGRAPH2020] SymmetryNet: Learning to Predict Reflectional and Rotational Symmetries of 3D Shapes from Single-View RGB-D Images [Project]
  • [ECCV2020] Learning Stereo from Single Images [github]
  • [Arxiv] Learning Long-term Visual Dynamics with Region Proposal Interaction Networks [Project]
  • [ECCV2020] Beyond Controlled Environments: 3D Camera Re-Localization in Changing Indoor Scenes [Project]
  • [ECCV2020] Unsupervised Shape and Pose Disentanglement for 3D Meshes
  • [Arxiv] PVSNet: Pixelwise Visibility-Aware Multi-View Stereo Network
  • [ECCV2020] P2Net: Patch-match and Plane-regularization for Unsupervised Indoor Depth Estimation
  • [CVPR2020] Learning multiview 3D point cloud registration [pytorch]
  • [CVPR2020] Feature-metric Registration: A Fast Semi-supervised Approach for Robust Point Cloud Registration without Correspondences
  • [Siggraph2020] Consistent Video Depth Estimation
  • [Arxiv] Deep Feature-preserving Normal Estimation for Point Cloud Filtering
  • [Arxiv] Pseudo RGB-D for Self-Improving Monocular SLAM and Depth Prediction
  • [CVPR2020] Towards Better Generalization: Joint Depth-Pose Learning without PoseNet [pytorch]
  • [Arxiv] Monocular Camera Localization in Prior LiDAR Maps with 2D-3D Line Correspondences
  • [Arxiv] Adversarial Texture Optimization from RGB-D Scans
  • [Arxiv] SAPIEN: A SimulAted Part-based Interactive ENvironment
  • [CVPR2020] G2L-Net: Global to Local Network for Real-time 6D Pose Estimation with Embedding Vector Features
  • [Arxiv] On Localizing a Camera from a Single Image
  • [Arxiv] DeepFit: 3D Surface Fitting via Neural Network Weighted Least Squares
  • [CVPR2020] KFNet: Learning Temporal Camera Relocalization using Kalman Filtering
  • [Arxiv] Neural Contours: Learning to Draw Lines from 3D Shapes
  • [Arxiv] 3dDepthNet: Point Cloud Guided Depth Completion Network for Sparse Depth and Single Color Image
  • [Arxiv] Unsupervised Learning of Category-Specific Symmetric 3D Keypoints from Point Sets
  • [CVPR2020] End-to-End Learning Local Multi-view Descriptors for 3D Point Clouds
  • [Arxiv] PnP-Net: A hybrid Perspective-n-Point Network
  • [CVPR2020] MobilePose: Real-Time Pose Estimation for Unseen Objects with Weak Shape Supervision
  • [CVPR2020] D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry
  • [ICIP2020] TRIANGLE-NET: TOWARDS ROBUSTNESS IN POINT CLOUD CLASSIFICATION
  • [ICRA2020] Robust 6D Object Pose Estimation by Learning RGB-D Features
  • [Arxiv] Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields
  • [Arxiv] Single Image Depth Estimation Trained via Depth from Defocus Cues [pytorch]
  • [Arxiv] DepthTransfer: Depth Extraction from Video Using Non-parametric Sampling
  • [Arxiv] Target-less registration of point clouds: A review
  • [Arxiv] Quaternion Equivariant Capsule Networks for 3D point clouds
  • [Arxiv] Category-Level Articulated Object Pose Estimation
  • [Arxiv] A Quantum Computational Approach to Correspondence Problems on Point Sets
  • [Arxiv] DeepSFM: Structure From Motion Via Deep Bundle Adjustment
  • [Arxiv] P2GNet: Pose-Guided Point Cloud Generating Networks for 6-DoF Object Pose Estimation
  • [ICCV2019] Learning Local RGB-to-CAD Correspondences for Object Pose Estimation
  • [ICCV2019] Joint Embedding of 3D Scan and CAD Objects [dataset]
  • [ICLR2019] BA-NET: DENSE BUNDLE ADJUSTMENT NETWORKS [tensorflow]
  • [ICCV2019] GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild
  • [ICCV2019] Closed-Form Optimal Two-View Triangulation Based on Angular Errors
  • [ICCV2019] Polarimetric Relative Pose Estimation
  • [ICCV2019] End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans
  • [ICCV2019] Deep Non-Rigid Structure from Motion
  • [CVPR2019] On the Continuity of Rotation Representations in Neural Networks [pytorch]
  • [Arxiv] Deep Interpretable Non-Rigid Structure from Motion [tensorflow]
  • [Arxiv] IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks [dataset]
  • [CVPR2019] Scan2CAD: Learning CAD Model Alignment in RGB-D Scans [pytorch] 🔥
  • [3DV2019] Location Field Descriptors: Single Image 3D Model Retrieval in the Wild
  • [CVPR2016] Marr Revisited: 2D-3D Alignment via Surface Normal Prediction [caffe]

Survey, Resources and Tools

  • [Survey] A comprehensive survey on point cloud registration
  • [Survey] Domain Generalization: A Survey
  • [Dataset] SUM: A Benchmark Dataset of Semantic Urban Meshes
  • [Survey] Attention Models for Point Clouds in Deep Learning: A Survey
  • [Benchmark] H3D: Benchmark on Semantic Segmentation of High-Resolution 3D Point Clouds and textured Meshes from UAV LiDAR and Multi-View-Stereo [Project]
  • [Survey] Dynamic Neural Networks: A Survey
  • [Survey] Online Continual Learning in Image Classification: An Empirical Survey
  • [Survey] Deep Learning for Visual Tracking: A Comprehensive Survey
  • [Survey] Occlusion Handling in Generic Object Detection: A Review
  • [Survey] Curriculum Learning: A Survey
  • [Github] Awesome Neural Radiance Fields
  • [Survey] Neural Volume Rendering: NeRF And Beyond
  • [Survey] Transformers in Vision: A Survey
  • [Survey] Efficient Transformers: A Survey
  • [Survey] Semantics for Robotic Mapping, Perception and Interaction: A Survey
  • [Survey] Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy

Before 2021

  • [Dataset] Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations [Github]
  • [Survey] Skeleton-based Approaches based on Machine Vision: A Survey
  • [Survey] Deep Learning-Based Human Pose Estimation: A Survey [Github]
  • [Dataset] Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding [Github]
  • [Survey] A Review and Comparative Study on Probabilistic Object Detection in Autonomous Driving [Github]
  • [Dataset] RELLIS-3D Dataset: Data, Benchmarks and Analysis [Github]
  • [Arxiv] Motion Prediction on Self-driving Cars: A Review
  • [Github] TESSE: Unity-based simulator to enable research in perception, mapping, learning, and robotics
  • [Survey] A Survey on Visual Transformer
  • [Survey] A Survey on Contrastive Self-supervised Learning
  • [Survey] A Survey of Surface Reconstruction from Point Clouds
  • [Dataset] Torch-Points3D: A Modular Multi-Task Framework for Reproducible Deep Learning on 3D Point Clouds [Project]
  • [Thesis] Learning to Reconstruct and Segment 3D Objects
  • [Survey] An Overview Of 3D Object Detection
  • [Survey] A Brief Review of Domain Adaptation
  • [Dataset] Announcing the Objectron Dataset
  • [Tutorial] Video Action Understanding: A Tutorial
  • [Arxiv] Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Reconstruction [Page]
  • [Survey] Multi-Task Learning with Deep Neural Networks: A Survey
  • [Survey] Deep Learning for 3D Point Cloud Understanding: A Survey
  • [Thesis] COMPUTATIONAL ANALYSIS OF DEFORMABLE MANIFOLDS: FROM GEOMETRIC MODELING TO DEEP LEARNING
  • [Arxiv] F*: An Interpretable Transformation of the F-measure
  • [Dataset] Gibson Database of 3D Spaces
  • [BMVC2020] Black Magic in Deep Learning: How Human Skill Impacts Network Training
  • [Arxiv] PyTorch Metric Learning
  • [Arxiv] RGB-D Salient Object Detection: A Survey [Project]
  • [Arxiv] AiRound and CV-BrCT: Novel Multi-View Datasets for Scene Classification [Project]
  • [CVPR2020] OASIS: A Large-Scale Dataset for Single Image 3D in the Wild [Project]
  • [Arxiv] 3D-FUTURE: 3D FUrniture shape with TextURE
  • [Arxiv] 3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics [Project][Link]
  • [Arxiv] Differentiable Rendering: A Survey
  • [Arxiv] Visual Relationship Detection using Scene Graphs: A Survey
  • [Arxiv] Polarization Human Shape and Pose Dataset
  • [Arxiv] IDDA: a large-scale multi-domain dataset for autonomous driving [Project page]
  • [CVPR2020] RoboTHOR: An Open Simulation-to-Real Embodied AI Platform [Project page]
  • [EG2020] State of the Art on Neural Rendering
  • [IJCAI-PRICAI2020] 3D-FUTURE: 3D FUrniture shape with TextURE
  • [Arxiv] Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways
  • [Arxiv] KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations
  • [Arxiv] A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications
  • [Arxiv] From Seeing to Moving: A Survey on Learning for Visual Indoor Navigation (VIN)
  • [Arxiv] DIODE: A Dense Indoor and Outdoor DEpth Dataset [dataset]
  • [Github] Various GANs with Pytorch.
  • [Arxiv] SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances [dataset]
  • [CVM] A Survey on Deep Geometry Learning: From a Representation Perspective
  • [Arxiv] A survey on Semi-, Self- and Unsupervised Techniques in Image Classification
  • [Arxiv] fastai: A Layered API for Deep Learning
  • [Arxiv] AU-AIR: A Multi-modal Unmanned Aerial Vehicle Dataset for Low Altitude Traffic Surveillance [dataset]
  • [Arxiv] VIRTUAL KITTI 2 [dataset]
  • [Arxiv] Tutorial on Variational Autoencoders
  • [Arxiv] Review: deep learning on 3D point clouds
  • [Arxiv] Image Segmentation Using Deep Learning: A Survey
  • [CVPR2018] Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction
  • [Arxiv] Evolution of Image Segmentation using Deep Convolutional Neural Network: A Survey
  • [Arxiv] MCMLSD: A Probabilistic Algorithm and Evaluation Framework for Line Segment Detection
  • [Arxiv] Deep Learning for 3D Point Clouds: A Survey
  • [Arxiv] A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D images
  • [Arxiv] A Survey on Deep Learning Architectures for Image-based Depth Reconstruction
  • [Arxiv] secml: A Python Library for Secure and Explainable Machine Learning
  • [Arxiv] Bundle Adjustment Revisited
  • [ICCV2019] Deep CG2Real: Synthetic-to-Real Translation via Image Disentanglement
  • [Arxiv] SIFT Meets CNN: A Decade Survey of Instance Retrieval
  • [ICCV2019] Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data [tensorflow]
  • [Arxiv] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks [dataset]
  • [Arxiv] Imbalance Problems in Object Detection: A Review [repository]
  • [IJCV] Deep Learning for Generic Object Detection: A Survey
  • [Arxiv] Differentiable Visual Computing (Ph.D thesis)
  • [BMVC2018] InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset [dataset]
  • [ICCV2017] The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes [dataset] [script] ⭐️
  • [Arxiv] SynthCity: A large scale synthetic point cloud [dataset]
  • [Github] Mesh Voxelization (SDFs or Occupancy grids)
  • [Github] SDFGen (to generate grid-based signed distance field (level set))
  • [Github] Blender renderer for python
  • [Github] Blender renderer for python
  • [Github] Volumetric TSDF Fusion of RGB-D Images in Python
  • [Github] Volumetric TSDF Fusion of Multiple Depth Maps
  • [Github] PyFusion
  • [Github] PyRender
  • [Github] PyMCubes
  • [Github] Watertight and Simplified Meshes through TSDF Fusion (Python tool for obtaining watertight meshes using TSDF fusion.)
  • [Github] Several tools about SDF functions.
  • [Github] 3DMatch Toolbox
  • [stackoverflow] Computing truncated signed distance function(TSDF) from a point cloud
  • [Github] voxblox: A library for flexible voxel-based mapping, mainly focusing on truncated and Euclidean signed distance fields.
  • [Github] Discregrid: A static C++ library for the generation of discrete functions on a box-shaped domain. This is especially suited for the generation of signed distance fields.
  • [Github] awesome-voxel: Voxel resources for coders
  • [Github] gvdb-voxels: Sparse volume compute and rendering on NVIDIA GPUs
  • [Github] pyntcloud is a Python library for working with 3D point clouds.
  • [Github] Open3D: A Modern Library for 3D Data Processing
  • [Github] mesh_to_sdf: Calculate signed distance fields for arbitrary meshes
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].