All Projects → referit3d → referit3d

referit3d / referit3d

Licence: MIT license
Code accompanying our ECCV-2020 paper on 3D Neural Listeners.

Programming Languages

C++
36643 projects - #6 most used programming language
python
139335 projects - #7 most used programming language
c
50402 projects - #5 most used programming language
Cuda
1817 projects
shell
77523 projects
Makefile
30231 projects

Projects that are alternatives of or similar to referit3d

Scannet
Stars: ✭ 860 (+1357.63%)
Mutual labels:  computer-graphics, rgbd
Shader-Playgrounds
A WebGL shaders editor for beginners and otherwise.
Stars: ✭ 28 (-52.54%)
Mutual labels:  computer-graphics
Pulp-Fiction-ARKit
An experiment using Volume to reconstruct Pulp Fiction's dance scene in Augmented Reality
Stars: ✭ 46 (-22.03%)
Mutual labels:  computer-graphics
svbrdf-estimation
SVBRDF Estimation using a Physically-based Differentiable Renderer
Stars: ✭ 16 (-72.88%)
Mutual labels:  computer-graphics
Metu-CENG
All the homeworks, studies and projects I've done at Metu-CENG
Stars: ✭ 32 (-45.76%)
Mutual labels:  computer-graphics
racket-vulkan
Racket integration with all things Vulkan 💥
Stars: ✭ 40 (-32.2%)
Mutual labels:  computer-graphics
visionmagic
Collection of vision & graphics algorithms
Stars: ✭ 39 (-33.9%)
Mutual labels:  computer-graphics
GeobitNonrigidDescriptor ICCV 2019
C++ implementation of the nonrigid descriptor Geobit presented at ICCV 2019 "GEOBIT: A Geodesic-Based Binary Descriptor Invariant to Non-Rigid Deformations for RGB-D Images"
Stars: ✭ 11 (-81.36%)
Mutual labels:  rgbd
SynergyNet
3DV 2021: Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry
Stars: ✭ 179 (+203.39%)
Mutual labels:  computer-graphics
-
图形学书单
Stars: ✭ 84 (+42.37%)
Mutual labels:  computer-graphics
creative-coding-notebooks
🎨 An authorial collection of fundamental recipes on Creative Coding and Recreational Programming.
Stars: ✭ 17 (-71.19%)
Mutual labels:  computer-graphics
DepthKit-for-Max
A patch for using DepthKit volumetric videos in Max/MSP/Jitter
Stars: ✭ 21 (-64.41%)
Mutual labels:  rgbd
Teapong
A two-player game for fans of Pong and the Utah teapot!
Stars: ✭ 24 (-59.32%)
Mutual labels:  computer-graphics
pytorch-translm
An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.
Stars: ✭ 22 (-62.71%)
Mutual labels:  language-modeling
CGvsPhoto
Computer Graphics vs Real Photographic Images : A Deep-learning approach
Stars: ✭ 24 (-59.32%)
Mutual labels:  computer-graphics
ESANet
ESANet: Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
Stars: ✭ 154 (+161.02%)
Mutual labels:  rgbd
multimodal-deep-learning-for-disaster-response
Damage Identification in Social Media Posts using Multimodal Deep Learning: code and dataset
Stars: ✭ 43 (-27.12%)
Mutual labels:  multimodal-deep-learning
iMIX
A framework for Multimodal Intelligence research from Inspur HSSLAB.
Stars: ✭ 21 (-64.41%)
Mutual labels:  multimodal-deep-learning
AwesomeMLForDigitalMedia
A curated list of awesome machine learning resources in the context of digital media and (interactive) computer graphics.
Stars: ✭ 17 (-71.19%)
Mutual labels:  computer-graphics
NIvsCG-keras
Distinguishing Between Natural and Computer-Generated Images Using Convolutional Neural Networks in Keras.
Stars: ✭ 20 (-66.1%)
Mutual labels:  computer-graphics

ReferIt3D: Neural Listeners for Fine-Grained 3D Object Identification in Real-World Scenes [ECCV 2020 (Oral)]

Website Badge License: MIT

Created by: Panos Achlioptas, Ahmed Abdelreheem, Fei Xia, Mohamed Elhoseiny, Leonidas Guibas

Introduction

This work is based on our ECCV-2020 paper. There, we proposed the novel task of identifying a 3D object in a real-world scene given discriminative language, created two relevant datasets (Nr3D and Sr3D) and proposed a 3D neural listener (ReferIt3DNet) for solving this task. The bulk of the provided code serves the training & evaluation of ReferIt3DNet in our data. For more information please visit our project's webpage.

ReferIt3DNet

ReferIt3DNet

Code-Dependencies

  1. Python 3.x with numpy, pandas, matplotlib (and a few more common packages - please see setup.py)
  2. Pytorch 1.x

Our code is tested with Python 3.6.9, Pytorch 1.4 and CUDA 10.0, on Ubuntu 14.04.

Installation

  • (recommended) you are advised to create a new anaconda environment, please use the following commands to create a new one.
    conda create -n referit3d_env python=3.6.9 cudatoolkit=10.0
    conda activate referit3d_env
    conda install pytorch torchvision -c pytorch
  • Install the referit3d python package using
    cd referit3d
    pip install -e .
  • To use a PointNet++ visual-encoder you need to compile its CUDA layers for PointNet++: Note: To do this compilation also need: gcc5.4 or later.
    cd external_tools/pointnet2
    python setup.py install

Dataset

ScanNet

First you must download the train/val scans of ScanNet if you do not have them locally. To do so, please refer to the ScanNet Dataset for more details.

Our Linguistic Data

  • Nr3D you can dowloaded Nr3D here (10.7MB)
  • Sr3D / Sr3D+ you can dowloaded Sr3D/Sr3D+ here (19MB / 20MB)

Since Sr3d is a synthetic dataset, you can change the hyper-parameters to create a version customized to your needs. please see referit3d/data_generation/sr3d/

Training

  • To train on either Nr3d or Sr3d dataset, use the following commands
    cd referit3d/scripts/
    python train_referit3d.py -scannet-file the_processed_scannet_file -referit3D-file dataset_file.csv --log-dir dir_to_log --n-workers 4

feel free to change the number of workers to match your #CPUs and RAM size.

  • To train nr3d in joint with sr3d, add the following argument
    --augment-with-sr3d sr3d_dataset_file.csv

Evaluation

  • To evaluate on either Nr3d or Sr3d dataset, use the following commands
    cd referit3d/scripts/
    python train_referit3d.py --mode evaluate -scannet-file the_processed_scannet_file -referit3D-file dataset_file.csv --resume-path the_path_to_the_best_model.pth  --n-workers 4 --batch-size 64 
  • To evaluate on joint trained model, add the following argument to the above command
    --augment-with-sr3d sr3d_dataset_file.csv

Pretrained model

you can download a pretrained ReferIt3DNet models on Nr3D and Sr3D here. please extract the zip file and then copy the extracted folder to referit3d/log folder. you can run the following the command to evaluate:

cd referit3d/scripts
python train_referit3d.py --mode evaluate -scannet-file path_to_keep_all_points_00_view_with_global_scan_alignment.pkl  -referit3D-file path_to_corresponding_csv.csv  --resume-path checkpoints/best_model.pth

ReferIt3D Benchmark Challenges

We wish to aggregate and highlight results from different approaches tackling the problem of fine-grained 3D object identification via language. If you use either of our datasets with a new method, please let us know! so we can add your method and attained results in our benchmark-aggregating page.

Citation

@article{achlioptas2020referit_3d,
    title={ReferIt3D: Neural Listeners for Fine-Grained 3D Object Identification in Real-World Scenes},
    author={Achlioptas, Panos and Abdelreheem, Ahmed and Xia, Fei and Elhoseiny, Mohamed and Guibas, Leonidas},
    journal={16th European Conference on Computer Vision (ECCV)},
    year={2020}
}

License

The code is licensed under MIT license (see LICENSE.md for details).

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].