Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → feixh → Visma

feixh / Visma

Licence: other

Visual-Inertial-Semantic-MApping Dataset and tools

Labels

detection mapping slam semantic object

Projects that are alternatives of or similar to Visma

Eao Slam

[IROS 2020] EAO-SLAM: Monocular Semi-Dense Object SLAM Based on Ensemble Data Association

Stars: ✭ 95 (+75.93%)

Mutual labels: slam, mapping, semantic

Visual slam related research

视觉(语义) SLAM 相关研究跟踪

Stars: ✭ 708 (+1211.11%)

Mutual labels: slam, mapping, semantic

UrbanLoco

UrbanLoco: A Full Sensor Suite Dataset for Mapping and Localization in Urban Scenes

Stars: ✭ 147 (+172.22%)

Mutual labels: mapping, slam

StrayVisualizer

Visualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html

Stars: ✭ 30 (-44.44%)

Mutual labels: mapping, slam

Comma2k19

A driving dataset for the development and validation of fused pose estimators and mapping algorithms

Stars: ✭ 391 (+624.07%)

Mutual labels: slam, mapping

slamkit

SLAM Kit

Stars: ✭ 28 (-48.15%)

Mutual labels: mapping, slam

direct lidar odometry

Direct LiDAR Odometry: Fast Localization with Dense Point Clouds

Stars: ✭ 202 (+274.07%)

Mutual labels: mapping, slam

Sc Lego Loam

LiDAR SLAM: Scan Context + LeGO-LOAM

Stars: ✭ 332 (+514.81%)

Mutual labels: slam, mapping

ros-vrep-slam

ROS and V-REP for Robot Mapping and Localization

Stars: ✭ 39 (-27.78%)

Mutual labels: mapping, slam

Multi Human Parsing

🔥🔥Official Repository for Multi-Human-Parsing (MHP)🔥🔥

Stars: ✭ 507 (+838.89%)

Mutual labels: detection, semantic

Loam noted

loam code noted in Chinese（loam中文注解版）

Stars: ✭ 455 (+742.59%)

Mutual labels: slam, mapping

Lio Mapping

Implementation of Tightly Coupled 3D Lidar Inertial Odometry and Mapping (LIO-mapping)

Stars: ✭ 520 (+862.96%)

Mutual labels: slam, mapping

CarND-VehicleDetection

vehicle detection with deep learning

Stars: ✭ 34 (-37.04%)

Mutual labels: object, detection

LVIO-SAM

A Multi-sensor Fusion Odometry via Smoothing and Mapping.

Stars: ✭ 143 (+164.81%)

Mutual labels: mapping, slam

Slam-Dunk-Android

Android implementation of "Fusion of inertial and visual measurements for rgb-d slam on mobile devices"

Stars: ✭ 25 (-53.7%)

Mutual labels: mapping, slam

GA SLAM

🚀 SLAM for autonomous planetary rovers with global localization

Stars: ✭ 40 (-25.93%)

Mutual labels: mapping, slam

Deep Learning Localization Mapping

A collection of deep learning based localization models

Stars: ✭ 300 (+455.56%)

Mutual labels: slam, mapping

Perception-of-Autonomous-mobile-robot

Perception of Autonomous mobile robot,Using ROS,rs-lidar-16,By SLAM,Object Detection with Yolov5 Based DNN

Stars: ✭ 40 (-25.93%)

Mutual labels: detection, slam

python-graphslam

Graph SLAM solver in Python

Stars: ✭ 118 (+118.52%)

Mutual labels: mapping, slam

Semantic suma

SuMa++: Efficient LiDAR-based Semantic SLAM (Chen et al IROS 2019)

Stars: ✭ 431 (+698.15%)

Mutual labels: slam, semantic

View All Similar Projects ➔

VISMA dataset tools

VISMA stands for Visual Inertial Semantic MApping and contains both RGB videos and inertial measurements for developing object-level (semantic) mapping systems.

We gave a live demo of our system at CVPR 2016 followed by a CVPR 2017 paper, where objects are modeled as 3D bounding boxes with semantic labels attached.

In the follow-up ECCV 2018 paper, the system has been further improved to model fine-grained object shapes as polygon meshes.

If you find VISMA or this repo useful and use them in your work, please cite the following papers:

@inproceedings{feiS18,
    title = {Visual-Inertial Object Detection and Mapping},
    author = {Fei, X. and Soatto, S.},
    booktitle = {Proceedings of the European Conference on Computer Vision},
    year = {2018}
}

[paper]-[poster]-[video]

@inproceedings{dongFS17,
    author = {Dong, J., Fei, X., and Soatto, S.},
    title = {Visual Inertial Semantic Scene Representation for 3D Object Detection},
    booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    year = {2017}
}

[paper]-[poster]-[video]

Data

Data is available in dropbox here.

Install

Dependencies

OpenCV: Image I/O and processing. Easy way is to install OpenCV via your favorite package manager.
Eigen: Linear algebra and matrix manipulation. Install via package manager or build from source.
Protobuf: Utilities for protocol buffer. Install via package manager.
abseil-cpp: Utilities from google. No need to build this manually, since the repo is add_subdirectory-ed into the main build script.
jsoncpp: I/O for json files. No need to build this manually, since the repo is add_subdirectory-ed into the main build script.

Once all the requirements are met, make a build directory, enter that directory and cmake .. followed by make.

To build the evaluation code, you need the following extra dependecies:

Open3D: For point cloud manipulation and visualization.
libigl: Mesh I/O and geometry processing. This is a header only library, clone the repo into thirdparty directory as libigl.

We provide a version of Open3D in thirdparty directory. First, go to thirdparty/Open3D and follow the instruction on http://www.open3d.org/docs/getting_started.html#ubuntu to build, i.e.:

util/scripts/install-deps-ubuntu.sh
mkdir build
cd build
cmake .. -DBUILD_SHARED_LIBS=ON -DCMAKE_INSTALL_PREFIX=../
make -j

Then, set build_evaluation to True in CMakeLists.txt and build.

Raw and preprocessed data

Raw data (RGB video and inertial measurements with time stamps) are stored in rosbags. You can run your favorite visual-inertial or visual SLAM to get camera poses.

In addition to the raw data, we also provide the following preprocessed data:

RGB images extracted from rosbags (XXX.png)
Associated camera pose at each time instant from our SLAM system (dataset)
Edge map (XXX.edge) and
Object bounding boxes (XXX.bbox)

Except the RGB images, all the other data are encoded according to protocols/vlslam.proto to avoid custom I/O. This being said, though the data loading example is written in C++, it will not be hard to parse data in other programming frameworks, say, python.

RGB-D (pseudo) ground truth

We ran ElasticFusion on RGB-D collected by a Kinect to get a (pseudo)ground truth reconstruction to which our semantic reconstruction is compared.

Folder RGBD contains all the data needed to evaluate semantic reconstruction for each sequence. For instance, clutter1 contains data to evaluate results on clutter sequence. clutter/test.klg.ply is the point cloud reconstruction of the scene from ElasticFusion. clutter/fragments contains the following items:

objects.json contains a complete list of objects in the scene. Each object is named as XXX_N.ply where XXX is the object name in the CAD database and N is the count of the object (there might be multiple identical objects in the scene).
For each object listed in objects.json, we have its point cloud segmented out from the RGB-D reconstruction. This is used to find the alignment between semantic reconstruction and RGB-D reconstruction in ICP as described in our paper.
alignment.json contains the ground truth object pose in the scene. The poses are found by orientation constrained ICP described in our paper.
augmented_scene.ply contains RGB-D point cloud with points sampled from the CAD models aligned to the scene. This is a point cloud file since RGB-D reconstruction is in the form of point cloud though the CAD models are provided as meshes.

Examples

Data loading (C++)

For example usage of data loader, see example/example_load.cpp and run example_load DATASET_DIRECTORY in the example sub-directory. The input RGB image, pre-computed edge map and object proposals, camera pose from SLAM will be loaded. To load the sparse reconstruction, 2D tracklets of feature points and other information from SLAM, see the protocol buffer file in protocols/vlslam.proto and modify the dataset loader accordingly.

Data loading (Python)

VISMA is designed for but not limited to developing visual-inertial semantic SLAM. In any case if one wants to use it for training deep neural networks, e.g., unsupervised depth prediction learned from monocular videos, we provide an example python script to load and process data.

First go to protocols and

protoc vlslam.proto --python_out=../scripts

which generates vlslam_pb2.py in scripts folder.

Go to project root directory and run the following

python scripts/example_load.py --dataroot YOUR_DIRECTORY_OF_PREPROCESSED_DATA

For more command line options, see the script.

TODO

Complete python script on loading bounding boxes, sparse features, etc.
Finalize example code on evaluation.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 54

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗