Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → jimmyyhwu → Pose Interpreter Networks

jimmyyhwu / Pose Interpreter Networks

Licence: mit

Real-Time Object Pose Estimation with Pose Interpreter Networks (IROS 2018)

Labels

jupyter-notebook deep-learning pytorch computer-vision artificial-intelligence 3d robotics pose-estimation

Projects that are alternatives of or similar to Pose Interpreter Networks

Pytorch Dense Correspondence

Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"

Stars: ✭ 445 (+327.88%)

Mutual labels: artificial-intelligence, robotics, 3d

Epipolar Transformers

Epipolar Transformers (CVPR 2020)

Stars: ✭ 245 (+135.58%)

Mutual labels: jupyter-notebook, 3d, pose-estimation

Visual Pushing Grasping

Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.

Stars: ✭ 516 (+396.15%)

Mutual labels: artificial-intelligence, robotics, 3d

Iros20 6d Pose Tracking

[IROS 2020] se(3)-TrackNet: Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic Domains

Stars: ✭ 113 (+8.65%)

Mutual labels: robotics, 3d, pose-estimation

Particle Filter Prototype

Particle Filter Implementations in Python and C++, with lecture notes and visualizations

Stars: ✭ 29 (-72.12%)

Mutual labels: artificial-intelligence, robotics, jupyter-notebook

Mit Deep Learning

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Stars: ✭ 8,912 (+8469.23%)

Mutual labels: artificial-intelligence, jupyter-notebook

Stag

STag: A Stable Fiducial Marker System

Stars: ✭ 75 (-27.88%)

Mutual labels: robotics, pose-estimation

Phormatics

Using A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)

Stars: ✭ 79 (-24.04%)

Mutual labels: artificial-intelligence, jupyter-notebook

Ai Dl Enthusiasts Meetup

AI & Deep Learning Enthusiasts Meetup Project & Study Sessions

Stars: ✭ 90 (-13.46%)

Mutual labels: artificial-intelligence, jupyter-notebook

Notebooks

Some notebooks

Stars: ✭ 53 (-49.04%)

Mutual labels: artificial-intelligence, jupyter-notebook

Ros Openpose

CMU's OpenPose for ROS

Stars: ✭ 81 (-22.12%)

Mutual labels: robotics, pose-estimation

60 days rl challenge

60_Days_RL_Challenge中文版

Stars: ✭ 92 (-11.54%)

Mutual labels: artificial-intelligence, jupyter-notebook

Votenet

Deep Hough Voting for 3D Object Detection in Point Clouds

Stars: ✭ 1,183 (+1037.5%)

Mutual labels: robotics, 3d

Frustum Pointnets

Frustum PointNets for 3D Object Detection from RGB-D Data

Stars: ✭ 1,154 (+1009.62%)

Mutual labels: robotics, 3d

Pose Estimation tutorials

Tools and tutorials of pose estimation and deep learning

Stars: ✭ 79 (-24.04%)

Mutual labels: jupyter-notebook, pose-estimation

Awesome Decision Making Reinforcement Learning

A selection of state-of-the-art research materials on decision making and motion planning.

Stars: ✭ 68 (-34.62%)

Mutual labels: artificial-intelligence, robotics

Simulator

A ROS/ROS2 Multi-robot Simulator for Autonomous Vehicles

Stars: ✭ 1,260 (+1111.54%)

Mutual labels: artificial-intelligence, 3d

Rlai Exercises

Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]

Stars: ✭ 97 (-6.73%)

Mutual labels: artificial-intelligence, jupyter-notebook

Person remover

People removal in images using Pix2Pix and YOLO.

Stars: ✭ 96 (-7.69%)

Mutual labels: artificial-intelligence, jupyter-notebook

Objectron

Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes

Stars: ✭ 1,352 (+1200%)

Mutual labels: jupyter-notebook, 3d

View All Similar Projects ➔

pose-interpreter-networks

This code release accompanies the following paper:

Real-Time Object Pose Estimation with Pose Interpreter Networks [arXiv] [video]

Jimmy Wu, Bolei Zhou, Rebecca Russell, Vincent Kee, Syler Wagner, Mitchell Hebert, Antonio Torralba, and David M.S. Johnson

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018)

Abstract: In this work, we introduce pose interpreter networks for 6-DoF object pose estimation. In contrast to other CNN-based approaches to pose estimation that require expensively annotated object pose data, our pose interpreter network is trained entirely on synthetic pose data. We use object masks as an intermediate representation to bridge real and synthetic. We show that when combined with a segmentation model trained on RGB images, our synthetically trained pose interpreter network is able to generalize to real data. Our end-to-end system for object pose estimation runs in real-time (20 Hz) on live RGB data, without using depth information or ICP refinement.

Overview

File or Directory	Purpose
`segmentation`	Training, evaluating, and visualizing segmentation models on real RGB data
`pose_estimation`	Training, evaluating, and visualizing pose estimation models (pose interpreter networks) on synthetic data
`ros-package`	ROS package for real-time object pose estimation on live RGB data
`end_to_end_eval.ipynb`	Evaluation of end-to-end model on real RGB data
`end_to_end_visualize.ipynb`	Demonstration of end-to-end model on real RGB data

Requirements

These are the basic dependencies (tested on Ubuntu 16.04.4 LTS) for training and evaluating models. Note that some components, such as the ROS package, may have additional/conflicting dependencies. Please see the corresponding READMEs for the specific requirements.

Python

python 3.6
pytorch 0.4 and torchvision 0.2 (conda install pytorch=0.4 torchvision=0.2 -c pytorch)
pyyaml
munch
COCO API
pypcd (see here for instructions)
pillow
scikit-image==0.14.2
matplotlib
tqdm
jupyter
tensorboardX
tensorflow (for running TensorBoard)

Blender

Please install Blender 2.79. There is no need to build from source, you can simply download the prebuilt binary and link it at /usr/local/bin/blender.

Getting Started

Dataset

Download the Oil Change dataset (about 15GB) using the following script:

./download_data.sh

Pretrained Models

We provide our pretrained models for segmentation and pose estimation. The segmentation model is trained on real RGB data while the pose estimation model (pose interpreter network) is trained entirely on synthetic data. The segmentation and pose estimation models are separately trained and combined into an end-to-end model for pose estimation on real RGB images.

The pretrained segmentation models can be downloaded using these commands:

cd segmentation
./download_pretrained.sh

The pretrained pose estimation models can be similarly downloaded:

cd pose_estimation
./download_pretrained.sh

Demonstrations

After downloading the pretrained models, please see the following notebooks for demonstrations. In order to run these notebooks, you will need to have all of the requirements listed above installed, including Blender.

Notebook	Purpose
end_to_end_visualize.ipynb	Visualize end-to-end model on real RGB data
segmentation/visualize.ipynb	Visualize pretrained segmentation model on real RGB data
pose_estimation/demo.ipynb	Visualize pretrained pose estimation model (pose interpreter network) on synthetic data that is randomly generated on the fly

Training the System for a New Environment

Our system and pretrained models are tailored towards our own lab testing environment, set of physical objects, and specific Kinect1 camera. To train the system for a new environment, please follow these steps:

Create a RGB segmentation dataset for your environment
Train a segmentation model on the RGB dataset
Calibrate your camera to get the intrinsics
Create .stl and .pcd 3D mesh models for the objects in your dataset
Use the camera intrinsics and .stl 3D mesh models to render a synthetic mask image dataset
Use the synthetic dataset and the .pcd mesh models to train an object mask pose interpreter network
Combine the segmentation model and pose interpreter network into an end-to-end model

Apart from creation of the RGB segmentation dataset, camera calibration, and 3D model creation, you should be able to adapt the existing code to do all of the other steps.

Citation

If you find this work useful for your research, please consider citing:

@inproceedings{wu2018pose,
  title = {Real-Time Object Pose Estimation with Pose Interpreter Networks},
  author = {Wu, Jimmy and Zhou, Bolei and Russell, Rebecca and Kee, Vincent and Wagner, Syler and Hebert, Mitchell and Torralba, Antonio and Johnson, David M.S.},
  booktitle = {IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
  year = {2018},
  doi = {10.1109/IROS.2018.8593662}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 104

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗