All Projects → m5823779 → motion-planner-reinforcement-learning

m5823779 / motion-planner-reinforcement-learning

Licence: other
End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo

Programming Languages

python
139335 projects - #7 most used programming language
CMake
9771 projects

Projects that are alternatives of or similar to motion-planner-reinforcement-learning

DDPG
End to End Mobile Robot Navigation using DDPG (Continuous Control with Deep Reinforcement Learning) based on Tensorflow + Gazebo
Stars: ✭ 41 (-58.59%)
Mutual labels:  navigation, deep-reinforcement-learning, gazebo, ddpg
deeprl-continuous-control
Learning Continuous Control in Deep Reinforcement Learning
Stars: ✭ 14 (-85.86%)
Mutual labels:  deep-reinforcement-learning, ddpg, continuous-control
LWDRLC
Lightweight deep RL Libraray for continuous control.
Stars: ✭ 14 (-85.86%)
Mutual labels:  deep-reinforcement-learning, ddpg, continuous-control
ddpg biped
Repository for Planar Bipedal walking robot in Gazebo environment using Deep Deterministic Policy Gradient(DDPG) using TensorFlow.
Stars: ✭ 65 (-34.34%)
Mutual labels:  ddpg, ros-kinetic, deep-deterministic-policy-gradient
deep-rl-quadcopter
Implementation of Deep Deterministic Policy Gradients (DDPG) to teach a Quadcopter How to Fly!
Stars: ✭ 17 (-82.83%)
Mutual labels:  deep-reinforcement-learning, ddpg, deep-deterministic-policy-gradient
Deeprl
Modularized Implementation of Deep RL Algorithms in PyTorch
Stars: ✭ 2,640 (+2566.67%)
Mutual labels:  deep-reinforcement-learning, ddpg
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+135.35%)
Mutual labels:  deep-reinforcement-learning, ddpg
AI
使用深度强化学习解决视觉跟踪和视觉导航问题
Stars: ✭ 16 (-83.84%)
Mutual labels:  navigation, deep-reinforcement-learning
wpr simulation
No description or website provided.
Stars: ✭ 24 (-75.76%)
Mutual labels:  navigation, gazebo
Object-Goal-Navigation
Pytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"
Stars: ✭ 107 (+8.08%)
Mutual labels:  navigation, deep-reinforcement-learning
neonavigation
A 2-D/3-DOF seamless global/local mobile robot motion planner package for ROS
Stars: ✭ 199 (+101.01%)
Mutual labels:  navigation, motion-planner
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+2558.59%)
Mutual labels:  deep-reinforcement-learning, continuous-control
Machine Learning Is All You Need
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Stars: ✭ 173 (+74.75%)
Mutual labels:  deep-reinforcement-learning, ddpg
Deep Reinforcement Learning Algorithms
31 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
Stars: ✭ 167 (+68.69%)
Mutual labels:  deep-reinforcement-learning, ddpg
pytorch-distributed
Ape-X DQN & DDPG with pytorch & tensorboard
Stars: ✭ 98 (-1.01%)
Mutual labels:  deep-reinforcement-learning, ddpg
Minimalrl
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Stars: ✭ 2,051 (+1971.72%)
Mutual labels:  deep-reinforcement-learning, ddpg
open manipulator simulations
ROS Simulation for OpenManipulator
Stars: ✭ 15 (-84.85%)
Mutual labels:  gazebo, turtlebot3
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+124.24%)
Mutual labels:  deep-reinforcement-learning, ddpg
Rainy
☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (-60.61%)
Mutual labels:  deep-reinforcement-learning, ddpg
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+2934.34%)
Mutual labels:  deep-reinforcement-learning, ddpg

End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo

The goal is to use deep reinforcement learning algorithms: Deep Deterministic Policy Gradient (DDPG) to control mobile robot(turtlebot) to avoid obstacles while trying to arrive a target.

Goal: Let robot(turtlebot) navigate to the target(enter green circle)

image Demo video (Speed up ten times )

Introduce

More and more service robots appear in our daily lives. The key technologies of service robots involve many fields. Including navigation, system control, mechanism modules, vision modules, voice modules, artificial intelligence, etc. In this research, we present a learning-based mapless motion planner. This saves us from using traditional methods such as "SLAM" to have maps and can also do the navigation.

Input(State):

  1. Laser finding (10 Dimensions)

  2. Past action (Linear velocity & Angular velocity) (2 Dimensions)

  3. Target position in robot frame (2 Dimensions) a. Relative distance b. Relative angle (Polar coordinates)

  4. Robot yaw angular (1 Dimensions)

  5. The degrees to face the target i.e.|Yaw - Relative angle| (1 Dimensions)

    Total: 16 Dimensions

Normalize input(state):

  1. Laser finding / Maximum laser finding range
  2. Past action (Orignal)
  3. Target position in robot frame
    • Relative distance / Diagonal length in the map
    • Relative angle / 360
  4. Robot yaw angular / 360
  5. The degrees to face the target / 180

Output(Action):

  1. Linear velocity (0~0.25 m/s) (1 Dimensions)
  2. Angular velocity (-0.5~0.5 rad/s) (1 Dimensions)

Reward:

  • Arrive the target: +120
  • Hit the wall: -100
  • Else: 500*(Past relative distance - current relative distance)

Algorithm: DDPG (Actor with batch normlization Critic without batch normlization)

Training env: gazebo

Installation Dependencies:

  1. Python3

  2. Tensorflow pip3 install tensorflow-gpu

  3. ROS Kinetic

http://wiki.ros.org/kinetic/Installation/Ubuntu

  1. Gazebo7 (When you install ros kinetic it also install gazebo7)

http://gazebosim.org/tutorials?cat=install&tut=install_ubuntu&ver=7.0

How to Run?

cd
mkdir catkin_ws && mkdir catkin_ws/src
cd catkin_ws/src
git clone https://github.com/m5823779/MotionPlannerUsingDDPG.git project
git clone https://github.com/m5823779/turtlebot3
git clone https://github.com/m5823779/turtlebot3_msgs
git clone https://github.com/m5823779/turtlebot3_simulations
cd ..
catkin_make

And add following line in ~/.bashrc

export TURTLEBOT3_MODEL=burger
source /home/"Enter your user name"/catkin_ws/devel/setup.bash

Then enter following command in terminal

source ~/.bashrc

Demo: First run:

roslaunch turtlebot3_gazebo turtlebot3_stage_1.launch

In another terminal run:

roslaunch project ddpg_stage_1.launch

Train: If you want to retrain yourself change the setting

is_training = True   # In: project/src/ddpg_stage_1.py

Reference:

Idea:

https://arxiv.org/pdf/1703.00420.pdf

Network structure:

https://github.com/floodsung/DDPG

Ros workspace:

https://github.com/ROBOTIS-GIT/turtlebot3

https://github.com/ROBOTIS-GIT/turtlebot3_msgs

https://github.com/ROBOTIS-GIT/turtlebot3_simulations

https://github.com/dranaju/project

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].