All Projects → nav74neet → ddpg_biped

nav74neet / ddpg_biped

Licence: other
Repository for Planar Bipedal walking robot in Gazebo environment using Deep Deterministic Policy Gradient(DDPG) using TensorFlow.

Programming Languages

python
139335 projects - #7 most used programming language
CMake
9771 projects

Projects that are alternatives of or similar to ddpg biped

motion-planner-reinforcement-learning
End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo
Stars: ✭ 99 (+52.31%)
Mutual labels:  ddpg, ros-kinetic, deep-deterministic-policy-gradient
pytorch-rl
Pytorch Implementation of RL algorithms
Stars: ✭ 15 (-76.92%)
Mutual labels:  openai-gym, ddpg
Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Ensemble-Strategy-ICAIF-2020
Live Trading. Please star.
Stars: ✭ 1,251 (+1824.62%)
Mutual labels:  openai-gym, ddpg
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+6072.31%)
Mutual labels:  openai-gym, ddpg
deep rl acrobot
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
Stars: ✭ 32 (-50.77%)
Mutual labels:  openai-gym, ddpg
deep-rl-quadcopter
Implementation of Deep Deterministic Policy Gradients (DDPG) to teach a Quadcopter How to Fly!
Stars: ✭ 17 (-73.85%)
Mutual labels:  ddpg, deep-deterministic-policy-gradient
Pytorch Ddpg
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
Stars: ✭ 272 (+318.46%)
Mutual labels:  openai-gym, ddpg
TF2-RL
Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]
Stars: ✭ 160 (+146.15%)
Mutual labels:  openai-gym, ddpg
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (+580%)
Mutual labels:  openai-gym, ddpg
Deep Reinforcement Learning For Automated Stock Trading Ensemble Strategy Icaif 2020
Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy. ICAIF 2020. Please star.
Stars: ✭ 518 (+696.92%)
Mutual labels:  openai-gym, ddpg
deeprl-continuous-control
Learning Continuous Control in Deep Reinforcement Learning
Stars: ✭ 14 (-78.46%)
Mutual labels:  ddpg, ddpg-algorithm
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+241.54%)
Mutual labels:  openai-gym, ddpg
summit xl sim
Packages for the simulation of the Summit XL, Summit XL HL and Summit-X (including X-WAM) robots
Stars: ✭ 32 (-50.77%)
Mutual labels:  ros-kinetic, gazebo-simulator
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+10589.23%)
Mutual labels:  ddpg, deep-deterministic-policy-gradient
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+506.15%)
Mutual labels:  openai-gym, ddpg
Openaigym
Solving OpenAI Gym problems.
Stars: ✭ 98 (+50.77%)
Mutual labels:  openai-gym, ddpg
UAV-DDPG
Code for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"
Stars: ✭ 133 (+104.62%)
Mutual labels:  ddpg, deep-deterministic-policy-gradient
Obstacle Avoidance ROS
Autonomous Obstacle Avoidance Robot ROS + GAZEBO
Stars: ✭ 52 (-20%)
Mutual labels:  ros-kinetic
gym-mtsim
A general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform (Approved by OpenAI Gym)
Stars: ✭ 196 (+201.54%)
Mutual labels:  openai-gym
pong-from-pixels
Training a Neural Network to play Pong from pixels
Stars: ✭ 25 (-61.54%)
Mutual labels:  openai-gym

Reinforcement Learning for Bipedal walking robot.

This repository contains the simulation architecture based in Gazebo environment for implementing reinforcement learning algorithm, DDPG for generating bipedal walking patterns for the robot.

Planar Bipedal walking robot in Gazebo environment using Deep Deterministic Policy Gradient(DDPG).

The autonomous walking of the bipedal walking robot is achieved using reinforcement learning algorithm called Deep Deterministic Policy Gradient(DDPG)1. DDPG utilises the actor-critic learning framework for learning controls in continuous action spaces.

The project details & the results of the experiment have been documented in the research manuscript, Bipedal walking robot using Deep Deterministic Policy Gradient

This project was developed at the Computational Intelligence Laboratory, IISc, Bangalore.

What you need before starting (Dependencies & Packages):

File setup:

  • walker_gazebo contains the robot model(both .stl files & .urdf file) and also the gazebo launch file.

  • walker_controller contains the reinforcement learning implementation of DDPG algorithm for control of the bipedal walking robot.

Learning to walk, initial baby steps

Stable bipedal walking

[Project video]

Note: A stable bipedal walking was acheived after training the model using a Nvidia GeForce GTX 1050 Ti GPU enabled system for over 41 hours. The visualization for the horizontal boom(attached to the waist) is turned off.

Sources:

  1. Lillicrap, Timothy P., et al. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015).
  2. Silver, David, et al. Deterministic Policy Gradient Algorithms. ICML (2014).

Project Collaborator(s):

Arun Kumar ([email protected]) & Dr. S N Omkar ([email protected])

Future work

Implement state of the art RL algorithms(TRPO & PPO) for the same. Hopefully lead to faster training and less convergence time.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].