All Projects → TianhongDai → Hindsight Experience Replay

TianhongDai / Hindsight Experience Replay

Licence: mit
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Hindsight Experience Replay

Deep Rl Keras
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Stars: ✭ 395 (+194.78%)
Mutual labels:  reinforcement-learning, ddpg
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+5085.07%)
Mutual labels:  reinforcement-learning, ddpg
Autonomous Learning Library
A PyTorch library for building deep reinforcement learning agents.
Stars: ✭ 425 (+217.16%)
Mutual labels:  reinforcement-learning, ddpg
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (+171.64%)
Mutual labels:  reinforcement-learning, ddpg
Torchrl
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Stars: ✭ 90 (-32.84%)
Mutual labels:  reinforcement-learning, ddpg
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+2894.03%)
Mutual labels:  reinforcement-learning, ddpg
Elegantrl
Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.
Stars: ✭ 575 (+329.1%)
Mutual labels:  reinforcement-learning, ddpg
Train Robot Arm From Scratch
Build environment and train a robot arm from scratch (Reinforcement Learning)
Stars: ✭ 242 (+80.6%)
Mutual labels:  reinforcement-learning, ddpg
Reinforcement Learning Wechat Jump
Reinforcement Learning for WeChat Jump
Stars: ✭ 85 (-36.57%)
Mutual labels:  reinforcement-learning, ddpg
Run Skeleton Run
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
Stars: ✭ 83 (-38.06%)
Mutual labels:  reinforcement-learning, ddpg
Deeprl Tensorflow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Stars: ✭ 319 (+138.06%)
Mutual labels:  reinforcement-learning, ddpg
Reinforcement Learning
🤖 Implements of Reinforcement Learning algorithms.
Stars: ✭ 104 (-22.39%)
Mutual labels:  reinforcement-learning, ddpg
Openai lab
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (+133.58%)
Mutual labels:  reinforcement-learning, ddpg
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+194.03%)
Mutual labels:  reinforcement-learning, ddpg
Pytorch Ddpg Naf
Implementation of algorithms for continuous control (DDPG and NAF).
Stars: ✭ 254 (+89.55%)
Mutual labels:  reinforcement-learning, ddpg
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (+229.85%)
Mutual labels:  reinforcement-learning, ddpg
Rlcycle
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
Stars: ✭ 184 (+37.31%)
Mutual labels:  reinforcement-learning, ddpg
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+73.88%)
Mutual labels:  reinforcement-learning, ddpg
Policy Gradient Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Stars: ✭ 54 (-59.7%)
Mutual labels:  reinforcement-learning, ddpg
Openaigym
Solving OpenAI Gym problems.
Stars: ✭ 98 (-26.87%)
Mutual labels:  reinforcement-learning, ddpg

Hindsight Experience Replay (HER)

This is a pytorch implementation of Hindsight Experience Replay.

Acknowledgement:

Requirements

  • python=3.5.2
  • openai-gym=0.12.5 (mujoco200 is supported, but you need to use gym >= 0.12.5, it has a bug in the previous version.)
  • mujoco-py=1.50.1.56 (Please use this version, if you use mujoco200, you may failed in the FetchSlide-v1)
  • pytorch=1.0.0 (If you use pytorch-0.4.1, you may have data type errors. I will fix it later.)
  • mpi4py

TODO List

  • [x] support GPU acceleration - although I have added GPU support, but I still not recommend if you don't have a powerful machine.
  • [x] add multi-env per MPI.
  • [x] add the plot and demo of the FetchSlide-v1.

Instruction to run the code

If you want to use GPU, just add the flag --cuda (Not Recommended, Better Use CPU).

  1. train the FetchReach-v1:
mpirun -np 1 python -u train.py --env-name='FetchReach-v1' --n-cycles=10 2>&1 | tee reach.log
  1. train the FetchPush-v1:
mpirun -np 8 python -u train.py --env-name='FetchPush-v1' 2>&1 | tee push.log
  1. train the FetchPickAndPlace-v1:
mpirun -np 16 python -u train.py --env-name='FetchPickAndPlace-v1' 2>&1 | tee pick.log
  1. train the FetchSlide-v1:
mpirun -np 8 python -u train.py --env-name='FetchSlide-v1' --n-epochs=200 2>&1 | tee slide.log

Play Demo

python demo.py --env-name=<environment name>

Download the Pre-trained Model

Please download them from the Google Driver, then put the saved_models under the current folder.

Results

Training Performance

It was plotted by using 5 different seeds, the solid line is the median value. Training_Curve

Demo:

Tips: when you watch the demo, you can press TAB to switch the camera in the mujoco.

FetchPush-v1 FetchPickAndPlace-v1 FetchSlide-v1
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].