Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → TianhongDai → Hindsight Experience Replay

TianhongDai / Hindsight Experience Replay

Licence: mit

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Programming Languages

139335 projects - #7 most used programming language

Labels

reinforcement-learning ddpg

Projects that are alternatives of or similar to Hindsight Experience Replay

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Stars: ✭ 395 (+194.78%)

Mutual labels: reinforcement-learning, ddpg

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+5085.07%)

Mutual labels: reinforcement-learning, ddpg

Autonomous Learning Library

A PyTorch library for building deep reinforcement learning agents.

Stars: ✭ 425 (+217.16%)

Mutual labels: reinforcement-learning, ddpg

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Stars: ✭ 364 (+171.64%)

Mutual labels: reinforcement-learning, ddpg

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Stars: ✭ 90 (-32.84%)

Mutual labels: reinforcement-learning, ddpg

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

Stars: ✭ 4,012 (+2894.03%)

Mutual labels: reinforcement-learning, ddpg

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.

Stars: ✭ 575 (+329.1%)

Mutual labels: reinforcement-learning, ddpg

Train Robot Arm From Scratch

Build environment and train a robot arm from scratch (Reinforcement Learning)

Stars: ✭ 242 (+80.6%)

Mutual labels: reinforcement-learning, ddpg

Reinforcement Learning Wechat Jump

Reinforcement Learning for WeChat Jump

Stars: ✭ 85 (-36.57%)

Mutual labels: reinforcement-learning, ddpg

Run Skeleton Run

Reason8.ai PyTorch solution for NIPS RL 2017 challenge

Stars: ✭ 83 (-38.06%)

Mutual labels: reinforcement-learning, ddpg

Deeprl Tensorflow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Stars: ✭ 319 (+138.06%)

Mutual labels: reinforcement-learning, ddpg

Reinforcement Learning

🤖 Implements of Reinforcement Learning algorithms.

Stars: ✭ 104 (-22.39%)

Mutual labels: reinforcement-learning, ddpg

An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

Stars: ✭ 313 (+133.58%)

Mutual labels: reinforcement-learning, ddpg

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (+194.03%)

Mutual labels: reinforcement-learning, ddpg

Pytorch Ddpg Naf

Implementation of algorithms for continuous control (DDPG and NAF).

Stars: ✭ 254 (+89.55%)

Mutual labels: reinforcement-learning, ddpg

Python library for Reinforcement Learning.

Stars: ✭ 442 (+229.85%)

Mutual labels: reinforcement-learning, ddpg

A library for ready-made reinforcement learning agents and reusable components for neat prototyping

Stars: ✭ 184 (+37.31%)

Mutual labels: reinforcement-learning, ddpg

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Stars: ✭ 233 (+73.88%)

Mutual labels: reinforcement-learning, ddpg

Policy Gradient Methods

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Stars: ✭ 54 (-59.7%)

Mutual labels: reinforcement-learning, ddpg

Solving OpenAI Gym problems.

Stars: ✭ 98 (-26.87%)

Mutual labels: reinforcement-learning, ddpg

View All Similar Projects ➔

Hindsight Experience Replay (HER)

This is a pytorch implementation of Hindsight Experience Replay.

Acknowledgement:

Openai Baselines

Requirements

python=3.5.2
openai-gym=0.12.5 (mujoco200 is supported, but you need to use gym >= 0.12.5, it has a bug in the previous version.)
mujoco-py=1.50.1.56 (~~Please use this version, if you use mujoco200, you may failed in the FetchSlide-v1~~)
pytorch=1.0.0 (If you use pytorch-0.4.1, you may have data type errors. I will fix it later.)
mpi4py

TODO List

[x] support GPU acceleration - although I have added GPU support, but I still not recommend if you don't have a powerful machine.
[x] add multi-env per MPI.
[x] add the plot and demo of the FetchSlide-v1.

Instruction to run the code

If you want to use GPU, just add the flag --cuda (Not Recommended, Better Use CPU).

train the FetchReach-v1:

mpirun -np 1 python -u train.py --env-name='FetchReach-v1' --n-cycles=10 2>&1 | tee reach.log

train the FetchPush-v1:

mpirun -np 8 python -u train.py --env-name='FetchPush-v1' 2>&1 | tee push.log

train the FetchPickAndPlace-v1:

mpirun -np 16 python -u train.py --env-name='FetchPickAndPlace-v1' 2>&1 | tee pick.log

train the FetchSlide-v1:

mpirun -np 8 python -u train.py --env-name='FetchSlide-v1' --n-epochs=200 2>&1 | tee slide.log

Play Demo

python demo.py --env-name=<environment name>

Download the Pre-trained Model

Please download them from the Google Driver, then put the saved_models under the current folder.

Results

Training Performance

It was plotted by using 5 different seeds, the solid line is the median value.

Demo:

Tips: when you watch the demo, you can press TAB to switch the camera in the mujoco.

FetchPush-v1	FetchPickAndPlace-v1	FetchSlide-v1

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 134

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗