All Projects → Khrylx → Pytorch Rl

Khrylx / Pytorch Rl

Licence: mit
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Pytorch Rl

Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (-2.74%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo, policy-gradient, trpo
Deep Reinforcement Learning With Pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+104.41%)
Mutual labels:  deep-reinforcement-learning, ppo, policy-gradient, trpo
Deeprl Tensorflow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Stars: ✭ 319 (-51.52%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo, trpo
Deeprl algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (-85.26%)
Mutual labels:  deep-reinforcement-learning, ppo, policy-gradient, trpo
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+37.39%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo, policy-gradient
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (-44.68%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo, policy-gradient
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+356.53%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo, policy-gradient
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (-66.26%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, trpo, ppo
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (-40.12%)
Mutual labels:  reinforcement-learning, generative-adversarial-network, deep-reinforcement-learning, policy-gradient
Rlgraph
RLgraph: Modular computation graphs for deep reinforcement learning
Stars: ✭ 272 (-58.66%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo
Pytorch Trpo
PyTorch implementation of Trust Region Policy Optimization
Stars: ✭ 303 (-53.95%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, trpo
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (-32.83%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, trpo
Rad
RAD: Reinforcement Learning with Augmented Data
Stars: ✭ 268 (-59.27%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo
Openai lab
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (-52.43%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
td-reg
TD-Regularized Actor-Critic Methods
Stars: ✭ 28 (-95.74%)
Mutual labels:  policy-gradient, trpo, ppo
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (-32.83%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Tensorflow Reinforce
Implementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (-27.05%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Stars: ✭ 54 (-91.79%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, ppo
Reinforcement Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Stars: ✭ 3,329 (+405.93%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo
Elegantrl
Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.
Stars: ✭ 575 (-12.61%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo

PyTorch implementation of reinforcement learning algorithms

This repository contains:

  1. policy gradient methods (TRPO, PPO, A2C)
  2. Generative Adversarial Imitation Learning (GAIL)

Important notes

  • The code now works for PyTorch 0.4. For PyTorch 0.3, please check out the 0.3 branch.
  • To run mujoco environments, first install mujoco-py and gym.
  • If you have a GPU, I recommend setting the OMP_NUM_THREADS to 1 (PyTorch will create additional threads when performing computations which can damage the performance of multiprocessing. This problem is most serious with Linux, where multiprocessing can be even slower than a single thread):
export OMP_NUM_THREADS=1

Features

  • Support discrete and continous action space.
  • Support multiprocessing for agent to collect samples in multiple environments simultaneously. (x8 faster than single thread)
  • Fast Fisher vector product calculation. For this part, Ankur kindly wrote a blog explaining the implementation details.

Policy gradient methods

Example

  • python examples/ppo_gym.py --env-name Hopper-v2

Reference

Generative Adversarial Imitation Learning (GAIL)

To save trajectory

  • python gail/save_expert_traj.py --model-path assets/learned_models/Hopper-v2_ppo.p

To do imitation learning

  • python gail/gail_gym.py --env-name Hopper-v2 --expert-traj-path assets/expert_traj/Hopper-v2_expert_traj.p
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].