Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

Khrylx / Pytorch Rl

Licence: mit

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

pytorch reinforcement-learning generative-adversarial-network deep-reinforcement-learning ppo policy-gradient trpo

Projects that are alternatives of or similar to Pytorch Rl

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (-2.74%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo, policy-gradient, trpo

Deep Reinforcement Learning With Pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Stars: ✭ 1,345 (+104.41%)

Mutual labels: deep-reinforcement-learning, ppo, policy-gradient, trpo

Deeprl Tensorflow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Stars: ✭ 319 (-51.52%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo, trpo

Deeprl algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Stars: ✭ 97 (-85.26%)

Mutual labels: deep-reinforcement-learning, ppo, policy-gradient, trpo

Slm Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Stars: ✭ 904 (+37.39%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo, policy-gradient

Lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Stars: ✭ 364 (-44.68%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo, policy-gradient

Easy Rl

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+356.53%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo, policy-gradient

Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Stars: ✭ 222 (-66.26%)

Mutual labels: deep-reinforcement-learning, policy-gradient, trpo, ppo

Pytorch Rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (-40.12%)

Mutual labels: reinforcement-learning, generative-adversarial-network, deep-reinforcement-learning, policy-gradient

Rlgraph

RLgraph: Modular computation graphs for deep reinforcement learning

Stars: ✭ 272 (-58.66%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo

Pytorch Trpo

PyTorch implementation of Trust Region Policy Optimization

Stars: ✭ 303 (-53.95%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, trpo

Mushroom Rl

Python library for Reinforcement Learning.

Stars: ✭ 442 (-32.83%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, trpo

Rad

RAD: Reinforcement Learning with Augmented Data

Stars: ✭ 268 (-59.27%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo

Openai lab

An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

Stars: ✭ 313 (-52.43%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient

td-reg

TD-Regularized Actor-Critic Methods

Stars: ✭ 28 (-95.74%)

Mutual labels: policy-gradient, trpo, ppo

Reinforcement learning tutorial with demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (-32.83%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient

Tensorflow Reinforce

Implementations of Reinforcement Learning Models in Tensorflow

Stars: ✭ 480 (-27.05%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient

Explorer

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Stars: ✭ 54 (-91.79%)

Mutual labels: deep-reinforcement-learning, policy-gradient, ppo

Reinforcement Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Stars: ✭ 3,329 (+405.93%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo

Elegantrl

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.

Stars: ✭ 575 (-12.61%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo

View All Similar Projects ➔

PyTorch implementation of reinforcement learning algorithms

This repository contains:

policy gradient methods (TRPO, PPO, A2C)
Generative Adversarial Imitation Learning (GAIL)

Important notes

The code now works for PyTorch 0.4. For PyTorch 0.3, please check out the 0.3 branch.
To run mujoco environments, first install mujoco-py and gym.
If you have a GPU, I recommend setting the OMP_NUM_THREADS to 1 (PyTorch will create additional threads when performing computations which can damage the performance of multiprocessing. This problem is most serious with Linux, where multiprocessing can be even slower than a single thread):

export OMP_NUM_THREADS=1

Features

Support discrete and continous action space.
Support multiprocessing for agent to collect samples in multiple environments simultaneously. (x8 faster than single thread)
Fast Fisher vector product calculation. For this part, Ankur kindly wrote a blog explaining the implementation details.

Policy gradient methods

Example

python examples/ppo_gym.py --env-name Hopper-v2

Reference

Generative Adversarial Imitation Learning (GAIL)

To save trajectory

python gail/save_expert_traj.py --model-path assets/learned_models/Hopper-v2_ppo.p

To do imitation learning

python gail/gail_gym.py --env-name Hopper-v2 --expert-traj-path assets/expert_traj/Hopper-v2_expert_traj.p

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 658

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (6) 🔗