mjacar / Pytorch Trpo
Licence: mit
PyTorch Implementation of Trust Region Policy Optimization (TRPO)
Stars: ✭ 123
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Pytorch Trpo
Top Deep Learning
Top 200 deep learning Github repositories sorted by the number of stars.
Stars: ✭ 1,365 (+1009.76%)
Mutual labels: deep-reinforcement-learning
Deeptraffic
DeepTraffic is a deep reinforcement learning competition, part of the MIT Deep Learning series.
Stars: ✭ 1,528 (+1142.28%)
Mutual labels: deep-reinforcement-learning
Drl Portfolio Management
CSCI 599 deep learning and its applications final project
Stars: ✭ 121 (-1.63%)
Mutual labels: deep-reinforcement-learning
Torchrl
Highly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (-17.07%)
Mutual labels: deep-reinforcement-learning
Aws Robomaker Sample Application Deepracer
Use AWS RoboMaker and demonstrate running a simulation which trains a reinforcement learning (RL) model to drive a car around a track
Stars: ✭ 105 (-14.63%)
Mutual labels: deep-reinforcement-learning
Tetris Ai
A deep reinforcement learning bot that plays tetris
Stars: ✭ 109 (-11.38%)
Mutual labels: deep-reinforcement-learning
Samsung Drl Code
Repository for codes of Deep Reinforcement Learning (DRL) lectured at Samsung
Stars: ✭ 99 (-19.51%)
Mutual labels: deep-reinforcement-learning
Rl Medical
Deep Reinforcement Learning (DRL) agents applied to medical images
Stars: ✭ 123 (+0%)
Mutual labels: deep-reinforcement-learning
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+2342.28%)
Mutual labels: deep-reinforcement-learning
Deep reinforcement learning
Resources, papers, tutorials
Stars: ✭ 119 (-3.25%)
Mutual labels: deep-reinforcement-learning
Intro To Deep Learning
A collection of materials to help you learn about deep learning
Stars: ✭ 103 (-16.26%)
Mutual labels: deep-reinforcement-learning
Macad Gym
Multi-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for Autonomous Driving Workshop at NeurIPS 2019:
Stars: ✭ 106 (-13.82%)
Mutual labels: deep-reinforcement-learning
Hierarchical Actor Critic Hac Pytorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
Stars: ✭ 116 (-5.69%)
Mutual labels: deep-reinforcement-learning
Deep Reinforcement Learning Notes
Deep Reinforcement Learning Notes
Stars: ✭ 101 (-17.89%)
Mutual labels: deep-reinforcement-learning
Deep Rl Tensorflow
TensorFlow implementation of Deep Reinforcement Learning papers
Stars: ✭ 1,552 (+1161.79%)
Mutual labels: deep-reinforcement-learning
Exploration By Disagreement
[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement
Stars: ✭ 99 (-19.51%)
Mutual labels: deep-reinforcement-learning
A3c Pytorch
PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
Stars: ✭ 108 (-12.2%)
Mutual labels: deep-reinforcement-learning
Rl Quadcopter
Teach a Quadcopter How to Fly!
Stars: ✭ 124 (+0.81%)
Mutual labels: deep-reinforcement-learning
Advanced Deep Learning And Reinforcement Learning Deepmind
🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind | YouTube videos 👉
Stars: ✭ 121 (-1.63%)
Mutual labels: deep-reinforcement-learning
Reinforcementlearning Atarigame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (-4.07%)
Mutual labels: deep-reinforcement-learning
PyTorch implementation of TRPO
This repo contains a PyTorch implementation of a Trust Region Policy Optimization agent for an environment with a discrete action space.
Environment Setup
-
Install conda for Python 2.7.
conda create --name trpo --file requirements/conda_requirements.txt
source activate trpo
pip install -r requirements/pip_requirements.txt
- Install PyTorch from source at commit eff5b8b.
Usage
python run_trpo.py --env=GYM_ENV_ID
where GYM_ENV_ID is the environment ID of the gym environment you which to train on.
Results
A game of Pong as played using the policy model learned from a TRPO agent
Plot of total reward per episode of Pong vs. episode number
Related Repos
OpenAI's Baseline implementation of parallel TRPO in TensorFlow
Ilya Kostrikov's implementation of TRPO for continuous control in PyTorch
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].