Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → mjacar → Pytorch Trpo

mjacar / Pytorch Trpo

Licence: mit

PyTorch Implementation of Trust Region Policy Optimization (TRPO)

Programming Languages

139335 projects - #7 most used programming language

Labels

pytorch deep-reinforcement-learning

Projects that are alternatives of or similar to Pytorch Trpo

Top Deep Learning

Top 200 deep learning Github repositories sorted by the number of stars.

Stars: ✭ 1,365 (+1009.76%)

Mutual labels: deep-reinforcement-learning

DeepTraffic is a deep reinforcement learning competition, part of the MIT Deep Learning series.

Stars: ✭ 1,528 (+1142.28%)

Mutual labels: deep-reinforcement-learning

Drl Portfolio Management

CSCI 599 deep learning and its applications final project

Stars: ✭ 121 (-1.63%)

Mutual labels: deep-reinforcement-learning

Highly Modular and Scalable Reinforcement Learning

Stars: ✭ 102 (-17.07%)

Mutual labels: deep-reinforcement-learning

Aws Robomaker Sample Application Deepracer

Use AWS RoboMaker and demonstrate running a simulation which trains a reinforcement learning (RL) model to drive a car around a track

Stars: ✭ 105 (-14.63%)

Mutual labels: deep-reinforcement-learning

A deep reinforcement learning bot that plays tetris

Stars: ✭ 109 (-11.38%)

Mutual labels: deep-reinforcement-learning

Samsung Drl Code

Repository for codes of Deep Reinforcement Learning (DRL) lectured at Samsung

Stars: ✭ 99 (-19.51%)

Mutual labels: deep-reinforcement-learning

Deep Reinforcement Learning (DRL) agents applied to medical images

Stars: ✭ 123 (+0%)

Mutual labels: deep-reinforcement-learning

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+2342.28%)

Mutual labels: deep-reinforcement-learning

Deep reinforcement learning

Resources, papers, tutorials

Stars: ✭ 119 (-3.25%)

Mutual labels: deep-reinforcement-learning

Intro To Deep Learning

A collection of materials to help you learn about deep learning

Stars: ✭ 103 (-16.26%)

Mutual labels: deep-reinforcement-learning

Multi-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for Autonomous Driving Workshop at NeurIPS 2019:

Stars: ✭ 106 (-13.82%)

Mutual labels: deep-reinforcement-learning

Hierarchical Actor Critic Hac Pytorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

Stars: ✭ 116 (-5.69%)

Mutual labels: deep-reinforcement-learning

Deep Reinforcement Learning Notes

Deep Reinforcement Learning Notes

Stars: ✭ 101 (-17.89%)

Mutual labels: deep-reinforcement-learning

Deep Rl Tensorflow

TensorFlow implementation of Deep Reinforcement Learning papers

Stars: ✭ 1,552 (+1161.79%)

Mutual labels: deep-reinforcement-learning

Exploration By Disagreement

[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement

Stars: ✭ 99 (-19.51%)

Mutual labels: deep-reinforcement-learning

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Stars: ✭ 108 (-12.2%)

Mutual labels: deep-reinforcement-learning

Teach a Quadcopter How to Fly!

Stars: ✭ 124 (+0.81%)

Mutual labels: deep-reinforcement-learning

Advanced Deep Learning And Reinforcement Learning Deepmind

🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind | YouTube videos 👉

Stars: ✭ 121 (-1.63%)

Mutual labels: deep-reinforcement-learning

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (-4.07%)

Mutual labels: deep-reinforcement-learning

View All Similar Projects ➔

PyTorch implementation of TRPO

This repo contains a PyTorch implementation of a Trust Region Policy Optimization agent for an environment with a discrete action space.

Environment Setup

Install conda for Python 2.7.

conda create --name trpo --file requirements/conda_requirements.txt
source activate trpo
pip install -r requirements/pip_requirements.txt

Install PyTorch from source at commit eff5b8b.

Usage

python run_trpo.py --env=GYM_ENV_ID

where GYM_ENV_ID is the environment ID of the gym environment you which to train on.

Results

A game of Pong as played using the policy model learned from a TRPO agent

Plot of total reward per episode of Pong vs. episode number

Related Repos

OpenAI's Baseline implementation of parallel TRPO in TensorFlow

Ilya Kostrikov's implementation of TRPO for continuous control in PyTorch

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 123

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗