arnomoonens / yarll

Licence: MIT license

Combining deep learning and reinforcement learning.

Programming Languages

python

139335 projects - #7 most used programming language

Jupyter Notebook

11667 projects

Projects that are alternatives of or similar to yarll

Easy Rl

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+3476.19%)

Mutual labels: deep-reinforcement-learning, policy-gradient, sarsa, a3c

deep rl acrobot

TensorFlow A2C to solve Acrobot, with synchronized parallel environments

Stars: ✭ 32 (-61.9%)

Mutual labels: deep-reinforcement-learning, openai-gym, policy-gradient, a3c

Deep-Reinforcement-Learning-Notebooks

This Repository contains a series of google colab notebooks which I created to help people dive into deep reinforcement learning.This notebooks contain both theory and implementation of different algorithms.

Stars: ✭ 15 (-82.14%)

Mutual labels: deep-reinforcement-learning, sarsa, a3c, soft-actor-critic

Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Stars: ✭ 222 (+164.29%)

Mutual labels: deep-reinforcement-learning, openai-gym, policy-gradient, a3c

Btgym

Scalable, event-driven, deep-learning-friendly backtesting library

Stars: ✭ 765 (+810.71%)

Mutual labels: deep-reinforcement-learning, openai-gym, policy-gradient, a3c

Master-Thesis

Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex

Stars: ✭ 33 (-60.71%)

Mutual labels: deep-reinforcement-learning, reinforcement-learning-algorithms, a3c

Deep-Reinforcement-Learning-CS285-Pytorch

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Stars: ✭ 104 (+23.81%)

Mutual labels: deep-reinforcement-learning, openai-gym, policy-gradient

Deep Reinforcement Learning With Pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Stars: ✭ 1,345 (+1501.19%)

Mutual labels: deep-reinforcement-learning, policy-gradient, a3c

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (+40.48%)

Mutual labels: deep-reinforcement-learning, openai-gym, a3c

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+8171.43%)

Mutual labels: policy-gradient, sarsa, a3c

Deep-rl-mxnet

Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO

Stars: ✭ 26 (-69.05%)

Mutual labels: deep-reinforcement-learning, policy-gradient, reinforcement-learning-algorithms

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

Stars: ✭ 4,012 (+4676.19%)

Mutual labels: deep-reinforcement-learning, openai-gym, reinforcement-learning-algorithms

a3c-super-mario-pytorch

Reinforcement Learning for Super Mario Bros using A3C on GPU

Stars: ✭ 35 (-58.33%)

Mutual labels: deep-reinforcement-learning, openai-gym, a3c

Rl a3c pytorch

A3C LSTM Atari with Pytorch plus A3G design

Stars: ✭ 482 (+473.81%)

Mutual labels: deep-reinforcement-learning, openai-gym, a3c

Reinforcement learning tutorial with demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (+426.19%)

Mutual labels: deep-reinforcement-learning, policy-gradient, a3c

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+661.9%)

Mutual labels: deep-reinforcement-learning, openai-gym, policy-gradient

Pytorch Rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (+369.05%)

Mutual labels: deep-reinforcement-learning, openai-gym, policy-gradient

Slm Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Stars: ✭ 904 (+976.19%)

Mutual labels: deep-reinforcement-learning, policy-gradient, a3c

Reinforcement Learning

Minimal and Clean Reinforcement Learning Examples

Stars: ✭ 2,863 (+3308.33%)

Mutual labels: deep-reinforcement-learning, policy-gradient, a3c

Cs234 Reinforcement Learning Winter 2019

My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019

Stars: ✭ 93 (+10.71%)

Mutual labels: deep-reinforcement-learning, openai-gym

View All Similar Projects ➔

Yet Another Reinforcement Learning Library (YARLL)

Update 14/05/2021: Added PyTorch implementation of REINFORCE.
Update 11/05/2021: Added PyTorch implementation of SAC.
Update 13/04/2021: Converted DDPG to Tensorflow 2.

Status

Different algorithms have currently been implemented (in no particular order):

Advantage Actor Critic
Asynchronous Advantage Actor Critic (A3C)
Deep Deterministic Policy Gradient (DDPG)
Proximal Policy Optimization (PPO)
Soft Actor-Critic (SAC) (TF2, PyTorch)
Trust Region Policy Optimization (TRPO)
REINFORCE (TF2, PyTorch) (convolutional neural network part has not been tested yet)
Cross-Entropy Method
Q-Learning
Deep Q-Learning
Fitted Q Iteration
Sarsa with with function approximation and eligibility traces
(Sequential) knowledge transfer
Asynchronous knowledge transfer

Asynchronous Advantage Actor Critic (A3C)

The code for this algorithm can be found here. Example run after training using 16 threads for a total of 5 million timesteps on the PongDeterministic-v4 environment:

How to run

First, install the library using pip (you can first remove OpenCV from the setup.py file if it is already installed):

pip install yarll

To use the library on a specific branch or to use it while changing the code, you can add the path to the library to your $PYTHONPATH (e.g., in your .bashrc or .zshrc file):

export PYTHONPATH=/path/to/yarll:$PYTHONPATH

Alternatively, you can add a symlink from your site-packages to the yarll directory.

Algorithms/experiments

You can run algorithms by passing the path to an experiment specification (which is a file in json format) to main.py:

python yarll/main.py <path_to_experiment_specification>

You can see all the possible arguments by running python yarll/main.py -h.

Examples of experiment specifications can be found in the experiment_specs folder.

Statistics

Statistics can be plot using:

python -m yarll.misc.plot_statistics <path_to_stats>

<path_to_stats> can be one of 2 things:

A json file generated using gym.wrappers.Monitor, in case it plots the episode lengths and total reward per episode.
A directory containing TensorFlow scalar summaries for different tasks, in which case all of the found scalars are plot.

Help about other arguments (e.g. for using smoothing) can be found by executing python -m yarll.misc.plot_statistics -h.

Alternatively, it is also possible to use Tensorboard to show statistics in the browser by passing the directory with the scalar summaries as --logdir argument.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

arnomoonens / yarll

Programming Languages

Labels

Projects that are alternatives of or similar to yarll

Yet Another Reinforcement Learning Library (YARLL)

Status

Asynchronous Advantage Actor Critic (A3C)

How to run

Algorithms/experiments

Statistics