Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → rlgraph → Rlgraph

rlgraph / Rlgraph

Licence: apache-2.0

RLgraph: Modular computation graphs for deep reinforcement learning

Programming Languages

139335 projects - #7 most used programming language

Labels

deep-learning machine-learning pytorch tensorflow reinforcement-learning neural-networks deep-reinforcement-learning dqn ppo

Projects that are alternatives of or similar to Rlgraph

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

Stars: ✭ 4,012 (+1375%)

Mutual labels: reinforcement-learning, neural-networks, deep-reinforcement-learning, dqn, ppo

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+1004.41%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn, ppo

Reinforcement Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Stars: ✭ 3,329 (+1123.9%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn, ppo

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Stars: ✭ 233 (-14.34%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn, ppo

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Stars: ✭ 904 (+232.35%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn, ppo

Deeprl Tensorflow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Stars: ✭ 319 (+17.28%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn, ppo

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Stars: ✭ 2,051 (+654.04%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn, ppo

Autonomous Learning Library

A PyTorch library for building deep reinforcement learning agents.

Stars: ✭ 425 (+56.25%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn, ppo

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.

Stars: ✭ 575 (+111.4%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn, ppo

Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!

Stars: ✭ 45 (-83.46%)

Mutual labels: reinforcement-learning, neural-networks, deep-reinforcement-learning, dqn

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Stars: ✭ 54 (-80.15%)

Mutual labels: deep-reinforcement-learning, dqn, ppo

ROS 2 enabled Machine Learning algorithms

Stars: ✭ 119 (-56.25%)

Mutual labels: reinforcement-learning, dqn, ppo

Unity Machine Learning Agents Toolkit

Stars: ✭ 12,134 (+4361.03%)

Mutual labels: reinforcement-learning, neural-networks, deep-reinforcement-learning

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Stars: ✭ 145 (-46.69%)

Mutual labels: reinforcement-learning, dqn, ppo

Reinforcement Learning

🤖 Implements of Reinforcement Learning algorithms.

Stars: ✭ 104 (-61.76%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn

Highly Modular and Scalable Reinforcement Learning

Stars: ✭ 102 (-62.5%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+867.65%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo

A PyTorch implementation of "Graph Classification Using Structural Attention" (KDD 2018).

Stars: ✭ 227 (-16.54%)

Mutual labels: reinforcement-learning, neural-networks, deep-reinforcement-learning

Deep Rl Trading

playing idealized trading games with deep reinforcement learning

Stars: ✭ 228 (-16.18%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, dqn

Deep RL with pytorch

A pytorch tutorial for DRL(Deep Reinforcement Learning)

Stars: ✭ 160 (-41.18%)

Mutual labels: deep-reinforcement-learning, dqn, ppo

View All Similar Projects ➔

RLgraph

Modular computation graphs for deep reinforcement learning.

RLgraph is a framework to quickly prototype, define and execute reinforcement learning algorithms both in research and practice. RLgraph is different from most other libraries as it can support TensorFlow (or static graphs in general) or eager/define-by run execution (PyTorch) through a single component interface. An introductory blogpost can also be found here: link.

RLgraph exposes a well defined API for using agents, and offers a novel component concept for testing and assembly of machine learning models. By separating graph definition, compilation and execution, multiple distributed backends and device execution strategies can be accessed without modifying agent definitions. This means it is especially suited for a smooth transition from applied use case prototypes to large scale distributed training.

The current state of RLgraph in version 0.4.0 is alpha. The core engine is substantially complete and works for TensorFlow and PyTorch (1.0). Distributed execution on Ray is exemplified via Distributed Prioritized Experience Replay (Ape-X), which also supports multi-gpu mode and solves e.g. Atari-Pong in ~1 hour on a single-node. Algorithms like Ape-X or PPO can be used both with PyTorch and TensorFlow. Distributed TensorFlow can be tested via the IMPALA agent. Please create an issue to discuss improvements or contributions.

RLgraph currently implements the following algorithms:

DQN - dqn_agent - paper
Double-DQN - dqn_agent - via double_dqn flag - paper
Dueling-DQN - dqn_agent - via dueling_dqn flag - paper
Prioritized experience replay - via memory_spec option prioritized_replay - paper
Deep-Q learning from demonstration dqfd_agent - paper
Distributed prioritized experience replay (Ape-X) on Ray - via apex_executor - paper
Importance-weighted actor-learner architecture (IMPALA) on distributed TF/Multi-threaded single-node - impala_agents - paper
Proximal policy optimization with generalized advantage estimation - ppo_agent - paper
Soft Actor-Critic / SAC sac_agent - paper
Simple actor-critic for REINFORCE/A2C/A3C actor_critic_agent - paper

The SingleThreadedWorker implements high-performance environment vectorisation, and a RayWorker can execute ray actor tasks in conjunction with a RayExecutor. The examples folder contains simple scripts to test these agents. There is also a very extensive test package including tests for virtually every component. Note that we run tests on TensorFlow and have not reached full coverage/test compatibility with PyTorch.

For more detailed documentation on RLgraph and its API-reference, please visit our readthedocs page here.

Below we show some training results on gym tasks:

Left: Soft Actor Critic on Pendulum-v0 (10 seeds). Right: Multi-GPU Ape-X on Pong-v0 (10 seeds).

Install

The simplest way to install RLgraph is from pip:

pip install rlgraph

Note that some backends (e.g. ray) need additional dependencies (see setup.py). For example, to install dependencies for the distributed backend ray, enter:

pip install rlgraph[ray]

To successfully run tests, please also install OpenAI gym, e.g.

pip install gym[all]

Upon calling RLgraph, a config JSON is created under ~.rlgraph/rlgraph.json which can be used to change backend settings. The current default stable backend is TensorFlow ("tf"). The PyTorch backend ("pytorch") does not support all utilities available in TF yet. Namely, device handling for PyTorch is incomplete, and we will likely wait until a stable PyTorch 1.0 release in the coming weeks.

Quickstart / example usage

We provide an example script for training the Ape-X algorithm on ALE using Ray in the examples folder.

First, you'll have to ensure, that Ray is used as the distributed backend. RLgraph checks the file ~/.rlgraph/rlgraph.json for this configuration. You can use this command to configure RLgraph to use TensorFlow as the backend and Ray as the distributed backend:

echo '{"BACKEND":"tf","DISTRIBUTED_BACKEND":"ray"}' > $HOME/.rlgraph/rlgraph.json

Then you can run our Ape-X example:

# Start ray on the head machine
ray start --head --redis-port 6379
# Optionally join to this cluster from other machines with ray start --redis-address=...

# Run script
python apex_pong.py

You can also train a simple DQN agent locally on OpenAI gym environments such as CartPole (this doesn't require Ray). The following example script also contains a simple tf-summary switch for adding neural net variables to your tensorboard reports (specify those Component by Perl-RegExp, whose variables you would like to see):

python dqn_cartpole_with_tf_summaries.py

Import and use agents

Agents can be imported and used as follows:

from rlgraph.agents import DQNAgent
from rlgraph.environments import OpenAIGymEnv

environment = OpenAIGymEnv('CartPole-v0')

# Create from .json file or dict, see agent API for all
# possible configuration parameters.
agent = DQNAgent.from_file(
  "configs/dqn_cartpole.json",
  state_space=environment.state_space, 
  action_space=environment.action_space
)

# Get an action, take a step, observe reward.
state = environment.reset()
action, preprocessed_state = agent.get_action(
  states=state,
  extra_returns="preprocessed_states"
)

# Execute step in environment.
next_state, reward, terminal, info =  environment.step(action)

# Observe result.
agent.observe(
    preprocessed_states=preprocessed_state,
    actions=action,
    internals=[],
    next_states=next_state,
    rewards=reward,
    terminals=terminal
)

# Call update when desired:
loss = agent.update()

Full examples can be found in the examples folder.

Cite

If you use RLgraph in your research, please cite the following paper: link

@InProceedings{Schaarschmidt2019,
  author    = {Schaarschmidt, Michael and Mika, Sven and Fricke, Kai and Yoneki, Eiko},
  title     = {{RLgraph: Modular Computation Graphs for Deep Reinforcement Learning}},
  booktitle = {{Proceedings of the 2nd Conference on Systems and Machine Learning (SysML)}},
  year      = {2019},
  month     = apr,
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 272

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (23) 🔗