All Projects → jingweiz → pytorch-distributed

jingweiz / pytorch-distributed

Licence: MIT license
Ape-X DQN & DDPG with pytorch & tensorboard

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to pytorch-distributed

Machine Learning Is All You Need
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Stars: ✭ 173 (+76.53%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+302.04%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Deeprl Tensorflow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Stars: ✭ 319 (+225.51%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Deep-rl-mxnet
Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO
Stars: ✭ 26 (-73.47%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Elegantrl
Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.
Stars: ✭ 575 (+486.73%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
rl implementations
No description or website provided.
Stars: ✭ 40 (-59.18%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Deeprl
Modularized Implementation of Deep RL Algorithms in PyTorch
Stars: ✭ 2,640 (+2593.88%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Machin
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
Stars: ✭ 145 (+47.96%)
Mutual labels:  distributed, dqn, ddpg
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (+351.02%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Autonomous Learning Library
A PyTorch library for building deep reinforcement learning agents.
Stars: ✭ 425 (+333.67%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Minimalrl
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Stars: ✭ 2,051 (+1992.86%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+2965.31%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Rainy
☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (-60.2%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Deep Reinforcement Learning Algorithms
31 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
Stars: ✭ 167 (+70.41%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+126.53%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+3993.88%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Tianshou
An elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+4092.86%)
Mutual labels:  dqn, ddpg, drl
Reinforcement Learning Algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Stars: ✭ 426 (+334.69%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Reinforcement Learning
🤖 Implements of Reinforcement Learning algorithms.
Stars: ✭ 104 (+6.12%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+137.76%)
Mutual labels:  deep-reinforcement-learning, dqn, ddpg

Distributed Deep Reinforcement Learning with

pytorch & tensorboard


  • Sample on-line plotting while training a Distributed DQN agent on Pong (nstep means lookahead this many steps when bootstraping the target q values):
    • blue: num_actors=2, nstep=1
    • orange: num_actors=8, nstep=1
    • grey: num_actors=8, nstep=5

dqn_pong


What is included?

This repo currently contains the following agents:

  • Distributed DQN [1]
  • Distributed DDPG [2]

Code structure:

NOTE: we follow the same code structure as pytorch-rl& pytorch-dnc.

  • ./utils/factory.py

We suggest the users refer to ./utils/factory.py, where we list all the integrated Env, Model, Memory, Agent into Dict's. All of those four core classes are implemented in ./core/. The factory pattern in ./utils/factory.py makes the code super clean, as no matter what type of Agent you want to train, or which type of Env you want to train on, all you need to do is to simply modify some parameters in ./utils/options.py, then the ./main.py will do it all (NOTE: this ./main.py file never needs to be modified).

  • ./core/single_processes/.

Each agent contains 4 types of single_process's:

  • Logger: plot Global/Actor/Learner/EvaluatorLogs onto tensorboard
  • Actor: collect experiences from Env and push to a global shared Memory
  • Learner: samples from the global shared Memory and do DRL updates on the Model
  • Evaluator: evaluate the Model during training

How to run:

You only need to modify some parameters in ./utils/options.py to train a new configuration.

  • Configure your training in ./utils/options.py:
  • line 13: add an entry into CONFIGS to define your training (agent_type, env_type, game, memory_type, model_type)
  • line 23: choose the entry ID you just added
  • line 19-20: fill in your machine/cluster ID (MACHINE) and timestamp (TIMESTAMP) to define your training signature (MACHINE_TIMESTAMP), the corresponding model file of this training will be saved under this signature (./models/MACHINE_TIMESTAMP.pth ). Also the tensorboard visualization will be displayed under this signature (first activate the tensorboard server by type in bash: tensorboard --logdir logs/, then open this address in your browser: http://localhost:6006/)
  • line 22: to train a model, set mode=1 (training visualization will be under http://localhost:6006/); to test the model of this current training, all you need to do is to set mode=2 .
  • Run:

python main.py


Dependencies:


Repos we referred to during the development of this repo:

This repo is developed together w/ @onlytailei.


Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].