All Projects → anita-hu → TF2-RL

anita-hu / TF2-RL

Licence: other
Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to TF2-RL

Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+38.75%)
Mutual labels:  openai-gym, dqn, ddpg, sac, ppo
Tianshou
An elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+2468.13%)
Mutual labels:  dqn, ddpg, sac, ppo
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+2407.5%)
Mutual labels:  openai-gym, dqn, ddpg, ppo
Rainy
☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (-75.62%)
Mutual labels:  dqn, ddpg, sac, ppo
ReinforcementLearningZoo.jl
juliareinforcementlearning.org/
Stars: ✭ 46 (-71.25%)
Mutual labels:  dqn, ddpg, sac, ppo
Minimalrl
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Stars: ✭ 2,051 (+1181.88%)
Mutual labels:  dqn, ddpg, sac, ppo
ElegantRL
Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥
Stars: ✭ 2,074 (+1196.25%)
Mutual labels:  dqn, ddpg, sac, ppo
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+45.63%)
Mutual labels:  dqn, ddpg, ppo
Deeprl
Modularized Implementation of Deep RL Algorithms in PyTorch
Stars: ✭ 2,640 (+1550%)
Mutual labels:  dqn, ddpg, ppo
Paddle-RLBooks
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Stars: ✭ 113 (-29.37%)
Mutual labels:  dqn, ddpg, sac
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+146.25%)
Mutual labels:  openai-gym, dqn, ddpg
Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Ensemble-Strategy-ICAIF-2020
Live Trading. Please star.
Stars: ✭ 1,251 (+681.88%)
Mutual labels:  openai-gym, ddpg, ppo
LWDRLC
Lightweight deep RL Libraray for continuous control.
Stars: ✭ 14 (-91.25%)
Mutual labels:  ddpg, sac, ppo
pytorch-rl
Pytorch Implementation of RL algorithms
Stars: ✭ 15 (-90.62%)
Mutual labels:  openai-gym, dqn, ddpg
Deep Reinforcement Learning For Automated Stock Trading Ensemble Strategy Icaif 2020
Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy. ICAIF 2020. Please star.
Stars: ✭ 518 (+223.75%)
Mutual labels:  openai-gym, ddpg, ppo
Openaigym
Solving OpenAI Gym problems.
Stars: ✭ 98 (-38.75%)
Mutual labels:  openai-gym, dqn, ddpg
Machine Learning Is All You Need
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Stars: ✭ 173 (+8.13%)
Mutual labels:  dqn, ddpg, ppo
Deep Reinforcement Learning Algorithms
31 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
Stars: ✭ 167 (+4.38%)
Mutual labels:  dqn, ddpg, ppo
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (+176.25%)
Mutual labels:  openai-gym, dqn, ddpg
mujoco-benchmark
Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library
Stars: ✭ 101 (-36.87%)
Mutual labels:  ddpg, sac, ppo

Reinforcement Learning Agents

Implemented for Tensorflow 2.0+

New Updates!

  • DDPG with prioritized replay
  • Primal-Dual DDPG for CMDP

Future Plans

  • SAC Discrete

Usage

  • Install dependancies imported (my tf2 conda env as reference)
  • Each file contains example code that runs training on CartPole env
  • Training: python3 TF2_DDPG_LSTM.py
  • Tensorboard: tensorboard --logdir=DDPG/logs

Hyperparameter tuning

Agents

Agents tested using CartPole env.

Name On/off policy Model Action space support
DQN off-policy Dense, LSTM discrete
DDPG off-policy Dense, LSTM discrete, continuous
AE-DDPG off-policy Dense discrete, continuous
SAC🐛 off-policy Dense continuous
PPO on-policy Dense discrete, continuous

Contrained MDP

Name On/off policy Model Action space support
Primal-Dual DDPG off-policy Dense discrete, continuous

Models

Models used to generate the demos are included in the repo, you can also find q value, reward and/or loss graphs

Demos

DQN Basic, time step = 4, 500 reward DQN LSTM, time step = 4, 500 reward
DDPG Basic, 500 reward DDPG LSTM, time step = 5, 500 reward
AE-DDPG Basic, 500 reward PPO Basic, 500 reward
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].