🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!

Stars: ✭ 173 (+8.13%)

Mutual labels: dqn, ddpg, ppo

Deep Reinforcement Learning Algorithms

31 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

Stars: ✭ 167 (+4.38%)

Mutual labels: dqn, ddpg, ppo

Mushroom Rl

Python library for Reinforcement Learning.

Stars: ✭ 442 (+176.25%)

Mutual labels: openai-gym, dqn, ddpg

mujoco-benchmark

Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library

Stars: ✭ 101 (-36.87%)

Mutual labels: ddpg, sac, ppo

View All Similar Projects ➔

Reinforcement Learning Agents

Implemented for Tensorflow 2.0+

New Updates!

DDPG with prioritized replay
Primal-Dual DDPG for CMDP

Future Plans

SAC Discrete

Usage

Install dependancies imported (my tf2 conda env as reference)
Each file contains example code that runs training on CartPole env
Training: python3 TF2_DDPG_LSTM.py
Tensorboard: tensorboard --logdir=DDPG/logs

Hyperparameter tuning

Install hyperopt https://github.com/hyperopt/hyperopt
Optional: switch agent used and configure param space in hyperparam_tune.py
Run: python3 hyperparam_tune.py

Agents

Agents tested using CartPole env.

Name	On/off policy	Model	Action space support
DQN	off-policy	Dense, LSTM	discrete
DDPG	off-policy	Dense, LSTM	discrete, continuous
AE-DDPG	off-policy	Dense	discrete, continuous
SAC🐛	off-policy	Dense	continuous
PPO	on-policy	Dense	discrete, continuous

Contrained MDP

Name	On/off policy	Model	Action space support
Primal-Dual DDPG	off-policy	Dense	discrete, continuous

Models

Models used to generate the demos are included in the repo, you can also find q value, reward and/or loss graphs

Demos

DQN Basic, time step = 4, 500 reward	DQN LSTM, time step = 4, 500 reward

DDPG Basic, 500 reward	DDPG LSTM, time step = 5, 500 reward

AE-DDPG Basic, 500 reward	PPO Basic, 500 reward

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

anita-hu / TF2-RL

Programming Languages

Labels

Projects that are alternatives of or similar to TF2-RL