All Projects → LQNew → LWDRLC

LQNew / LWDRLC

Licence: MIT license
Lightweight deep RL Libraray for continuous control.

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to LWDRLC

Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+1485.71%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, ddpg, sac, ppo, td3
Rainy
☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (+178.57%)
Mutual labels:  deep-reinforcement-learning, ddpg, sac, ppo, td3
Tianshou
An elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+29250%)
Mutual labels:  policy-gradient, ddpg, sac, ppo, td3
Deeprl
Modularized Implementation of Deep RL Algorithms in PyTorch
Stars: ✭ 2,640 (+18757.14%)
Mutual labels:  deep-reinforcement-learning, ddpg, ppo, td3
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+21357.14%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, ddpg, ppo
ElegantRL
Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥
Stars: ✭ 2,074 (+14714.29%)
Mutual labels:  ddpg, sac, ppo, td3
model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
Stars: ✭ 56 (+300%)
Mutual labels:  ddpg, sac, ppo, td3
Minimalrl
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Stars: ✭ 2,051 (+14550%)
Mutual labels:  deep-reinforcement-learning, ddpg, sac, ppo
ReinforcementLearningZoo.jl
juliareinforcementlearning.org/
Stars: ✭ 46 (+228.57%)
Mutual labels:  ddpg, sac, ppo, td3
Paddle-RLBooks
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Stars: ✭ 113 (+707.14%)
Mutual labels:  policy-gradient, ddpg, sac, td3
Deep-rl-mxnet
Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO
Stars: ✭ 26 (+85.71%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, ddpg, td3
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (+2500%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, ddpg, ppo
Deep Reinforcement Learning With Pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+9507.14%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, ppo
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+1564.29%)
Mutual labels:  deep-reinforcement-learning, ddpg, ppo
Deeprl algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (+592.86%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, ppo
mujoco-benchmark
Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library
Stars: ✭ 101 (+621.43%)
Mutual labels:  ddpg, sac, ppo
pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022
Stars: ✭ 162 (+1057.14%)
Mutual labels:  deep-reinforcement-learning, sac, td3
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+6357.14%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, ppo
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+18700%)
Mutual labels:  deep-reinforcement-learning, continuous-control, ppo
Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Ensemble-Strategy-ICAIF-2020
Live Trading. Please star.
Stars: ✭ 1,251 (+8835.71%)
Mutual labels:  deep-reinforcement-learning, ddpg, ppo

LWDRLC: LightWeight Deep Reinforcement Learning libraray for Continous control

LWDRLC is a deep reinforcement learning (RL) library which is inspired by some other deep RL code bases (i.e., Spinning Up repository, Stable-baselines3 , Fujimoto TD3 repository, and Tonic repository).

🚀 Beyond State-Of-The-Art

LWDRL provides further tricks to improve performance of state-of-the-art algorithms potentially beyond their original papers. Therefore, LWDRL enables every user to achieve professional-level performance just in a few lines of codes.

Supported algorithms

algorithm continuous control on-policy / off-policy
Vanilla Policy Gradient (VPG) on-policy
Proximal Policy Optimization (PPO) on-policy
Deep Deterministic Policy Gradients (DDPG) off-policy
Twin Delayed Deep Deterministic Policy Gradients (TD3) off-policy
Soft Actor-Critic (SAC) off-policy

Instructions

Recommend: Run with Docker

# python        3.6    (apt)
# pytorch       1.4.0  (pip)
# tensorflow    1.14.0 (pip)
# DMC Control Suite and MuJoCo
cd dockerfiles
docker build . -t lwdrl

For other dockerfiles, you can go to RL Dockefiles.

Launch experiments

Run with the scripts batch_off_policy_mujoco_cuda.sh / batch_off_policy_dmc_cuda.sh / batch_on_policy_mujoco_cuda.sh / batch_on_policy_dmc_cuda.sh:

# eg.
bash batch_off_policy_mujoco_cuda.sh Hopper-v2 TD3 0  # env_name: Hopper-v2, algorithm: TD3, CUDA_Num : 0

Plot results

# eg. Notice: `-l` denotes labels, `data/DDPG-Hopper-v2/` represents the collecting dataset, 
# and `-s` represents smoothing value.
python spinupUtils/plot.py \
    data/DDPG-Hopper-v2/ \
    -l DDPG -s 10

Visualization of the environments

Run with the scripts render_dmc.py / render_mujoco.py:

# eg.
python render_dmc.py --env swimmer-swimmer6  # env_name: swimmer-swimmer6

Performance on MuJoCo

Including Ant-v2, HalfCheetah-v2, Hopper-v2, Humanoid-v2, Swimmer-v2, Walker2d-v2.

Citation

@misc{QingLi2021lwdrl,
  author = {Qing Li},
  title = {LWDRL: LightWeight Deep Reinforcement Learning Library},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/LQNew/LWDRL}}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].