TianshouAn elegant PyTorch deep reinforcement learning library.
Pytorch ReinforcePyTorch Implementation of REINFORCE for both discrete & continuous control
Pytorch A2c Ppo Acktr GailPyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Dm controlDeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Pytorch sacPyTorch implementation of Soft Actor-Critic (SAC)
CoachReinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
MjrlReinforcement learning algorithms for MuJoCo tasks
Deeprl algorithmsDeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Pytorch sac aePyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
TorchrlPytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
MetaworldAn open source robotics benchmark for meta- and multi-task reinforcement learning
Pytorch RlThis repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Lagomlagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
TrpoTrust Region Policy Optimization with TensorFlow and OpenAI Gym
Pytorch TrpoPyTorch implementation of Trust Region Policy Optimization
DrqDrQ: Data regularized Q
Meta-SACAuto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
kinpySimple kinematics calculation toolkit for robotics
trajoptTrajectory optimization algorithms for robotic control.
citoA Contact-Implicit Trajectory Optimization Package
MuJoCo RL UR5A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.
ddrlDeep Developmental Reinforcement Learning
mujocoPython wrapper for MuJoCo physics simulation.
protoProto-RL: Reinforcement Learning with Prototypical Representations
mujocoMulti-Joint dynamics with Contact. A general purpose physics simulator.
Pytorch-RL-CPPA Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)
jax-rlJAX implementations of core Deep RL algorithms
mujoco-benchmarkProvide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library