Top 68 ppo open source projects

An elegant PyTorch deep reinforcement learning library.

✭ 4,109

python Makefile pytorch dqn ppo policy-gradient imitation-learning ddpg mujoco benchmark library rl cql atari sac drl npg double-dqn trpo a2c td3 bcq

Pytorch Drl

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

✭ 233

python pytorch reinforcement-learning deep-reinforcement-learning dqn ppo rl actor-critic ddpg deep-q-network

Deeprl

Modularized Implementation of Deep RL Algorithms in PyTorch

✭ 2,640

python Dockerfile shell pytorch deep-reinforcement-learning dqn ppo ddpg rainbow double-dqn dueling-network-architecture quantile-regression option-critic-architecture deeprl categorical-dqn a2c prioritized-experience-replay option-critic td3

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Machine Learning Is All You Need

🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!

✭ 173

python pytorch tensorflow keras convolutional-neural-networks gan lstm deep-reinforcement-learning resnet dqn logistic-regression random-forest ppo actor-critic ddpg decision-trees trpo

Deep Reinforcement Learning Algorithms

31 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

✭ 167

jupyter-notebook deep-reinforcement-learning dqn ppo ddpg

Minimalrl

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

✭ 2,051

python deep-learning machine-learning pytorch reinforcement-learning deep-reinforcement-learning simple dqn ppo a3c ddpg reinforce sac acer a2c policy-gradients

Machin

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

✭ 145

python deep-learning pytorch reinforcement-learning distributed dqn ppo a3c ddpg

Tf deep rl trader

Trading Environment(OpenAI Gym) + PPO(TensorForce)

✭ 139

python tensorflow trading stock-market ppo

Rl Collision Avoidance

Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"

✭ 125

python reinforcement-learning ros ppo

Ros2learn

ROS 2 enabled Machine Learning algorithms

✭ 119

python deep-learning machine-learning reinforcement-learning robotics ros ml dqn ppo rl trpo

Doom Net Pytorch

Reinforcement learning models in ViZDoom environment

✭ 113

python pytorch reinforcement-learning learning agent ppo doom

Easy Rl

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

✭ 3,004

python Jupyter Notebook reinforcement-learning deep-reinforcement-learning dqn ppo policy-gradient q-learning a3c imitation-learning sarsa ddpg

Gail ppo tf

Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action

✭ 99

python machine-learning tensorflow ppo imitation-learning

Deep Reinforcement Learning With Pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

✭ 1,345

python deep-learning pytorch algorithm deep-reinforcement-learning resnet dqn ppo policy-gradient actor-critic a3c trpo

Deeprl algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

✭ 97

python deep-reinforcement-learning dqn ppo policy-gradient mujoco trpo

Torchrl

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

✭ 90

python pytorch reinforcement-learning algorithm dqn gym ppo ddpg mujoco trpo

Reinforcement learning

Reinforcement learning tutorials

✭ 82

python reinforcement-learning dqn ppo policy-gradient a3c

Sc2aibot

Implementing reinforcement-learning algorithms for pysc2 -environment

✭ 83

python tensorflow reinforcement-learning ppo deepmind

Run Skeleton Run

Reason8.ai PyTorch solution for NIPS RL 2017 challenge

✭ 83

python pytorch tensorflow reinforcement-learning ppo actor-critic ddpg nips-2017 trpo

Torch Ac

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

✭ 70

python pytorch reinforcement-learning deep-reinforcement-learning recurrent-neural-networks ppo actor-critic a3c

On Policy

This is the official implementation of Multi-Agent PPO.

✭ 63

python algorithms ppo

Mario rl

✭ 60

python deep-learning pytorch reinforcement-learning ppo actor-critic

Learning2run

Our NIPS 2017: Learning to Run source code

✭ 57

python deep-learning machine-learning tensorflow artificial-intelligence reinforcement-learning ppo nips-2017

Gym Continuousdoubleauction

A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.

✭ 50

jupyter-notebook lstm quantitative-finance ppo quantitative-trading

Slm Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

✭ 904

python pytorch reinforcement-learning benchmark deep-reinforcement-learning dqn ppo policy-gradient a3c

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Deeprl Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

✭ 748

python3 jupyter-notebook pytorch reinforcement-learning deep-reinforcement-learning ppo actor-critic deep-q-network

Pytorch Rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

✭ 658

python pytorch reinforcement-learning generative-adversarial-network deep-reinforcement-learning ppo policy-gradient trpo

Super Mario Bros Ppo Pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

✭ 649

python python3 deep-learning pytorch reinforcement-learning ai openai-gym gym ppo

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

✭ 640

jupyter-notebook reinforcement-learning deep-reinforcement-learning openai-gym ppo policy-gradient q-learning deep-learning-algorithms deep-q-network trpo

Elegantrl

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.

✭ 575

python pytorch reinforcement-learning deep-reinforcement-learning lightweight dqn ppo ddpg stable efficient

Deep Reinforcement Learning For Automated Stock Trading Ensemble Strategy Icaif 2020

Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy. ICAIF 2020. Please star.

✭ 518

jupyter-notebook deep-reinforcement-learning openai-gym ppo ddpg

Autonomous Learning Library

A PyTorch library for building deep reinforcement learning agents.

✭ 425

python reinforcement-learning deep-reinforcement-learning dqn ppo ddpg

Reinforcement Learning Algorithms

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

✭ 426

python deep-learning pytorch algorithm deep-reinforcement-learning dqn ppo actor-critic ddpg trpo

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

✭ 4,012

Jupyter Notebook python TeX pytorch reinforcement-learning neural-networks deep-reinforcement-learning dqn openai-gym ppo dynamic-programming ddpg reinforcement-learning-algorithms hill-climbing cross-entropy openai-gym-solutions pytorch-rl ml-agents rl-algorithms

Lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

✭ 364

python jupyter-notebook deep-learning machine-learning pytorch artificial-intelligence reinforcement-learning deep-reinforcement-learning research ppo policy-gradient ddpg mujoco

Pytorch Cpp Rl

PyTorch C++ Reinforcement Learning

✭ 353

cpp cplusplus pytorch reinforcement-learning ppo actor-critic

Rl Starter Files

RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code

✭ 325

python pytorch ppo a3c

Ppo Pytorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

✭ 325

python pytorch deep-reinforcement-learning pytorch-tutorial ppo policy-gradient

Deeprl Tensorflow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

✭ 319

python deep-learning machine-learning tensorflow reinforcement-learning deep-reinforcement-learning dqn ppo a3c ddpg trpo

Reinforcement Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

✭ 3,329

Jupyter Notebook python deep-learning machine-learning artificial-intelligence reinforcement-learning deep-reinforcement-learning dqn ppo deepmind qlearning evolution-strategies a2c policy-gradients

Deep reinforcement learning course

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

✭ 3,232

Jupyter Notebook python deep-learning pytorch tensorflow unity deep-reinforcement-learning tensorflow-tutorials ppo actor-critic deep-q-network qlearning deep-q-learning a2c

Rlgraph

RLgraph: Modular computation graphs for deep reinforcement learning

✭ 272

python deep-learning machine-learning pytorch tensorflow reinforcement-learning neural-networks deep-reinforcement-learning dqn ppo

Rad

RAD: Reinforcement Learning with Augmented Data

✭ 268

jupyter-notebook deep-learning reinforcement-learning deep-neural-networks deep-reinforcement-learning ppo rl deep-learning-algorithms deep-q-network rad

ppo-pytorch

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

✭ 83

python reinforcement-learning deep-learning pytorch icm proximal-policy-optimization ppo mountaincar-v0 cartpole-v1 intrinsic-curiosity-module generalized-advantage-estimation pendulum-v0

Deep-Reinforcement-Learning-Notebooks

This Repository contains a series of google colab notebooks which I created to help people dive into deep reinforcement learning.This notebooks contain both theory and implementation of different algorithms.

Deep RL with pytorch

A pytorch tutorial for DRL(Deep Reinforcement Learning)

✭ 160

Jupyter Notebook python deep-reinforcement-learning pytorch dqn mcts uct c51 iqn hedge ppo a2c gail counterfactual-regret-minimization qr-dqn random-network-distillation soft-actor-critic self-imitation-learning

stadium

A graphical interface for reinforcement learning and gym-based environments. Integrates tensorboard and various configuration utilities for ease of usage.

✭ 26

python HTML gui gym-environment ppo open-ai-gym stable-baselines rl-environments

xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

✭ 229

python Jupyter Notebook shell impala dqn reinforcement-learning-algorithms ppo muzero qmix

TF2-RL

Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]

✭ 160

python reinforcement-learning openai-gym dqn tensorboard ddpg sac ppo tensorflow2 ae-ddpg

ReinforcementLearningZoo.jl

juliareinforcementlearning.org/