Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → qfettes → Deeprl Tutorials

qfettes / Deeprl Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Programming Languages

python3

1442 projects

Labels

jupyter-notebook pytorch reinforcement-learning deep-reinforcement-learning ppo actor-critic deep-q-network

Projects that are alternatives of or similar to Deeprl Tutorials

Rad

RAD: Reinforcement Learning with Augmented Data

Stars: ✭ 268 (-64.17%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo, deep-q-network

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (-14.44%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo, deep-q-network

Pytorch Drl

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Stars: ✭ 233 (-68.85%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo, actor-critic, deep-q-network

Deep reinforcement learning course

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

Stars: ✭ 3,232 (+332.09%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning, ppo, actor-critic, deep-q-network

Reinforcement Learning

Minimal and Clean Reinforcement Learning Examples

Stars: ✭ 2,863 (+282.75%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, actor-critic, deep-q-network

Lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Stars: ✭ 364 (-51.34%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

Stars: ✭ 4,012 (+436.36%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (-84.22%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, actor-critic

Rl Course Experiments

Stars: ✭ 73 (-90.24%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, deep-q-network

Reinforcement learning tutorial with demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (-40.91%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, actor-critic

Pytorch sac

PyTorch implementation of Soft Actor-Critic (SAC)

Stars: ✭ 174 (-76.74%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, actor-critic

Tensorflow Reinforce

Implementations of Reinforcement Learning Models in Tensorflow

Stars: ✭ 480 (-35.83%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, actor-critic, deep-q-network

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+251.87%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo, actor-critic

Torch Ac

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

Stars: ✭ 70 (-90.64%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo, actor-critic

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+828.88%)

Mutual labels: reinforcement-learning, ppo, actor-critic, deep-q-network

2048 Deep Reinforcement Learning

Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning

Stars: ✭ 169 (-77.41%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, deep-q-network

Drq

DrQ: Data regularized Q

Stars: ✭ 268 (-64.17%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, actor-critic

Reinforcement Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Stars: ✭ 3,329 (+345.05%)

Mutual labels: jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo

Pytorch Cpp Rl

PyTorch C++ Reinforcement Learning

Stars: ✭ 353 (-52.81%)

Mutual labels: reinforcement-learning, ppo, actor-critic

Elegantrl

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.

Stars: ✭ 575 (-23.13%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, ppo

View All Similar Projects ➔

DeepRL-Tutorials

The intent of these IPython Notebooks are mostly to help me practice and understand the papers I read; thus, I will opt for readability over efficiency in some cases. First the implementation will be uploaded, followed by markup to explain each portion of code. I'll be assigning credit for any code which is borrowed in the Acknowledgements section of this README.

Relevant Papers:

Human Level Control Through Deep Reinforement Learning [Publication] [code]
Multi-Step Learning (from Reinforcement Learning: An Introduction, Chapter 7) [Publication][code]
Deep Reinforcement Learning with Double Q-learning [Publication][code]
Dueling Network Architectures for Deep Reinforcement Learning [Publication][code]
Noisy Networks for Exploration [Publication][code]
Prioritized Experience Replay [Publication][code]
A Distributional Perspective on Reinforcement Learning [Publication][code]
Rainbow: Combining Improvements in Deep Reinforcement Learning [Publication][code]
Distributional Reinforcement Learning with Quantile Regression [Publication][code]
Rainbow with Quantile Regression [code]
Deep Recurrent Q-Learning for Partially Observable MDPs [Publication][code]
Advantage Actor Critic (A2C) [Publication1][Publication2][code]
High-Dimensional Continuous Control Using Generalized Advantage Estimation [Publication][code]
Proximal Policy Optimization Algorithms [Publication][code]

Requirements:

Python 3.6
Numpy
Gym
Pytorch 0.4.0
Matplotlib
OpenCV
Baslines

Acknowledgements:

Credit to @baselines for the environment wrappers and inspiration for the prioritized replay code used only in the development code
Credit to @higgsfield for the plotting code, epsilon annealing code, and inspiration for the prioritized replay implementation in the IPython notebook
Credit to @Kaixhin for factorized Noisy Linear Layer implementation and the projection_distribution function found in Categorical-DQN.ipynb
Credit to @ikostrikov for A2C, GAE, PPO and visdom plotting code implementation reference

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 748

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗