All Projects → qfettes → Deeprl Tutorials

qfettes / Deeprl Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Programming Languages

python3
1442 projects

Projects that are alternatives of or similar to Deeprl Tutorials

Rad
RAD: Reinforcement Learning with Augmented Data
Stars: ✭ 268 (-64.17%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo, deep-q-network
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (-14.44%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo, deep-q-network
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (-68.85%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo, actor-critic, deep-q-network
Deep reinforcement learning course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Stars: ✭ 3,232 (+332.09%)
Mutual labels:  jupyter-notebook, deep-reinforcement-learning, ppo, actor-critic, deep-q-network
Reinforcement Learning
Minimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+282.75%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, actor-critic, deep-q-network
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (-51.34%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+436.36%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo
Reinforcementlearning Atarigame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (-84.22%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, actor-critic
Rl Course Experiments
Stars: ✭ 73 (-90.24%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, deep-q-network
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (-40.91%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, actor-critic
Pytorch sac
PyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (-76.74%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, actor-critic
Tensorflow Reinforce
Implementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (-35.83%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, actor-critic, deep-q-network
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+251.87%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo, actor-critic
Torch Ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (-90.64%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo, actor-critic
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+828.88%)
Mutual labels:  reinforcement-learning, ppo, actor-critic, deep-q-network
2048 Deep Reinforcement Learning
Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning
Stars: ✭ 169 (-77.41%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, deep-q-network
Drq
DrQ: Data regularized Q
Stars: ✭ 268 (-64.17%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, actor-critic
Reinforcement Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Stars: ✭ 3,329 (+345.05%)
Mutual labels:  jupyter-notebook, reinforcement-learning, deep-reinforcement-learning, ppo
Pytorch Cpp Rl
PyTorch C++ Reinforcement Learning
Stars: ✭ 353 (-52.81%)
Mutual labels:  reinforcement-learning, ppo, actor-critic
Elegantrl
Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.
Stars: ✭ 575 (-23.13%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, ppo

DeepRL-Tutorials

The intent of these IPython Notebooks are mostly to help me practice and understand the papers I read; thus, I will opt for readability over efficiency in some cases. First the implementation will be uploaded, followed by markup to explain each portion of code. I'll be assigning credit for any code which is borrowed in the Acknowledgements section of this README.

Relevant Papers:

  1. Human Level Control Through Deep Reinforement Learning [Publication] [code]
  2. Multi-Step Learning (from Reinforcement Learning: An Introduction, Chapter 7) [Publication][code]
  3. Deep Reinforcement Learning with Double Q-learning [Publication][code]
  4. Dueling Network Architectures for Deep Reinforcement Learning [Publication][code]
  5. Noisy Networks for Exploration [Publication][code]
  6. Prioritized Experience Replay [Publication][code]
  7. A Distributional Perspective on Reinforcement Learning [Publication][code]
  8. Rainbow: Combining Improvements in Deep Reinforcement Learning [Publication][code]
  9. Distributional Reinforcement Learning with Quantile Regression [Publication][code]
  10. Rainbow with Quantile Regression [code]
  11. Deep Recurrent Q-Learning for Partially Observable MDPs [Publication][code]
  12. Advantage Actor Critic (A2C) [Publication1][Publication2][code]
  13. High-Dimensional Continuous Control Using Generalized Advantage Estimation [Publication][code]
  14. Proximal Policy Optimization Algorithms [Publication][code]

Requirements:

  • Python 3.6
  • Numpy
  • Gym
  • Pytorch 0.4.0
  • Matplotlib
  • OpenCV
  • Baslines

Acknowledgements:

  • Credit to @baselines for the environment wrappers and inspiration for the prioritized replay code used only in the development code
  • Credit to @higgsfield for the plotting code, epsilon annealing code, and inspiration for the prioritized replay implementation in the IPython notebook
  • Credit to @Kaixhin for factorized Noisy Linear Layer implementation and the projection_distribution function found in Categorical-DQN.ipynb
  • Credit to @ikostrikov for A2C, GAE, PPO and visdom plotting code implementation reference
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].