All Projects → kekmodel → rl_pytorch

kekmodel / rl_pytorch

Licence: MIT license
Deep Reinforcement Learning Algorithms Implementation in PyTorch

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to rl pytorch

dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
Stars: ✭ 20 (-13.04%)
Mutual labels:  deep-reinforcement-learning, openai-gym
a3c-super-mario-pytorch
Reinforcement Learning for Super Mario Bros using A3C on GPU
Stars: ✭ 35 (+52.17%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Deep Reinforcement Learning Gym
Deep reinforcement learning model implementation in Tensorflow + OpenAI gym
Stars: ✭ 200 (+769.57%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Finrl Library
FinRL: Financial Reinforcement Learning Framework. Please star. 🔥
Stars: ✭ 3,037 (+13104.35%)
Mutual labels:  deep-reinforcement-learning, openai-gym
FinRL Podracer
Cloud-native Financial Reinforcement Learning
Stars: ✭ 179 (+678.26%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Hands On Intelligent Agents With Openai Gym
Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch
Stars: ✭ 189 (+721.74%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+865.22%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Treeqn
Stars: ✭ 77 (+234.78%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Ensemble-Strategy-ICAIF-2020
Live Trading. Please star.
Stars: ✭ 1,251 (+5339.13%)
Mutual labels:  deep-reinforcement-learning, openai-gym
reinforcement learning ppo rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
Stars: ✭ 33 (+43.48%)
Mutual labels:  deep-reinforcement-learning, random-network-distillation
Reinforcementlearning Atarigame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (+413.04%)
Mutual labels:  deep-reinforcement-learning, openai-gym
ddrl
Deep Developmental Reinforcement Learning
Stars: ✭ 27 (+17.39%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Hierarchical Actor Critic Hac Pytorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
Stars: ✭ 116 (+404.35%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Deep-Reinforcement-Learning-CS285-Pytorch
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Stars: ✭ 104 (+352.17%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Cs234 Reinforcement Learning Winter 2019
My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019
Stars: ✭ 93 (+304.35%)
Mutual labels:  deep-reinforcement-learning, openai-gym
yarll
Combining deep learning and reinforcement learning.
Stars: ✭ 84 (+265.22%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Async Deeprl
Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning
Stars: ✭ 44 (+91.3%)
Mutual labels:  deep-reinforcement-learning, openai-gym
Noreward Rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Stars: ✭ 1,176 (+5013.04%)
Mutual labels:  deep-reinforcement-learning, openai-gym
deep rl acrobot
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
Stars: ✭ 32 (+39.13%)
Mutual labels:  deep-reinforcement-learning, openai-gym
drl grasping
Deep Reinforcement Learning for Robotic Grasping from Octrees
Stars: ✭ 160 (+595.65%)
Mutual labels:  deep-reinforcement-learning, openai-gym

Deep RL Algorithms in PyTorch

Models

  • DQN
  • Dueling Double DQN
  • Categorical DQN (C51)
  • Categotical Dueling Double DQN
  • Proximal Policy Optimization (PPO)
    • discrete (episodic, n-step)
  • Soft Actor-Critic (SAC)
    • debugging

Exploration

  • Random Network Distillation (RND)

Experiments

The result of passing the environment-defined "solving" criteria.

  • Dueling Double DQN
    • Only one hyperparameter "UP_COEF" was adjusted.
CartPole-v0
CartPole-v1
MountainCar-v0
LunarLander-v2

TODO

  • Quantile Regression DQN (QR DQN)
  • Implicit Quantile Networks (IQN)
  • Intrinsic Curiosity Module (ICM)
  • Rainbow
  • Parametric DQN
  • Proximal Policy Optimization (PPO)
    • continuous
  • Deep Deterministic Policy Gradient (DDPG)
  • MCTS Net
  • Parallel Models
    • Ape-X
    • R2D2
    • PAAC
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].