All Projects → floringogianu → Categorical Dqn

floringogianu / Categorical Dqn

Licence: mit
A working implementation of the Categorical DQN (Distributional RL).

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Categorical Dqn

Pytorch Rl
Deep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (+727.78%)
Mutual labels:  reinforcement-learning, dqn
Reinforcement learning
Reinforcement learning tutorials
Stars: ✭ 82 (-8.89%)
Mutual labels:  reinforcement-learning, dqn
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+7620%)
Mutual labels:  reinforcement-learning, dqn
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (+391.11%)
Mutual labels:  reinforcement-learning, dqn
Ml In Tf
Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!
Stars: ✭ 45 (-50%)
Mutual labels:  reinforcement-learning, dqn
Elegantrl
Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.
Stars: ✭ 575 (+538.89%)
Mutual labels:  reinforcement-learning, dqn
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+904.44%)
Mutual labels:  reinforcement-learning, dqn
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+337.78%)
Mutual labels:  reinforcement-learning, dqn
Deep Q Learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Stars: ✭ 1,013 (+1025.56%)
Mutual labels:  reinforcement-learning, dqn
Rainbow Is All You Need
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
Stars: ✭ 938 (+942.22%)
Mutual labels:  reinforcement-learning, dqn
Autonomous Learning Library
A PyTorch library for building deep reinforcement learning agents.
Stars: ✭ 425 (+372.22%)
Mutual labels:  reinforcement-learning, dqn
Reinforcepy
Collection of reinforcement learners implemented in python. Mainly including DQN and its variants
Stars: ✭ 54 (-40%)
Mutual labels:  reinforcement-learning, dqn
Aigames
use AI to play some games.
Stars: ✭ 422 (+368.89%)
Mutual labels:  reinforcement-learning, dqn
Gym Anytrading
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
Stars: ✭ 627 (+596.67%)
Mutual labels:  reinforcement-learning, dqn
Deep Rl Keras
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Stars: ✭ 395 (+338.89%)
Mutual labels:  reinforcement-learning, dqn
Tensorlayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
Stars: ✭ 6,796 (+7451.11%)
Mutual labels:  reinforcement-learning, dqn
Rl algorithms
Structural implementation of RL key algorithms
Stars: ✭ 352 (+291.11%)
Mutual labels:  reinforcement-learning, dqn
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+4357.78%)
Mutual labels:  reinforcement-learning, dqn
Chainerrl
ChainerRL is a deep reinforcement learning library built on top of Chainer.
Stars: ✭ 931 (+934.44%)
Mutual labels:  reinforcement-learning, dqn
Deep traffic
MIT DeepTraffic top 2% solution (75.01 mph) 🚗.
Stars: ✭ 47 (-47.78%)
Mutual labels:  reinforcement-learning, dqn

Categorical DQN.

Implementation of the Categorical DQN as described in A distributional Perspective on Reinforcement Learning.

Thanks to @tudor-berariu for optimisation and training tricks and for catching two nasty bugs.

Dependencies

You can take a look in the env export file for the full list of dependencies.

Install the game of Catch:

git clone https://github.com/floringogianu/gym_fast_envs
cd gym_fast_envs

pip install -r requirements.txt
pip install -e .

Install visdom for reporting: pip install visdom.

Training

First start the visdom server: python -m visdom.server. If you don't want to install or use visdom make sure you deactivate the display_plots option in the configs.

Train the Categorical DQN with python main.py -cf configs/catch_categorical.yaml.

Train a DQN baseline with python main.py -cf configs/catch_dqn.yaml.

To Do

  • [x] Migrate to Pytorch 0.2.0. Breaks compatibility with 0.1.12.
  • [x] Add some training curves.
  • [x] Run on Atari.
  • [x] Add proper evaluation.

Results

First row is with batch size of 64, the second with 32. Will run on more seeds and average for a better comparison. Working on adding Atari results.

Catch Learning Curves

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].