All Projects → JohanSamir → revisiting_rainbow

JohanSamir / revisiting_rainbow

Licence: Apache-2.0 license
Revisiting Rainbow

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to revisiting rainbow

Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+228.17%)
Mutual labels:  deep-reinforcement-learning, rl
Ai Economist
Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
Stars: ✭ 537 (+656.34%)
Mutual labels:  ml, deep-reinforcement-learning
Learning To Communicate Pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Stars: ✭ 236 (+232.39%)
Mutual labels:  deep-reinforcement-learning, rl
Exploration By Disagreement
[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement
Stars: ✭ 99 (+39.44%)
Mutual labels:  deep-reinforcement-learning, rl
jax-rl
JAX implementations of core Deep RL algorithms
Stars: ✭ 61 (-14.08%)
Mutual labels:  deep-reinforcement-learning, jax
Aws Robomaker Sample Application Deepracer
Use AWS RoboMaker and demonstrate running a simulation which trains a reinforcement learning (RL) model to drive a car around a track
Stars: ✭ 105 (+47.89%)
Mutual labels:  deep-reinforcement-learning, rl
Irl Imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
Stars: ✭ 333 (+369.01%)
Mutual labels:  ml, rl
Trax
Trax — Deep Learning with Clear Code and Speed
Stars: ✭ 6,666 (+9288.73%)
Mutual labels:  deep-reinforcement-learning, jax
Real Time Ml Project
A curated list of applied machine learning and data science notebooks and libraries across different industries.
Stars: ✭ 143 (+101.41%)
Mutual labels:  ml, rl
Ros2learn
ROS 2 enabled Machine Learning algorithms
Stars: ✭ 119 (+67.61%)
Mutual labels:  ml, rl
Rlenv.directory
Explore and find reinforcement learning environments in a list of 150+ open source environments.
Stars: ✭ 79 (+11.27%)
Mutual labels:  deep-reinforcement-learning, rl
gym-rs
OpenAI's Gym written in pure Rust for blazingly fast performance
Stars: ✭ 34 (-52.11%)
Mutual labels:  ml, rl
Muzero General
MuZero
Stars: ✭ 1,187 (+1571.83%)
Mutual labels:  deep-reinforcement-learning, rl
Deeprl
Modularized Implementation of Deep RL Algorithms in PyTorch
Stars: ✭ 2,640 (+3618.31%)
Mutual labels:  deep-reinforcement-learning, rainbow
Noreward Rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Stars: ✭ 1,176 (+1556.34%)
Mutual labels:  deep-reinforcement-learning, rl
neptune-client
📒 Experiment tracking tool and model registry
Stars: ✭ 348 (+390.14%)
Mutual labels:  ml, rl
Drq
DrQ: Data regularized Q
Stars: ✭ 268 (+277.46%)
Mutual labels:  deep-reinforcement-learning, rl
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (+522.54%)
Mutual labels:  deep-reinforcement-learning, rl
Dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Stars: ✭ 9,681 (+13535.21%)
Mutual labels:  ml, rl
Atari
AI research environment for the Atari 2600 games 🤖.
Stars: ✭ 174 (+145.07%)
Mutual labels:  ml, rl

Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research

In this work we argue that, despite the community’s emphasis on large-scale environments, the traditional small-scale environments can still yield valuable scientific insights and can help reduce the barriers to entry for underprivileged communities. To substantiate our claims, we empirically revisit the paper which introduced the Rainbow algorithm Hessel et al., 2018 and present some new insights into the algorithms used by Rainbow.

Our rainbow agent implements three addittional components to the rainbow agent proposed by Dopamine. (Pablo Samuel Castro et al., 2018)

if you are interested to know more about Revisiting Rainbow, considering check the following resources:

Quick Start

To use the algorithms proposed in the Revisiting Rainbow paper, you need python3 installed, make sure pip is also up to date. If you want to run the MinAtar experiments you should install it. To install MinAtar, please check the following paper (Young et al., 2019) and repositore (github):

  1. Clone the repo:
https://github.com/JohanSamir/revisiting_rainbow

If you prefer running the algorithms in a virtualenv, you can do the following before step 2:

python3 -m venv venv
source venv/bin/activate
# Upgrade Pip
pip install --upgrade pip
  1. Finally setup the environment and install Revisiting Rainbow's dependencies
pip install -U pip
pip install -r revisiting_rainbow/requirements.txt

Running tests

Check the following colab file revisiting_rainbow/test_main.ipynb to run the basic DQN agent.

References

Hado van Hasselt, Arthur Guez, and David Silver. Deep reinforcement learning with double q-learning. In Proceedings of the Thirthieth AAAI Conference On Artificial Intelligence (AAAI), 2016.

Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, and David Silver. Rainbow: Combining Improvements in Deep Reinforcement learning. In Proceedings of the AAAI Conference on Artificial Intelligence, 2018.

Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alexander Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, and Shane Legg. Noisy networks for exploration. In Proceedings of the International Conference on Representation Learning (ICLR 2018), Vancouver (Canada), 2018.

Pablo Samuel Castro, Subhodeep Moitra, Carles Gelada, Saurabh Kumar, and Marc G. Bellemare. Dopamine: A Research Framework for Deep Reinforcement Learning, 2018.

Kenny Young and Tian Tian. Minatar: An atari-inspired testbed for thorough and reproducible reinforcement learning experiments, 2019.

Ziyu Wang, Tom Schaul, Matteo Hessel, Hado Hasselt, Marc Lanctot, and Nando Freitas. Dueling network architectures for deep reinforcement learning. In Proceedings of the 33rd International Conference on Machine Learning, volume 48, pages 1995–2003, 2016.

Vieillard, N., Pietquin, O., and Geist, M. Munchausen Reinforcement Learning. In Advances in Neural Information Processing Systems (NeurIPS), 2020.

Giving credit

If you use Revisiting Rainbow in your research please cite the following:

Johan S Obando-Ceron, & Pablo Samuel Castro (2020). Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research. Proceedings of the 38th International Conference on Machine Learning, ICML 2021. arXiv preprint:

In BibTeX format:

@inproceedings{obando2020revisiting,
  title={Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research},
  author={Obando-Ceron, Johan S and Castro, Pablo Samuel},
  booktitle = {Proceedings of the 38th International Conference on Machine Learning},
  year = {2021},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].