All Projects → jcwleo → Mario_rl

jcwleo / Mario_rl

Licence: mit

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Mario rl

Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+4286.67%)
Mutual labels:  reinforcement-learning, ppo, actor-critic
Torch Ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (+16.67%)
Mutual labels:  reinforcement-learning, ppo, actor-critic
Pytorch Cpp Rl
PyTorch C++ Reinforcement Learning
Stars: ✭ 353 (+488.33%)
Mutual labels:  reinforcement-learning, ppo, actor-critic
Run Skeleton Run
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
Stars: ✭ 83 (+38.33%)
Mutual labels:  reinforcement-learning, ppo, actor-critic
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+11480%)
Mutual labels:  reinforcement-learning, ppo, actor-critic
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+288.33%)
Mutual labels:  reinforcement-learning, ppo, actor-critic
Deeprl Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Stars: ✭ 748 (+1146.67%)
Mutual labels:  reinforcement-learning, ppo, actor-critic
Reaver
Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.
Stars: ✭ 499 (+731.67%)
Mutual labels:  reinforcement-learning, actor-critic
Dissecting Reinforcement Learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Stars: ✭ 512 (+753.33%)
Mutual labels:  reinforcement-learning, actor-critic
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+966.67%)
Mutual labels:  reinforcement-learning, ppo
Rlseq2seq
Deep Reinforcement Learning For Sequence to Sequence Models
Stars: ✭ 683 (+1038.33%)
Mutual labels:  reinforcement-learning, actor-critic
Rl a3c pytorch
A3C LSTM Atari with Pytorch plus A3G design
Stars: ✭ 482 (+703.33%)
Mutual labels:  reinforcement-learning, actor-critic
Tensorflow Reinforce
Implementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (+700%)
Mutual labels:  reinforcement-learning, actor-critic
Elegantrl
Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.
Stars: ✭ 575 (+858.33%)
Mutual labels:  reinforcement-learning, ppo
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+636.67%)
Mutual labels:  reinforcement-learning, actor-critic
Pytorch Rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+996.67%)
Mutual labels:  reinforcement-learning, ppo
Autonomous Learning Library
A PyTorch library for building deep reinforcement learning agents.
Stars: ✭ 425 (+608.33%)
Mutual labels:  reinforcement-learning, ppo
Super Mario Bros Ppo Pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Stars: ✭ 649 (+981.67%)
Mutual labels:  reinforcement-learning, ppo
Pytorch Rl
Deep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (+1141.67%)
Mutual labels:  reinforcement-learning, actor-critic
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+1406.67%)
Mutual labels:  reinforcement-learning, ppo

Super Mario Bros RL

Alt text

1. Setup

Requirements


2. How to Train

Modify the parameters in mario_a2c.py as you like.

python3 mario_a2c.py

or

python3 mario_ppo.py

3. How to Eval

Modify the is_load_model, is_render parameters in mario_a2c.py as you like.

python3 mario_a2c.py

or

python3 mario_ppo.py

4. Loss/Reward Graph

It use just A2C(PAAC) image


It use just ICM and no ext reward.(Curiosity-driven) image

References

[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Noisy Networks for Exploration
[4] Proximal Policy Optimization Algorithms
[5] Curiosity-driven Exploration by Self-supervised Prediction

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].