jcwleo / Mario_rl
Licence: mit
Stars: ✭ 60
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Mario rl
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+4286.67%)
Mutual labels: reinforcement-learning, ppo, actor-critic
Torch Ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (+16.67%)
Mutual labels: reinforcement-learning, ppo, actor-critic
Pytorch Cpp Rl
PyTorch C++ Reinforcement Learning
Stars: ✭ 353 (+488.33%)
Mutual labels: reinforcement-learning, ppo, actor-critic
Run Skeleton Run
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
Stars: ✭ 83 (+38.33%)
Mutual labels: reinforcement-learning, ppo, actor-critic
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+11480%)
Mutual labels: reinforcement-learning, ppo, actor-critic
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+288.33%)
Mutual labels: reinforcement-learning, ppo, actor-critic
Deeprl Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Stars: ✭ 748 (+1146.67%)
Mutual labels: reinforcement-learning, ppo, actor-critic
Reaver
Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.
Stars: ✭ 499 (+731.67%)
Mutual labels: reinforcement-learning, actor-critic
Dissecting Reinforcement Learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Stars: ✭ 512 (+753.33%)
Mutual labels: reinforcement-learning, actor-critic
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+966.67%)
Mutual labels: reinforcement-learning, ppo
Rlseq2seq
Deep Reinforcement Learning For Sequence to Sequence Models
Stars: ✭ 683 (+1038.33%)
Mutual labels: reinforcement-learning, actor-critic
Rl a3c pytorch
A3C LSTM Atari with Pytorch plus A3G design
Stars: ✭ 482 (+703.33%)
Mutual labels: reinforcement-learning, actor-critic
Tensorflow Reinforce
Implementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (+700%)
Mutual labels: reinforcement-learning, actor-critic
Elegantrl
Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.
Stars: ✭ 575 (+858.33%)
Mutual labels: reinforcement-learning, ppo
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+636.67%)
Mutual labels: reinforcement-learning, actor-critic
Pytorch Rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+996.67%)
Mutual labels: reinforcement-learning, ppo
Autonomous Learning Library
A PyTorch library for building deep reinforcement learning agents.
Stars: ✭ 425 (+608.33%)
Mutual labels: reinforcement-learning, ppo
Super Mario Bros Ppo Pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Stars: ✭ 649 (+981.67%)
Mutual labels: reinforcement-learning, ppo
Pytorch Rl
Deep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (+1141.67%)
Mutual labels: reinforcement-learning, actor-critic
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+1406.67%)
Mutual labels: reinforcement-learning, ppo
Super Mario Bros RL
-
[x] Advantage Actor critic [1]
-
[x] Parallel Advantage Actor critic [2]
-
[x] Noisy Networks for Exploration [3]
-
[x] Proximal Policy Optimization Algorithms [4]
-
[x] Curiosity-driven Exploration by Self-supervised Prediction [5] (WIP)
1. Setup
Requirements
- python3.6
- gym-super-mario-bros
- OpenCV Python
- PyTorch
- tensorboardX
2. How to Train
Modify the parameters in mario_a2c.py
as you like.
python3 mario_a2c.py
or
python3 mario_ppo.py
3. How to Eval
Modify the is_load_model
, is_render
parameters in mario_a2c.py
as you like.
python3 mario_a2c.py
or
python3 mario_ppo.py
4. Loss/Reward Graph
It use just ICM and no ext reward.(Curiosity-driven)
References
[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Noisy Networks for Exploration
[4] Proximal Policy Optimization Algorithms
[5] Curiosity-driven Exploration by Self-supervised Prediction
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].