Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → jcwleo → Mario_rl

jcwleo / Mario_rl

Licence: mit

Programming Languages

python

139335 projects - #7 most used programming language

Labels

deep-learning pytorch reinforcement-learning ppo actor-critic

Projects that are alternatives of or similar to Mario rl

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+4286.67%)

Mutual labels: reinforcement-learning, ppo, actor-critic

Torch Ac

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

Stars: ✭ 70 (+16.67%)

Mutual labels: reinforcement-learning, ppo, actor-critic

Pytorch Cpp Rl

PyTorch C++ Reinforcement Learning

Stars: ✭ 353 (+488.33%)

Mutual labels: reinforcement-learning, ppo, actor-critic

Run Skeleton Run

Reason8.ai PyTorch solution for NIPS RL 2017 challenge

Stars: ✭ 83 (+38.33%)

Mutual labels: reinforcement-learning, ppo, actor-critic

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+11480%)

Mutual labels: reinforcement-learning, ppo, actor-critic

Pytorch Drl

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Stars: ✭ 233 (+288.33%)

Mutual labels: reinforcement-learning, ppo, actor-critic

Deeprl Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Stars: ✭ 748 (+1146.67%)

Mutual labels: reinforcement-learning, ppo, actor-critic

Reaver

Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.

Stars: ✭ 499 (+731.67%)

Mutual labels: reinforcement-learning, actor-critic

Dissecting Reinforcement Learning

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Stars: ✭ 512 (+753.33%)

Mutual labels: reinforcement-learning, actor-critic

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+966.67%)

Mutual labels: reinforcement-learning, ppo

Rlseq2seq

Deep Reinforcement Learning For Sequence to Sequence Models

Stars: ✭ 683 (+1038.33%)

Mutual labels: reinforcement-learning, actor-critic

Rl a3c pytorch

A3C LSTM Atari with Pytorch plus A3G design

Stars: ✭ 482 (+703.33%)

Mutual labels: reinforcement-learning, actor-critic

Tensorflow Reinforce

Implementations of Reinforcement Learning Models in Tensorflow

Stars: ✭ 480 (+700%)

Mutual labels: reinforcement-learning, actor-critic

Elegantrl

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.

Stars: ✭ 575 (+858.33%)

Mutual labels: reinforcement-learning, ppo

Reinforcement learning tutorial with demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (+636.67%)

Mutual labels: reinforcement-learning, actor-critic

Pytorch Rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Stars: ✭ 658 (+996.67%)

Mutual labels: reinforcement-learning, ppo

Autonomous Learning Library

A PyTorch library for building deep reinforcement learning agents.

Stars: ✭ 425 (+608.33%)

Mutual labels: reinforcement-learning, ppo

Super Mario Bros Ppo Pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

Stars: ✭ 649 (+981.67%)

Mutual labels: reinforcement-learning, ppo

Pytorch Rl

Deep Reinforcement Learning with pytorch & visdom

Stars: ✭ 745 (+1141.67%)

Mutual labels: reinforcement-learning, actor-critic

Slm Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Stars: ✭ 904 (+1406.67%)

Mutual labels: reinforcement-learning, ppo

View All Similar Projects ➔

Super Mario Bros RL

[x] Advantage Actor critic [1]
[x] Parallel Advantage Actor critic [2]
[x] Noisy Networks for Exploration [3]
[x] Proximal Policy Optimization Algorithms [4]
[x] Curiosity-driven Exploration by Self-supervised Prediction [5] (WIP)
'Random Network Distillation' pytorch model
'Curiosity-driven Exploration' pytorch model

1. Setup

Requirements

2. How to Train

Modify the parameters in mario_a2c.py as you like.

python3 mario_a2c.py

python3 mario_ppo.py

3. How to Eval

Modify the is_load_model, is_render parameters in mario_a2c.py as you like.

python3 mario_a2c.py

python3 mario_ppo.py

4. Loss/Reward Graph

It use just A2C(PAAC)

It use just ICM and no ext reward.(Curiosity-driven)

References

[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Noisy Networks for Exploration
[4] Proximal Policy Optimization Algorithms
[5] Curiosity-driven Exploration by Self-supervised Prediction

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 60

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (9) 🔗