All Projects → wenkesj → mario

wenkesj / mario

Licence: other
Super Mario Reinforcement Learning from Demonstration

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to mario

rl trading
No description or website provided.
Stars: ✭ 14 (-44%)
Mutual labels:  openai-gym, tensorforce
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+788%)
Mutual labels:  openai-gym
Finrl Library
FinRL: Financial Reinforcement Learning Framework. Please star. 🔥
Stars: ✭ 3,037 (+12048%)
Mutual labels:  openai-gym
A3c continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
Stars: ✭ 223 (+792%)
Mutual labels:  openai-gym
Coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Stars: ✭ 2,085 (+8240%)
Mutual labels:  openai-gym
awesome-isaac-gym
A curated list of awesome NVIDIA Issac Gym frameworks, papers, software, and resources
Stars: ✭ 373 (+1392%)
Mutual labels:  openai-gym
Ravens
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Stars: ✭ 133 (+432%)
Mutual labels:  openai-gym
gym-rs
OpenAI's Gym written in pure Rust for blazingly fast performance
Stars: ✭ 34 (+36%)
Mutual labels:  openai-gym
RLGC
An open-source platform for applying Reinforcement Learning for Grid Control (RLGC)
Stars: ✭ 85 (+240%)
Mutual labels:  openai-gym
Ns3 Gym
ns3-gym - The Playground for Reinforcement Learning in Networking Research
Stars: ✭ 221 (+784%)
Mutual labels:  openai-gym
Gymfc
A universal flight control tuning framework
Stars: ✭ 210 (+740%)
Mutual labels:  openai-gym
Tensorflow Rl
Implementations of deep RL papers and random experimentation
Stars: ✭ 176 (+604%)
Mutual labels:  openai-gym
yarll
Combining deep learning and reinforcement learning.
Stars: ✭ 84 (+236%)
Mutual labels:  openai-gym
Gym Fx
Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress
Stars: ✭ 151 (+504%)
Mutual labels:  openai-gym
deep-rl-docker
Docker image with OpenAI Gym, Baselines, MuJoCo and Roboschool, utilizing TensorFlow and JupyterLab.
Stars: ✭ 44 (+76%)
Mutual labels:  openai-gym
Holdem
🃏 OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning
Stars: ✭ 135 (+440%)
Mutual labels:  openai-gym
Deep Reinforcement Learning Gym
Deep reinforcement learning model implementation in Tensorflow + OpenAI gym
Stars: ✭ 200 (+700%)
Mutual labels:  openai-gym
Ma Gym
A collection of multi agent environments based on OpenAI gym.
Stars: ✭ 226 (+804%)
Mutual labels:  openai-gym
a3c-super-mario-pytorch
Reinforcement Learning for Super Mario Bros using A3C on GPU
Stars: ✭ 35 (+40%)
Mutual labels:  openai-gym
ddp-gym
Differential Dynamic Programming controller operating in OpenAI Gym environment.
Stars: ✭ 70 (+180%)
Mutual labels:  openai-gym

teaching mario to play mario

This is a very simple example using reinforceio/tensorforce, tensorflow and openai/gym. This was a small weekend project when DQfD was first published and I thought others might use it as a positive-slope for their experiments getting started. I don't have the access to computing power to let this little guy run for long periods of time. Most of the agents I've trained here make use of my own demonstrations playing mario and they are sparse. I must say, I think the agent is just about as good as I am (that's definitely not saying much). I eventually want to combine this technique with evolution strategies, since both frameworks are very straight-forward, scalable, and easy to implement. It would be interesting to combine the weighted ranking algorithm of evolution strategies with replay prioritization to make the next generation of parameters a function of the demonstrations of past generations (if that makes sense, more on that later).

cnn

cnn+lstm

If you want specific access to more documentation and details, please send me an email and I may be able to help!

NOTE: that this will probably not run on your system because it required a few changes to openai/gym and ppaquette/gym-super-mario, as well as reinforceio/tensorforce :(. I basically steem-rolled the errors that came up (openai/gym changes were due to a depreciation of an Env Wrapper that ppaquette/gym-super-mario used. I chose to remedy this issue by cherrypicking the changes and placing it into ppaquette/gym-super-mario, since it was removed without depreciation warning.)

The reinforceio/tensorforce changes are a little more complicated, since they involved a small DQFDAgent change and a few code fixes - I will most likely submit a pull request for most of these libraries so everyone can use this.

attempt to install

# install/upgrade tensorforce
pip install --upgrade tensorforce
# install gym extensions
pip install gym_pull
# install mario environment
pip install ppaquette_gym_super_mario
python -c "import gym_pull; gym_pull.pull('github.com/ppaquette/gym-super-mario')"

attempt to teach/train and validate

# if your starting from scratch
mkdir agents demos monitors
# now run this, change if necessary
python mario.py ppaquette/meta-SuperMarioBros-v0 \
  -a DQFDAgent -c mario_agent.json -pt \
  -ld ./demos/ -s ./agents/ -m ./monitors/ -mv 100 -D

recording how bad you are at playing the game

python demo.py ppaquette/meta-SuperMarioBros-v0 -s .
# start playing... it loops over when you die and starts a new demo file.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].