All Projects → happywu → A3c

happywu / A3c

MXNET + OpenAI Gym implementation of A3C from "Asynchronous Methods for Deep Reinforcement Learning"

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to A3c

Bombora
My experimentations with Reinforcement Learning in Pytorch
Stars: ✭ 18 (+100%)
Mutual labels:  reinforcement-learning, a3c
Rl a3c pytorch
A3C LSTM Atari with Pytorch plus A3G design
Stars: ✭ 482 (+5255.56%)
Mutual labels:  reinforcement-learning, a3c
Ai Blog
Accompanying repository for Let's make a DQN / A3C series.
Stars: ✭ 351 (+3800%)
Mutual labels:  reinforcement-learning, a3c
Pysc2 Agents
This is a simple implementation of DeepMind's PySC2 RL agents.
Stars: ✭ 262 (+2811.11%)
Mutual labels:  reinforcement-learning, a3c
Machine Learning Curriculum
💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (+8355.56%)
Mutual labels:  reinforcement-learning, mxnet
Deeprl Tensorflow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Stars: ✭ 319 (+3444.44%)
Mutual labels:  reinforcement-learning, a3c
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+4811.11%)
Mutual labels:  reinforcement-learning, a3c
Rlcycle
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
Stars: ✭ 184 (+1944.44%)
Mutual labels:  reinforcement-learning, a3c
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+77100%)
Mutual labels:  reinforcement-learning, a3c
Pytorch Rl
Deep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (+8177.78%)
Mutual labels:  reinforcement-learning, a3c
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+9944.44%)
Mutual labels:  reinforcement-learning, a3c
Super Mario Bros A3c Pytorch
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Stars: ✭ 775 (+8511.11%)
Mutual labels:  reinforcement-learning, a3c
Polyaxon
Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+32855.56%)
Mutual labels:  reinforcement-learning, mxnet
Rl4j
Deep Reinforcement Learning for the JVM (Deep-Q, A3C)
Stars: ✭ 330 (+3566.67%)
Mutual labels:  reinforcement-learning, a3c
Reinforcement Learning
Minimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+31711.11%)
Mutual labels:  reinforcement-learning, a3c
Deep Rl Keras
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Stars: ✭ 395 (+4288.89%)
Mutual labels:  reinforcement-learning, a3c
Coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Stars: ✭ 2,085 (+23066.67%)
Mutual labels:  reinforcement-learning, mxnet
Tensorflow Rl
Implementations of deep RL papers and random experimentation
Stars: ✭ 176 (+1855.56%)
Mutual labels:  reinforcement-learning, a3c
Async deep reinforce
Asynchronous Methods for Deep Reinforcement Learning
Stars: ✭ 565 (+6177.78%)
Mutual labels:  reinforcement-learning, a3c
Btgym
Scalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+8400%)
Mutual labels:  reinforcement-learning, a3c

Still in progress.

A3C

This is a MXNET implementation of A3C as described in "Asynchronous Methods for Deep Reinforcement Learning.

Requirement

  • openai gym
  • mxnet

Flappy Bird

Game source from Using Deep Q-Network to Learn How To Play Flappy Bird.

If you don't want to run FlappyBird, you can ignore this.

To run experiment:

python a3c.py --game-source=flappybird --num-threads=16 --save-model-prefix=a3c-flappybird --save-every=1000

To eval, I have upload a checkpoint of mine, you could try your own parameters.

python a3c.py --test --model-prefix=a3ce-8 --load-epoch=305000 --game-source=flappybird

Notice

If you train on computer without GPUS, please change "devs = gpu(1)" to "devs = cpu()"

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].