Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (+4811.11%)

Mutual labels: reinforcement-learning, a3c

Rlcycle

A library for ready-made reinforcement learning agents and reusable components for neat prototyping

Stars: ✭ 184 (+1944.44%)

Mutual labels: reinforcement-learning, a3c

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+77100%)

Mutual labels: reinforcement-learning, a3c

Pytorch Rl

Deep Reinforcement Learning with pytorch & visdom

Stars: ✭ 745 (+8177.78%)

Mutual labels: reinforcement-learning, a3c

Slm Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Stars: ✭ 904 (+9944.44%)

Mutual labels: reinforcement-learning, a3c

Super Mario Bros A3c Pytorch

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Stars: ✭ 775 (+8511.11%)

Mutual labels: reinforcement-learning, a3c

Polyaxon

Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)

Stars: ✭ 2,966 (+32855.56%)

Mutual labels: reinforcement-learning, mxnet

Rl4j

Deep Reinforcement Learning for the JVM (Deep-Q, A3C)

Stars: ✭ 330 (+3566.67%)

Mutual labels: reinforcement-learning, a3c

Reinforcement Learning

Minimal and Clean Reinforcement Learning Examples

Stars: ✭ 2,863 (+31711.11%)

Mutual labels: reinforcement-learning, a3c

Deep Rl Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Stars: ✭ 395 (+4288.89%)

Mutual labels: reinforcement-learning, a3c

Coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Stars: ✭ 2,085 (+23066.67%)

Mutual labels: reinforcement-learning, mxnet

Tensorflow Rl

Implementations of deep RL papers and random experimentation

Stars: ✭ 176 (+1855.56%)

Mutual labels: reinforcement-learning, a3c

Async deep reinforce

Asynchronous Methods for Deep Reinforcement Learning

Stars: ✭ 565 (+6177.78%)

Mutual labels: reinforcement-learning, a3c

Btgym

Scalable, event-driven, deep-learning-friendly backtesting library

Stars: ✭ 765 (+8400%)

Mutual labels: reinforcement-learning, a3c

View All Similar Projects ➔

Still in progress.

A3C

This is a MXNET implementation of A3C as described in "Asynchronous Methods for Deep Reinforcement Learning.

Requirement

openai gym
mxnet

Flappy Bird

Game source from Using Deep Q-Network to Learn How To Play Flappy Bird.

If you don't want to run FlappyBird, you can ignore this.

To run experiment:

python a3c.py --game-source=flappybird --num-threads=16 --save-model-prefix=a3c-flappybird --save-every=1000

To eval, I have upload a checkpoint of mine, you could try your own parameters.

python a3c.py --test --model-prefix=a3ce-8 --load-epoch=305000 --game-source=flappybird

Notice

If you train on computer without GPUS, please change "devs = gpu(1)" to "devs = cpu()"

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 9

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗