All Projects → dbobrenko → Async Deeprl

dbobrenko / Async Deeprl

Licence: mit
Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Async Deeprl

Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+1354.55%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym, q-learning
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+904.55%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, q-learning
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+795.45%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Basic reinforcement learning
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Stars: ✭ 826 (+1777.27%)
Mutual labels:  reinforcement-learning, openai-gym, q-learning
Learningx
Deep & Classical Reinforcement Learning + Machine Learning Examples in Python
Stars: ✭ 241 (+447.73%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, q-learning
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+404.55%)
Mutual labels:  deep-reinforcement-learning, openai-gym, q-learning
Rl Book
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
Stars: ✭ 464 (+954.55%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Gym Fx
Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress
Stars: ✭ 151 (+243.18%)
Mutual labels:  reinforcement-learning, openai-gym, q-learning
Dissecting Reinforcement Learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Stars: ✭ 512 (+1063.64%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, q-learning
Rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Stars: ✭ 980 (+2127.27%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Deterministic Gail Pytorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
Stars: ✭ 44 (+0%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Deep Rl Trading
playing idealized trading games with deep reinforcement learning
Stars: ✭ 228 (+418.18%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, q-learning
2048 Deep Reinforcement Learning
Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning
Stars: ✭ 169 (+284.09%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, q-learning
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+9018.18%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Accel Brain Code
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
Stars: ✭ 166 (+277.27%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, q-learning
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (+904.55%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Hierarchical Actor Critic Hac Pytorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
Stars: ✭ 116 (+163.64%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Reinforcementlearning Atarigame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (+168.18%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Rl a3c pytorch
A3C LSTM Atari with Pytorch plus A3G design
Stars: ✭ 482 (+995.45%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Gym Anytrading
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
Stars: ✭ 627 (+1325%)
Mutual labels:  reinforcement-learning, openai-gym, q-learning

Asynchronous Deep Reinforcement Learning

Intuition, implementation description and training results can be found here.

An attempt to implement asynchronous one-step Q-Learning from Google DeepMind's paper "Asynchronous Methods for Deep Reinforcement Learning", Mnih et al., 2016.

async-qlearning-spaceinvaders

Benchmarks for current implementation of Asynchronous one-step Q-Learning:

Device Input shape FPS (skipped frames was not counted)
GPU GTX 980 Ti 84x84x4 ~530
CPU Core i7-3770 @ 3.40GHz (4 cores, 8 threads) 84x84x4 ~300

Requirements

  1. Linux based OS or Mac OS X;

  2. Anaconda package (recommended);

    OR manually install python (both 2.7+ and 3.5+ versions are supported), and run in terminal:

    pip install six
    pip install future
    pip install scipy
    pip install pillow
    
  3. TensorFlow;

  4. OpenAI Gym.

Usage

To train your own model on 'Atari 2600 SpaceInvaders', simply run:

python run_dqn.py

To specify another environment, use --env flag, e.g:

python run_dqn.py --env Pong-v0

All available environments you can check here. Note, that current implementation supports environments only with raw pixels observations. Tested OpenAI Gym environments:

  • SpaceInvaders-v0
  • Pong-v0

To change amount of spawned threads, use --threads flag (by default = 8).

To use GPU instead of cpu, pass --gpu flag.

All available flags can be checked by: python run_dqn.py --help

To read TensorBoard logs, use: tensorboard --logdir=path/to/logdir

Trained models

To use pretrained agent, or change log folder, just use --logdir flag:

python run_dqn.py --logdir path/to/checkpoint/folder/

Model, trained on SpaceInvaders, over 80 millions of frames, can be downloaded from here.

Evaluation

To evaluate trained agent, use:

python run_dqn.py --eval --eval_dir folder/for/evaluation/write --logdir path/to/checkpoint/folder/
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].