Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → dbobrenko → Async Deeprl

dbobrenko / Async Deeprl

Licence: mit

Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning

Programming Languages

python

139335 projects - #7 most used programming language

Labels

deep-learning tensorflow reinforcement-learning asynchronous deep-reinforcement-learning openai-gym q-learning

Projects that are alternatives of or similar to Async Deeprl

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+1354.55%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym, q-learning

Reinforcement learning tutorial with demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (+904.55%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, q-learning

Pytorch Rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (+795.45%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Basic reinforcement learning

An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.

Stars: ✭ 826 (+1777.27%)

Mutual labels: reinforcement-learning, openai-gym, q-learning

Learningx

Deep & Classical Reinforcement Learning + Machine Learning Examples in Python

Stars: ✭ 241 (+447.73%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, q-learning

Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Stars: ✭ 222 (+404.55%)

Mutual labels: deep-reinforcement-learning, openai-gym, q-learning

Rl Book

Source codes for the book "Reinforcement Learning: Theory and Python Implementation"

Stars: ✭ 464 (+954.55%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Gym Fx

Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress

Stars: ✭ 151 (+243.18%)

Mutual labels: reinforcement-learning, openai-gym, q-learning

Dissecting Reinforcement Learning

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Stars: ✭ 512 (+1063.64%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, q-learning

Rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Stars: ✭ 980 (+2127.27%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Deterministic Gail Pytorch

PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning

Stars: ✭ 44 (+0%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Deep Rl Trading

playing idealized trading games with deep reinforcement learning

Stars: ✭ 228 (+418.18%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, q-learning

2048 Deep Reinforcement Learning

Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning

Stars: ✭ 169 (+284.09%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, q-learning

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

Stars: ✭ 4,012 (+9018.18%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Accel Brain Code

The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.

Stars: ✭ 166 (+277.27%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, q-learning

Mushroom Rl

Python library for Reinforcement Learning.

Stars: ✭ 442 (+904.55%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Hierarchical Actor Critic Hac Pytorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

Stars: ✭ 116 (+163.64%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (+168.18%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Rl a3c pytorch

A3C LSTM Atari with Pytorch plus A3G design

Stars: ✭ 482 (+995.45%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Gym Anytrading

The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)

Stars: ✭ 627 (+1325%)

Mutual labels: reinforcement-learning, openai-gym, q-learning

View All Similar Projects ➔

Asynchronous Deep Reinforcement Learning

Intuition, implementation description and training results can be found here.

An attempt to implement asynchronous one-step Q-Learning from Google DeepMind's paper "Asynchronous Methods for Deep Reinforcement Learning", Mnih et al., 2016.

Benchmarks for current implementation of Asynchronous one-step Q-Learning:

Device	Input shape	FPS (skipped frames was not counted)
GPU GTX 980 Ti	84x84x4	~530
CPU Core i7-3770 @ 3.40GHz (4 cores, 8 threads)	84x84x4	~300

Requirements

Linux based OS or Mac OS X;
Anaconda package (recommended);

OR manually install python (both 2.7+ and 3.5+ versions are supported), and run in terminal:
```
pip install six
pip install future
pip install scipy
pip install pillow
```
TensorFlow;
OpenAI Gym.

Usage

To train your own model on 'Atari 2600 SpaceInvaders', simply run:

python run_dqn.py

To specify another environment, use --env flag, e.g:

python run_dqn.py --env Pong-v0

All available environments you can check here. Note, that current implementation supports environments only with raw pixels observations. Tested OpenAI Gym environments:

SpaceInvaders-v0
Pong-v0

To change amount of spawned threads, use --threads flag (by default = 8).

To use GPU instead of cpu, pass --gpu flag.

All available flags can be checked by: python run_dqn.py --help

To read TensorBoard logs, use: tensorboard --logdir=path/to/logdir

Trained models

To use pretrained agent, or change log folder, just use --logdir flag:

python run_dqn.py --logdir path/to/checkpoint/folder/

Model, trained on SpaceInvaders, over 80 millions of frames, can be downloaded from here.

Evaluation

To evaluate trained agent, use:

python run_dqn.py --eval --eval_dir folder/for/evaluation/write --logdir path/to/checkpoint/folder/

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 44

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗