All Projects → miyosuda → Unreal

miyosuda / Unreal

Licence: other
Reinforcement learning with unsupervised auxiliary tasks

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Unreal

Btgym
Scalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+96.15%)
Mutual labels:  unreal, reinforcement-learning
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+928.72%)
Mutual labels:  reinforcement-learning
Pytorch Cpp Rl
PyTorch C++ Reinforcement Learning
Stars: ✭ 353 (-9.49%)
Mutual labels:  reinforcement-learning
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (-6.67%)
Mutual labels:  reinforcement-learning
Rl Chatbot
🤖 Deep Reinforcement Learning Chatbot
Stars: ✭ 357 (-8.46%)
Mutual labels:  reinforcement-learning
Home Platform
HoME: a Household Multimodal Environment is a platform for artificial agents to learn from vision, audio, semantics, physics, and interaction with objects and other agents, all within a realistic context.
Stars: ✭ 370 (-5.13%)
Mutual labels:  reinforcement-learning
Cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features
Stars: ✭ 349 (-10.51%)
Mutual labels:  reinforcement-learning
Pypokerengine
Poker engine for poker AI development in Python
Stars: ✭ 388 (-0.51%)
Mutual labels:  reinforcement-learning
Gym Duckietown
Self-driving car simulator for the Duckietown universe
Stars: ✭ 379 (-2.82%)
Mutual labels:  reinforcement-learning
Qtrader
Reinforcement Learning for Portfolio Management
Stars: ✭ 363 (-6.92%)
Mutual labels:  reinforcement-learning
Recnn
Reinforced Recommendation toolkit built around pytorch 1.7
Stars: ✭ 362 (-7.18%)
Mutual labels:  reinforcement-learning
Genrl
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
Stars: ✭ 356 (-8.72%)
Mutual labels:  reinforcement-learning
Ark
ARK is a lightweight, agility, elastic, distributed plugin framework written in C++,make it easier and faster to create your own application service.
Stars: ✭ 370 (-5.13%)
Mutual labels:  unreal
Meta Rl
Implementation of Meta-RL A3C algorithm
Stars: ✭ 355 (-8.97%)
Mutual labels:  reinforcement-learning
Maro
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
Stars: ✭ 378 (-3.08%)
Mutual labels:  reinforcement-learning
Rl algorithms
Structural implementation of RL key algorithms
Stars: ✭ 352 (-9.74%)
Mutual labels:  reinforcement-learning
Text summurization abstractive methods
Multiple implementations for abstractive text summurization , using google colab
Stars: ✭ 359 (-7.95%)
Mutual labels:  reinforcement-learning
Reinforcement Learning In Portfolio Management
In this paper, we implement three state-of-art continuous reinforcement learning algorithms, Deep Deterministic Policy Gradient (DDPG), Proximal Policy Optimization (PPO) and Policy Gradient (PG)in portfolio management.
Stars: ✭ 367 (-5.9%)
Mutual labels:  reinforcement-learning
Learning Deep Learning
Paper reading notes on Deep Learning and Machine Learning
Stars: ✭ 388 (-0.51%)
Mutual labels:  reinforcement-learning
Awesome Monte Carlo Tree Search Papers
A curated list of Monte Carlo tree search papers with implementations.
Stars: ✭ 387 (-0.77%)
Mutual labels:  reinforcement-learning

UNREAL

CircleCI

About

Replicating UNREAL algorithm described in Google Deep Mind's paper "Reinforcement learning with unsupervised auxiliary tasks."

https://arxiv.org/pdf/1611.05397.pdf

Implemented with TensorFlow and DeepMind Lab environment.

Preview

seekavoid_arena_01

seekavoid_arena_01

stairway_to_melon

stairway_to_melon

nav_maze_static_01

nav_maze_static_01

Network

Network

All weights of convolution layers and LSTM layer are shared.

Requirements

  • TensorFlow (Tested with r1.0)
  • DeepMind Lab
  • numpy
  • cv2
  • pygame
  • matplotlib

Result

"seekavoid_arena_01" Level

seekavoid_01_score

"nav_maze_static_01" Level

nav_maze_static_01_score

How to train

First, download and install DeepMind Lab

$ git clone https://github.com/deepmind/lab.git

Then build it following the build instruction. https://github.com/deepmind/lab/blob/master/docs/build.md

Clone this repo in lab directory.

$ cd lab
$ git clone https://github.com/miyosuda/unreal.git

Add this bazel instruction at the end of lab/BUILD file

package(default_visibility = ["//visibility:public"])

Then run bazel command to run training.

bazel run //unreal:train --define headless=glx

--define headlesss=glx uses GPU rendering and it requires display not to sleep. (We need to disable display sleep.)

If you have any trouble with GPU rendering, please use software rendering with --define headless=osmesa option.

How to show result

To show result after training, run this command.

bazel run //unreal:display --define headless=glx
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].