Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

HoME: a Household Multimodal Environment is a platform for artificial agents to learn from vision, audio, semantics, physics, and interaction with objects and other agents, all within a realistic context.

Stars: ✭ 370 (-5.13%)

Mutual labels: reinforcement-learning

Cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features

Stars: ✭ 349 (-10.51%)

Mutual labels: reinforcement-learning

Pypokerengine

Poker engine for poker AI development in Python

Stars: ✭ 388 (-0.51%)

Mutual labels: reinforcement-learning

Gym Duckietown

Self-driving car simulator for the Duckietown universe

Stars: ✭ 379 (-2.82%)

Mutual labels: reinforcement-learning

Qtrader

Reinforcement Learning for Portfolio Management

Stars: ✭ 363 (-6.92%)

Mutual labels: reinforcement-learning

Recnn

Reinforced Recommendation toolkit built around pytorch 1.7

Stars: ✭ 362 (-7.18%)

Mutual labels: reinforcement-learning

Genrl

A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL

Stars: ✭ 356 (-8.72%)

Mutual labels: reinforcement-learning

Ark

ARK is a lightweight, agility, elastic, distributed plugin framework written in C++，make it easier and faster to create your own application service.

Stars: ✭ 370 (-5.13%)

Mutual labels: unreal

Meta Rl

Implementation of Meta-RL A3C algorithm

Stars: ✭ 355 (-8.97%)

Mutual labels: reinforcement-learning

Maro

Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.

Stars: ✭ 378 (-3.08%)

Mutual labels: reinforcement-learning

Rl algorithms

Structural implementation of RL key algorithms

Stars: ✭ 352 (-9.74%)

Mutual labels: reinforcement-learning

Text summurization abstractive methods

Multiple implementations for abstractive text summurization , using google colab

Stars: ✭ 359 (-7.95%)

Mutual labels: reinforcement-learning

Reinforcement Learning In Portfolio Management

In this paper, we implement three state-of-art continuous reinforcement learning algorithms, Deep Deterministic Policy Gradient (DDPG), Proximal Policy Optimization (PPO) and Policy Gradient (PG)in portfolio management.

Stars: ✭ 367 (-5.9%)

Mutual labels: reinforcement-learning

Learning Deep Learning

Paper reading notes on Deep Learning and Machine Learning

Stars: ✭ 388 (-0.51%)

Mutual labels: reinforcement-learning

Awesome Monte Carlo Tree Search Papers

A curated list of Monte Carlo tree search papers with implementations.

Stars: ✭ 387 (-0.77%)

Mutual labels: reinforcement-learning

View All Similar Projects ➔

UNREAL

About

Replicating UNREAL algorithm described in Google Deep Mind's paper "Reinforcement learning with unsupervised auxiliary tasks."

https://arxiv.org/pdf/1611.05397.pdf

Implemented with TensorFlow and DeepMind Lab environment.

Preview

seekavoid_arena_01

stairway_to_melon

nav_maze_static_01

Network

All weights of convolution layers and LSTM layer are shared.

Requirements

TensorFlow (Tested with r1.0)
DeepMind Lab
numpy
cv2
pygame
matplotlib

Result

"seekavoid_arena_01" Level

"nav_maze_static_01" Level

How to train

First, download and install DeepMind Lab

$ git clone https://github.com/deepmind/lab.git

Then build it following the build instruction. https://github.com/deepmind/lab/blob/master/docs/build.md

Clone this repo in lab directory.

$ cd lab
$ git clone https://github.com/miyosuda/unreal.git

Add this bazel instruction at the end of lab/BUILD file

package(default_visibility = ["//visibility:public"])

Then run bazel command to run training.

bazel run //unreal:train --define headless=glx

--define headlesss=glx uses GPU rendering and it requires display not to sleep. (We need to disable display sleep.)

If you have any trouble with GPU rendering, please use software rendering with --define headless=osmesa option.

How to show result

To show result after training, run this command.

bazel run //unreal:display --define headless=glx

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 390

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (8) 🔗