Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → rmst → rtrl

rmst / rtrl

Licence: MIT license

PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)

Programming Languages

139335 projects - #7 most used programming language

14818 projects

77523 projects

Labels

machine-learning reinforcement-learning deep-learning deep-reinforcement-learning pytorch

Projects that are alternatives of or similar to rtrl

Meta-Learning-for-StarCraft-II-Minigames

We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.

Stars: ✭ 26 (-54.39%)

Mutual labels: deep-reinforcement-learning

[ICRA 2021] Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning

Stars: ✭ 43 (-24.56%)

Mutual labels: deep-reinforcement-learning

datascience-mashup

In this repo I will try to gather all of the projects related to data science with clean datasets and high accuracy models to solve real world problems.

Stars: ✭ 36 (-36.84%)

Mutual labels: deep-reinforcement-learning

An out-of-the-box GUI tool for offline deep reinforcement learning

Stars: ✭ 80 (+40.35%)

Mutual labels: deep-reinforcement-learning

code summarization public

source code for 'Improving automatic source code summarization via deep reinforcement learning'

Stars: ✭ 71 (+24.56%)

Mutual labels: deep-reinforcement-learning

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Stars: ✭ 126 (+121.05%)

Mutual labels: deep-reinforcement-learning

High-speed Autonomous Drifting with Deep Reinforcement Learning

Stars: ✭ 82 (+43.86%)

Mutual labels: deep-reinforcement-learning

Develop ultimate AI Pokémon trainer

Stars: ✭ 18 (-68.42%)

Mutual labels: deep-reinforcement-learning

Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"

Stars: ✭ 99 (+73.68%)

Mutual labels: deep-reinforcement-learning

End to End Mobile Robot Navigation using DDPG (Continuous Control with Deep Reinforcement Learning) based on Tensorflow + Gazebo

Stars: ✭ 41 (-28.07%)

Mutual labels: deep-reinforcement-learning

A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.

Stars: ✭ 134 (+135.09%)

Mutual labels: deep-reinforcement-learning

This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.

Stars: ✭ 122 (+114.04%)

Mutual labels: deep-reinforcement-learning

No description or website provided.

Stars: ✭ 54 (-5.26%)

Mutual labels: deep-reinforcement-learning

使用深度强化学习解决视觉跟踪和视觉导航问题

Stars: ✭ 16 (-71.93%)

Mutual labels: deep-reinforcement-learning

Deep Reinforcement Learning for Robotic Grasping from Octrees

Stars: ✭ 160 (+180.7%)

Mutual labels: deep-reinforcement-learning

imitation learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Stars: ✭ 93 (+63.16%)

Mutual labels: deep-reinforcement-learning

Reinforcement Learning Demos

Stars: ✭ 66 (+15.79%)

Mutual labels: deep-reinforcement-learning

AutoPentest-DRL

AutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning

Stars: ✭ 196 (+243.86%)

Mutual labels: deep-reinforcement-learning

🐵 An AI chess-board-game framework(by many programming languages) implementations.

Stars: ✭ 40 (-29.82%)

Mutual labels: deep-reinforcement-learning

AI booklet CE-AUT

Booklet and exam of Artificial Intelligence Master Degree at Amirkabir University of technology.

Stars: ✭ 14 (-75.44%)

Mutual labels: deep-reinforcement-learning

View All Similar Projects ➔

Real-Time Reinforcement Learning

This repo is accompanying our paper "Real-Time Reinforcement Learning" (https://arxiv.org/abs/1911.04448).

Traditional Reinforcement Learning Real-Time Reinforcement Learning

Getting Started

This repo can be pip-installed via

pip install git+https://github.com/rmst/rtrl.git

To train an RTAC agent on the basic Pendulum-v0 task run

python -m rtrl run rtrl:RtacTraining Env.id=Pendulum-v0

Mujoco Experiments

To install Mujoco you follow the instructions at openai/gym or have a look at our dockerfile. The following environments were used in the paper.

To train an RTAC agent on HalfCheetah-v2 run

python -m rtrl run rtrl:RtacTraining Env.id=HalfCheetah-v2

To train a SAC agent on Ant-v2 with a real-time wrapper (i.e. RTMDP in the paper) run

python -m rtrl run rtrl:SacTraining Env.id=Ant-v2 Env.real_time=True

Avenue Experiments

Avenue (Ibrahim et al., 2019) can be pip-installed via

pip install git+https://github.com/elementai/avenue.git

To train an RTAC agent to drive on a race track (right video) run

python -m rtrl run rtrl:RtacAvenueTraining Env.id=RaceSolo-v0

Note that this requires a lot of resources, especially memory (16GB+).

Storing Stats

python -m rtrl run just prints stats to stdout. To save stats use the following instead.

python -m rtrl run-fs experiment-1 rtrl:RtacTraining Env.id=Pendulum-v0

Stats are generated and printed every round but only saved to disk every epoch. The stats will be saved as pickled pandas dataframes in experiment-1/stats.

Checkpointing

This repo supports checkpointing. Every epoch the whole run object (e.g. instances of rtrl.training:Training) is pickled to disk and reloaded. This is to ensure reproducibilty.

You can manually load and inspect pickled run instances with the standard pickle:load or the more convenient rtrl:load. For example, to look at the first transition in a SAC agent's replay memory run

import rtrl
run = rtrl.load('experiment-1/state')
print(run.agent.memory[0])

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 57

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗