All Projects → qbx2 → Paac.pytorch

qbx2 / Paac.pytorch

Pytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/abs/1705.04862

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Paac.pytorch

Deterministic Gail Pytorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
Stars: ✭ 44 (+100%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Rlenv.directory
Explore and find reinforcement learning environments in a list of 150+ open source environments.
Stars: ✭ 79 (+259.09%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Drlkit
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (+31.82%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Deepdrive
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
Stars: ✭ 628 (+2754.55%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Gym Gazebo2
gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
Stars: ✭ 257 (+1068.18%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Rl algos
Reinforcement Learning Algorithms
Stars: ✭ 14 (-36.36%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Muzero General
MuZero
Stars: ✭ 1,187 (+5295.45%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Pytorch sac ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
Stars: ✭ 94 (+327.27%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Naf Tensorflow
"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
Stars: ✭ 192 (+772.73%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Pytorch sac
PyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (+690.91%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+1690.91%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Drq
DrQ: Data regularized Q
Stars: ✭ 268 (+1118.18%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Rl Book
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
Stars: ✭ 464 (+2009.09%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Habitat Lab
A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
Stars: ✭ 587 (+2568.18%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Btgym
Scalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+3377.27%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Osim Rl
Reinforcement learning environments with musculoskeletal models
Stars: ✭ 763 (+3368.18%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Elegantrl
Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.
Stars: ✭ 575 (+2513.64%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+2809.09%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Super Mario Bros Ppo Pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Stars: ✭ 649 (+2850%)
Mutual labels:  gym, reinforcement-learning
Gibsonenv
Gibson Environments: Real-World Perception for Embodied Agents
Stars: ✭ 666 (+2927.27%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning

PAAC.pytorch

Pytorch implementation of the PAAC algorithm presented in "Efficient Parallel Methods for Deep Reinforcement Learning". PAAC is the abbreviation of Parallel Advantage Actor-Critic.

Currently, because the PAAC network is not using LSTM, the evaluation result is not very good. I'm working on the LSTM version of PAAC (waiting for a new graphic card due to lack of current gpu's memory.)

The original paper is here: https://arxiv.org/abs/1705.04862

Requirements

PAAC.pytorch requires torch, torchvision, PIL, gym.

Libraries used in this project:

Result (BreakoutDeterministic-v4 training log)

log

https://www.youtube.com/watch?v=6FMzNaL88wQ

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].