Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → qbx2 → Paac.pytorch

qbx2 / Paac.pytorch

Pytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/abs/1705.04862

Programming Languages

139335 projects - #7 most used programming language

Labels

pytorch reinforcement-learning deep-reinforcement-learning gym

Projects that are alternatives of or similar to Paac.pytorch

Deterministic Gail Pytorch

PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning

Stars: ✭ 44 (+100%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

Rlenv.directory

Explore and find reinforcement learning environments in a list of 150+ open source environments.

Stars: ✭ 79 (+259.09%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms

Stars: ✭ 29 (+31.82%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving

Stars: ✭ 628 (+2754.55%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo

Stars: ✭ 257 (+1068.18%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

Reinforcement Learning Algorithms

Stars: ✭ 14 (-36.36%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

MuZero

Stars: ✭ 1,187 (+5295.45%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

Stars: ✭ 94 (+327.27%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow

Stars: ✭ 192 (+772.73%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

PyTorch implementation of Soft Actor-Critic (SAC)

Stars: ✭ 174 (+690.91%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (+1690.91%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

DrQ: Data regularized Q

Stars: ✭ 268 (+1118.18%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

Source codes for the book "Reinforcement Learning: Theory and Python Implementation"

Stars: ✭ 464 (+2009.09%)

Mutual labels: gym, reinforcement-learning, deep-reinforcement-learning

A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.

Stars: ✭ 587 (+2568.18%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Scalable, event-driven, deep-learning-friendly backtesting library

Stars: ✭ 765 (+3377.27%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Reinforcement learning environments with musculoskeletal models

Stars: ✭ 763 (+3368.18%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.

Stars: ✭ 575 (+2513.64%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+2809.09%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Super Mario Bros Ppo Pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

Stars: ✭ 649 (+2850%)

Mutual labels: gym, reinforcement-learning

Gibson Environments: Real-World Perception for Embodied Agents

Stars: ✭ 666 (+2927.27%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

View All Similar Projects ➔

PAAC.pytorch

Pytorch implementation of the PAAC algorithm presented in "Efficient Parallel Methods for Deep Reinforcement Learning". PAAC is the abbreviation of Parallel Advantage Actor-Critic.

Currently, because the PAAC network is not using LSTM, the evaluation result is not very good. I'm working on the LSTM version of PAAC (waiting for a new graphic card due to lack of current gpu's memory.)

The original paper is here: https://arxiv.org/abs/1705.04862

Requirements

PAAC.pytorch requires torch, torchvision, PIL, gym.

Libraries used in this project:

torch==0.1.12+32e6665
torchvision==0.1.8
Pillow==4.1.1
[email protected]

Result (BreakoutDeterministic-v4 training log)

https://www.youtube.com/watch?v=6FMzNaL88wQ

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 22

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗