Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

Alfredvc / Paac

Licence: other

Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning

Programming Languages

python

139335 projects - #7 most used programming language

Labels

machine-learning tensorflow open-source reinforcement-learning learning

Projects that are alternatives of or similar to Paac

Resources I Like

📚💯 Collection of learning resources i like

Stars: ✭ 280 (+42.86%)

Mutual labels: learning, open-source

Amazon Sagemaker Examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Stars: ✭ 6,346 (+3137.76%)

Mutual labels: learning, reinforcement-learning

Awesome Monte Carlo Tree Search Papers

A curated list of Monte Carlo tree search papers with implementations.

Stars: ✭ 387 (+97.45%)

Mutual labels: learning, reinforcement-learning

Spot mini mini

Dynamics and Domain Randomized Gait Modulation with Bezier Curves for Sim-to-Real Legged Locomotion.

Stars: ✭ 426 (+117.35%)

Mutual labels: open-source, reinforcement-learning

Artificialintelligenceengines

Computer code collated for use with Artificial Intelligence Engines book by JV Stone

Stars: ✭ 35 (-82.14%)

Mutual labels: learning, reinforcement-learning

Learningx

Deep & Classical Reinforcement Learning + Machine Learning Examples in Python

Stars: ✭ 241 (+22.96%)

Mutual labels: learning, reinforcement-learning

Hypatia

A JavaScript open source LMS (eLearning platform) for MOOCs and online courses

Stars: ✭ 478 (+143.88%)

Mutual labels: learning, open-source

Opensourceresources

Free opensource Learning Resources related to Web-Development A to Z 🔥❤

Stars: ✭ 210 (+7.14%)

Mutual labels: learning, open-source

Notebooks

Learn Python for free using open-source notebooks in Hebrew.

Stars: ✭ 877 (+347.45%)

Mutual labels: learning, open-source

Awesome Ai Books

Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning

Stars: ✭ 855 (+336.22%)

Mutual labels: learning, reinforcement-learning

Contribute To Open Source

Learn the GitHub workflow by contributing code in a fun simulation project

Stars: ✭ 684 (+248.98%)

Mutual labels: learning, open-source

Rhisis

Rhisis is an experimental FlyFF MMORPG emulator built with C# 9 and .NET 5

Stars: ✭ 132 (-32.65%)

Mutual labels: learning, open-source

Doom Net Pytorch

Reinforcement learning models in ViZDoom environment

Stars: ✭ 113 (-42.35%)

Mutual labels: learning, reinforcement-learning

Cherry

A PyTorch Library for Reinforcement Learning Research

Stars: ✭ 143 (-27.04%)

Mutual labels: learning, reinforcement-learning

Naf Tensorflow

"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow

Stars: ✭ 192 (-2.04%)

Mutual labels: reinforcement-learning

Richdocuments

📔 Collabora Online for Nextcloud

Stars: ✭ 193 (-1.53%)

Mutual labels: open-source

Uci Ml Api

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

Stars: ✭ 190 (-3.06%)

Mutual labels: learning

Elixir agent

New Relic's Open Source Elixir Agent

Stars: ✭ 191 (-2.55%)

Mutual labels: open-source

Tagspaces

TagSpaces is an offline, open source, document manager with tagging support

Stars: ✭ 2,451 (+1150.51%)

Mutual labels: open-source

Iblog

🖋 (issues blog) 欢迎订阅(watch👀) 收藏(star)

Stars: ✭ 194 (-1.02%)

Mutual labels: learning

View All Similar Projects ➔

Efficient Parallel Methods for Deep Reinforcement Learning

This repository contains an open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning.

PAAC is a conceptually simple advantage actor-critic algorithm designed to run efficiently on a GPU, offering A3C like performance in under 12 hours of training.

Runing via docker (recommended)

Follow the instructions to install nvidia-docker
Clone this repository
Run the container with nvidia-docker run -it -v <absolute-path>/paac:/root/paac -p 6006:6006 alfredvc/tf1-ale.

A CPU version of the docker container is also provided and can be run with docker run -it -v <absolute-path>/paac:/root/paac -p 6006:6006 alfredvc/tf1-ale:cpu. When running on the CPU pass the device flag -d '/cpu:0' to the training script.

Runing locally

Requirements

Python 3.4+
TensorFlow 1.0+
Arcade-Learning-Environment
cython (pip3 package)
scikit-image (pip3 package)
python3-tk

Training the agent

To train an agent to play, for example, pong run

python3 train.py -g pong -df logs/

For pong, the agent will begin to learn after about 5 million frames, and will learn an optimal policy after about 15 million frames.

Training can be stopped, for example by using Ctrl+c, and then resumed again by running python3 train.py -g pong -df logs/.

On a setup with an Intel i7-4790k CPU and an Nvidia GTX 980 Ti GPU with default settings, you can expect around 3000 timesteps (global steps) per second. Training for 80 million timesteps requires under 8 hours.

Qbert

Visualizing training

Open a new terminal
Attach to the running docker container with docker exec -it CONTAINER_NAME bash
Run tensorboard --logdir=<absolute-path>/paac/logs/tf.
In your browser navigate to localhost:6006/

If running locally, skip step 2.

Testing the agent

To test the performance of a trained agent run python3 test.py -f logs/ -tc 5 Output:

Performed 5 tests for seaquest.
Mean: 1704.00
Min: 1680.00
Max: 1720.00
Std: 14.97

Generating gifs

Gifs can be generated from stored network weights, for example a gif of the agent playing breakout can be generated with

python3 test.py -f pretrained/breakout/ -gn breakout

This may take a few minutes.

Pretrained models

Pretrained models for some games can be found here. These models can be used as starting points for training on the same game, other games, or to generate gifs.

Adapting the code

This codebase was designed to be easily modified to new environments and new neural network architectures.

Adapting to a new environment

The codebase currently contains a single environment, namely atari_emulator.py. To train on a new environment, simply create a new class that inherits from BaseEnvironment and modify environment_creator.py to create an instance of your new environment.

Adapting to new neural network architectures

The codebase contains currently two neural network architectures, the architecture used in Playing Atari with Deep Reinforcement Learning, and the architecture from Human-level control through deep reinforcement learning. Both adapted to an actor-critic algorithm. To create a new architecture follow the pattern demonstrated in NatureNetwork and NIPSNetwork. Then create a new class that inherits from both the PolicyVNetwork andYourNetwork. For example: NewArchitecturePolicyVNetwork(PolicyVNetwork, YourNetwork). Then use this class in train.py.

Citing PAAC

If you use PAAC in your research, we ask that you please cite our paper:

@ARTICLE{2017arXiv170504862C,
   author = {{Clemente}, A.~V. and {Castej{\'o}n}, H.~N. and {Chandra}, A.
	},
    title = "{Efficient Parallel Methods for Deep Reinforcement Learning}",
  journal = {ArXiv e-prints},
archivePrefix = "arXiv",
   eprint = {1705.04862},
 primaryClass = "cs.LG",
 keywords = {Computer Science - Learning},
     year = 2017,
    month = may,
   adsurl = {http://adsabs.harvard.edu/abs/2017arXiv170504862C},
  adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

The paper has been accepted as a poster to The Multi-disciplinary Conference on Reinforcement Learning and Decision Making (citation to come).

Disclaimer

The code in this repository is not the code used to generate the results from the paper, but should give similar results. Some changes have been made:

Gradient clipping default value changed from 40.0 to 3.0.
Entropy regularization constant default changed from 0.01 to 0.02.
Using OpenAI Gym results in an increase in training time of 33%. This is because converting the image from RGB to Grayscale in python is slow.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 196

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗