Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Kaixhin → Dist-A3C

Kaixhin / Dist-A3C

Licence: MIT License

Distributed A3C

Programming Languages

139335 projects - #7 most used programming language

Labels

deep-learning deep-reinforcement-learning

Projects that are alternatives of or similar to Dist-A3C

playing-mario-with-deep-reinforcement-learning

An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.

Stars: ✭ 55 (+48.65%)

Mutual labels: deep-reinforcement-learning

Deep Developmental Reinforcement Learning

Stars: ✭ 27 (-27.03%)

Mutual labels: deep-reinforcement-learning

wolpertinger ddpg

Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.

Stars: ✭ 44 (+18.92%)

Mutual labels: deep-reinforcement-learning

Machine- and Reinforcement Learning ExtensioN for (game) Engines

Stars: ✭ 47 (+27.03%)

Mutual labels: deep-reinforcement-learning

DRL graph exploration

Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs

Stars: ✭ 53 (+43.24%)

Mutual labels: deep-reinforcement-learning

Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO

Stars: ✭ 26 (-29.73%)

Mutual labels: deep-reinforcement-learning

Communicative Multiagent Deep Reinforcement Learning for Anatomical Landmark Detection using PyTorch.

Stars: ✭ 36 (-2.7%)

Mutual labels: deep-reinforcement-learning

reinforcement-learning-papers

My notes on reinforcement learning papers

Stars: ✭ 13 (-64.86%)

Mutual labels: deep-reinforcement-learning

DRL DeliveryDuel

Deep Reinforcement Learning applied to a modern 3D video-game environment called Delivery Duel.

Stars: ✭ 30 (-18.92%)

Mutual labels: deep-reinforcement-learning

snake-reinforcement-DNN

Developing a deep neural network to play a snake game

Stars: ✭ 12 (-67.57%)

Mutual labels: deep-reinforcement-learning

Deep-Q-Networks

Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym

Stars: ✭ 38 (+2.7%)

Mutual labels: deep-reinforcement-learning

NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.

Stars: ✭ 20 (-45.95%)

Mutual labels: deep-reinforcement-learning

Object-Goal-Navigation

Pytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"

Stars: ✭ 107 (+189.19%)

Mutual labels: deep-reinforcement-learning

deep-rl-quadcopter

Implementation of Deep Deterministic Policy Gradients (DDPG) to teach a Quadcopter How to Fly!

Stars: ✭ 17 (-54.05%)

Mutual labels: deep-reinforcement-learning

catalyst-examples

Examples

Stars: ✭ 54 (+45.95%)

Mutual labels: deep-reinforcement-learning

Fitness landscape exploration sandbox for biological sequence design.

Stars: ✭ 92 (+148.65%)

Mutual labels: deep-reinforcement-learning

Deep Reinforcement Learning Algorithms Implementation in PyTorch

Stars: ✭ 23 (-37.84%)

Mutual labels: deep-reinforcement-learning

DeepBeerInventory-RL

The code for the SRDQN algorithm to train an agent for the beer game problem

Stars: ✭ 27 (-27.03%)

Mutual labels: deep-reinforcement-learning

Deep-Reinforcement-Learning

Introduction to Deep Reinforcement Learning

Stars: ✭ 71 (+91.89%)

Mutual labels: deep-reinforcement-learning

Reinforcement Learning

Research repo of RL

Stars: ✭ 20 (-45.95%)

Mutual labels: deep-reinforcement-learning

View All Similar Projects ➔

Dist-A3C

TODO: Have server use mp - one thread for server, one for testing. Keep counter to know once finished. Also be able to send push notifications to kill running clients once counter done.

Distributed asynchronous advantage actor-critic (A3C) [1] with generalised advantage estimation (GAE) [2]. Run python server.py <options> to start the server and python client.py <options> for as many clients as wanted.

Requirements

To install all dependencies with Anaconda run conda env create -f environment.yml and use source activate dista3c to activate the environment.

Acknowledgements

@ikostrikov for pytorch-a3c

References

[1] Asynchronous Methods for Deep Reinforcement Learning
[2] High-Dimensional Continuous Control Using Generalized Advantage Estimation

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 37

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗