All Projects → Kaixhin → Dist-A3C

Kaixhin / Dist-A3C

Licence: MIT License
Distributed A3C

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Dist-A3C

playing-mario-with-deep-reinforcement-learning
An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.
Stars: ✭ 55 (+48.65%)
Mutual labels:  deep-reinforcement-learning
ddrl
Deep Developmental Reinforcement Learning
Stars: ✭ 27 (-27.03%)
Mutual labels:  deep-reinforcement-learning
wolpertinger ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
Stars: ✭ 44 (+18.92%)
Mutual labels:  deep-reinforcement-learning
MaRLEnE
Machine- and Reinforcement Learning ExtensioN for (game) Engines
Stars: ✭ 47 (+27.03%)
Mutual labels:  deep-reinforcement-learning
DRL graph exploration
Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs
Stars: ✭ 53 (+43.24%)
Mutual labels:  deep-reinforcement-learning
Deep-rl-mxnet
Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO
Stars: ✭ 26 (-29.73%)
Mutual labels:  deep-reinforcement-learning
rl-medical
Communicative Multiagent Deep Reinforcement Learning for Anatomical Landmark Detection using PyTorch.
Stars: ✭ 36 (-2.7%)
Mutual labels:  deep-reinforcement-learning
reinforcement-learning-papers
My notes on reinforcement learning papers
Stars: ✭ 13 (-64.86%)
Mutual labels:  deep-reinforcement-learning
DRL DeliveryDuel
Deep Reinforcement Learning applied to a modern 3D video-game environment called Delivery Duel.
Stars: ✭ 30 (-18.92%)
Mutual labels:  deep-reinforcement-learning
snake-reinforcement-DNN
Developing a deep neural network to play a snake game
Stars: ✭ 12 (-67.57%)
Mutual labels:  deep-reinforcement-learning
Deep-Q-Networks
Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym
Stars: ✭ 38 (+2.7%)
Mutual labels:  deep-reinforcement-learning
dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
Stars: ✭ 20 (-45.95%)
Mutual labels:  deep-reinforcement-learning
Object-Goal-Navigation
Pytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"
Stars: ✭ 107 (+189.19%)
Mutual labels:  deep-reinforcement-learning
deep-rl-quadcopter
Implementation of Deep Deterministic Policy Gradients (DDPG) to teach a Quadcopter How to Fly!
Stars: ✭ 17 (-54.05%)
Mutual labels:  deep-reinforcement-learning
catalyst-examples
Examples
Stars: ✭ 54 (+45.95%)
Mutual labels:  deep-reinforcement-learning
FLEXS
Fitness landscape exploration sandbox for biological sequence design.
Stars: ✭ 92 (+148.65%)
Mutual labels:  deep-reinforcement-learning
rl pytorch
Deep Reinforcement Learning Algorithms Implementation in PyTorch
Stars: ✭ 23 (-37.84%)
Mutual labels:  deep-reinforcement-learning
DeepBeerInventory-RL
The code for the SRDQN algorithm to train an agent for the beer game problem
Stars: ✭ 27 (-27.03%)
Mutual labels:  deep-reinforcement-learning
Deep-Reinforcement-Learning
Introduction to Deep Reinforcement Learning
Stars: ✭ 71 (+91.89%)
Mutual labels:  deep-reinforcement-learning
Reinforcement Learning
Research repo of RL
Stars: ✭ 20 (-45.95%)
Mutual labels:  deep-reinforcement-learning

Dist-A3C

MIT License

TODO: Have server use mp - one thread for server, one for testing. Keep counter to know once finished. Also be able to send push notifications to kill running clients once counter done.

Distributed asynchronous advantage actor-critic (A3C) [1] with generalised advantage estimation (GAE) [2]. Run python server.py <options> to start the server and python client.py <options> for as many clients as wanted.

Requirements

To install all dependencies with Anaconda run conda env create -f environment.yml and use source activate dista3c to activate the environment.

Acknowledgements

References

[1] Asynchronous Methods for Deep Reinforcement Learning
[2] High-Dimensional Continuous Control Using Generalized Advantage Estimation

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].