Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → xhujoy → Pysc2 Agents

xhujoy / Pysc2 Agents

This is a simple implementation of DeepMind's PySC2 RL agents.

Programming Languages

139335 projects - #7 most used programming language

Labels

reinforcement-learning a3c

Projects that are alternatives of or similar to Pysc2 Agents

MXNET + OpenAI Gym implementation of A3C from "Asynchronous Methods for Deep Reinforcement Learning"

Stars: ✭ 9 (-96.56%)

Mutual labels: reinforcement-learning, a3c

Reinforcement learning

Reinforcement learning tutorials

Stars: ✭ 82 (-68.7%)

Mutual labels: reinforcement-learning, a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Stars: ✭ 879 (+235.5%)

Mutual labels: reinforcement-learning, a3c

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Stars: ✭ 6,796 (+2493.89%)

Mutual labels: reinforcement-learning, a3c

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Stars: ✭ 2,051 (+682.82%)

Mutual labels: reinforcement-learning, a3c

My experimentations with Reinforcement Learning in Pytorch

Stars: ✭ 18 (-93.13%)

Mutual labels: reinforcement-learning, a3c

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

Stars: ✭ 70 (-73.28%)

Mutual labels: reinforcement-learning, a3c

Deep Reinforcement Learning with pytorch & visdom

Stars: ✭ 745 (+184.35%)

Mutual labels: reinforcement-learning, a3c

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Stars: ✭ 145 (-44.66%)

Mutual labels: reinforcement-learning, a3c

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (-54.96%)

Mutual labels: reinforcement-learning, a3c

Super Mario Bros A3c Pytorch

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Stars: ✭ 775 (+195.8%)

Mutual labels: reinforcement-learning, a3c

A library for ready-made reinforcement learning agents and reusable components for neat prototyping

Stars: ✭ 184 (-29.77%)

Mutual labels: reinforcement-learning, a3c

Scalable, event-driven, deep-learning-friendly backtesting library

Stars: ✭ 765 (+191.98%)

Mutual labels: reinforcement-learning, a3c

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Stars: ✭ 904 (+245.04%)

Mutual labels: reinforcement-learning, a3c

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+2551.91%)

Mutual labels: reinforcement-learning, a3c

Policy Gradient Methods

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Stars: ✭ 54 (-79.39%)

Mutual labels: reinforcement-learning, a3c

A3C LSTM Atari with Pytorch plus A3G design

Stars: ✭ 482 (+83.97%)

Mutual labels: reinforcement-learning, a3c

Async deep reinforce

Asynchronous Methods for Deep Reinforcement Learning

Stars: ✭ 565 (+115.65%)

Mutual labels: reinforcement-learning, a3c

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+1046.56%)

Mutual labels: reinforcement-learning, a3c

Implementations of deep RL papers and random experimentation

Stars: ✭ 176 (-32.82%)

Mutual labels: reinforcement-learning, a3c

View All Similar Projects ➔

PySC2 agents

This is a simple implementation of DeepMind's PySC2 RL agents. In this project, the agents are defined according to the original paper, which use all feature maps and structured information to predict both actions and arguments via an A3C algorithm.

Requirements

PySC2 is a learning environment of StarCraft II provided by DeepMind. It provides an interface for RL agents to interact with StarCraft II, getting observations and sending actions. You can follow the tutorial in PySC2 repo to install it.

pip install s2clientprotocol==1.1
pip install pysc2==1.1

Python packages might miss: tensorflow and absl-py. If pip is set up on your system, it can be easily installed by running

pip install absl-py
pip install tensorflow-gpu

Getting Started

Clone this repo:

git clone https://github.com/xhujoy/pysc2-agents
cd pysc2-agents

Testing

Download the pretrained model from here and extract them to ./snapshot/.
Test the pretrained model:

python -m main --map=MoveToBeacon --training=False

You will get the following results for different maps.

	MoveToBeacon	CollectMineralShards	DefeatRoaches
Mean Score	~25	~62	~87
Max Score	31	97	371

Training

Train a model by yourself:

python -m main --map=MoveToBeacon

Notations

Different from the original A3C algorithm, we replace the policy penalty term with epsilon greedy exploration.
When train a model by yourself, you'd better to run several times and choose the best one. If you get better results than ours, it's grateful to share with us.

Licensed under The MIT License.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 262

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (6) 🔗