All Projects → chris-chris → Pysc2 Examples

chris-chris / Pysc2 Examples

Licence: apache-2.0
StarCraft II - pysc2 Deep Reinforcement Learning Examples

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Pysc2 Examples

Learning To Communicate Pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Stars: ✭ 236 (-67.31%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deepmind
Tensorflow Reinforce
Implementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (-33.52%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Reinforcement Learning
Minimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+296.54%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
2048 Deep Reinforcement Learning
Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning
Stars: ✭ 169 (-76.59%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Habitat Lab
A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
Stars: ✭ 587 (-18.7%)
Mutual labels:  ai, reinforcement-learning, deep-reinforcement-learning
Reinforcementlearning.jl
A reinforcement learning package for Julia
Stars: ✭ 192 (-73.41%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Animalai Olympics
Code repository for the Animal AI Olympics competition
Stars: ✭ 544 (-24.65%)
Mutual labels:  ai, reinforcement-learning, deep-reinforcement-learning
Mujocounity
Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).
Stars: ✭ 47 (-93.49%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deepmind
Rad
RAD: Reinforcement Learning with Augmented Data
Stars: ✭ 268 (-62.88%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
reinforce-js
[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.
Stars: ✭ 20 (-97.23%)
Mutual labels:  deep-reinforcement-learning, deepmind, deep-q-network
Accel Brain Code
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
Stars: ✭ 166 (-77.01%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
Stars: ✭ 346 (-52.08%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Advanced Deep Learning And Reinforcement Learning Deepmind
🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind | YouTube videos 👉
Stars: ✭ 121 (-83.24%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deepmind
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (-67.73%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Rl Course Experiments
Stars: ✭ 73 (-89.89%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Pygame Learning Environment
PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.
Stars: ✭ 828 (+14.68%)
Mutual labels:  ai, reinforcement-learning, deep-reinforcement-learning
Deep Q Learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Stars: ✭ 1,013 (+40.3%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Ml In Tf
Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!
Stars: ✭ 45 (-93.77%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deepmind
Rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Stars: ✭ 980 (+35.73%)
Mutual labels:  ai, reinforcement-learning, deep-reinforcement-learning
Reinforcement Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Stars: ✭ 3,329 (+361.08%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deepmind

StartCraft II Reinforcement Learning Examples

This example program was built on

  • pysc2 (Deepmind) [https://github.com/deepmind/pysc2]
  • baselines (OpenAI) [https://github.com/openai/baselines]
  • s2client-proto (Blizzard) [https://github.com/Blizzard/s2client-proto]
  • Tensorflow 1.3 (Google) [https://github.com/tensorflow/tensorflow]

Current examples

Minimaps

  • CollectMineralShards with Deep Q Network

CollectMineralShards

Quick Start Guide

1. Get PySC2

PyPI

The easiest way to get PySC2 is to use pip:

$ pip install git+https://github.com/deepmind/pysc2

Also, you have to install baselines library.

$ pip install git+https://github.com/openai/baselines

2. Install StarCraft II

Mac / Win

You have to purchase StarCraft II and install it. Or even the Starter Edition will work.

http://us.battle.net/sc2/en/legacy-of-the-void/

Linux Packages

Follow Blizzard's documentation to get the linux version. By default, PySC2 expects the game to live in ~/StarCraftII/.

3. Download Maps

Download the ladder maps and the mini games and extract them to your StarcraftII/Maps/ directory.

4. Train it!

$ python train_mineral_shards.py --algorithm=a2c

5. Enjoy it!

$ python enjoy_mineral_shards.py

4-1. Train it with DQN

$ python train_mineral_shards.py --algorithm=deepq --prioritized=True --dueling=True --timesteps=2000000 --exploration_fraction=0.2

4-2. Train it with A2C(A3C)

$ python train_mineral_shards.py --algorithm=a2c --num_agents=2 --num_scripts=2 --timesteps=2000000
Description Default Parameter Type
map Gym Environment CollectMineralShards string
log logging type : tensorboard, stdout tensorboard string
algorithm Currently, support 2 algorithms : deepq, a2c a2c string
timesteps Total training steps 2000000 int
exploration_fraction exploration fraction 0.5 float
prioritized Whether using prioritized replay for DQN False boolean
dueling Whether using dueling network for DQN False boolean
lr learning rate (if 0 set random e-5 ~ e-3) 0.0005 float
num_agents number of agents for A2C 4 int
num_scripts number of scripted agents for A2C 4 int
nsteps number of steps for update policy 20 int
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].