Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Texas holdem OpenAi gym poker environment with reinforcement learning based on keras-rl. Includes virtual rendering and montecarlo for equity calculation.

Stars: ✭ 299 (-69.49%)

Mutual labels: poker, reinforcement-learning, openai-gym

Mushroom Rl

Python library for Reinforcement Learning.

Stars: ✭ 442 (-54.9%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Basic reinforcement learning

An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.

Stars: ✭ 826 (-15.71%)

Mutual labels: ai, reinforcement-learning, openai-gym

Snake Ai Reinforcement

AI for Snake game trained from pixels using Deep Reinforcement Learning (DQN).

Stars: ✭ 123 (-87.45%)

Mutual labels: ai, game, reinforcement-learning

Aigames

use AI to play some games.

Stars: ✭ 422 (-56.94%)

Mutual labels: ai, game, reinforcement-learning

Awesome Deep Rl

For deep RL and the future of AI.

Stars: ✭ 985 (+0.51%)

Mutual labels: game, reinforcement-learning, deep-reinforcement-learning

Rl a3c pytorch

A3C LSTM Atari with Pytorch plus A3G design

Stars: ✭ 482 (-50.82%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Animalai Olympics

Code repository for the Animal AI Olympics competition

Stars: ✭ 544 (-44.49%)

Mutual labels: ai, reinforcement-learning, deep-reinforcement-learning

Habitat Lab

A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.

Stars: ✭ 587 (-40.1%)

Mutual labels: ai, reinforcement-learning, deep-reinforcement-learning

Btgym

Scalable, event-driven, deep-learning-friendly backtesting library

Stars: ✭ 765 (-21.94%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Cartpole

OpenAI's cartpole env solver.

Stars: ✭ 107 (-89.08%)

Mutual labels: ai, reinforcement-learning, openai-gym

Snake

Artificial intelligence for the Snake game.

Stars: ✭ 1,241 (+26.63%)

Mutual labels: game, reinforcement-learning, deep-reinforcement-learning

Super Mario Bros Ppo Pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

Stars: ✭ 649 (-33.78%)

Mutual labels: ai, reinforcement-learning, openai-gym

Pytorch Rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (-59.8%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Hierarchical Actor Critic Hac Pytorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

Stars: ✭ 116 (-88.16%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (-87.96%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Rl Book

Source codes for the book "Reinforcement Learning: Theory and Python Implementation"

Stars: ✭ 464 (-52.65%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (-34.69%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, openai-gym

View All Similar Projects ➔

RLCard: A Toolkit for Reinforcement Learning in Card Games

RLCard is a toolkit for Reinforcement Learning (RL) in card games. It supports multiple card environments with easy-to-use interfaces. The goal of RLCard is to bridge reinforcement learning and imperfect information games. RLCard is developed by DATA Lab at Texas A&M University and community contributors.

Official Website: http://www.rlcard.org
Tutorial in Jupyter Notebook: https://github.com/datamllab/rlcard-tutorial
Paper: https://arxiv.org/abs/1910.04376
GUI: RLCard-Showdown
Resources: Awesome-Game-AI

Community:

Slack: Discuss in our #rlcard-project slack channel.
QQ Group: Join our QQ group 665647450. Password: rlcardqqgroup

News:

Our package is used in PettingZoo. Please check it out!
We have released RLCard-Showdown, GUI demo for RLCard. Please check out here!
Jupyter Notebook tutorial available! We add some examples in R to call Python interfaces of RLCard with reticulate. See here
Thanks for the contribution of @Clarit7 for supporting different number of players in Blackjack. We call for contributions for gradually making the games more configurable. See here for more details.
Thanks for the contribution of @Clarit7 for the Blackjack and Limit Hold'em human interface.
Now RLCard supports environment local seeding and multiprocessing. Thanks for the testing scripts provided by @weepingwillowben.
Human interface of NoLimit Holdem available. The action space of NoLimit Holdem has been abstracted. Thanks for the contribution of @AdrianP-.
New game Gin Rummy and human GUI available. Thanks for the contribution of @billh0420.
PyTorch implementation available. Thanks for the contribution of @mjudell.

Cite this work

If you find this repo useful, you may cite:

@article{zha2019rlcard,
  title={RLCard: A Toolkit for Reinforcement Learning in Card Games},
  author={Zha, Daochen and Lai, Kwei-Herng and Cao, Yuanpu and Huang, Songyi and Wei, Ruzhe and Guo, Junyu and Hu, Xia},
  journal={arXiv preprint arXiv:1910.04376},
  year={2019}
}

Installation

Make sure that you have Python 3.5+ and pip installed. We recommend installing the latest version of rlcard with pip:

git clone https://github.com/datamllab/rlcard.git
cd rlcard
pip install -e .

Alternatively, you can install the latest stable version with:

pip install rlcard

The default installation will only include the card environments. To use Tensorflow implementation of the example algorithms, install the supported verison of Tensorflow with:

pip install rlcard[tensorflow]

To try PyTorch implementations, please run:

pip install rlcard[torch]

If you meet any problems when installing PyTorch with the command above, you may follow the instructions on PyTorch official website to manually install PyTorch.

We also provide conda installation method:

conda install -c toubun rlcard

Conda installation only provides the card environments, you need to manually install Tensorflow or Pytorch on your demands.

Examples

Please refer to examples/. A short example is as below.

import rlcard
from rlcard.agents import RandomAgent

env = rlcard.make('blackjack')
env.set_agents([RandomAgent(action_num=env.action_num)])

trajectories, payoffs = env.run()

We also recommend the following toy examples in Python.

R examples can be found here.

Demo

Run examples/leduc_holdem_human.py to play with the pre-trained Leduc Hold'em model. Leduc Hold'em is a simplified version of Texas Hold'em. Rules can be found here.

>> Leduc Hold'em pre-trained model

>> Start a new game!
>> Agent 1 chooses raise

=============== Community Card ===============
┌─────────┐
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
└─────────┘
===============   Your Hand    ===============
┌─────────┐
│J        │
│         │
│         │
│    ♥    │
│         │
│         │
│        J│
└─────────┘
===============     Chips      ===============
Yours:   +
Agent 1: +++
=========== Actions You Can Choose ===========
0: call, 1: raise, 2: fold

>> You choose action (integer):

We also provide a GUI for easy debugging. Please check here. Some demos:

Available Environments

We provide a complexity estimation for the games on several aspects. InfoSet Number: the number of information sets; InfoSet Size: the average number of states in a single information set; Action Size: the size of the action space. Name: the name that should be passed to rlcard.make to create the game environment. We also provide the link to the documentation and the random example.

Game	InfoSet Number	InfoSet Size	Action Size	Name	Usage
Blackjack (wiki, baike)	10^3	10^1	10^0	blackjack	doc, example
Leduc Hold’em (paper)	10^2	10^2	10^0	leduc-holdem	doc, example
Limit Texas Hold'em (wiki, baike)	10^14	10^3	10^0	limit-holdem	doc, example
Dou Dizhu (wiki, baike)	10^53 ~ 10^83	10^23	10^4	doudizhu	doc, example
Simple Dou Dizhu (wiki, baike)	-	-	-	simple-doudizhu	doc, example
Mahjong (wiki, baike)	10^121	10^48	10^2	mahjong	doc, example
No-limit Texas Hold'em (wiki, baike)	10^162	10^3	10^4	no-limit-holdem	doc, example
UNO (wiki, baike)	10^163	10^10	10^1	uno	doc, example
Gin Rummy (wiki, baike)	10^52	-	-	gin-rummy	doc, example

API Cheat Sheet

How to create an environment

You can use the the following interface to make an environment. You may optionally specify some configurations with a dictionary.

env = rlcard.make(env_id, config={}): Make an environment. env_id is a string of a environment; config is a dictionary that specifies some environment configurations, which are as follows.
- seed: Default None. Set a environment local random seed for reproducing the results.
- env_num: Default 1. It specifies how many environments running in parallel. If the number is larger than 1, then the tasks will be assigned to multiple processes for acceleration.
- allow_step_back: Defualt False. True if allowing step_back function to traverse backward in the tree.
- allow_raw_data: Default False. True if allowing raw data in the state.
- single_agent_mode: Default False. True if using single agent mode, i.e., Gym style interface with other players as pretrained/rule models.
- active_player: Defualt 0. If single_agent_mode is True, active_player will specify operating on which player in single agent mode.
- record_action: Default False. If True, a field of action_record will be in the state to record the historical actions. This may be used for human-agent play.
- Game specific configurations: These fields start with game_. Currently, we only support game_player_num in Blackjack.

Once the environemnt is made, we can access some information of the game.

env.action_num: The number of actions.
env.player_num: The number of players.
env.state_space: Ther state space of the observations.
env.timestep: The number of timesteps stepped by the environment.

What is state in RLCard

State is a Python dictionary. It will always have observation state['obs'] and legal actions state['legal_actions']. If allow_raw_data is True, state will also have raw observation state['raw_obs'] and raw legal actions state['raw_legal_actions'].

Basic interfaces

The following interfaces provide a basic usage. It is easy to use but it has assumtions on the agent. The agent must follow agent template.

env.set_agents(agents): agents is a list of Agent object. The length of the list should be equal to the number of the players in the game.
env.run(is_training=False): Run a complete game and return trajectories and payoffs. The function can be used after the set_agents is called. If is_training is True, it will use step function in the agent to play the game. If is_training is False, eval_step will be called instead.

Advanced interfaces

For advanced usage, the following interfaces allow flexible operations on the game tree. These interfaces do not make any assumtions on the agent.

env.reset(): Initialize a game. Return the state and the first player ID.
env.step(action, raw_action=False): Take one step in the environment. action can be raw action or integer; raw_action should be True if the action is raw action (string).
env.step_back(): Available only when allow_step_back is True. Take one step backward. This can be used for algorithms that operate on the game tree, such as CFR (chance sampling).
env.is_over(): Return True if the current game is over. Otherewise, return False.
env.get_player_id(): Return the Player ID of the current player.
env.get_state(player_id): Return the state that corresponds to player_id.
env.get_payoffs(): In the end of the game, return a list of payoffs for all the players.
env.get_perfect_information(): (Currently only support some of the games) Obtain the perfect information at the current state.

Running with multiple processes

RLCard now supports acceleration with multiple processes. Simply change env_num when making the environment to indicate how many processes would be used. Currenly we only support run() function with multiple processes. An example is DQN on blackjack

Library Structure

The purposes of the main modules are listed as below:

/examples: Examples of using RLCard.
/docs: Documentation of RLCard.
/tests: Testing scripts for RLCard.
/rlcard/agents: Reinforcement learning algorithms and human agents.
/rlcard/envs: Environment wrappers (state representation, action encoding etc.)
/rlcard/games: Various game engines.
/rlcard/models: Model zoo including pre-trained models and rule models.

Evaluation

The perfomance is measured by winning rates through tournaments. Example outputs are as follows:

For your information, there is a nice online evaluation platform pokerwars that could be connected with RLCard with some modifications.

Contributing

Contribution to this project is greatly appreciated! Please create an issue for feedbacks/bugs. If you want to contribute codes, please refer to Contributing Guide.

Acknowledgements

We would like to thank JJ World Network Technology Co.,LTD for the generous support and all the contributions from the community contributors.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 980

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (61) 🔗

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

datamllab / Rlcard

Programming Languages

Labels

Projects that are alternatives of or similar to Rlcard

RLCard: A Toolkit for Reinforcement Learning in Card Games

Cite this work

Installation

Examples

Demo

Available Environments

API Cheat Sheet

How to create an environment

What is state in RLCard

Basic interfaces

Advanced interfaces

Running with multiple processes

Library Structure

Evaluation

More Documents

Contributing

Acknowledgements