Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → thuxugang → Doudizhu

thuxugang / Doudizhu

AI斗地主

Programming Languages

python

139335 projects - #7 most used programming language

Labels

reinforcement-learning ai dqn

Projects that are alternatives of or similar to Doudizhu

Snake Ai Reinforcement

AI for Snake game trained from pixels using Deep Reinforcement Learning (DQN).

Stars: ✭ 123 (-17.45%)

Mutual labels: ai, reinforcement-learning, dqn

Aigames

use AI to play some games.

Stars: ✭ 422 (+183.22%)

Mutual labels: ai, reinforcement-learning, dqn

Atari

AI research environment for the Atari 2600 games 🤖.

Stars: ✭ 174 (+16.78%)

Mutual labels: ai, reinforcement-learning, dqn

Cartpole

OpenAI's cartpole env solver.

Stars: ✭ 107 (-28.19%)

Mutual labels: ai, reinforcement-learning, dqn

Openaigym

Solving OpenAI Gym problems.

Stars: ✭ 98 (-34.23%)

Mutual labels: reinforcement-learning, dqn

Machin

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Stars: ✭ 145 (-2.68%)

Mutual labels: reinforcement-learning, dqn

Torchrl

Highly Modular and Scalable Reinforcement Learning

Stars: ✭ 102 (-31.54%)

Mutual labels: reinforcement-learning, dqn

Allenact

An open source framework for research in Embodied-AI from AI2.

Stars: ✭ 144 (-3.36%)

Mutual labels: ai, reinforcement-learning

Reinforcement learning

Reinforcement learning tutorials

Stars: ✭ 82 (-44.97%)

Mutual labels: reinforcement-learning, dqn

Reinforcement Learning

🤖 Implements of Reinforcement Learning algorithms.

Stars: ✭ 104 (-30.2%)

Mutual labels: reinforcement-learning, dqn

Minimalrl

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Stars: ✭ 2,051 (+1276.51%)

Mutual labels: reinforcement-learning, dqn

Categorical Dqn

A working implementation of the Categorical DQN (Distributional RL).

Stars: ✭ 90 (-39.6%)

Mutual labels: reinforcement-learning, dqn

Torchrl

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Stars: ✭ 90 (-39.6%)

Mutual labels: reinforcement-learning, dqn

Reinforcement learning

강화학습에 대한 기본적인 알고리즘 구현

Stars: ✭ 100 (-32.89%)

Mutual labels: reinforcement-learning, dqn

Reinforcement Learning Wechat Jump

Reinforcement Learning for WeChat Jump

Stars: ✭ 85 (-42.95%)

Mutual labels: ai, reinforcement-learning

Easy Rl

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+1916.11%)

Mutual labels: reinforcement-learning, dqn

Deep Neuroevolution

Stars: ✭ 1,526 (+924.16%)

Mutual labels: ai, reinforcement-learning

Ctc Executioner

Master Thesis: Limit order placement with Reinforcement Learning

Stars: ✭ 112 (-24.83%)

Mutual labels: reinforcement-learning, dqn

Rainbow

A PyTorch implementation of Rainbow DQN agent

Stars: ✭ 147 (-1.34%)

Mutual labels: reinforcement-learning, dqn

Holodeck Engine

High Fidelity Simulator for Reinforcement Learning and Robotics Research.

Stars: ✭ 48 (-67.79%)

Mutual labels: ai, reinforcement-learning

View All Similar Projects ➔

斗地主

deecamp斗地主

master分支

提供了可以结合AI的程序引擎，在next_moves中提供按照规则的出牌所有可能性，需要自己实现从next_moves中选择所出的牌（myutil中的choose方法），默认random

web分支

1.页面展示，提供可视化调试方法

2.可以选择跟人对战

使用方法

1.启动server.py

2.访问http://127.0.0.1:5000/ddz

rl_pdqn分支

模仿OpenAI，提供了可以结合RL的程序引擎，可以选择对手为random或陈潇规则(cxgz)或自身(self)，但是训练时只能训练一个且为player 1。该分支rl模型为prioritized_dqn，具体模型参考https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow 。

目前胜率 vs random（90%）， cxgz（44%）

multi-rl分支

模仿OpenAI，提供了可以结合RL的程序引擎，可以同时训练多个rl player

mcts分支

mcts暴力解决（TODO：由于deepcopy牌局的回复速度比较慢，1000/6s）

contributor

Deecamp第五组

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 149

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗