Alphazero gomokuAn implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (+1817.91%)
Mutual labels: gomoku, monte-carlo-tree-search, alphazero
Alpha Zero GeneralA clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Stars: ✭ 2,617 (+1852.99%)
Mutual labels: gomoku, monte-carlo-tree-search, alphazero
Deep Reinforcement LearningRepo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+2894.03%)
Mutual labels: deep-reinforcement-learning, pytorch-rl
AnimalChessAnimal Fight Chess Game(斗兽棋) written in rust.
Stars: ✭ 76 (-43.28%)
Mutual labels: monte-carlo-tree-search, alphazero
alphaFivealphaGo版本的五子棋(gobang, gomoku)
Stars: ✭ 51 (-61.94%)
Mutual labels: gomoku, alphazero
AlphaZero GobangDeep Learning big homework of UCAS
Stars: ✭ 29 (-78.36%)
Mutual labels: gomoku, alphazero
gobang一个五子棋AI,使用原生JavaScript开发
Stars: ✭ 22 (-83.58%)
Mutual labels: gomoku, gomoku-game
muzeroA clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Stars: ✭ 126 (-5.97%)
Mutual labels: deep-reinforcement-learning, alphazero
alphastoneUsing self-play, MCTS, and a deep neural network to create a hearthstone ai player
Stars: ✭ 24 (-82.09%)
Mutual labels: deep-reinforcement-learning, monte-carlo-tree-search
pomdp-baselinesSimple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022
Stars: ✭ 162 (+20.9%)
Mutual labels: deep-reinforcement-learning
FinRL PodracerCloud-native Financial Reinforcement Learning
Stars: ✭ 179 (+33.58%)
Mutual labels: deep-reinforcement-learning
Master-ThesisDeep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex
Stars: ✭ 33 (-75.37%)
Mutual labels: deep-reinforcement-learning
decentralized-rlDecentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)
Stars: ✭ 40 (-70.15%)
Mutual labels: deep-reinforcement-learning
LWDRLCLightweight deep RL Libraray for continuous control.
Stars: ✭ 14 (-89.55%)
Mutual labels: deep-reinforcement-learning
Meta-Learning-for-StarCraft-II-MinigamesWe reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.
Stars: ✭ 26 (-80.6%)
Mutual labels: deep-reinforcement-learning
pytorch-noreward-rlpytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
Stars: ✭ 79 (-41.04%)
Mutual labels: deep-reinforcement-learning
minervaAn out-of-the-box GUI tool for offline deep reinforcement learning
Stars: ✭ 80 (-40.3%)
Mutual labels: deep-reinforcement-learning
imitation learningPyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (-30.6%)
Mutual labels: deep-reinforcement-learning