All Projects → connect4 → Similar Projects or Alternatives

130 Open source projects that are alternatives of or similar to connect4

Master-Thesis
Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex
Stars: ✭ 33 (+0%)
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+1093.94%)
Mutual labels:  policy-gradient
KKAlphaGoZero
alphaGoZero论文的实现
Stars: ✭ 35 (+6.06%)
Mutual labels:  alphago-zero
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (+1003.03%)
Mutual labels:  policy-gradient
alpha sigma
A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
Stars: ✭ 134 (+306.06%)
Mutual labels:  monte-carlo-tree-search
Rl algorithms
Structural implementation of RL key algorithms
Stars: ✭ 352 (+966.67%)
Mutual labels:  policy-gradient
agentmodels.org
Modeling agents with probabilistic programs
Stars: ✭ 66 (+100%)
Ppo Pytorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Stars: ✭ 325 (+884.85%)
Mutual labels:  policy-gradient
alphastone
Using self-play, MCTS, and a deep neural network to create a hearthstone ai player
Stars: ✭ 24 (-27.27%)
Mutual labels:  monte-carlo-tree-search
Reinforcement Learning Kr
[파이썬과 케라스로 배우는 강화학습] 예제
Stars: ✭ 282 (+754.55%)
Mutual labels:  policy-gradient
marltoolbox
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
Stars: ✭ 25 (-24.24%)
policy-gradient-pong
tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
Stars: ✭ 29 (-12.12%)
Mutual labels:  policy-gradient
pytorch-rl
Pytorch Implementation of RL algorithms
Stars: ✭ 15 (-54.55%)
rl implementations
No description or website provided.
Stars: ✭ 40 (+21.21%)
Mutual labels:  policy-gradient
ReZero-ResNet
Unofficial pytorch implementation of ReZero in ResNet
Stars: ✭ 23 (-30.3%)
Mutual labels:  residual-networks
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+2639.39%)
Mutual labels:  policy-gradient
quoridor-ai
Quoridor AI based on Monte Carlo tree search
Stars: ✭ 23 (-30.3%)
Mutual labels:  monte-carlo-tree-search
TRPO-TensorFlow
Trust Region Policy Optimization (TRPO) in pure TensorFlow
Stars: ✭ 17 (-48.48%)
Mutual labels:  policy-gradient
Deep Algotrading
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (+424.24%)
Mutual labels:  policy-gradient
imitation learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+181.82%)
Mutual labels:  policy-gradient
deep-active-inference-mc
Deep active inference agents using Monte-Carlo methods
Stars: ✭ 41 (+24.24%)
Mutual labels:  monte-carlo-tree-search
HandyRL
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Stars: ✭ 228 (+590.91%)
Mutual labels:  policy-gradient
Show Adapt And Tell
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (+342.42%)
Mutual labels:  policy-gradient
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+12057.58%)
breakout-Deep-Q-Network
Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout
Stars: ✭ 69 (+109.09%)
Mutual labels:  dueling-dqn
AnimalChess
Animal Fight Chess Game(斗兽棋) written in rust.
Stars: ✭ 76 (+130.3%)
Mutual labels:  monte-carlo-tree-search
UCThello
UCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
Stars: ✭ 26 (-21.21%)
Mutual labels:  monte-carlo-tree-search
Reinforcement-Learning-CheatSheet
Cheatsheet of Reinforcement Learning (Based on Sutton-Barto Book - 2nd Edition)
Stars: ✭ 22 (-33.33%)
Mlds2018spring
Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (+275.76%)
Mutual labels:  policy-gradient
Neural-Fictitous-Self-Play
Scalable Implementation of Neural Fictitous Self-Play
Stars: ✭ 52 (+57.58%)
ludorum.js
A board game framework, focused not on graphics or user interfaces, but on artificial players design, implementation and testing.
Stars: ✭ 13 (-60.61%)
Mutual labels:  monte-carlo-tree-search
Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
Stars: ✭ 52 (+57.58%)
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+9003.03%)
Mutual labels:  policy-gradient
xingtian
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Stars: ✭ 229 (+593.94%)
rpg
Ranking Policy Gradient
Stars: ✭ 22 (-33.33%)
Mutual labels:  policy-gradient
RL-code-resources
A collection of Reinforcement Learning GitHub code resources divided by frameworks and environments
Stars: ✭ 51 (+54.55%)
Reinforcement learning
강화학습에 대한 기본적인 알고리즘 구현
Stars: ✭ 100 (+203.03%)
Mutual labels:  policy-gradient
TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
Stars: ✭ 41 (+24.24%)
RL
A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
Stars: ✭ 22 (-33.33%)
Mutual labels:  policy-gradient
Bender
Easily craft fast Neural Networks on iOS! Use TensorFlow models. Metal under the hood.
Stars: ✭ 1,728 (+5136.36%)
Mutual labels:  residual-networks
Deeprl algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (+193.94%)
Mutual labels:  policy-gradient
AlphaZero Gobang
Deep Learning big homework of UCAS
Stars: ✭ 29 (-12.12%)
Mutual labels:  residual-networks
Hypernets
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Stars: ✭ 221 (+569.7%)
Mutual labels:  monte-carlo-tree-search
wideresnet-tensorlayer
Wide Residual Networks implemented in TensorLayer and TensorFlow.
Stars: ✭ 44 (+33.33%)
Mutual labels:  residual-networks
Codegan
[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
Stars: ✭ 73 (+121.21%)
Mutual labels:  policy-gradient
resnet-cifar10
ResNet for Cifar10
Stars: ✭ 21 (-36.36%)
Mutual labels:  residual-networks
caffe-wrn-generator
Caffe Wide-Residual-Network (WRN) Generator
Stars: ✭ 19 (-42.42%)
Mutual labels:  residual-networks
Parl Sample
Deep reinforcement learning using baidu PARL(maze,flappy bird and so on)
Stars: ✭ 37 (+12.12%)
Mutual labels:  policy-gradient
TicTacToeUI-Android
Check out the new style for App Design aims for Tic Tac Toe Game...😉😀😁😎
Stars: ✭ 40 (+21.21%)
Mutual labels:  tictactoe-game
MCTS-agent-python
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games …
Stars: ✭ 22 (-33.33%)
Mutual labels:  monte-carlo-tree-search
Btgym
Scalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+2218.18%)
Mutual labels:  policy-gradient
l2rpn-baselines
L2RPN Baselines a repository to host baselines for l2rpn competitions.
Stars: ✭ 57 (+72.73%)
Reinforcement-Learning-on-google-colab
Reinforcement Learning algorithm's using google-colab
Stars: ✭ 33 (+0%)
Pytorch Rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+1893.94%)
Mutual labels:  policy-gradient
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+1839.39%)
Mutual labels:  policy-gradient
Rlseq2seq
Deep Reinforcement Learning For Sequence to Sequence Models
Stars: ✭ 683 (+1969.7%)
Mutual labels:  policy-gradient
alphaFive
alphaGo版本的五子棋(gobang, gomoku)
Stars: ✭ 51 (+54.55%)
Mutual labels:  alphago-zero
VREP-RL-bot
Reinforcement Learning in Vrep
Stars: ✭ 14 (-57.58%)
Pytorch-RL-CPP
A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)
Stars: ✭ 73 (+121.21%)
TAA-PG
Usage of policy gradient reinforcement learning to solve portfolio optimization problems (Tactical Asset Allocation).
Stars: ✭ 26 (-21.21%)
Mutual labels:  policy-gradient
61-120 of 130 similar projects