All Projects → connect4 → Similar Projects or Alternatives

130 Open source projects that are alternatives of or similar to connect4

Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex

Stars: ✭ 33 (+0%)

Mutual labels: reinforcement-learning-algorithms

Pytorch Rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (+1093.94%)

Mutual labels: policy-gradient

KKAlphaGoZero

alphaGoZero论文的实现

Stars: ✭ 35 (+6.06%)

Mutual labels: alphago-zero

Lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Stars: ✭ 364 (+1003.03%)

Mutual labels: policy-gradient

alpha sigma

A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.

Stars: ✭ 134 (+306.06%)

Mutual labels: monte-carlo-tree-search

Rl algorithms

Structural implementation of RL key algorithms

Stars: ✭ 352 (+966.67%)

Mutual labels: policy-gradient

agentmodels.org

Modeling agents with probabilistic programs

Stars: ✭ 66 (+100%)

Mutual labels: reinforcement-learning-algorithms

Ppo Pytorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Stars: ✭ 325 (+884.85%)

Mutual labels: policy-gradient

alphastone

Using self-play, MCTS, and a deep neural network to create a hearthstone ai player

Stars: ✭ 24 (-27.27%)

Mutual labels: monte-carlo-tree-search

Reinforcement Learning Kr

[파이썬과 케라스로 배우는 강화학습] 예제

Stars: ✭ 282 (+754.55%)

Mutual labels: policy-gradient

marltoolbox

A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).

Stars: ✭ 25 (-24.24%)

Mutual labels: reinforcement-learning-algorithms

policy-gradient-pong

tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/

Stars: ✭ 29 (-12.12%)

Mutual labels: policy-gradient

pytorch-rl

Pytorch Implementation of RL algorithms

Stars: ✭ 15 (-54.55%)

Mutual labels: reinforcement-learning-algorithms

rl implementations

No description or website provided.

Stars: ✭ 40 (+21.21%)

Mutual labels: policy-gradient

ReZero-ResNet

Unofficial pytorch implementation of ReZero in ResNet

Stars: ✭ 23 (-30.3%)

Mutual labels: residual-networks

Slm Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Stars: ✭ 904 (+2639.39%)

Mutual labels: policy-gradient

quoridor-ai

Quoridor AI based on Monte Carlo tree search

Stars: ✭ 23 (-30.3%)

Mutual labels: monte-carlo-tree-search

TRPO-TensorFlow

Trust Region Policy Optimization (TRPO) in pure TensorFlow

Stars: ✭ 17 (-48.48%)

Mutual labels: policy-gradient

Deep Algotrading

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Stars: ✭ 173 (+424.24%)

Mutual labels: policy-gradient

imitation learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Stars: ✭ 93 (+181.82%)

Mutual labels: policy-gradient

deep-active-inference-mc

Deep active inference agents using Monte-Carlo methods

Stars: ✭ 41 (+24.24%)

Mutual labels: monte-carlo-tree-search

HandyRL

HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.

Stars: ✭ 228 (+590.91%)

Mutual labels: policy-gradient

Show Adapt And Tell

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Stars: ✭ 146 (+342.42%)

Mutual labels: policy-gradient

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

Stars: ✭ 4,012 (+12057.58%)

Mutual labels: reinforcement-learning-algorithms

breakout-Deep-Q-Network

Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout

Stars: ✭ 69 (+109.09%)

Mutual labels: dueling-dqn

AnimalChess

Animal Fight Chess Game（斗兽棋） written in rust.

Stars: ✭ 76 (+130.3%)

Mutual labels: monte-carlo-tree-search

UCThello

UCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)

Stars: ✭ 26 (-21.21%)

Mutual labels: monte-carlo-tree-search

Reinforcement-Learning-CheatSheet

Cheatsheet of Reinforcement Learning (Based on Sutton-Barto Book - 2nd Edition)

Stars: ✭ 22 (-33.33%)

Mutual labels: reinforcement-learning-algorithms

Mlds2018spring

Machine Learning and having it Deep and Structured (MLDS) in 2018 spring

Stars: ✭ 124 (+275.76%)

Mutual labels: policy-gradient

Neural-Fictitous-Self-Play

Scalable Implementation of Neural Fictitous Self-Play

Stars: ✭ 52 (+57.58%)

Mutual labels: reinforcement-learning-algorithms

ludorum.js

A board game framework, focused not on graphics or user interfaces, but on artificial players design, implementation and testing.

Stars: ✭ 13 (-60.61%)

Mutual labels: monte-carlo-tree-search

Recurrent-Deep-Q-Learning

Solving POMDP using Recurrent networks

Stars: ✭ 52 (+57.58%)

Mutual labels: reinforcement-learning-algorithms

Easy Rl

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+9003.03%)

Mutual labels: policy-gradient

xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

Stars: ✭ 229 (+593.94%)

Mutual labels: reinforcement-learning-algorithms

rpg

Ranking Policy Gradient

Stars: ✭ 22 (-33.33%)

Mutual labels: policy-gradient

RL-code-resources

A collection of Reinforcement Learning GitHub code resources divided by frameworks and environments

Stars: ✭ 51 (+54.55%)

Mutual labels: reinforcement-learning-algorithms

Reinforcement learning

강화학습에 대한 기본적인 알고리즘 구현

Stars: ✭ 100 (+203.03%)

Mutual labels: policy-gradient

TD3-BipedalWalkerHardcore-v2

Solve BipedalWalkerHardcore-v2 with TD3

Stars: ✭ 41 (+24.24%)

Mutual labels: reinforcement-learning-algorithms

A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm

Stars: ✭ 22 (-33.33%)

Mutual labels: policy-gradient

Bender

Easily craft fast Neural Networks on iOS! Use TensorFlow models. Metal under the hood.

Stars: ✭ 1,728 (+5136.36%)

Mutual labels: residual-networks

Deeprl algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Stars: ✭ 97 (+193.94%)

Mutual labels: policy-gradient

AlphaZero Gobang

Deep Learning big homework of UCAS

Stars: ✭ 29 (-12.12%)

Mutual labels: residual-networks

Hypernets

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

Stars: ✭ 221 (+569.7%)

Mutual labels: monte-carlo-tree-search

wideresnet-tensorlayer

Wide Residual Networks implemented in TensorLayer and TensorFlow.

Stars: ✭ 44 (+33.33%)

Mutual labels: residual-networks

Codegan

[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks

Stars: ✭ 73 (+121.21%)

Mutual labels: policy-gradient

resnet-cifar10

ResNet for Cifar10

Stars: ✭ 21 (-36.36%)

Mutual labels: residual-networks

caffe-wrn-generator

Caffe Wide-Residual-Network (WRN) Generator

Stars: ✭ 19 (-42.42%)

Mutual labels: residual-networks

Parl Sample

Deep reinforcement learning using baidu PARL(maze,flappy bird and so on)

Stars: ✭ 37 (+12.12%)

Mutual labels: policy-gradient

TicTacToeUI-Android

Check out the new style for App Design aims for Tic Tac Toe Game...😉😀😁😎

Stars: ✭ 40 (+21.21%)

Mutual labels: tictactoe-game

MCTS-agent-python

Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games …

Stars: ✭ 22 (-33.33%)

Mutual labels: monte-carlo-tree-search

Btgym

Scalable, event-driven, deep-learning-friendly backtesting library

Stars: ✭ 765 (+2218.18%)

Mutual labels: policy-gradient

l2rpn-baselines

L2RPN Baselines a repository to host baselines for l2rpn competitions.

Stars: ✭ 57 (+72.73%)

Mutual labels: reinforcement-learning-algorithms

Reinforcement-Learning-on-google-colab

Reinforcement Learning algorithm's using google-colab

Stars: ✭ 33 (+0%)

Mutual labels: reinforcement-learning-algorithms

Pytorch Rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Stars: ✭ 658 (+1893.94%)

Mutual labels: policy-gradient

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+1839.39%)

Mutual labels: policy-gradient

Rlseq2seq

Deep Reinforcement Learning For Sequence to Sequence Models

Stars: ✭ 683 (+1969.7%)

Mutual labels: policy-gradient

alphaFive

alphaGo版本的五子棋(gobang, gomoku)