Master-ThesisDeep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex
Stars: ✭ 33 (+0%)
Pytorch RlThis repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+1093.94%)
Lagomlagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (+1003.03%)
alpha sigmaA pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
Stars: ✭ 134 (+306.06%)
Rl algorithmsStructural implementation of RL key algorithms
Stars: ✭ 352 (+966.67%)
agentmodels.orgModeling agents with probabilistic programs
Stars: ✭ 66 (+100%)
Ppo PytorchMinimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Stars: ✭ 325 (+884.85%)
alphastoneUsing self-play, MCTS, and a deep neural network to create a hearthstone ai player
Stars: ✭ 24 (-27.27%)
marltoolboxA toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
Stars: ✭ 25 (-24.24%)
policy-gradient-pongtensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
Stars: ✭ 29 (-12.12%)
pytorch-rlPytorch Implementation of RL algorithms
Stars: ✭ 15 (-54.55%)
ReZero-ResNetUnofficial pytorch implementation of ReZero in ResNet
Stars: ✭ 23 (-30.3%)
Slm LabModular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+2639.39%)
quoridor-aiQuoridor AI based on Monte Carlo tree search
Stars: ✭ 23 (-30.3%)
TRPO-TensorFlowTrust Region Policy Optimization (TRPO) in pure TensorFlow
Stars: ✭ 17 (-48.48%)
Deep AlgotradingA resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (+424.24%)
imitation learningPyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+181.82%)
HandyRLHandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Stars: ✭ 228 (+590.91%)
Show Adapt And TellCode for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (+342.42%)
breakout-Deep-Q-NetworkReinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout
Stars: ✭ 69 (+109.09%)
AnimalChessAnimal Fight Chess Game(斗兽棋) written in rust.
Stars: ✭ 76 (+130.3%)
UCThelloUCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
Stars: ✭ 26 (-21.21%)
Mlds2018springMachine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (+275.76%)
ludorum.jsA board game framework, focused not on graphics or user interfaces, but on artificial players design, implementation and testing.
Stars: ✭ 13 (-60.61%)
Easy Rl强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+9003.03%)
xingtianxingtian is a componentized library for the development and verification of reinforcement learning algorithms
Stars: ✭ 229 (+593.94%)
rpgRanking Policy Gradient
Stars: ✭ 22 (-33.33%)
RL-code-resourcesA collection of Reinforcement Learning GitHub code resources divided by frameworks and environments
Stars: ✭ 51 (+54.55%)
RLA set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
Stars: ✭ 22 (-33.33%)
BenderEasily craft fast Neural Networks on iOS! Use TensorFlow models. Metal under the hood.
Stars: ✭ 1,728 (+5136.36%)
Deeprl algorithmsDeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (+193.94%)
HypernetsA General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Stars: ✭ 221 (+569.7%)
Codegan[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
Stars: ✭ 73 (+121.21%)
Parl SampleDeep reinforcement learning using baidu PARL(maze,flappy bird and so on)
Stars: ✭ 37 (+12.12%)
TicTacToeUI-AndroidCheck out the new style for App Design aims for Tic Tac Toe Game...😉😀😁😎
Stars: ✭ 40 (+21.21%)
MCTS-agent-pythonMonte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games …
Stars: ✭ 22 (-33.33%)
BtgymScalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+2218.18%)
l2rpn-baselinesL2RPN Baselines a repository to host baselines for l2rpn competitions.
Stars: ✭ 57 (+72.73%)
Pytorch RlPyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+1893.94%)
Rlseq2seqDeep Reinforcement Learning For Sequence to Sequence Models
Stars: ✭ 683 (+1969.7%)
alphaFivealphaGo版本的五子棋(gobang, gomoku)
Stars: ✭ 51 (+54.55%)
VREP-RL-botReinforcement Learning in Vrep
Stars: ✭ 14 (-57.58%)
Pytorch-RL-CPPA Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)
Stars: ✭ 73 (+121.21%)
TAA-PGUsage of policy gradient reinforcement learning to solve portfolio optimization problems (Tactical Asset Allocation).
Stars: ✭ 26 (-21.21%)