Paddle-RLBooksPaddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Stars: ✭ 113 (+242.42%)
Alpha Zero GeneralA clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Stars: ✭ 2,617 (+7830.3%)
yarllCombining deep learning and reinforcement learning.
Stars: ✭ 84 (+154.55%)
Alphazero gomokuAn implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (+7687.88%)
TianshouAn elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+12351.52%)
distributed rlPytorch implementation of distributed deep reinforcement learning
Stars: ✭ 66 (+100%)
Reinforcement LearningDeep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
Stars: ✭ 61 (+84.85%)
Deep-rl-mxnetMxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO
Stars: ✭ 26 (-21.21%)
king-pongDeep Reinforcement Learning Pong Agent, King Pong, he's the best
Stars: ✭ 23 (-30.3%)
rl-algorithmsReinforcement learning algorithms
Stars: ✭ 40 (+21.21%)
SharkStockAutomate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …
Stars: ✭ 63 (+90.91%)
banditsComparison of bandit algorithms from the Reinforcement Learning bible.
Stars: ✭ 16 (-51.52%)
alphazeroBoard Game Reinforcement Learning using AlphaZero method. including Makhos (Thai Checkers), Reversi, Connect Four, Tic-tac-toe game rules
Stars: ✭ 24 (-27.27%)
MultihopkgMulti-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (+512.12%)
deep rl acrobotTensorFlow A2C to solve Acrobot, with synchronized parallel environments
Stars: ✭ 32 (-3.03%)
A2cA Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (+412.12%)
Policy GradientMinimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (+309.09%)
ml-aiML-AI Community | Open Source | Built in Bharat for the World | Data science problem statements and solutions
Stars: ✭ 32 (-3.03%)
segmentation-enhanced-resunetUrban building extraction in Daejeon region using Modified Residual U-Net (Modified ResUnet) and applying post-processing.
Stars: ✭ 34 (+3.03%)
Fruit-APIA Universal Deep Reinforcement Learning Framework
Stars: ✭ 61 (+84.85%)
Pytorch RlTutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (+266.67%)
Master-ThesisDeep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex
Stars: ✭ 33 (+0%)
alpha sigmaA pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
Stars: ✭ 134 (+306.06%)
agentmodels.orgModeling agents with probabilistic programs
Stars: ✭ 66 (+100%)
alphastoneUsing self-play, MCTS, and a deep neural network to create a hearthstone ai player
Stars: ✭ 24 (-27.27%)
marltoolboxA toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
Stars: ✭ 25 (-24.24%)
pytorch-rlPytorch Implementation of RL algorithms
Stars: ✭ 15 (-54.55%)
TorchrlHighly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (+209.09%)
DRL in CVA course on Deep Reinforcement Learning in Computer Vision. Visit Website:
Stars: ✭ 59 (+78.79%)
Deep AlgotradingA resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (+424.24%)
imitation learningPyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+181.82%)
Show Adapt And TellCode for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (+342.42%)
onnOnline Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
Stars: ✭ 139 (+321.21%)
Deep Reinforcement Learning With PytorchPyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+3975.76%)
Mlds2018springMachine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (+275.76%)
ludorum.jsA board game framework, focused not on graphics or user interfaces, but on artificial players design, implementation and testing.
Stars: ✭ 13 (-60.61%)
Easy Rl强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+9003.03%)
rpgRanking Policy Gradient
Stars: ✭ 22 (-33.33%)
RLA set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
Stars: ✭ 22 (-33.33%)
Deeprl algorithmsDeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (+193.94%)
HypernetsA General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Stars: ✭ 221 (+569.7%)
Codegan[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
Stars: ✭ 73 (+121.21%)
LWDRLCLightweight deep RL Libraray for continuous control.
Stars: ✭ 14 (-57.58%)
UAV-DDPGCode for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"
Stars: ✭ 133 (+303.03%)
Parl SampleDeep reinforcement learning using baidu PARL(maze,flappy bird and so on)
Stars: ✭ 37 (+12.12%)
Slm LabModular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+2639.39%)
l2rpn-baselinesL2RPN Baselines a repository to host baselines for l2rpn competitions.
Stars: ✭ 57 (+72.73%)
breakout-Deep-Q-NetworkReinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout
Stars: ✭ 69 (+109.09%)
BtgymScalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+2218.18%)