Reinforcement learning tutorial with demoReinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+1909.09%)
imitation learningPyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+322.73%)
TianshouAn elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+18577.27%)
Easy Rl强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+13554.55%)
TrpoTrust Region Policy Optimization with TensorFlow and OpenAI Gym
Stars: ✭ 343 (+1459.09%)
deep tradingThis project aims to select a supervised algorithm that can predict stock prices basing on historical data and use the predictor generated to form trading strategies.
Stars: ✭ 18 (-18.18%)
Policy GradientMinimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (+513.64%)
Slm LabModular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+4009.09%)
SeqGAN-PyTorchImplementation of Sequence Generative Adversarial Nets with Policy Gradient in PyTorch
Stars: ✭ 40 (+81.82%)
Deep Rl KerasKeras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Stars: ✭ 395 (+1695.45%)
Deep Reinforcement Learning With PytorchPyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+6013.64%)
A2cA Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (+668.18%)
Openai labAn experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (+1322.73%)
ADL2019Applied Deep Learning (2019 Spring) @ NTU
Stars: ✭ 20 (-9.09%)
Paddle-RLBooksPaddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Stars: ✭ 113 (+413.64%)
td-regTD-Regularized Actor-Critic Methods
Stars: ✭ 28 (+27.27%)
Reinforcement LearningDeep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
Stars: ✭ 61 (+177.27%)
Pytorch RlTutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (+450%)
Pytorch RlPyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+2890.91%)
ExplorerExplorer is a PyTorch reinforcement learning framework for exploring new ideas.
Stars: ✭ 54 (+145.45%)
connect4Solving board games like Connect4 using Deep Reinforcement Learning
Stars: ✭ 33 (+50%)
Pytorch RlThis repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+1690.91%)
Deep AlgotradingA resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (+686.36%)
Lagomlagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (+1554.55%)
Deeprl algorithmsDeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (+340.91%)
Rl algorithmsStructural implementation of RL key algorithms
Stars: ✭ 352 (+1500%)
SharkStockAutomate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …
Stars: ✭ 63 (+186.36%)
Ppo PytorchMinimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Stars: ✭ 325 (+1377.27%)
Codegan[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
Stars: ✭ 73 (+231.82%)
Show Adapt And TellCode for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (+563.64%)
policy-gradient-pongtensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
Stars: ✭ 29 (+31.82%)
Parl SampleDeep reinforcement learning using baidu PARL(maze,flappy bird and so on)
Stars: ✭ 37 (+68.18%)
Pontryagin-Differentiable-ProgrammingA unified end-to-end learning and control framework that is able to learn a (neural) control objective function, dynamics equation, control policy, or/and optimal trajectory in a control system.
Stars: ✭ 111 (+404.55%)
Deep-rl-mxnetMxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO
Stars: ✭ 26 (+18.18%)
BtgymScalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+3377.27%)
TRPO-TensorFlowTrust Region Policy Optimization (TRPO) in pure TensorFlow
Stars: ✭ 17 (-22.73%)
Mlds2018springMachine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (+463.64%)
Deep-Reinforcement-Learning-CS285-PytorchSolutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Stars: ✭ 104 (+372.73%)
Rlseq2seqDeep Reinforcement Learning For Sequence to Sequence Models
Stars: ✭ 683 (+3004.55%)
HandyRLHandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Stars: ✭ 228 (+936.36%)
SeqganA simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Stars: ✭ 502 (+2181.82%)
LWDRLCLightweight deep RL Libraray for continuous control.
Stars: ✭ 14 (-36.36%)
RLA set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
Stars: ✭ 22 (+0%)
TAA-PGUsage of policy gradient reinforcement learning to solve portfolio optimization problems (Tactical Asset Allocation).
Stars: ✭ 26 (+18.18%)
Tensorflow ReinforceImplementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (+2081.82%)
yarllCombining deep learning and reinforcement learning.
Stars: ✭ 84 (+281.82%)
MultihopkgMulti-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (+818.18%)
TorchrlHighly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (+363.64%)
DeerDEEp Reinforcement learning framework
Stars: ✭ 455 (+1968.18%)