Easy Rl强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+13554.55%)
Mutual labels: policy-gradient, imitation-learning
imitation learningPyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+322.73%)
Mutual labels: policy-gradient, imitation-learning
TianshouAn elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+18577.27%)
Mutual labels: policy-gradient, imitation-learning
Reinforcement learning tutorial with demoReinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+1909.09%)
Mutual labels: policy-gradient, imitation-learning
A2cA Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (+668.18%)
Mutual labels: policy-gradient
Deep Reinforcement Learning With PytorchPyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+6013.64%)
Mutual labels: policy-gradient
Pontryagin-Differentiable-ProgrammingA unified end-to-end learning and control framework that is able to learn a (neural) control objective function, dynamics equation, control policy, or/and optimal trajectory in a control system.
Stars: ✭ 111 (+404.55%)
Mutual labels: imitation-learning
Reinforcement LearningMinimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+12913.64%)
Mutual labels: policy-gradient
Policy GradientMinimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (+513.64%)
Mutual labels: policy-gradient
Deep AlgotradingA resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (+686.36%)
Mutual labels: policy-gradient
Deeprl algorithmsDeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (+340.91%)
Mutual labels: policy-gradient
SharkStockAutomate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …
Stars: ✭ 63 (+186.36%)
Mutual labels: policy-gradient
Codegan[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
Stars: ✭ 73 (+231.82%)
Mutual labels: policy-gradient
Mlds2018springMachine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (+463.64%)
Mutual labels: policy-gradient
Pytorch RlTutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (+450%)
Mutual labels: policy-gradient
Show Adapt And TellCode for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (+563.64%)
Mutual labels: policy-gradient
Deep-Reinforcement-Learning-With-PythonMaster classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+909.09%)
Mutual labels: policy-gradient