Tensorflow ReinforceImplementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (-29.72%)
Pytorch RlTutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (-82.28%)
Reinforcement learning tutorial with demoReinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (-35.29%)
Mlds2018springMachine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (-81.84%)
A2cA Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (-75.26%)
Openai labAn experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (-54.17%)
Slm LabModular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+32.36%)
Torch AcRecurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (-89.75%)
Rl a3c pytorchA3C LSTM Atari with Pytorch plus A3G design
Stars: ✭ 482 (-29.43%)
Deeprl TutorialsContains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Stars: ✭ 748 (+9.52%)
BtgymScalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+12.01%)
ChainerrlChainerRL is a deep reinforcement learning library built on top of Chainer.
Stars: ✭ 931 (+36.31%)
TorchrlHighly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (-85.07%)
Policy GradientMinimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (-80.23%)
Run Skeleton RunReason8.ai PyTorch solution for NIPS RL 2017 challenge
Stars: ✭ 83 (-87.85%)
Reinforcementlearning AtarigamePytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (-82.72%)
Show Adapt And TellCode for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (-78.62%)
Pytorch sacPyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (-74.52%)
Pytorch DrlPyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (-65.89%)
Reinforcement LearningDeep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
Stars: ✭ 61 (-91.07%)
Pytorch RlDeep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (+9.08%)
DrqDrQ: Data regularized Q
Stars: ✭ 268 (-60.76%)
ReaverReaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.
Stars: ✭ 499 (-26.94%)
Deep Reinforcement Learning With PytorchPyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+96.93%)
Pytorch A3cPyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Stars: ✭ 879 (+28.7%)
Rl algosReinforcement Learning Algorithms
Stars: ✭ 14 (-97.95%)
Rl algorithmsStructural implementation of RL key algorithms
Stars: ✭ 352 (-48.46%)
Pytorch sac aePyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
Stars: ✭ 94 (-86.24%)
Deep AlgotradingA resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (-74.67%)
Paddle-RLBooksPaddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Stars: ✭ 113 (-83.46%)
Easy Rl强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+339.82%)
Lagomlagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (-46.71%)
Pytorch RlThis repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (-42.31%)
Pytorch A2c Ppo Acktr GailPyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+285.36%)
Deep Rl KerasKeras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Stars: ✭ 395 (-42.17%)
ExplorerExplorer is a PyTorch reinforcement learning framework for exploring new ideas.
Stars: ✭ 54 (-92.09%)
MultihopkgMulti-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (-70.42%)
ADL2019Applied Deep Learning (2019 Spring) @ NTU
Stars: ✭ 20 (-97.07%)
Dissecting Reinforcement LearningPython code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Stars: ✭ 512 (-25.04%)
Pytorch Cpp RlPyTorch C++ Reinforcement Learning
Stars: ✭ 353 (-48.32%)
AdeptrlReinforcement learning framework to accelerate research
Stars: ✭ 173 (-74.67%)
TrpoTrust Region Policy Optimization with TensorFlow and OpenAI Gym
Stars: ✭ 343 (-49.78%)
Pytorch RlPyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (-3.66%)
SeqganA simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Stars: ✭ 502 (-26.5%)
Habitat LabA modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
Stars: ✭ 587 (-14.06%)
Ai ToolboxA C++ framework for MDPs and POMDPs with Python bindings
Stars: ✭ 500 (-26.79%)
Gym AnytradingThe most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
Stars: ✭ 627 (-8.2%)