A3cMXNET + OpenAI Gym implementation of A3C from "Asynchronous Methods for Deep Reinforcement Learning"
Stars: ✭ 9 (-97.72%)
Torch AcRecurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (-82.28%)
RainbowA PyTorch implementation of Rainbow DQN agent
Stars: ✭ 147 (-62.78%)
DrqDrQ: Data regularized Q
Stars: ✭ 268 (-32.15%)
Deep AlgotradingA resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (-56.2%)
Reinforcementlearning AtarigamePytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (-70.13%)
Ctc ExecutionerMaster Thesis: Limit order placement with Reinforcement Learning
Stars: ✭ 112 (-71.65%)
ChainerrlChainerRL is a deep reinforcement learning library built on top of Chainer.
Stars: ✭ 931 (+135.7%)
Pysc2 AgentsThis is a simple implementation of DeepMind's PySC2 RL agents.
Stars: ✭ 262 (-33.67%)
RlgraphRLgraph: Modular computation graphs for deep reinforcement learning
Stars: ✭ 272 (-31.14%)
GenrlA PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
Stars: ✭ 356 (-9.87%)
Policy GradientMinimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (-65.82%)
AgentsTF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Stars: ✭ 2,135 (+440.51%)
Hindsight Experience ReplayThis is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Stars: ✭ 134 (-66.08%)
DeepdriveDeepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
Stars: ✭ 628 (+58.99%)
Mlds2018springMachine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (-68.61%)
MultihopkgMulti-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (-48.86%)
Pytorch Ddpg NafImplementation of algorithms for continuous control (DDPG and NAF).
Stars: ✭ 254 (-35.7%)
Gym Gazebo2gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
Stars: ✭ 257 (-34.94%)
Aleph starReinforcement learning with A* and a deep heuristic
Stars: ✭ 235 (-40.51%)
Deep Rl Tradingplaying idealized trading games with deep reinforcement learning
Stars: ✭ 228 (-42.28%)
Rl BookSource codes for the book "Reinforcement Learning: Theory and Python Implementation"
Stars: ✭ 464 (+17.47%)
Robotics Rl SrlS-RL Toolbox: Reinforcement Learning (RL) and State Representation Learning (SRL) for Robotics
Stars: ✭ 453 (+14.68%)
Reinforcement LearningLearn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Stars: ✭ 3,329 (+742.78%)
Deterministic Gail PytorchPyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
Stars: ✭ 44 (-88.86%)
DrlkitA High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-92.66%)
Rl algosReinforcement Learning Algorithms
Stars: ✭ 14 (-96.46%)
Trading GymA Trading environment base on Gym
Stars: ✭ 71 (-82.03%)
Dmc2gymOpenAI Gym wrapper for the DeepMind Control Suite
Stars: ✭ 75 (-81.01%)
Deep-Reinforcement-Learning-NotebooksThis Repository contains a series of google colab notebooks which I created to help people dive into deep reinforcement learning.This notebooks contain both theory and implementation of different algorithms.
Stars: ✭ 15 (-96.2%)
Rlenv.directoryExplore and find reinforcement learning environments in a list of 150+ open source environments.
Stars: ✭ 79 (-80%)
Stable Baselines3PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Stars: ✭ 1,263 (+219.75%)
Rl Baselines ZooA collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Stars: ✭ 839 (+112.41%)
Lf2gymAn OpenAI-gym-like environment for Little Fighter 2
Stars: ✭ 79 (-80%)
CleanrlHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features
Stars: ✭ 349 (-11.65%)
Stable BaselinesMirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Stars: ✭ 115 (-70.89%)
Drqn TensorflowDeep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
Stars: ✭ 127 (-67.85%)
Pytorch sac aePyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
Stars: ✭ 94 (-76.2%)
Sumo RlA simple interface to instantiate Reinforcement Learning environments with SUMO for Traffic Signal Control. Compatible with Gym Env from OpenAI and MultiAgentEnv from RLlib.
Stars: ✭ 145 (-63.29%)
Rl Baselines3 ZooA collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.
Stars: ✭ 161 (-59.24%)
Gym SokobanSokoban environment for OpenAI Gym
Stars: ✭ 186 (-52.91%)
Naf Tensorflow"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
Stars: ✭ 192 (-51.39%)
Paac.pytorchPytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/abs/1705.04862
Stars: ✭ 22 (-94.43%)
Pytorch sacPyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (-55.95%)
Pytorch A3cSimple A3C implementation with pytorch + multiprocessing
Stars: ✭ 364 (-7.85%)
omdJAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
Stars: ✭ 43 (-89.11%)
yarllCombining deep learning and reinforcement learning.
Stars: ✭ 84 (-78.73%)
TF2-RLReinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]
Stars: ✭ 160 (-59.49%)
Ma GymA collection of multi agent environments based on OpenAI gym.
Stars: ✭ 226 (-42.78%)
UAV-DDPGCode for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"
Stars: ✭ 133 (-66.33%)
Rainy☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (-90.13%)