Baby A3cA high-performance Atari A3C agent in 180 lines of PyTorch
Stars: ✭ 144 (+1100%)
Dissecting Reinforcement LearningPython code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Stars: ✭ 512 (+4166.67%)
Machine Learning Is All You Need🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Stars: ✭ 173 (+1341.67%)
Deep Reinforcement Learning With PytorchPyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+11108.33%)
ExplorerExplorer is a PyTorch reinforcement learning framework for exploring new ideas.
Stars: ✭ 54 (+350%)
Pytorch RlDeep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (+6108.33%)
Pytorch sac aePyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
Stars: ✭ 94 (+683.33%)
Rl algosReinforcement Learning Algorithms
Stars: ✭ 14 (+16.67%)
Pytorch DrlPyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (+1841.67%)
Reinforcement Learning AlgorithmsThis repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Stars: ✭ 426 (+3450%)
Master-ThesisDeep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex
Stars: ✭ 33 (+175%)
Pytorch sacPyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (+1350%)
Openai labAn experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (+2508.33%)
Pytorch A3cPyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Stars: ✭ 879 (+7225%)
Fruit-APIA Universal Deep Reinforcement Learning Framework
Stars: ✭ 61 (+408.33%)
Reinforcement learning tutorial with demoReinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+3583.33%)
Tensorflow ReinforceImplementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (+3900%)
Rl a3c pytorchA3C LSTM Atari with Pytorch plus A3G design
Stars: ✭ 482 (+3916.67%)
Torch AcRecurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (+483.33%)
Deeprl TutorialsContains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Stars: ✭ 748 (+6133.33%)
Pytorch A2c Ppo Acktr GailPyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+21833.33%)
DrqDrQ: Data regularized Q
Stars: ✭ 268 (+2133.33%)
Reinforcementlearning AtarigamePytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (+883.33%)
jax-rlJAX implementations of core Deep RL algorithms
Stars: ✭ 61 (+408.33%)
off-policy-continuous-control[DeepRL Workshop, NeurIPS-21] Recurrent Off-policy Baselines for Memory-based Continuous Control (RDPG, RTD3 and RSAC)
Stars: ✭ 29 (+141.67%)
code summarization publicsource code for 'Improving automatic source code summarization via deep reinforcement learning'
Stars: ✭ 71 (+491.67%)
mmnMoore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
Stars: ✭ 39 (+225%)
decentralized-rlDecentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)
Stars: ✭ 40 (+233.33%)
datascience-mashupIn this repo I will try to gather all of the projects related to data science with clean datasets and high accuracy models to solve real world problems.
Stars: ✭ 36 (+200%)
Carla-ppoThis repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
Stars: ✭ 122 (+916.67%)
pomdp-baselinesSimple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022
Stars: ✭ 162 (+1250%)
DeepLearningFlappyFrogFlappy Frog hack using Deep Reinforcement Learning (Deep Q-learning). 暴力膜蛤不可取。
Stars: ✭ 16 (+33.33%)
AutoPentest-DRLAutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning
Stars: ✭ 196 (+1533.33%)
DDPGEnd to End Mobile Robot Navigation using DDPG (Continuous Control with Deep Reinforcement Learning) based on Tensorflow + Gazebo
Stars: ✭ 41 (+241.67%)
alpha sigmaA pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
Stars: ✭ 134 (+1016.67%)
mentalRLCode for our AAMAS 2020 paper: "A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry".
Stars: ✭ 22 (+83.33%)
pytorch-noreward-rlpytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
Stars: ✭ 79 (+558.33%)
minervaAn out-of-the-box GUI tool for offline deep reinforcement learning
Stars: ✭ 80 (+566.67%)
RamudroidRamudroid, autonomous solar-powered robot to clean roads, realtime object detection and webrtc based streaming
Stars: ✭ 22 (+83.33%)
AI booklet CE-AUTBooklet and exam of Artificial Intelligence Master Degree at Amirkabir University of technology.
Stars: ✭ 14 (+16.67%)
AI使用深度强化学习解决视觉跟踪和视觉导航问题
Stars: ✭ 16 (+33.33%)
Pytorch-PCGradPytorch reimplementation for "Gradient Surgery for Multi-Task Learning"
Stars: ✭ 179 (+1391.67%)
interp-e2e-drivingInterpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Stars: ✭ 159 (+1225%)
alphastoneUsing self-play, MCTS, and a deep neural network to create a hearthstone ai player
Stars: ✭ 24 (+100%)
racing dreamerLatent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing
Stars: ✭ 31 (+158.33%)
pokeaiDevelop ultimate AI Pokémon trainer
Stars: ✭ 18 (+50%)
neural-mpcNo description or website provided.
Stars: ✭ 54 (+350%)
imitation learningPyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+675%)