DQN-AtariDeep Q-Learning (DQN) implementation for Atari pong.
Stars: ✭ 53 (-98.71%)
pytorch-rlPytorch Implementation of RL algorithms
Stars: ✭ 15 (-99.63%)
connect4Solving board games like Connect4 using Deep Reinforcement Learning
Stars: ✭ 33 (-99.2%)
Deep-Reinforcement-Learning-CS285-PytorchSolutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Stars: ✭ 104 (-97.47%)
Gail TfTensorflow implementation of generative adversarial imitation learning
Stars: ✭ 179 (-95.64%)
breakout-Deep-Q-NetworkReinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout
Stars: ✭ 69 (-98.32%)
TRPO-TensorFlowTrust Region Policy Optimization (TRPO) in pure TensorFlow
Stars: ✭ 17 (-99.59%)
SRLFSimple Reinforcement Learning Framework
Stars: ✭ 24 (-99.42%)
dqn-pytorchDQN to play Atari Pong
Stars: ✭ 77 (-98.13%)
xingtianxingtian is a componentized library for the development and verification of reinforcement learning algorithms
Stars: ✭ 229 (-94.43%)
Tensorflow RlImplementations of deep RL papers and random experimentation
Stars: ✭ 176 (-95.72%)
RlgraphRLgraph: Modular computation graphs for deep reinforcement learning
Stars: ✭ 272 (-93.38%)
logrlLogarithmic Reinforcement Learning
Stars: ✭ 25 (-99.39%)
AtariAI research environment for the Atari 2600 games 🤖.
Stars: ✭ 174 (-95.77%)
DQN-pytorchA PyTorch implementation of Human-Level Control through Deep Reinforcement Learning
Stars: ✭ 23 (-99.44%)
Openai labAn experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (-92.38%)
Pytorch TrpoPyTorch implementation of Trust Region Policy Optimization
Stars: ✭ 303 (-92.63%)
Ppo PytorchMinimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Stars: ✭ 325 (-92.09%)
trading gyma unified environment for supervised learning and reinforcement learning in the context of quantitative trading
Stars: ✭ 36 (-99.12%)
Rl algorithmsStructural implementation of RL key algorithms
Stars: ✭ 352 (-91.43%)
TrpoTrust Region Policy Optimization with TensorFlow and OpenAI Gym
Stars: ✭ 343 (-91.65%)
Irl ImitationImplementation of Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
Stars: ✭ 333 (-91.9%)
Reinforcement learning tutorial with demoReinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (-89.24%)
PantheonPantheon of Congestion Control
Stars: ✭ 170 (-95.86%)
Dqn FlappybirdPlay flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch
Stars: ✭ 37 (-99.1%)
DrqDrQ: Data regularized Q
Stars: ✭ 268 (-93.48%)
RadRAD: Reinforcement Learning with Augmented Data
Stars: ✭ 268 (-93.48%)
Meta-SACAuto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
Stars: ✭ 19 (-99.54%)
OpenaigymSolving OpenAI Gym problems.
Stars: ✭ 98 (-97.61%)
Pytorch RlDeep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (-81.87%)
Gail ppo tfTensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
Stars: ✭ 99 (-97.59%)
learning-to-drive-in-5-minutesImplementation of reinforcement learning approach to make a car learn to drive smoothly in minutes
Stars: ✭ 227 (-94.48%)
TorchrlHighly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (-97.52%)
Pytorch RlTutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (-97.06%)
GymfcA universal flight control tuning framework
Stars: ✭ 210 (-94.89%)
Rl Baselines3 ZooA collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.
Stars: ✭ 161 (-96.08%)
TracerbenchAutomated Chrome tracing for benchmarking.
Stars: ✭ 189 (-95.4%)
UibenchUI Benchmark
Stars: ✭ 163 (-96.03%)
MjrlReinforcement learning algorithms for MuJoCo tasks
Stars: ✭ 162 (-96.06%)
Are We Fast YetAre We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays
Stars: ✭ 161 (-96.08%)
D OptimizerMake Dota 2 fps great again
Stars: ✭ 161 (-96.08%)
KubestonePerformance benchmarks for Kubernetes
Stars: ✭ 159 (-96.13%)
Blue benchmarkBLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.
Stars: ✭ 159 (-96.13%)
Alphazero gomokuAn implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (-37.45%)
Ann BenchmarksBenchmarks of approximate nearest neighbor libraries in Python
Stars: ✭ 2,658 (-35.31%)
Sv BenchmarksCollection of Verification Tasks
Stars: ✭ 158 (-96.15%)
AgentsTF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Stars: ✭ 2,135 (-48.04%)
ChineseblueChinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Stars: ✭ 149 (-96.37%)