All Projects → Tianshou → Similar Projects or Alternatives

741 Open source projects that are alternatives of or similar to Tianshou

DQN-Atari
Deep Q-Learning (DQN) implementation for Atari pong.
Stars: ✭ 53 (-98.71%)
Mutual labels:  dqn, atari
pytorch-rl
Pytorch Implementation of RL algorithms
Stars: ✭ 15 (-99.63%)
Mutual labels:  dqn, ddpg
connect4
Solving board games like Connect4 using Deep Reinforcement Learning
Stars: ✭ 33 (-99.2%)
Mutual labels:  policy-gradient, double-dqn
Deep-Reinforcement-Learning-CS285-Pytorch
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Stars: ✭ 104 (-97.47%)
Mutual labels:  policy-gradient, mujoco
Gail Tf
Tensorflow implementation of generative adversarial imitation learning
Stars: ✭ 179 (-95.64%)
Mutual labels:  imitation-learning, trpo
breakout-Deep-Q-Network
Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout
Stars: ✭ 69 (-98.32%)
Mutual labels:  dqn, atari
TRPO-TensorFlow
Trust Region Policy Optimization (TRPO) in pure TensorFlow
Stars: ✭ 17 (-99.59%)
Mutual labels:  policy-gradient, trpo
SRLF
Simple Reinforcement Learning Framework
Stars: ✭ 24 (-99.42%)
Mutual labels:  ddpg, trpo
dqn-pytorch
DQN to play Atari Pong
Stars: ✭ 77 (-98.13%)
Mutual labels:  dqn, atari
xingtian
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Stars: ✭ 229 (-94.43%)
Mutual labels:  dqn, ppo
Learning To Communicate Pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Stars: ✭ 236 (-94.26%)
Mutual labels:  dqn, rl
Tensorflow Rl
Implementations of deep RL papers and random experimentation
Stars: ✭ 176 (-95.72%)
Mutual labels:  dqn, trpo
Rlgraph
RLgraph: Modular computation graphs for deep reinforcement learning
Stars: ✭ 272 (-93.38%)
Mutual labels:  dqn, ppo
logrl
Logarithmic Reinforcement Learning
Stars: ✭ 25 (-99.39%)
Mutual labels:  dqn, rl
Deep reinforcement learning course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Stars: ✭ 3,232 (-21.34%)
Mutual labels:  ppo, a2c
Atari
AI research environment for the Atari 2600 games 🤖.
Stars: ✭ 174 (-95.77%)
Mutual labels:  dqn, rl
DQN-pytorch
A PyTorch implementation of Human-Level Control through Deep Reinforcement Learning
Stars: ✭ 23 (-99.44%)
Mutual labels:  dqn, atari
Openai lab
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (-92.38%)
Mutual labels:  policy-gradient, ddpg
Pytorch Trpo
PyTorch implementation of Trust Region Policy Optimization
Stars: ✭ 303 (-92.63%)
Mutual labels:  mujoco, trpo
Ppo Pytorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Stars: ✭ 325 (-92.09%)
Mutual labels:  ppo, policy-gradient
trading gym
a unified environment for supervised learning and reinforcement learning in the context of quantitative trading
Stars: ✭ 36 (-99.12%)
Mutual labels:  ddpg, ppo
playing-mario-with-deep-reinforcement-learning
An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.
Stars: ✭ 55 (-98.66%)
Mutual labels:  dqn, double-dqn
Rl algorithms
Structural implementation of RL key algorithms
Stars: ✭ 352 (-91.43%)
Mutual labels:  dqn, policy-gradient
Trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
Stars: ✭ 343 (-91.65%)
Mutual labels:  policy-gradient, mujoco
Awesome Monte Carlo Tree Search Papers
A curated list of Monte Carlo tree search papers with implementations.
Stars: ✭ 387 (-90.58%)
Mutual labels:  policy-gradient, rl
Irl Imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
Stars: ✭ 333 (-91.9%)
Mutual labels:  imitation-learning, rl
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (-89.24%)
Deep Reinforcement Learning For Automated Stock Trading Ensemble Strategy Icaif 2020
Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy. ICAIF 2020. Please star.
Stars: ✭ 518 (-87.39%)
Mutual labels:  ppo, ddpg
Pantheon
Pantheon of Congestion Control
Stars: ✭ 170 (-95.86%)
Mutual labels:  imitation-learning, benchmark
Dqn Flappybird
Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch
Stars: ✭ 37 (-99.1%)
Mutual labels:  dqn, rl
Drq
DrQ: Data regularized Q
Stars: ✭ 268 (-93.48%)
Mutual labels:  mujoco, rl
Rad
RAD: Reinforcement Learning with Augmented Data
Stars: ✭ 268 (-93.48%)
Mutual labels:  ppo, rl
Reinforcement Learning Kr
[파이썬과 케라스로 배우는 강화학습] 예제
Stars: ✭ 282 (-93.14%)
Mutual labels:  dqn, policy-gradient
Meta-SAC
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
Stars: ✭ 19 (-99.54%)
Mutual labels:  sac, mujoco
Openaigym
Solving OpenAI Gym problems.
Stars: ✭ 98 (-97.61%)
Mutual labels:  dqn, ddpg
Pytorch Rl
Deep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (-81.87%)
Mutual labels:  dqn, trpo
Gail ppo tf
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
Stars: ✭ 99 (-97.59%)
Mutual labels:  ppo, imitation-learning
Reinforcement learning
강화학습에 대한 기본적인 알고리즘 구현
Stars: ✭ 100 (-97.57%)
Mutual labels:  dqn, policy-gradient
learning-to-drive-in-5-minutes
Implementation of reinforcement learning approach to make a car learn to drive smoothly in minutes
Stars: ✭ 227 (-94.48%)
Mutual labels:  rl, sac
Torchrl
Highly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (-97.52%)
Mutual labels:  dqn, policy-gradient
Reinforcement Learning
🤖 Implements of Reinforcement Learning algorithms.
Stars: ✭ 104 (-97.47%)
Mutual labels:  dqn, ddpg
Pytorch Rl
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (-97.06%)
Mutual labels:  policy-gradient, rl
Gymfc
A universal flight control tuning framework
Stars: ✭ 210 (-94.89%)
Mutual labels:  benchmark, rl
Rl Baselines3 Zoo
A collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.
Stars: ✭ 161 (-96.08%)
Mutual labels:  rl
Tracerbench
Automated Chrome tracing for benchmarking.
Stars: ✭ 189 (-95.4%)
Mutual labels:  benchmark
Uibench
UI Benchmark
Stars: ✭ 163 (-96.03%)
Mutual labels:  benchmark
Mjrl
Reinforcement learning algorithms for MuJoCo tasks
Stars: ✭ 162 (-96.06%)
Mutual labels:  mujoco
Automlbenchmark
OpenML AutoML Benchmarking Framework
Stars: ✭ 210 (-94.89%)
Mutual labels:  benchmark
Hands On Intelligent Agents With Openai Gym
Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch
Stars: ✭ 189 (-95.4%)
Mutual labels:  dqn
Are We Fast Yet
Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays
Stars: ✭ 161 (-96.08%)
Mutual labels:  benchmark
D Optimizer
Make Dota 2 fps great again
Stars: ✭ 161 (-96.08%)
Mutual labels:  benchmark
Kubestone
Performance benchmarks for Kubernetes
Stars: ✭ 159 (-96.13%)
Mutual labels:  benchmark
Blue benchmark
BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.
Stars: ✭ 159 (-96.13%)
Mutual labels:  benchmark
Java Object Mapper Benchmark
JMH benchmark of Java object-to-object mapping frameworks
Stars: ✭ 227 (-94.48%)
Mutual labels:  benchmark
Alphazero gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (-37.45%)
Mutual labels:  rl
Ann Benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
Stars: ✭ 2,658 (-35.31%)
Mutual labels:  benchmark
Sv Benchmarks
Collection of Verification Tasks
Stars: ✭ 158 (-96.15%)
Mutual labels:  benchmark
Agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Stars: ✭ 2,135 (-48.04%)
Mutual labels:  dqn
Jax Rs Performance Comparison
⚡️ Performance Comparison of Jax-RS implementations and embedded containers
Stars: ✭ 181 (-95.6%)
Mutual labels:  benchmark
Chineseblue
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Stars: ✭ 149 (-96.37%)
Mutual labels:  benchmark
61-120 of 741 similar projects