Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (-89.24%)

Mutual labels: policy-gradient, imitation-learning

Deep Reinforcement Learning For Automated Stock Trading Ensemble Strategy Icaif 2020

Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy. ICAIF 2020. Please star.

Stars: ✭ 518 (-87.39%)

Mutual labels: ppo, ddpg

Pantheon

Pantheon of Congestion Control

Stars: ✭ 170 (-95.86%)

Mutual labels: imitation-learning, benchmark

Dqn Flappybird

Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch

Stars: ✭ 37 (-99.1%)

Mutual labels: dqn, rl

Drq

DrQ: Data regularized Q

Stars: ✭ 268 (-93.48%)

Mutual labels: mujoco, rl

Rad

RAD: Reinforcement Learning with Augmented Data

Stars: ✭ 268 (-93.48%)

Mutual labels: ppo, rl

Reinforcement Learning Kr

[파이썬과 케라스로 배우는 강화학습] 예제

Stars: ✭ 282 (-93.14%)

Mutual labels: dqn, policy-gradient

Meta-SAC

Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020

Stars: ✭ 19 (-99.54%)

Mutual labels: sac, mujoco

Openaigym

Solving OpenAI Gym problems.

Stars: ✭ 98 (-97.61%)

Mutual labels: dqn, ddpg

Pytorch Rl

Deep Reinforcement Learning with pytorch & visdom

Stars: ✭ 745 (-81.87%)

Mutual labels: dqn, trpo

Gail ppo tf

Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action

Stars: ✭ 99 (-97.59%)

Mutual labels: ppo, imitation-learning

Reinforcement learning

강화학습에 대한 기본적인 알고리즘 구현

Stars: ✭ 100 (-97.57%)

Mutual labels: dqn, policy-gradient

learning-to-drive-in-5-minutes

Implementation of reinforcement learning approach to make a car learn to drive smoothly in minutes

Stars: ✭ 227 (-94.48%)

Mutual labels: rl, sac

Torchrl

Highly Modular and Scalable Reinforcement Learning

Stars: ✭ 102 (-97.52%)

Mutual labels: dqn, policy-gradient

Reinforcement Learning

🤖 Implements of Reinforcement Learning algorithms.

Stars: ✭ 104 (-97.47%)

Mutual labels: dqn, ddpg

Pytorch Rl

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Stars: ✭ 121 (-97.06%)

Mutual labels: policy-gradient, rl

Gymfc

A universal flight control tuning framework

Stars: ✭ 210 (-94.89%)

Mutual labels: benchmark, rl

Rl Baselines3 Zoo

A collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.

Stars: ✭ 161 (-96.08%)

Mutual labels: rl

Tracerbench

Automated Chrome tracing for benchmarking.

Stars: ✭ 189 (-95.4%)

Mutual labels: benchmark

Uibench

UI Benchmark

Stars: ✭ 163 (-96.03%)

Mutual labels: benchmark

Mjrl

Reinforcement learning algorithms for MuJoCo tasks

Stars: ✭ 162 (-96.06%)

Mutual labels: mujoco

Automlbenchmark

OpenML AutoML Benchmarking Framework

Stars: ✭ 210 (-94.89%)

Mutual labels: benchmark

Hands On Intelligent Agents With Openai Gym

Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch

Stars: ✭ 189 (-95.4%)

Mutual labels: dqn

Are We Fast Yet

Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays

Stars: ✭ 161 (-96.08%)

Mutual labels: benchmark

D Optimizer

Make Dota 2 fps great again

Stars: ✭ 161 (-96.08%)

Mutual labels: benchmark

Kubestone

Performance benchmarks for Kubernetes

Stars: ✭ 159 (-96.13%)

Mutual labels: benchmark

Blue benchmark

BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.

Stars: ✭ 159 (-96.13%)

Mutual labels: benchmark

Java Object Mapper Benchmark

JMH benchmark of Java object-to-object mapping frameworks

Stars: ✭ 227 (-94.48%)

Mutual labels: benchmark

Alphazero gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Stars: ✭ 2,570 (-37.45%)

Mutual labels: rl

Ann Benchmarks

Benchmarks of approximate nearest neighbor libraries in Python

Stars: ✭ 2,658 (-35.31%)

Mutual labels: benchmark

Sv Benchmarks

Collection of Verification Tasks

Stars: ✭ 158 (-96.15%)

Mutual labels: benchmark

Agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Stars: ✭ 2,135 (-48.04%)

Mutual labels: dqn

Jax Rs Performance Comparison

⚡️ Performance Comparison of Jax-RS implementations and embedded containers

Stars: ✭ 181 (-95.6%)

Mutual labels: benchmark

Chineseblue

Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)

Stars: ✭ 149 (-96.37%)

Mutual labels: benchmark

61-120 of 741 similar projects

‹

›

next*5