All Projects → td-reg → Similar Projects or Alternatives

118 Open source projects that are alternatives of or similar to td-reg

Deep Reinforcement Learning With Pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+4703.57%)
Mutual labels:  policy-gradient, trpo, ppo
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+692.86%)
Mutual labels:  policy-gradient, trpo, ppo
Pytorch Rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+2250%)
Mutual labels:  policy-gradient, trpo, ppo
Tianshou
An elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+14575%)
Mutual labels:  policy-gradient, trpo, ppo
Deeprl algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (+246.43%)
Mutual labels:  policy-gradient, trpo, ppo
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+2185.71%)
Mutual labels:  policy-gradient, trpo, ppo
Ppo Pytorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Stars: ✭ 325 (+1060.71%)
Mutual labels:  policy-gradient, ppo
TRPO-TensorFlow
Trust Region Policy Optimization (TRPO) in pure TensorFlow
Stars: ✭ 17 (-39.29%)
Mutual labels:  policy-gradient, trpo
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+10628.57%)
Mutual labels:  policy-gradient, ppo
Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Stars: ✭ 54 (+92.86%)
Mutual labels:  policy-gradient, ppo
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+3128.57%)
Mutual labels:  policy-gradient, ppo
LWDRLC
Lightweight deep RL Libraray for continuous control.
Stars: ✭ 14 (-50%)
Mutual labels:  policy-gradient, ppo
Run Skeleton Run
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
Stars: ✭ 83 (+196.43%)
Mutual labels:  trpo, ppo
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+24714.29%)
Mutual labels:  policy-gradient, ppo
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (+1200%)
Mutual labels:  policy-gradient, ppo
Reinforcement learning
Reinforcement learning tutorials
Stars: ✭ 82 (+192.86%)
Mutual labels:  policy-gradient, ppo
Reinforcement Learning
Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
Stars: ✭ 61 (+117.86%)
Mutual labels:  policy-gradient, ppo
Reinforcement Learning Algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Stars: ✭ 426 (+1421.43%)
Mutual labels:  trpo, ppo
Deeprl Tensorflow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Stars: ✭ 319 (+1039.29%)
Mutual labels:  trpo, ppo
Ros2learn
ROS 2 enabled Machine Learning algorithms
Stars: ✭ 119 (+325%)
Mutual labels:  trpo, ppo
Torchrl
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Stars: ✭ 90 (+221.43%)
Mutual labels:  trpo, ppo
Machine Learning Is All You Need
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Stars: ✭ 173 (+517.86%)
Mutual labels:  trpo, ppo
imitation learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+232.14%)
Mutual labels:  policy-gradient, ppo
Torchrl
Highly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (+264.29%)
Mutual labels:  policy-gradient
rpg
Ranking Policy Gradient
Stars: ✭ 22 (-21.43%)
Mutual labels:  policy-gradient
Reinforcement learning
강화학습에 대한 기본적인 알고리즘 구현
Stars: ✭ 100 (+257.14%)
Mutual labels:  policy-gradient
Rainy
☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (+39.29%)
Mutual labels:  ppo
deep rl acrobot
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
Stars: ✭ 32 (+14.29%)
Mutual labels:  policy-gradient
trading gym
a unified environment for supervised learning and reinforcement learning in the context of quantitative trading
Stars: ✭ 36 (+28.57%)
Mutual labels:  ppo
RL
A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
Stars: ✭ 22 (-21.43%)
Mutual labels:  policy-gradient
Codegan
[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
Stars: ✭ 73 (+160.71%)
Mutual labels:  policy-gradient
Rl Course Experiments
Stars: ✭ 73 (+160.71%)
Mutual labels:  policy-gradient
Parl Sample
Deep reinforcement learning using baidu PARL(maze,flappy bird and so on)
Stars: ✭ 37 (+32.14%)
Mutual labels:  policy-gradient
mujoco-benchmark
Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library
Stars: ✭ 101 (+260.71%)
Mutual labels:  ppo
Btgym
Scalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+2632.14%)
Mutual labels:  policy-gradient
yarll
Combining deep learning and reinforcement learning.
Stars: ✭ 84 (+200%)
Mutual labels:  policy-gradient
Rlseq2seq
Deep Reinforcement Learning For Sequence to Sequence Models
Stars: ✭ 683 (+2339.29%)
Mutual labels:  policy-gradient
connect4
Solving board games like Connect4 using Deep Reinforcement Learning
Stars: ✭ 33 (+17.86%)
Mutual labels:  policy-gradient
TAA-PG
Usage of policy gradient reinforcement learning to solve portfolio optimization problems (Tactical Asset Allocation).
Stars: ✭ 26 (-7.14%)
Mutual labels:  policy-gradient
Reinforcement Learning
Minimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+10125%)
Mutual labels:  policy-gradient
Seqgan
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Stars: ✭ 502 (+1692.86%)
Mutual labels:  policy-gradient
SharkStock
Automate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …
Stars: ✭ 63 (+125%)
Mutual labels:  policy-gradient
Tensorflow Reinforce
Implementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (+1614.29%)
Mutual labels:  policy-gradient
Deer
DEEp Reinforcement learning framework
Stars: ✭ 455 (+1525%)
Mutual labels:  policy-gradient
Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Ensemble-Strategy-ICAIF-2020
Live Trading. Please star.
Stars: ✭ 1,251 (+4367.86%)
Mutual labels:  ppo
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+1478.57%)
Mutual labels:  policy-gradient
Multihopkg
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (+621.43%)
Mutual labels:  policy-gradient
Deep Rl Keras
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Stars: ✭ 395 (+1310.71%)
Mutual labels:  policy-gradient
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+1307.14%)
Mutual labels:  policy-gradient
Deep Algotrading
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (+517.86%)
Mutual labels:  policy-gradient
Awesome Monte Carlo Tree Search Papers
A curated list of Monte Carlo tree search papers with implementations.
Stars: ✭ 387 (+1282.14%)
Mutual labels:  policy-gradient
Relational Deep Reinforcement Learning
No description or website provided.
Stars: ✭ 44 (+57.14%)
Mutual labels:  ppo
rl trading
No description or website provided.
Stars: ✭ 14 (-50%)
Mutual labels:  ppo
A2c
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (+503.57%)
Mutual labels:  policy-gradient
Text summurization abstractive methods
Multiple implementations for abstractive text summurization , using google colab
Stars: ✭ 359 (+1182.14%)
Mutual labels:  policy-gradient
Rl algorithms
Structural implementation of RL key algorithms
Stars: ✭ 352 (+1157.14%)
Mutual labels:  policy-gradient
Show Adapt And Tell
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (+421.43%)
Mutual labels:  policy-gradient
Trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
Stars: ✭ 343 (+1125%)
Mutual labels:  policy-gradient
reinforcement learning ppo rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
Stars: ✭ 33 (+17.86%)
Mutual labels:  ppo
Policy Gradient
Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (+382.14%)
Mutual labels:  policy-gradient
1-60 of 118 similar projects