All Projects → MahanFathi → TRPO-TensorFlow

MahanFathi / TRPO-TensorFlow

Licence: other
Trust Region Policy Optimization (TRPO) in pure TensorFlow

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to TRPO-TensorFlow

Pytorch Rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+3770.59%)
Mutual labels:  policy-gradient, trpo
Tianshou
An elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+24070.59%)
Mutual labels:  policy-gradient, trpo
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+1205.88%)
Mutual labels:  policy-gradient, trpo
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+3664.71%)
Mutual labels:  policy-gradient, trpo
Deeprl algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (+470.59%)
Mutual labels:  policy-gradient, trpo
Deep Reinforcement Learning With Pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+7811.76%)
Mutual labels:  policy-gradient, trpo
td-reg
TD-Regularized Actor-Critic Methods
Stars: ✭ 28 (+64.71%)
Mutual labels:  policy-gradient, trpo
Reinforcement Learning
Minimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+16741.18%)
Mutual labels:  policy-gradient
LWDRLC
Lightweight deep RL Libraray for continuous control.
Stars: ✭ 14 (-17.65%)
Mutual labels:  policy-gradient
Multihopkg
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (+1088.24%)
Mutual labels:  policy-gradient
A2c
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (+894.12%)
Mutual labels:  policy-gradient
SharkStock
Automate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …
Stars: ✭ 63 (+270.59%)
Mutual labels:  policy-gradient
RL
A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
Stars: ✭ 22 (+29.41%)
Mutual labels:  policy-gradient
Deep-Reinforcement-Learning-CS285-Pytorch
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Stars: ✭ 104 (+511.76%)
Mutual labels:  policy-gradient
Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Stars: ✭ 54 (+217.65%)
Mutual labels:  policy-gradient
DRL in CV
A course on Deep Reinforcement Learning in Computer Vision. Visit Website:
Stars: ✭ 59 (+247.06%)
Mutual labels:  policy-gradient
HandyRL
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Stars: ✭ 228 (+1241.18%)
Mutual labels:  policy-gradient
Deep Algotrading
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (+917.65%)
Mutual labels:  policy-gradient
deep rl acrobot
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
Stars: ✭ 32 (+88.24%)
Mutual labels:  policy-gradient
connect4
Solving board games like Connect4 using Deep Reinforcement Learning
Stars: ✭ 33 (+94.12%)
Mutual labels:  policy-gradient

TRPO-TensorFlow

Trust Region Policy Optimization (TRPO) in pure TensorFlow

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].