All Projects → keon → Policy Gradient

keon / Policy Gradient

Licence: mit
Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Policy Gradient

Reinforcement Learning
Minimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+2020.74%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+191.85%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Openai lab
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (+131.85%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Torchrl
Highly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (-24.44%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Pytorch Rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+387.41%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (+169.63%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+2125.19%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+227.41%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+374.07%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Tensorflow Reinforce
Implementations of Reinforcement Learning Models in Tensorflow
Stars: ✭ 480 (+255.56%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Slm Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Stars: ✭ 904 (+569.63%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Btgym
Scalable, event-driven, deep-learning-friendly backtesting library
Stars: ✭ 765 (+466.67%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Rl Course Experiments
Stars: ✭ 73 (-45.93%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Deep Reinforcement Learning With Pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+896.3%)
Mutual labels:  deep-reinforcement-learning, policy-gradient
Deeprl algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Stars: ✭ 97 (-28.15%)
Mutual labels:  deep-reinforcement-learning, policy-gradient
Samsung Drl Code
Repository for codes of Deep Reinforcement Learning (DRL) lectured at Samsung
Stars: ✭ 99 (-26.67%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Awesome Deep Reinforcement Learning
Curated list for Deep Reinforcement Learning (DRL): software frameworks, models, datasets, gyms, baselines...
Stars: ✭ 95 (-29.63%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Reinforcement learning
강화학습에 대한 기본적인 알고리즘 구현
Stars: ✭ 100 (-25.93%)
Mutual labels:  reinforcement-learning, policy-gradient
Reinforcement Learning
🤖 Implements of Reinforcement Learning algorithms.
Stars: ✭ 104 (-22.96%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Hierarchical Actor Critic Hac Pytorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
Stars: ✭ 116 (-14.07%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning

Policy Gradient

Minimal implementation of Stochastic Policy Gradient Algorithm in Keras

Pong Agent

pg

This PG agent seems to get more frequent wins after about 8000 episodes. Below is the score graph.

score

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].