Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → MahanFathi → TRPO-TensorFlow

MahanFathi / TRPO-TensorFlow

Licence: other

Trust Region Policy Optimization (TRPO) in pure TensorFlow

Programming Languages

139335 projects - #7 most used programming language

Labels

reinforcement-learning tensorflow policy-gradient trpo trust-region-policy-optimization

Projects that are alternatives of or similar to TRPO-TensorFlow

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Stars: ✭ 658 (+3770.59%)

Mutual labels: policy-gradient, trpo

An elegant PyTorch deep reinforcement learning library.

Stars: ✭ 4,109 (+24070.59%)

Mutual labels: policy-gradient, trpo

Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Stars: ✭ 222 (+1205.88%)

Mutual labels: policy-gradient, trpo

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+3664.71%)

Mutual labels: policy-gradient, trpo

Deeprl algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Stars: ✭ 97 (+470.59%)

Mutual labels: policy-gradient, trpo

Deep Reinforcement Learning With Pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Stars: ✭ 1,345 (+7811.76%)

Mutual labels: policy-gradient, trpo

TD-Regularized Actor-Critic Methods

Stars: ✭ 28 (+64.71%)

Mutual labels: policy-gradient, trpo

Reinforcement Learning

Minimal and Clean Reinforcement Learning Examples

Stars: ✭ 2,863 (+16741.18%)

Mutual labels: policy-gradient

Lightweight deep RL Libraray for continuous control.

Stars: ✭ 14 (-17.65%)

Mutual labels: policy-gradient

Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout

Stars: ✭ 202 (+1088.24%)

Mutual labels: policy-gradient

A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow

Stars: ✭ 169 (+894.12%)

Mutual labels: policy-gradient

Automate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …

Stars: ✭ 63 (+270.59%)

Mutual labels: policy-gradient

A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm

Stars: ✭ 22 (+29.41%)

Mutual labels: policy-gradient

Deep-Reinforcement-Learning-CS285-Pytorch

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Stars: ✭ 104 (+511.76%)

Mutual labels: policy-gradient

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Stars: ✭ 54 (+217.65%)

Mutual labels: policy-gradient

A course on Deep Reinforcement Learning in Computer Vision. Visit Website:

Stars: ✭ 59 (+247.06%)

Mutual labels: policy-gradient

HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.

Stars: ✭ 228 (+1241.18%)

Mutual labels: policy-gradient

Deep Algotrading

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Stars: ✭ 173 (+917.65%)

Mutual labels: policy-gradient

deep rl acrobot

TensorFlow A2C to solve Acrobot, with synchronized parallel environments

Stars: ✭ 32 (+88.24%)

Mutual labels: policy-gradient

Solving board games like Connect4 using Deep Reinforcement Learning

Stars: ✭ 33 (+94.12%)

Mutual labels: policy-gradient

View All Similar Projects ➔

TRPO-TensorFlow

Trust Region Policy Optimization (TRPO) in pure TensorFlow

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 17

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗