Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → ikostrikov → Pytorch Trpo

ikostrikov / Pytorch Trpo

Licence: mit

PyTorch implementation of Trust Region Policy Optimization

Programming Languages

python

139335 projects - #7 most used programming language

Labels

deep-learning pytorch reinforcement-learning deep-reinforcement-learning mujoco trpo

Projects that are alternatives of or similar to Pytorch Trpo

Mushroom Rl

Python library for Reinforcement Learning.

Stars: ✭ 442 (+45.87%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, mujoco, trpo

Lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Stars: ✭ 364 (+20.13%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, mujoco

Deeprl Tensorflow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Stars: ✭ 319 (+5.28%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, trpo

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+111.22%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, trpo

Deeprl algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Stars: ✭ 97 (-67.99%)

Mutual labels: deep-reinforcement-learning, mujoco, trpo

Pytorch sac ae

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

Stars: ✭ 94 (-68.98%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, mujoco

Pytorch Rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (+30.03%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, mujoco

Pytorch Rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Stars: ✭ 658 (+117.16%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, trpo

Torchrl

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Stars: ✭ 90 (-70.3%)

Mutual labels: reinforcement-learning, mujoco, trpo

Pytorch Rl

Deep Reinforcement Learning with pytorch & visdom

Stars: ✭ 745 (+145.87%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, trpo

Drq

DrQ: Data regularized Q

Stars: ✭ 268 (-11.55%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, mujoco

Pytorch sac

PyTorch implementation of Soft Actor-Critic (SAC)

Stars: ✭ 174 (-42.57%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, mujoco

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+768.65%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, mujoco

Roboleague

A car soccer environment inspired by Rocket League for deep reinforcement learning experiments in an adversarial self-play setting.

Stars: ✭ 236 (-22.11%)