All Projects → rpg → Similar Projects or Alternatives

108 Open source projects that are alternatives of or similar to rpg

Reinforcement learning tutorial with demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (+1909.09%)

Mutual labels: policy-gradient, imitation-learning

imitation learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Stars: ✭ 93 (+322.73%)

Mutual labels: policy-gradient, imitation-learning

Tianshou

An elegant PyTorch deep reinforcement learning library.

Stars: ✭ 4,109 (+18577.27%)

Mutual labels: policy-gradient, imitation-learning

Easy Rl

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+13554.55%)

Mutual labels: policy-gradient, imitation-learning

Reinforcement learning

Reinforcement learning tutorials

Stars: ✭ 82 (+272.73%)

Mutual labels: policy-gradient

Awesome Monte Carlo Tree Search Papers

A curated list of Monte Carlo tree search papers with implementations.

Stars: ✭ 387 (+1659.09%)

Mutual labels: policy-gradient

Trpo

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

Stars: ✭ 343 (+1459.09%)

Mutual labels: policy-gradient

deep trading

This project aims to select a supervised algorithm that can predict stock prices basing on historical data and use the predictor generated to form trading strategies.

Stars: ✭ 18 (-18.18%)

Mutual labels: policy-gradient

Policy Gradient

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

Stars: ✭ 135 (+513.64%)

Mutual labels: policy-gradient

Slm Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Stars: ✭ 904 (+4009.09%)

Mutual labels: policy-gradient

SeqGAN-PyTorch

Implementation of Sequence Generative Adversarial Nets with Policy Gradient in PyTorch

Stars: ✭ 40 (+81.82%)

Mutual labels: policy-gradient

Deep Rl Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Stars: ✭ 395 (+1695.45%)

Mutual labels: policy-gradient

Deep Reinforcement Learning With Pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Stars: ✭ 1,345 (+6013.64%)

Mutual labels: policy-gradient

Text summurization abstractive methods

Multiple implementations for abstractive text summurization , using google colab

Stars: ✭ 359 (+1531.82%)

Mutual labels: policy-gradient

A2c

A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow

Stars: ✭ 169 (+668.18%)

Mutual labels: policy-gradient

Openai lab

An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

Stars: ✭ 313 (+1322.73%)

Mutual labels: policy-gradient

Rl Course Experiments

Stars: ✭ 73 (+231.82%)

Mutual labels: policy-gradient

ADL2019

Applied Deep Learning (2019 Spring) @ NTU

Stars: ✭ 20 (-9.09%)

Mutual labels: policy-gradient

Reinforcement Learning

Minimal and Clean Reinforcement Learning Examples

Stars: ✭ 2,863 (+12913.64%)

Mutual labels: policy-gradient

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+31481.82%)

Mutual labels: policy-gradient

Paddle-RLBooks

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

Stars: ✭ 113 (+413.64%)

Mutual labels: policy-gradient

td-reg

TD-Regularized Actor-Critic Methods

Stars: ✭ 28 (+27.27%)

Mutual labels: policy-gradient

Reinforcement Learning

Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3

Stars: ✭ 61 (+177.27%)

Mutual labels: policy-gradient

Pytorch Rl

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Stars: ✭ 121 (+450%)

Mutual labels: policy-gradient

Pytorch Rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Stars: ✭ 658 (+2890.91%)

Mutual labels: policy-gradient

Explorer

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Stars: ✭ 54 (+145.45%)

Mutual labels: policy-gradient

connect4

Solving board games like Connect4 using Deep Reinforcement Learning

Stars: ✭ 33 (+50%)

Mutual labels: policy-gradient

Reinforcement learning

강화학습에 대한 기본적인 알고리즘 구현

Stars: ✭ 100 (+354.55%)

Mutual labels: policy-gradient

Pytorch Rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (+1690.91%)

Mutual labels: policy-gradient

Deep Algotrading

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Stars: ✭ 173 (+686.36%)

Mutual labels: policy-gradient

Lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Stars: ✭ 364 (+1554.55%)

Mutual labels: policy-gradient

Deeprl algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Stars: ✭ 97 (+340.91%)

Mutual labels: policy-gradient

Rl algorithms

Structural implementation of RL key algorithms

Stars: ✭ 352 (+1500%)

Mutual labels: policy-gradient

SharkStock

Automate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …

Stars: ✭ 63 (+186.36%)

Mutual labels: policy-gradient

Ppo Pytorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Stars: ✭ 325 (+1377.27%)

Mutual labels: policy-gradient

Codegan

[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks

Stars: ✭ 73 (+231.82%)

Mutual labels: policy-gradient

Reinforcement Learning Kr

[파이썬과 케라스로 배우는 강화학습] 예제

Stars: ✭ 282 (+1181.82%)

Mutual labels: policy-gradient

Show Adapt And Tell

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Stars: ✭ 146 (+563.64%)

Mutual labels: policy-gradient

policy-gradient-pong

tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/

Stars: ✭ 29 (+31.82%)

Mutual labels: policy-gradient

Parl Sample

Deep reinforcement learning using baidu PARL(maze,flappy bird and so on)

Stars: ✭ 37 (+68.18%)

Mutual labels: policy-gradient

rl implementations

No description or website provided.

Stars: ✭ 40 (+81.82%)

Mutual labels: policy-gradient

Pontryagin-Differentiable-Programming

A unified end-to-end learning and control framework that is able to learn a (neural) control objective function, dynamics equation, control policy, or/and optimal trajectory in a control system.

Stars: ✭ 111 (+404.55%)

Mutual labels: imitation-learning

Deep-rl-mxnet

Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO

Stars: ✭ 26 (+18.18%)

Mutual labels: policy-gradient

Btgym

Scalable, event-driven, deep-learning-friendly backtesting library

Stars: ✭ 765 (+3377.27%)

Mutual labels: policy-gradient

TRPO-TensorFlow

Trust Region Policy Optimization (TRPO) in pure TensorFlow

Stars: ✭ 17 (-22.73%)

Mutual labels: policy-gradient

Mlds2018spring

Machine Learning and having it Deep and Structured (MLDS) in 2018 spring

Stars: ✭ 124 (+463.64%)

Mutual labels: policy-gradient

Deep-Reinforcement-Learning-CS285-Pytorch

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Stars: ✭ 104 (+372.73%)

Mutual labels: policy-gradient

Rlseq2seq

Deep Reinforcement Learning For Sequence to Sequence Models

Stars: ✭ 683 (+3004.55%)

Mutual labels: policy-gradient

HandyRL

HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.

Stars: ✭ 228 (+936.36%)

Mutual labels: policy-gradient

Seqgan

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

Stars: ✭ 502 (+2181.82%)

Mutual labels: policy-gradient

LWDRLC

Lightweight deep RL Libraray for continuous control.

Stars: ✭ 14 (-36.36%)

Mutual labels: policy-gradient

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow