Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → yukezhu → Tensorflow Reinforce

yukezhu / Tensorflow Reinforce

Licence: mit

Implementations of Reinforcement Learning Models in Tensorflow

Programming Languages

python

139335 projects - #7 most used programming language

Labels

tensorflow reinforcement-learning deep-reinforcement-learning policy-gradient actor-critic deep-q-network

Projects that are alternatives of or similar to Tensorflow Reinforce

Reinforcement Learning

Minimal and Clean Reinforcement Learning Examples

Stars: ✭ 2,863 (+496.46%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient, actor-critic, deep-q-network

Reinforcement learning tutorial with demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (-7.92%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient, actor-critic

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+33.33%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient, deep-q-network

Pytorch Drl

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Stars: ✭ 233 (-51.46%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, actor-critic, deep-q-network

Mlds2018spring

Machine Learning and having it Deep and Structured (MLDS) in 2018 spring

Stars: ✭ 124 (-74.17%)

Mutual labels: reinforcement-learning, policy-gradient, actor-critic, deep-q-network

Deeprl Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Stars: ✭ 748 (+55.83%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, actor-critic, deep-q-network

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+1347.5%)

Mutual labels: reinforcement-learning, policy-gradient, actor-critic, deep-q-network

rl implementations

No description or website provided.

Stars: ✭ 40 (-91.67%)

Mutual labels: deep-reinforcement-learning, policy-gradient, deep-q-network, actor-critic

Rl Course Experiments

Stars: ✭ 73 (-84.79%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient, deep-q-network

Openai lab

An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

Stars: ✭ 313 (-34.79%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient, actor-critic

Pytorch sac

PyTorch implementation of Soft Actor-Critic (SAC)

Stars: ✭ 174 (-63.75%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, actor-critic

Reinforcementlearning.jl

A reinforcement learning package for Julia

Stars: ✭ 192 (-60%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, deep-q-network

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+448.33%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, actor-critic

A2c

A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow

Stars: ✭ 169 (-64.79%)

Mutual labels: reinforcement-learning, policy-gradient, actor-critic

2048 Deep Reinforcement Learning

Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning

Stars: ✭ 169 (-64.79%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, deep-q-network

Accel Brain Code

The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.

Stars: ✭ 166 (-65.42%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, deep-q-network

Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Stars: ✭ 222 (-53.75%)

Mutual labels: deep-reinforcement-learning, policy-gradient, actor-critic

Curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Stars: ✭ 346 (-27.92%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, deep-q-network

Lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Stars: ✭ 364 (-24.17%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient

Pytorch Rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Stars: ✭ 394 (-17.92%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning, policy-gradient

View All Similar Projects ➔

Tensorflow-Reinforce

A collection of Tensorflow implementations of reinforcement learning models. Models are evaluated in OpenAI Gym environments. Any contribution/feedback is more than welcome. Disclaimer: These implementations are used for educational purposes only (i.e., to learn deep RL myself). There is no guarantee that the exact models will work on any of your particular RL problems without changes.

Environments

This codebase works in both Python 2.7 and 3.5. The models are implemented in Tensorflow 1.0.

Models

Model	Code	References
Cross-Entropy Method	run_cem_cartpole	Cross-entropy method
Tabular Q Learning	rl/tabular_q_learner	Sutton and Barto, Chapter 8
Deep Q Network	rl/neural_q_learner	Mnih et al.
Double Deep Q Network	rl/neural_q_learner	van Hasselt et al.
REINFORCE Policy Gradient	rl/pg_reinforce	Sutton et al.
Actor-critic Policy Gradient	rl/pg_actor_critic	Minh et al.
Deep Deterministic Policy Gradient	rl/pg_ddpg	Lillicrap et al.

License

MIT

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 480

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (6) 🔗