All Projects → yukezhu → Tensorflow Reinforce

yukezhu / Tensorflow Reinforce

Licence: mit
Implementations of Reinforcement Learning Models in Tensorflow

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Tensorflow Reinforce

Reinforcement Learning
Minimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+496.46%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient, actor-critic, deep-q-network
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (-7.92%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient, actor-critic
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+33.33%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient, deep-q-network
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (-51.46%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, actor-critic, deep-q-network
Mlds2018spring
Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (-74.17%)
Mutual labels:  reinforcement-learning, policy-gradient, actor-critic, deep-q-network
Deeprl Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Stars: ✭ 748 (+55.83%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, actor-critic, deep-q-network
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+1347.5%)
Mutual labels:  reinforcement-learning, policy-gradient, actor-critic, deep-q-network
rl implementations
No description or website provided.
Stars: ✭ 40 (-91.67%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, deep-q-network, actor-critic
Rl Course Experiments
Stars: ✭ 73 (-84.79%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient, deep-q-network
Openai lab
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (-34.79%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient, actor-critic
Pytorch sac
PyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (-63.75%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, actor-critic
Reinforcementlearning.jl
A reinforcement learning package for Julia
Stars: ✭ 192 (-60%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+448.33%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, actor-critic
A2c
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (-64.79%)
Mutual labels:  reinforcement-learning, policy-gradient, actor-critic
2048 Deep Reinforcement Learning
Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning
Stars: ✭ 169 (-64.79%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Accel Brain Code
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
Stars: ✭ 166 (-65.42%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (-53.75%)
Mutual labels:  deep-reinforcement-learning, policy-gradient, actor-critic
Curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
Stars: ✭ 346 (-27.92%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, deep-q-network
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (-24.17%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (-17.92%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, policy-gradient

Tensorflow-Reinforce

A collection of Tensorflow implementations of reinforcement learning models. Models are evaluated in OpenAI Gym environments. Any contribution/feedback is more than welcome. Disclaimer: These implementations are used for educational purposes only (i.e., to learn deep RL myself). There is no guarantee that the exact models will work on any of your particular RL problems without changes.

Environments

This codebase works in both Python 2.7 and 3.5. The models are implemented in Tensorflow 1.0.

Models

Model Code References
Cross-Entropy Method run_cem_cartpole Cross-entropy method
Tabular Q Learning rl/tabular_q_learner Sutton and Barto, Chapter 8
Deep Q Network rl/neural_q_learner Mnih et al.
Double Deep Q Network rl/neural_q_learner van Hasselt et al.
REINFORCE Policy Gradient rl/pg_reinforce Sutton et al.
Actor-critic Policy Gradient rl/pg_actor_critic Minh et al.
Deep Deterministic Policy Gradient rl/pg_ddpg Lillicrap et al.

License

MIT

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].