Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → awjuliani → Deeprl Agents

awjuliani / Deeprl Agents

Licence: mit

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

Programming Languages

Jupyter Notebook

11667 projects

139335 projects - #7 most used programming language

Labels

tensorflow reinforcement-learning

Projects that are alternatives of or similar to Deeprl Agents

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (-94.51%)

Mutual labels: jupyter-notebook, reinforcement-learning

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"

Stars: ✭ 126 (-94.14%)

Mutual labels: jupyter-notebook, reinforcement-learning

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Stars: ✭ 121 (-94.37%)

Mutual labels: jupyter-notebook, reinforcement-learning

Tensorflow2.0 Examples

🙄 Difficult algorithm, Simple code.

Stars: ✭ 1,397 (-34.99%)

Mutual labels: jupyter-notebook, reinforcement-learning

PyTorch implementation of Soft Actor-Critic (SAC)

Stars: ✭ 174 (-91.9%)

Mutual labels: jupyter-notebook, reinforcement-learning

Ctc Executioner

Master Thesis: Limit order placement with Reinforcement Learning

Stars: ✭ 112 (-94.79%)

Mutual labels: jupyter-notebook, reinforcement-learning

Teach a Quadcopter How to Fly!

Stars: ✭ 124 (-94.23%)

Mutual labels: jupyter-notebook, reinforcement-learning

60 days rl challenge

60_Days_RL_Challenge中文版

Stars: ✭ 92 (-95.72%)

Mutual labels: jupyter-notebook, reinforcement-learning

2048 Deep Reinforcement Learning

Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning

Stars: ✭ 169 (-92.14%)

Mutual labels: jupyter-notebook, reinforcement-learning

Chess Alpha Zero

Chess reinforcement learning by AlphaGo Zero methods.

Stars: ✭ 1,868 (-13.08%)

Mutual labels: jupyter-notebook, reinforcement-learning

Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]

Stars: ✭ 97 (-95.49%)

Mutual labels: jupyter-notebook, reinforcement-learning

Deep Algotrading

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Stars: ✭ 173 (-91.95%)

Mutual labels: jupyter-notebook, reinforcement-learning

Learning human driver models from NGSIM data with imitation learning.

Stars: ✭ 96 (-95.53%)

Mutual labels: jupyter-notebook, reinforcement-learning

Coursera reinforcement learning

Coursera Reinforcement Learning Specialization by University of Alberta & Alberta Machine Intelligence Institute

Stars: ✭ 114 (-94.7%)

Mutual labels: jupyter-notebook, reinforcement-learning

Rl Movie Recommender

The purpose of our research is to study reinforcement learning approaches to building a movie recommender system. We formulate the problem of interactive recommendation as a contextual multi-armed bandit.

Stars: ✭ 93 (-95.67%)

Mutual labels: jupyter-notebook, reinforcement-learning

Advanced Deep Learning And Reinforcement Learning Deepmind

🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind | YouTube videos 👉

Stars: ✭ 121 (-94.37%)

Mutual labels: jupyter-notebook, reinforcement-learning

Tensorflow Tutorials

TensorFlow Tutorials with YouTube Videos

Stars: ✭ 8,919 (+315.03%)

Mutual labels: jupyter-notebook, reinforcement-learning

MAGNet: Multi-agents control using Graph Neural Networks

Stars: ✭ 88 (-95.91%)

Mutual labels: jupyter-notebook, reinforcement-learning

Data Science Question Answer

A repo for data science related questions and answers

Stars: ✭ 2,000 (-6.93%)

Mutual labels: jupyter-notebook, reinforcement-learning

Machine Learning And Reinforcement Learning In Finance

Machine Learning and Reinforcement Learning in Finance New York University Tandon School of Engineering

Stars: ✭ 173 (-91.95%)

Mutual labels: jupyter-notebook, reinforcement-learning

View All Similar Projects ➔

Deep Reinforcement Learning Agents

This repository contains a collection of reinforcement learning algorithms written in Tensorflow. The ipython notebook here were written to go along with a still-underway tutorial series I have been publishing on Medium. If you are new to reinforcement learning, I recommend reading the accompanying post for each algorithm.

The repository currently contains the following algorithms:

Q-Table - An implementation of Q-learning using tables to solve a stochastic environment problem.
Q-Network - A neural network implementation of Q-Learning to solve the same environment as in Q-Table.
Simple-Policy - An implementation of policy gradient method for stateless environments such as n-armed bandit problems.
Contextual-Policy - An implementation of policy gradient method for stateful environments such as contextual bandit problems.
Policy-Network - An implementation of a neural network policy-gradient agent that solves full RL problems with states and delayed rewards, and two opposite actions (ie. CartPole or Pong).
Vanilla-Policy - An implementation of a neural network vanilla-policy-gradient agent that solves full RL problems with states, delayed rewards, and an arbitrary number of actions.
Model-Network - An addition to the Policy-Network algorithm which includes a separate network which models the environment dynamics.
Double-Dueling-DQN - An implementation of a Deep-Q Network with the Double DQN and Dueling DQN additions to improve stability and performance.
Deep-Recurrent-Q-Network - An implementation of a Deep Recurrent Q-Network which can solve reinforcement learning problems involving partial observability.
Q-Exploration - An implementation of DQN containing multiple action-selection strategies for exploration. Strategies include: greedy, random, e-greedy, Boltzmann, and Bayesian Dropout.
A3C-Doom - An implementation of Asynchronous Advantage Actor-Critic (A3C) algorithm. It utilizes multiple agents to collectively improve a policy. This implementation can solve RL problems in 3D environments such as VizDoom challenges.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 2,149

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (42) 🔗