All Projects → pekaalto → DQN

pekaalto / DQN

Licence: other
Deep-Q-Network reinforcement learning algorithm applied to a simple 2d-car-racing environment

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to DQN

Wger
Self hosted FLOSS fitness/workout, nutrition and weight tracker written with Django
Stars: ✭ 1,372 (+3166.67%)
Mutual labels:  gym
Gym Sokoban
Sokoban environment for OpenAI Gym
Stars: ✭ 186 (+342.86%)
Mutual labels:  gym
multi car racing
An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.
Stars: ✭ 58 (+38.1%)
Mutual labels:  gym
Drqn Tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
Stars: ✭ 127 (+202.38%)
Mutual labels:  gym
Pytorch sac
PyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (+314.29%)
Mutual labels:  gym
Gym Unrealcv
Unreal environments for reinforcement learning
Stars: ✭ 202 (+380.95%)
Mutual labels:  gym
Torchrl
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Stars: ✭ 90 (+114.29%)
Mutual labels:  gym
omd
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
Stars: ✭ 43 (+2.38%)
Mutual labels:  gym
Atari
AI research environment for the Atari 2600 games 🤖.
Stars: ✭ 174 (+314.29%)
Mutual labels:  gym
Rls
Reinforcement Learning Algorithms Based on TensorFlow 2.x
Stars: ✭ 239 (+469.05%)
Mutual labels:  gym
Sumo Rl
A simple interface to instantiate Reinforcement Learning environments with SUMO for Traffic Signal Control. Compatible with Gym Env from OpenAI and MultiAgentEnv from RLlib.
Stars: ✭ 145 (+245.24%)
Mutual labels:  gym
A2c
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (+302.38%)
Mutual labels:  gym
Pytorch Reinforce
PyTorch Implementation of REINFORCE for both discrete & continuous control
Stars: ✭ 212 (+404.76%)
Mutual labels:  gym
Stable Baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Stars: ✭ 115 (+173.81%)
Mutual labels:  gym
king-pong
Deep Reinforcement Learning Pong Agent, King Pong, he's the best
Stars: ✭ 23 (-45.24%)
Mutual labels:  deep-q-network
Pytorch sac ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
Stars: ✭ 94 (+123.81%)
Mutual labels:  gym
Naf Tensorflow
"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
Stars: ✭ 192 (+357.14%)
Mutual labels:  gym
good robot
"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer; and “Good Robot!”: Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer
Stars: ✭ 84 (+100%)
Mutual labels:  deep-q-network
DQN-using-PyTorch-and-ML-Agents
A simple example of how to implement vector based DQN using PyTorch and a ML-Agents environment
Stars: ✭ 81 (+92.86%)
Mutual labels:  deep-q-network
Ma Gym
A collection of multi agent environments based on OpenAI gym.
Stars: ✭ 226 (+438.1%)
Mutual labels:  gym

Info

This project implements the DQN reinforcement learning agent similar to Human-level control through deep reinforcement learning

(See also David Silvers RL course lecture 6. This stuff is clearly and shortly explained in 1h15min onwards)

The agent is applied to the Open AI gym's 2d-car-racing environment

The structure of the q-network differs from the original paper. In particular, the network here is much smaller and can be easily trained without GPU. (It's easy to specify any other structures as well)

The agent learns to drive the car from pixels in a few hours and doesn't need any hand-crafted features. There are some minor environment specific tweaks for the car-racing but the base-agent doesn't know about car-racing.

Pre-trained agent

The checkpoint provided in the repo used the default parameters specified in the runner/agent and 150000~ playing steps for learning.

The training took about 5h with CPU. This agent is playing in the above gif and in this video: https://youtu.be/CVZQOAlQib0

The agent sometimes cuts corners but other than that it can drive flawlessly for minutes. There are some occasional mistakes though.

Running instuctions

Just clone the repo and use car_runner_main.py . The settings are specified in the beginning of the runner.

You can either train from scratch or load the existing checkpoint from this repo and see the agent driving somewhat properly right away. Or you can train the provided checkpoint more.

Requirements

  • Python 3.5 (will not work with python 2)
  • OpenAI Gym (the car-racing environment)
  • Tensorflow 1.0.0
  • numpy
  • scikit-image
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].