Alternatives and detailed information of DQN

pekaalto / DQN

Licence: other

Deep-Q-Network reinforcement learning algorithm applied to a simple 2d-car-racing environment

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to DQN

Wger

Self hosted FLOSS fitness/workout, nutrition and weight tracker written with Django

Stars: ✭ 1,372 (+3166.67%)

Mutual labels: gym

Gym Sokoban

Sokoban environment for OpenAI Gym

Stars: ✭ 186 (+342.86%)

Mutual labels: gym

multi car racing

An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.

Stars: ✭ 58 (+38.1%)

Mutual labels: gym

Drqn Tensorflow

Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro

Stars: ✭ 127 (+202.38%)

Mutual labels: gym

Pytorch sac

PyTorch implementation of Soft Actor-Critic (SAC)

Stars: ✭ 174 (+314.29%)

Mutual labels: gym

Gym Unrealcv

Unreal environments for reinforcement learning

Stars: ✭ 202 (+380.95%)

Mutual labels: gym

Torchrl

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Stars: ✭ 90 (+114.29%)

Mutual labels: gym

omd

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

Stars: ✭ 43 (+2.38%)

Mutual labels: gym

Atari

AI research environment for the Atari 2600 games 🤖.

Stars: ✭ 174 (+314.29%)

Mutual labels: gym

Rls

Reinforcement Learning Algorithms Based on TensorFlow 2.x

Stars: ✭ 239 (+469.05%)

Mutual labels: gym

Sumo Rl

A simple interface to instantiate Reinforcement Learning environments with SUMO for Traffic Signal Control. Compatible with Gym Env from OpenAI and MultiAgentEnv from RLlib.

Stars: ✭ 145 (+245.24%)

Mutual labels: gym

A2c

A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow

Stars: ✭ 169 (+302.38%)

Mutual labels: gym

Pytorch Reinforce

PyTorch Implementation of REINFORCE for both discrete & continuous control

Stars: ✭ 212 (+404.76%)

Mutual labels: gym

Stable Baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Stars: ✭ 115 (+173.81%)

Mutual labels: gym

king-pong

Deep Reinforcement Learning Pong Agent, King Pong, he's the best

Stars: ✭ 23 (-45.24%)

Mutual labels: deep-q-network

Pytorch sac ae

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

Stars: ✭ 94 (+123.81%)

Mutual labels: gym

Naf Tensorflow

"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow

Stars: ✭ 192 (+357.14%)

Mutual labels: gym

good robot

"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer; and “Good Robot!”: Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer

Stars: ✭ 84 (+100%)

Mutual labels: deep-q-network

DQN-using-PyTorch-and-ML-Agents

A simple example of how to implement vector based DQN using PyTorch and a ML-Agents environment

Stars: ✭ 81 (+92.86%)

Mutual labels: deep-q-network

Ma Gym

A collection of multi agent environments based on OpenAI gym.

Stars: ✭ 226 (+438.1%)

Mutual labels: gym

View All Similar Projects ➔

Info

This project implements the DQN reinforcement learning agent similar to Human-level control through deep reinforcement learning

(See also David Silvers RL course lecture 6. This stuff is clearly and shortly explained in 1h15min onwards)

The agent is applied to the Open AI gym's 2d-car-racing environment

The structure of the q-network differs from the original paper. In particular, the network here is much smaller and can be easily trained without GPU. (It's easy to specify any other structures as well)

The agent learns to drive the car from pixels in a few hours and doesn't need any hand-crafted features. There are some minor environment specific tweaks for the car-racing but the base-agent doesn't know about car-racing.

Pre-trained agent

The checkpoint provided in the repo used the default parameters specified in the runner/agent and 150000~ playing steps for learning.

The training took about 5h with CPU. This agent is playing in the above gif and in this video: https://youtu.be/CVZQOAlQib0

The agent sometimes cuts corners but other than that it can drive flawlessly for minutes. There are some occasional mistakes though.

Running instuctions

Just clone the repo and use car_runner_main.py . The settings are specified in the beginning of the runner.

You can either train from scratch or load the existing checkpoint from this repo and see the agent driving somewhat properly right away. Or you can train the provided checkpoint more.

Requirements

Python 3.5 (will not work with python 2)
OpenAI Gym (the car-racing environment)
Tensorflow 1.0.0
numpy
scikit-image

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

pekaalto / DQN

Programming Languages

Labels

Projects that are alternatives of or similar to DQN

Info

Pre-trained agent

Running instuctions

Requirements