All Projects → KaleabTessera → DQN-Atari

KaleabTessera / DQN-Atari

Licence: other
Deep Q-Learning (DQN) implementation for Atari pong.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to DQN-Atari

breakout-Deep-Q-Network
Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout
Stars: ✭ 69 (+30.19%)
Mutual labels:  dqn, atari
Tianshou
An elegant PyTorch deep reinforcement learning library.
Stars: ✭ 4,109 (+7652.83%)
Mutual labels:  dqn, atari
dqn-pytorch
DQN to play Atari Pong
Stars: ✭ 77 (+45.28%)
Mutual labels:  dqn, atari
DQN-pytorch
A PyTorch implementation of Human-Level Control through Deep Reinforcement Learning
Stars: ✭ 23 (-56.6%)
Mutual labels:  dqn, atari
retrore
A curated list of original and reverse-engineered vintage 6502 game sourcecode.
Stars: ✭ 22 (-58.49%)
Mutual labels:  atari
RetroLove
A collection of simple games built with the LOVE game framework.
Stars: ✭ 13 (-75.47%)
Mutual labels:  pong
Rainy
☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (-26.42%)
Mutual labels:  dqn
pong-almost-from-scratch
This is an attempt of making the game pong using modern programming languages
Stars: ✭ 14 (-73.58%)
Mutual labels:  pong
distributedRL
A framework for easy prototyping of distributed reinforcement learning algorithms
Stars: ✭ 93 (+75.47%)
Mutual labels:  dqn
free-lunch-saliency
Code for "Free-Lunch Saliency via Attention in Atari Agents"
Stars: ✭ 15 (-71.7%)
Mutual labels:  atari
AI BIG DATAS ALGORITHM
大数据+人工智能+数据结构相关案例项目
Stars: ✭ 28 (-47.17%)
Mutual labels:  dqn
pongloader
🏓 Pong in 512 bytes. (boot sector)
Stars: ✭ 86 (+62.26%)
Mutual labels:  pong
6502.Net
A .Net-based Cross-Assembler for Several 8-Bit Microprocessors
Stars: ✭ 44 (-16.98%)
Mutual labels:  atari
pacman-ai
A.I. plays the original 1980 Pacman using Neuroevolution of Augmenting Topologies and Deep Q Learning
Stars: ✭ 26 (-50.94%)
Mutual labels:  dqn
PongC
Classic game of Pong in the terminal made in C.
Stars: ✭ 18 (-66.04%)
Mutual labels:  pong
dqn-obstacle-avoidance
Deep Reinforcement Learning for Fixed-Wing Flight Control with Deep Q-Network
Stars: ✭ 57 (+7.55%)
Mutual labels:  dqn
fastbasic
FastBasic - Fast BASIC interpreter for the Atari 8-bit computers
Stars: ✭ 108 (+103.77%)
Mutual labels:  atari
salvador
A free, open-source compressor for the ZX0 format
Stars: ✭ 35 (-33.96%)
Mutual labels:  atari
RL-FlappyBird
Using reinforcement learning to train FlappyBird.
Stars: ✭ 68 (+28.3%)
Mutual labels:  dqn
mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
Stars: ✭ 39 (-26.42%)
Mutual labels:  atari

DQN-Atari

Deep Q-network implementation for Pong-vo. The implementation follows from the paper - Playing Atari with Deep Reinforcement Learning and Human-level control through deep reinforcement learning.

Results

Video of Gameplay - DQN Nature Paper

DQN Video

Reward per Episode

Rewards Per Episode

Summary of Implementation

DQN Nature Architecture Implementation

  • Input : 84 × 84 × 4 image (using the last 4 frames of a history)
  • Conv Layer 1 : 32 8 × 8 filters with stride 4
  • Conv Layer 2: 64 4 × 4 filters with stride 2
  • Conv Layer 3: 64 3 × 3 filters with stride 1
  • Fully Connected 1 : fully-connected and consists of 256 rectifier units
  • Output : fully connected linear layer with a single output for each valid action.

DQN Neurips Architecture Implementation

  • Input : 84 × 84 × 4 image (using the last 4 frames of a history)
  • Conv Layer 1 : 16 8 × 8 filters with stride 4
  • Conv Layer 2: 32 4 × 4 filters with stride 2
  • Fully Connected 1 : fully-connected and consists of 256 rectifier units
  • Output : fully connected linear layer with a single output for each valid action.

Other Params

  • Optimizer: RMSProp
  • Batch Size: 32
  • E-greedy : 0.1

How to run

Create a new environment

Example:

conda create -n dqn_pong

Install Dependencies

pip install -r requirements.txt

To use gym.wrappers.Monitor to record the last episode

sudo apt-get install ffmpeg

Run Training from Scratch

python train_atari.py

Use a trained agent

python train_atari.py --load-checkpoint-file results/checkpoint_dqn_nature.pth

View Progress

A video is recorded every 50 episodes. See videos in /video/ folder.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].