All Projects → flyyufelix → Vizdoom Keras Rl

flyyufelix / Vizdoom Keras Rl

Licence: mit
Reinforcement Learning in Keras on VizDoom

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Vizdoom Keras Rl

Deep reinforcement learning
Resources, papers, tutorials
Stars: ✭ 119 (-8.46%)
Mutual labels:  reinforcement-learning
Mlds2018spring
Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (-4.62%)
Mutual labels:  reinforcement-learning
Banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
Stars: ✭ 127 (-2.31%)
Mutual labels:  reinforcement-learning
Ros2learn
ROS 2 enabled Machine Learning algorithms
Stars: ✭ 119 (-8.46%)
Mutual labels:  reinforcement-learning
Snake Ai Reinforcement
AI for Snake game trained from pixels using Deep Reinforcement Learning (DQN).
Stars: ✭ 123 (-5.38%)
Mutual labels:  reinforcement-learning
Hbayesdm
Hierarchical Bayesian modeling of RLDM tasks, using R & Python
Stars: ✭ 124 (-4.62%)
Mutual labels:  reinforcement-learning
Reinforcementlearning Atarigame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (-9.23%)
Mutual labels:  reinforcement-learning
Aixijs
AIXIjs - General Reinforcement Learning in the Browser
Stars: ✭ 128 (-1.54%)
Mutual labels:  reinforcement-learning
Rl Medical
Deep Reinforcement Learning (DRL) agents applied to medical images
Stars: ✭ 123 (-5.38%)
Mutual labels:  reinforcement-learning
Rl Collision Avoidance
Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
Stars: ✭ 125 (-3.85%)
Mutual labels:  reinforcement-learning
Pytorch Rl
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (-6.92%)
Mutual labels:  reinforcement-learning
Multiagenttorcs
The multi-agent version of TORCS for developing control algorithms for fully autonomous driving in the cluttered, multi-agent settings of everyday life.
Stars: ✭ 122 (-6.15%)
Mutual labels:  reinforcement-learning
Srl Zoo
State Representation Learning (SRL) zoo with PyTorch - Part of S-RL Toolbox
Stars: ✭ 125 (-3.85%)
Mutual labels:  reinforcement-learning
Firedup
Clone of OpenAI's Spinning Up in PyTorch
Stars: ✭ 119 (-8.46%)
Mutual labels:  reinforcement-learning
Dm env
A Python interface for reinforcement learning environments
Stars: ✭ 127 (-2.31%)
Mutual labels:  reinforcement-learning
Machine learning lectures
Collection of lectures and lab lectures on machine learning and deep learning. Lab practices in Python and TensorFlow.
Stars: ✭ 118 (-9.23%)
Mutual labels:  reinforcement-learning
Rl Quadcopter
Teach a Quadcopter How to Fly!
Stars: ✭ 124 (-4.62%)
Mutual labels:  reinforcement-learning
Automata
A comprehensive autonomous decentralized systems framework for AI control architects.
Stars: ✭ 130 (+0%)
Mutual labels:  reinforcement-learning
Toycarirl
Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinforcement Learning Abbeel & Ng, 2004)
Stars: ✭ 128 (-1.54%)
Mutual labels:  reinforcement-learning
Modular Rl
[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"
Stars: ✭ 126 (-3.08%)
Mutual labels:  reinforcement-learning

Implementation of Reinforcement Learning Algorithms in Keras tested on VizDoom

This repo includes implementation of Double Deep Q Network (DDQN), Dueling DDQN, Deep Recurrent Q Network (DRQN) with LSTM, REINFORCE, Advantage Actor Critic (A2C), A2C with LSTM, and C51 DDQN (Distribution Bellman). All implementations are tested on VizDoom Defend the Center scenario, which is a 3D partially observable environment.

For more details on the implementation, you can check out my blog post at https://flyyufelix.github.io/2017/10/12/dqn-vs-pg.html.

Results

Below is the performance chart of 20,000 episodes of DDQN, REINFORCE, and A2C running on Defend the Center. Y-axis is the average number of kills (moving average over 50 episodes).

Performance Chart 1

The performance chart of 15,000 episodes C51 DDQN and DDQN running on Defend the Center.

Performance Chart 2

Usage

First follow this instruction to install VizDoom. If you use python, you can simply do pip install:

$ pip install vizdoom

Second, clone ViZDoom to your machine, copy the python files provided in this repo over to examples/python.

To test if the environment is working, run

$ cd examples/python
$ python ddqn.py

You should see some printouts indicating that the DDQN is running successfully. Errors would be thrown otherwise.

Dependencies

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].