All Projects → flyyufelix → Direct Future Prediction Keras

flyyufelix / Direct Future Prediction Keras

Licence: mit
Direct Future Prediction (DFP ) in Keras

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Direct Future Prediction Keras

Pytorch sac ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
Stars: ✭ 94 (-8.74%)
Mutual labels:  reinforcement-learning
Researchpapernotes
Initiative to read research papers
Stars: ✭ 97 (-5.83%)
Mutual labels:  reinforcement-learning
Samsung Drl Code
Repository for codes of Deep Reinforcement Learning (DRL) lectured at Samsung
Stars: ✭ 99 (-3.88%)
Mutual labels:  reinforcement-learning
Rlenvs
Reinforcement learning environments for Torch7
Stars: ✭ 94 (-8.74%)
Mutual labels:  reinforcement-learning
Ngsim env
Learning human driver models from NGSIM data with imitation learning.
Stars: ✭ 96 (-6.8%)
Mutual labels:  reinforcement-learning
Papers Literature Ml Dl Rl Ai
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (+1201.94%)
Mutual labels:  reinforcement-learning
Deep learning study
A curated list of Deep Learning
Stars: ✭ 93 (-9.71%)
Mutual labels:  reinforcement-learning
Reinforcement learning
강화학습에 대한 기본적인 알고리즘 구현
Stars: ✭ 100 (-2.91%)
Mutual labels:  reinforcement-learning
Seqgan tensorflow
SeqGAN tensorflow implementation
Stars: ✭ 96 (-6.8%)
Mutual labels:  reinforcement-learning
Chemgan Challenge
Code for the paper: Benhenda, M. 2017. ChemGAN challenge for drug discovery: can AI reproduce natural chemical diversity? arXiv preprint arXiv:1708.08227.
Stars: ✭ 98 (-4.85%)
Mutual labels:  reinforcement-learning
Awesome Deep Reinforcement Learning
Curated list for Deep Reinforcement Learning (DRL): software frameworks, models, datasets, gyms, baselines...
Stars: ✭ 95 (-7.77%)
Mutual labels:  reinforcement-learning
Gym Electric Motor
Gym Electric Motor (GEM): An OpenAI Gym Environment for Electric Motors
Stars: ✭ 95 (-7.77%)
Mutual labels:  reinforcement-learning
Rlai Exercises
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
Stars: ✭ 97 (-5.83%)
Mutual labels:  reinforcement-learning
Cs234 Reinforcement Learning Winter 2019
My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019
Stars: ✭ 93 (-9.71%)
Mutual labels:  reinforcement-learning
Gym Ignition
Framework for developing OpenAI Gym robotics environments simulated with Ignition Gazebo
Stars: ✭ 97 (-5.83%)
Mutual labels:  reinforcement-learning
Rl Movie Recommender
The purpose of our research is to study reinforcement learning approaches to building a movie recommender system. We formulate the problem of interactive recommendation as a contextual multi-armed bandit.
Stars: ✭ 93 (-9.71%)
Mutual labels:  reinforcement-learning
Torchcraft
Connecting Torch to StarCraft
Stars: ✭ 1,341 (+1201.94%)
Mutual labels:  reinforcement-learning
Torchrl
Highly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (-0.97%)
Mutual labels:  reinforcement-learning
Reimprovejs
A framework using TensorFlow.js for Deep Reinforcement Learning
Stars: ✭ 101 (-1.94%)
Mutual labels:  reinforcement-learning
Openaigym
Solving OpenAI Gym problems.
Stars: ✭ 98 (-4.85%)
Mutual labels:  reinforcement-learning

Discounted Future Prediction (DFP) implemented in Keras

This repo includes implementation of Discounted Future Prediction (DFP) Algorithm describe in this paper. The implementation is tested on the VizDoom Health Gathering scenario, which is a 3D partially observable environment.

For a general introduction of DFP and step-by-step walkthrough of the implementation, please check out my blog post at https://flyyufelix.github.io/2017/11/17/direct-future-prediction.html.

Results

Below is the performance chart of 40,000 episodes of DFP and DDQN running on Health Gathering. Y-axis is the average survival time (moving average over 50 episodes).

DFP Performance Chart

Usage

First follow this instruction to install VizDoom. If you use python, you can simply do pip install:

$ pip install vizdoom

Second, clone ViZDoom to your machine, copy the python files provided in this repo over to examples/python.

Next, edit scenarios/health_gathering.cfg file. Replace this line

doom_scenario_path = health_gathering.wad

with

doom_scenario_path = health_gathering_supreme.wad

To test if the environment is working, run

$ cd examples/python
$ python dfp.py

You should see some printouts indicating that the DFP is running successfully. Errors will be thrown otherwise.

Dependencies

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].