Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → flyyufelix → Direct Future Prediction Keras

flyyufelix / Direct Future Prediction Keras

Licence: mit

Direct Future Prediction (DFP ) in Keras

Programming Languages

139335 projects - #7 most used programming language

Labels

keras reinforcement-learning

Projects that are alternatives of or similar to Direct Future Prediction Keras

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

Stars: ✭ 94 (-8.74%)

Mutual labels: reinforcement-learning

Researchpapernotes

Initiative to read research papers

Stars: ✭ 97 (-5.83%)

Mutual labels: reinforcement-learning

Samsung Drl Code

Repository for codes of Deep Reinforcement Learning (DRL) lectured at Samsung

Stars: ✭ 99 (-3.88%)

Mutual labels: reinforcement-learning

Reinforcement learning environments for Torch7

Stars: ✭ 94 (-8.74%)

Mutual labels: reinforcement-learning

Learning human driver models from NGSIM data with imitation learning.

Stars: ✭ 96 (-6.8%)

Mutual labels: reinforcement-learning

Papers Literature Ml Dl Rl Ai

Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning

Stars: ✭ 1,341 (+1201.94%)

Mutual labels: reinforcement-learning

Deep learning study

A curated list of Deep Learning

Stars: ✭ 93 (-9.71%)

Mutual labels: reinforcement-learning

Reinforcement learning

강화학습에 대한 기본적인 알고리즘 구현

Stars: ✭ 100 (-2.91%)

Mutual labels: reinforcement-learning

Seqgan tensorflow

SeqGAN tensorflow implementation

Stars: ✭ 96 (-6.8%)

Mutual labels: reinforcement-learning

Chemgan Challenge

Code for the paper: Benhenda, M. 2017. ChemGAN challenge for drug discovery: can AI reproduce natural chemical diversity? arXiv preprint arXiv:1708.08227.

Stars: ✭ 98 (-4.85%)

Mutual labels: reinforcement-learning

Awesome Deep Reinforcement Learning

Curated list for Deep Reinforcement Learning (DRL): software frameworks, models, datasets, gyms, baselines...

Stars: ✭ 95 (-7.77%)

Mutual labels: reinforcement-learning

Gym Electric Motor

Gym Electric Motor (GEM): An OpenAI Gym Environment for Electric Motors

Stars: ✭ 95 (-7.77%)

Mutual labels: reinforcement-learning

Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]

Stars: ✭ 97 (-5.83%)

Mutual labels: reinforcement-learning

Cs234 Reinforcement Learning Winter 2019

My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019

Stars: ✭ 93 (-9.71%)

Mutual labels: reinforcement-learning

Framework for developing OpenAI Gym robotics environments simulated with Ignition Gazebo

Stars: ✭ 97 (-5.83%)

Mutual labels: reinforcement-learning

Rl Movie Recommender

The purpose of our research is to study reinforcement learning approaches to building a movie recommender system. We formulate the problem of interactive recommendation as a contextual multi-armed bandit.

Stars: ✭ 93 (-9.71%)

Mutual labels: reinforcement-learning

Connecting Torch to StarCraft

Stars: ✭ 1,341 (+1201.94%)

Mutual labels: reinforcement-learning

Highly Modular and Scalable Reinforcement Learning

Stars: ✭ 102 (-0.97%)

Mutual labels: reinforcement-learning

A framework using TensorFlow.js for Deep Reinforcement Learning

Stars: ✭ 101 (-1.94%)

Mutual labels: reinforcement-learning

Solving OpenAI Gym problems.

Stars: ✭ 98 (-4.85%)

Mutual labels: reinforcement-learning

View All Similar Projects ➔

Discounted Future Prediction (DFP) implemented in Keras

This repo includes implementation of Discounted Future Prediction (DFP) Algorithm describe in this paper. The implementation is tested on the VizDoom Health Gathering scenario, which is a 3D partially observable environment.

For a general introduction of DFP and step-by-step walkthrough of the implementation, please check out my blog post at https://flyyufelix.github.io/2017/11/17/direct-future-prediction.html.

Results

Below is the performance chart of 40,000 episodes of DFP and DDQN running on Health Gathering. Y-axis is the average survival time (moving average over 50 episodes).

Usage

First follow this instruction to install VizDoom. If you use python, you can simply do pip install:

$ pip install vizdoom

Second, clone ViZDoom to your machine, copy the python files provided in this repo over to examples/python.

Next, edit scenarios/health_gathering.cfg file. Replace this line

doom_scenario_path = health_gathering.wad

with

doom_scenario_path = health_gathering_supreme.wad

To test if the environment is working, run

$ cd examples/python
$ python dfp.py

You should see some printouts indicating that the DFP is running successfully. Errors will be thrown otherwise.

Dependencies

Keras 1.2.2 / 2.0.5
Tensorflow 0.12.0 / 1.2.1
VizDoom Environment

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 103

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗