All Projects → ShibiHe → Model Free Episodic Control

ShibiHe / Model Free Episodic Control

Licence: mit
This is the implementation of paper Model Free Episodic Control

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Model Free Episodic Control

Numpy neural net
A simple neural network (multilayer perceptron) with backpropagation implemented in Python with NumPy
Stars: ✭ 25 (-19.35%)
Mutual labels:  numpy
Gym Alttp Gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
Stars: ✭ 14 (-54.84%)
Mutual labels:  numpy
Shadowmusic
A temporal music synthesizer
Stars: ✭ 28 (-9.68%)
Mutual labels:  numpy
Machine Learning Experiments
🤖 Interactive Machine Learning experiments: 🏋️models training + 🎨models demo
Stars: ✭ 841 (+2612.9%)
Mutual labels:  numpy
Eliot
Eliot: the logging system that tells you *why* it happened
Stars: ✭ 874 (+2719.35%)
Mutual labels:  numpy
Glumpy
Python+Numpy+OpenGL: fast, scalable and beautiful scientific visualization
Stars: ✭ 882 (+2745.16%)
Mutual labels:  numpy
Skydetector
A Python implementation of Sky Region Detection in a Single Image for Autonomous Ground Robot Navigation (Shen and Wang, 2013)
Stars: ✭ 23 (-25.81%)
Mutual labels:  numpy
Machine Learning Alpine
Alpine Container for Machine Learning
Stars: ✭ 30 (-3.23%)
Mutual labels:  numpy
Tensorboardx
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
Stars: ✭ 7,185 (+23077.42%)
Mutual labels:  numpy
Pythondatasciencehandbook
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
Stars: ✭ 31,995 (+103109.68%)
Mutual labels:  numpy
Napari
napari: a fast, interactive, multi-dimensional image viewer for python
Stars: ✭ 847 (+2632.26%)
Mutual labels:  numpy
Pyda 2e Zh
📖 [译] 利用 Python 进行数据分析 · 第 2 版
Stars: ✭ 866 (+2693.55%)
Mutual labels:  numpy
Gym Dart
OpenAI Gym environments using DART
Stars: ✭ 20 (-35.48%)
Mutual labels:  openai-gym
Rl Baselines Zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Stars: ✭ 839 (+2606.45%)
Mutual labels:  openai-gym
Gym Panda
An OpenAI Gym Env for Panda
Stars: ✭ 29 (-6.45%)
Mutual labels:  openai-gym
Numpyro
Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation to GPU/TPU/CPU.
Stars: ✭ 929 (+2896.77%)
Mutual labels:  numpy
Numsharp
High Performance Computation for N-D Tensors in .NET, similar API to NumPy.
Stars: ✭ 882 (+2745.16%)
Mutual labels:  numpy
Numpycpp
A c++ header library for matrix operation inspired Numpy Scipy, MATLAB only using Eigen.
Stars: ✭ 30 (-3.23%)
Mutual labels:  numpy
Drlkit
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-6.45%)
Mutual labels:  numpy
My Very Deep Caffe
This is an implementation of very deep two stream CNNs for action recognition. The implementation is inspired by Wang et. al., https://github.com/yjxiong/caffe. Some improvements from Wang's implementation include reading videos from LDMB database, faster testing using LDMB interface. The aim is to work better with big dataset such as UCF101, HMDB51, Sports1M and ActivityNet easily.
Stars: ✭ 21 (-32.26%)
Mutual labels:  deep

Model-Free-Episodic-Control

This is the implementation of DQN and Model Free Episodic Control

#Introduction This package provides DQN and Episodic Control. The DQN implementation is based on spragunr/deep_q_rl and the Episodic Control is written by myself.

Model Free Episodic Control, C. Blundell et al., arXiv, 2016.

Human-level control through deep reinforcement learning, V. Mnih et al., Nature, 2015.

I have contacted the author C. Blundell of Model Free Episodic Control. He told me he was using approximate KNN to speed up episodic control, however, he did not tell me details. So I used annoy to do KNN, and I rebuild the search tree frequently.

related repo:https://github.com/astier/model-free-episodic-control

#Dependencies

Game roms should be stored in directory roms which stays next to dqn_ep.

Model-Free-Episodic-Control

├ dqn_ep -> source codes

├ roms -> game roms

└ README.md

###Tips: I made some changes to DQN so that we do not need OpenCV any more. In addition if your python has OpenAI gym then you do not need to install Arcade Learning Environment (https://github.com/mgbellemare/Arcade-Learning-Environment) Check spragunr/deep_q_rl's readMe to know more about how to install ALE.

Personally, I recommend using OpenAI gym because it not only can be installed by simply pip install gym but also provides us atari game roms (For instance /Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/atari_py/atari_roms on my mac).

###Dependencies for running DQN

Arcade Learning Environment or OpenAI gym

Numpy and SciPy

Theano

Lasagne

A reasonable GPU

###Dependencies for running Episodic Control

Arcade Learning Environment or OpenAI gym

Numpy and SciPy

annoy for approximate KNN

A reasonable CPU

Running

examples:

THEANO_FLAGS='device=gpu0, floatX=float32' python run_nature.py

THEANO_FLAGS='device=gpu1, floatX=float32' python run_nature.py -r ms_pacman

python run_episodic_control.py

To get more running details, we can use python run_episodic_control.py -h or python run_nature.py -h

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].