All Projects → KokoMind → Recurrent Environment Simulators

KokoMind / Recurrent Environment Simulators

Licence: apache-2.0
Deepmind Recurrent Environment Simulators paper implementation in tensorflow

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Recurrent Environment Simulators

Ml In Tf
Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!
Stars: ✭ 45 (-38.36%)
Mutual labels:  reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks, deepmind
Tensorflow Tutorial
TensorFlow and Deep Learning Tutorials
Stars: ✭ 748 (+924.66%)
Mutual labels:  reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks
Learning To Communicate Pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Stars: ✭ 236 (+223.29%)
Mutual labels:  reinforcement-learning, recurrent-neural-networks, deepmind
Awesome Tensorlayer
A curated list of dedicated resources and applications
Stars: ✭ 248 (+239.73%)
Mutual labels:  reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks
Machine Learning Curriculum
💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (+942.47%)
Mutual labels:  reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks
Easy Tensorflow
Simple and comprehensive tutorials in TensorFlow
Stars: ✭ 2,871 (+3832.88%)
Mutual labels:  reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks
Pysc2 Examples
StarCraft II - pysc2 Deep Reinforcement Learning Examples
Stars: ✭ 722 (+889.04%)
Mutual labels:  reinforcement-learning, deepmind
Udacity Deep Learning Nanodegree
This is just a collection of projects that made during my DEEPLEARNING NANODEGREE by UDACITY
Stars: ✭ 15 (-79.45%)
Mutual labels:  reinforcement-learning, recurrent-neural-networks
Torch Ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (-4.11%)
Mutual labels:  reinforcement-learning, recurrent-neural-networks
Deep Plant
Deep-Plant: Plant Classification with CNN/RNN. It consists of CAFFE/Tensorflow implementation of our PR-17, TIP-18 (HGO-CNN & PlantStructNet) and MalayaKew dataset.
Stars: ✭ 66 (-9.59%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks
Reaver
Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.
Stars: ✭ 499 (+583.56%)
Mutual labels:  reinforcement-learning, deepmind
Price prediction lob
Deep learning for price movement prediction using high frequency limit order data
Stars: ✭ 27 (-63.01%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks
3d Reconstruction With Neural Networks
3D reconstruction with neural networks using Tensorflow. See link for Video (https://www.youtube.com/watch?v=iI6ZMST8Ri0)
Stars: ✭ 71 (-2.74%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks
Trending Deep Learning
Top 100 trending deep learning repositories sorted by the number of stars gained on a specific day.
Stars: ✭ 543 (+643.84%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks
Stanford Cs 230 Deep Learning
VIP cheatsheets for Stanford's CS 230 Deep Learning
Stars: ✭ 5,149 (+6953.42%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks
Reading comprehension tf
Machine Reading Comprehension in Tensorflow
Stars: ✭ 37 (-49.32%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks
Mujocounity
Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).
Stars: ✭ 47 (-35.62%)
Mutual labels:  reinforcement-learning, deepmind
Tensorflow Cnn Time Series
Feeding images of time series to Conv Nets! (Tensorflow + Keras)
Stars: ✭ 49 (-32.88%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks
Image Captioning
Image Captioning: Implementing the Neural Image Caption Generator with python
Stars: ✭ 52 (-28.77%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks
Eeglearn
A set of functions for supervised feature learning/classification of mental states from EEG based on "EEG images" idea.
Stars: ✭ 469 (+542.47%)
Mutual labels:  convolutional-neural-networks, recurrent-neural-networks

Recurrent-Environment-Simulators

Modeling the environment is an important task for intelligent agents to be able to plan and act efficiently. A Recurrent Environment Simulator network can achive this task easily by learning to predict the next observation given the history of observations and actions. Making the agent able to predict the consequences of its actions.

This repository contains a tensorflow implementation of the Recurrent Enviroment Simulators paper puplished by DeepMind at ICML 2017. (https://arxiv.org/abs/1704.02254)

Network Architecture

The following figure visualize the RES architecture. It can be devided to three main parts, the encoder, the decoder and the action-conditioned LSTM. The encoder extract the features from the observation at time step t, the action-conditioned LSTM keeps useful features from previous frames to help the decoder predict better observations for time step t+1.



Action-conditioned LSTM

One of the main contributions of their work is fusing the action with the hidden state representation when predicting the next hidden state representation in time. In previous work, the action was used instead to directly predict the next image. Why? Authors suggest it could “enable the model to incorporate action information more effectively”. so paper used a modified version of LSTM called Action conditioned LSTM. mainly it's an early fusion between actions and states. They used this approach as it enables them to explore how the model generalises to different action policies.



Data Collection using A2C RL agent

We trained a synchronous Advantage Actor Critic (A2C) agent and used it to explore the desired enviroment and collect data, using openAi Atari enviroments.

We'll provide some collected data from different Atari enviroments.

Usage

Dependencies

Python 3.X
tensorflow 1.3.0
numpy 1.13.1
tqdm

Run

  • Collect data from any atari enviroment using the method mentioned before, or use the provided data.
  • Edit the configration file to meet your need.
  • Run python res.py is_train=True

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].