Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → KokoMind → Recurrent Environment Simulators

KokoMind / Recurrent Environment Simulators

Licence: apache-2.0

Deepmind Recurrent Environment Simulators paper implementation in tensorflow

Programming Languages

python

139335 projects - #7 most used programming language

Labels

tensorflow neural-network reinforcement-learning convolutional-neural-networks recurrent-neural-networks deepmind

Projects that are alternatives of or similar to Recurrent Environment Simulators

Ml In Tf

Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!

Stars: ✭ 45 (-38.36%)

Mutual labels: reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks, deepmind

Tensorflow Tutorial

TensorFlow and Deep Learning Tutorials

Stars: ✭ 748 (+924.66%)

Mutual labels: reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks

Learning To Communicate Pytorch

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Stars: ✭ 236 (+223.29%)

Mutual labels: reinforcement-learning, recurrent-neural-networks, deepmind

Awesome Tensorlayer

A curated list of dedicated resources and applications

Stars: ✭ 248 (+239.73%)

Mutual labels: reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks

Machine Learning Curriculum

💻 Make machines learn so that you don't have to struggle to program them; The ultimate list

Stars: ✭ 761 (+942.47%)

Mutual labels: reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks

Easy Tensorflow

Simple and comprehensive tutorials in TensorFlow

Stars: ✭ 2,871 (+3832.88%)

Mutual labels: reinforcement-learning, convolutional-neural-networks, recurrent-neural-networks

Pysc2 Examples

StarCraft II - pysc2 Deep Reinforcement Learning Examples

Stars: ✭ 722 (+889.04%)

Mutual labels: reinforcement-learning, deepmind

Udacity Deep Learning Nanodegree

This is just a collection of projects that made during my DEEPLEARNING NANODEGREE by UDACITY

Stars: ✭ 15 (-79.45%)

Mutual labels: reinforcement-learning, recurrent-neural-networks

Torch Ac

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

Stars: ✭ 70 (-4.11%)

Mutual labels: reinforcement-learning, recurrent-neural-networks

Deep Plant

Deep-Plant: Plant Classification with CNN/RNN. It consists of CAFFE/Tensorflow implementation of our PR-17, TIP-18 (HGO-CNN & PlantStructNet) and MalayaKew dataset.

Stars: ✭ 66 (-9.59%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

Reaver

Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.

Stars: ✭ 499 (+583.56%)

Mutual labels: reinforcement-learning, deepmind

Price prediction lob

Deep learning for price movement prediction using high frequency limit order data

Stars: ✭ 27 (-63.01%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

3d Reconstruction With Neural Networks

3D reconstruction with neural networks using Tensorflow. See link for Video (https://www.youtube.com/watch?v=iI6ZMST8Ri0)

Stars: ✭ 71 (-2.74%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

Trending Deep Learning

Top 100 trending deep learning repositories sorted by the number of stars gained on a specific day.

Stars: ✭ 543 (+643.84%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

Stanford Cs 230 Deep Learning

VIP cheatsheets for Stanford's CS 230 Deep Learning

Stars: ✭ 5,149 (+6953.42%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

Reading comprehension tf

Machine Reading Comprehension in Tensorflow

Stars: ✭ 37 (-49.32%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

Mujocounity

Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).

Stars: ✭ 47 (-35.62%)

Mutual labels: reinforcement-learning, deepmind

Tensorflow Cnn Time Series

Feeding images of time series to Conv Nets! (Tensorflow + Keras)

Stars: ✭ 49 (-32.88%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

Image Captioning

Image Captioning: Implementing the Neural Image Caption Generator with python

Stars: ✭ 52 (-28.77%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

Eeglearn

A set of functions for supervised feature learning/classification of mental states from EEG based on "EEG images" idea.

Stars: ✭ 469 (+542.47%)

Mutual labels: convolutional-neural-networks, recurrent-neural-networks

View All Similar Projects ➔

Recurrent-Environment-Simulators

Modeling the environment is an important task for intelligent agents to be able to plan and act efficiently. A Recurrent Environment Simulator network can achive this task easily by learning to predict the next observation given the history of observations and actions. Making the agent able to predict the consequences of its actions.

This repository contains a tensorflow implementation of the Recurrent Enviroment Simulators paper puplished by DeepMind at ICML 2017. (https://arxiv.org/abs/1704.02254)

Network Architecture

The following figure visualize the RES architecture. It can be devided to three main parts, the encoder, the decoder and the action-conditioned LSTM. The encoder extract the features from the observation at time step t, the action-conditioned LSTM keeps useful features from previous frames to help the decoder predict better observations for time step t+1.

Action-conditioned LSTM

One of the main contributions of their work is fusing the action with the hidden state representation when predicting the next hidden state representation in time. In previous work, the action was used instead to directly predict the next image. Why? Authors suggest it could “enable the model to incorporate action information more effectively”. so paper used a modified version of LSTM called Action conditioned LSTM. mainly it's an early fusion between actions and states. They used this approach as it enables them to explore how the model generalises to different action policies.

Data Collection using A2C RL agent

We trained a synchronous Advantage Actor Critic (A2C) agent and used it to explore the desired enviroment and collect data, using openAi Atari enviroments.

We'll provide some collected data from different Atari enviroments.

Usage

Dependencies

Python 3.X
tensorflow 1.3.0
numpy 1.13.1
tqdm

Run

Collect data from any atari enviroment using the method mentioned before, or use the provided data.
Edit the configration file to meet your need.
Run python res.py is_train=True

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 73

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗