All Projects → TheAbhiKumar → Tensorflow Value Iteration Networks

TheAbhiKumar / Tensorflow Value Iteration Networks

Licence: apache-2.0
TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Tensorflow Value Iteration Networks

Neural Localization
Train an RL agent to localize actively (PyTorch)
Stars: ✭ 193 (-64.85%)
Mutual labels:  reinforcement-learning, neural-networks
Machine Learning Articles
Monthly Series - Top 10 Machine Learning Articles
Stars: ✭ 516 (-6.01%)
Mutual labels:  reinforcement-learning, neural-networks
Dm control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Stars: ✭ 2,592 (+372.13%)
Mutual labels:  reinforcement-learning, neural-networks
Banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
Stars: ✭ 127 (-76.87%)
Mutual labels:  reinforcement-learning, neural-networks
Gdrl
Grokking Deep Reinforcement Learning
Stars: ✭ 304 (-44.63%)
Mutual labels:  reinforcement-learning, neural-networks
Ml Agents
Unity Machine Learning Agents Toolkit
Stars: ✭ 12,134 (+2110.2%)
Mutual labels:  reinforcement-learning, neural-networks
Gam
A PyTorch implementation of "Graph Classification Using Structural Attention" (KDD 2018).
Stars: ✭ 227 (-58.65%)
Mutual labels:  reinforcement-learning, neural-networks
Ml In Tf
Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!
Stars: ✭ 45 (-91.8%)
Mutual labels:  reinforcement-learning, neural-networks
Sharpneat
SharpNEAT - Evolution of Neural Networks. A C# .NET Framework.
Stars: ✭ 273 (-50.27%)
Mutual labels:  reinforcement-learning, neural-networks
Rlgraph
RLgraph: Modular computation graphs for deep reinforcement learning
Stars: ✭ 272 (-50.46%)
Mutual labels:  reinforcement-learning, neural-networks
Numpy Ml
Machine learning, in numpy
Stars: ✭ 11,100 (+1921.86%)
Mutual labels:  reinforcement-learning, neural-networks
Dissecting Reinforcement Learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Stars: ✭ 512 (-6.74%)
Mutual labels:  reinforcement-learning, neural-networks
Outlace.github.io
Machine learning and data science blog.
Stars: ✭ 65 (-88.16%)
Mutual labels:  reinforcement-learning, neural-networks
Deep Cfr
Scalable Implementation of Deep CFR and Single Deep CFR
Stars: ✭ 158 (-71.22%)
Mutual labels:  reinforcement-learning, neural-networks
Mujocounity
Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).
Stars: ✭ 47 (-91.44%)
Mutual labels:  reinforcement-learning, neural-networks
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (-59.56%)
Mutual labels:  reinforcement-learning, neural-networks
Qualia2.0
Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-92.53%)
Mutual labels:  reinforcement-learning, neural-networks
Machine Learning From Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-92.35%)
Mutual labels:  reinforcement-learning, neural-networks
Nn
🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Stars: ✭ 5,720 (+941.89%)
Mutual labels:  reinforcement-learning, neural-networks
Tensorlayer Tricks
How to use TensorLayer
Stars: ✭ 357 (-34.97%)
Mutual labels:  reinforcement-learning, neural-networks

Value Iteration Networks in TensorFlow

Tamar, A., Wu, Y., Thomas, G., Levine, S., and Abbeel, P. Value Iteration Networks. Neural Information Processing Systems (NIPS) 2016

This repository contains an implementation of Value Iteration Networks in TensorFlow which won the Best Paper Award at NIPS 2016. This code is based on the original Theano implementation by the authors.

Value Iteration Network and Module

Training

  • Download the 16x16 and 28x28 GridWorld datasets from the author's repository. This repository contains the 8x8 GridWorld dataset for convenience and its small size.
# Runs the 8x8 Gridworld with default parameters
python3 train.py

If you want to monitor training progress change config.log to True and launch tensorboard --logdir /tmp/vintf/. The log directory is /tmp/vintf/ by default, but can be changed in config.logdir. The code currently runs the 8x8 GridWorld model by default.

The 8x8 GridWorld model converges in under 30 epochs with about ~98.5% accuracy. The paper lists that it should be around 99.6% and I was able to reproduce this with the Theano code. Results for 16x16 and 28x28 can be seen here

Dependencies

  • Python >= 3.6
  • TensorFlow >= 1.0
  • SciPy >= 0.18.1 (to load the data)

Datasets

  • The GridWorld dataset used is from the author's repository. It also contains Matlab scripts to generate the dataset. The code to process the dataset is from the original repository with minor modifications under this license
  • The model was also originally tested on three other domains and the author's original code will be released eventually
    • Mars Rover Navigation
    • Continuous control
    • WebNav

Resources

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].