Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → google-research → World_models

google-research / World_models

Licence: apache-2.0

Programming Languages

python

139335 projects - #7 most used programming language

World Models Library

World Models is a platform-agnostic library to facilitate visual based agents for planning. This notebook (run it in colab) shows how to use World Models library and its different components.

To run locally, use the following command:

python3 -m world_models.bin.train_eval  \
    --config_path=/path/to/config  \
    --output_dir=/path/to/output_dir  \
    --logtostderr

Experiment Results

Below is a summary of our findings. For full discussion please see our paper: Models, Pixels, and Rewards: Evaluating Design Trade-offs in Visual Model-Based Reinforcement Learning

Is predicting future rewards sufficient for achieving success in visual model-based reinforcement learning? We experimentally demonstrate that this is usually not the case in the online settings and the key is to predict future images too.

Amazingly, this also means there is a weak correlation between reward prediction accuracy and performance of the agent. However, we show that there is a much stronger correlation between image reconstruction error and the performance of the agent.

We show how this phenomenon is directly related to exploration: models that fit the data better usually perform better in an offline setup. Surprisingly, these are often not the same models that perform the best when learning and exploring from scratch!

How to Cite

If you use this work, please cite the following paper where it was first introduced:

   @article{2020worldmodels,
     title   = {Models, Pixels, and Rewards: Evaluating Design Trade-offs in Visual Model-Based Reinforcement Learning},
     author  = {Mohammad Babaeizadeh and Mohammad Taghi Saffar and Danijar Hafner and Harini Kannan and Chelsea Finn and Sergey Levine and Dumitru Erhan},
     year    = {2020},
     url     = {https://arxiv.org/abs/2012.04603}
   }

You can reach us at [email protected]

Dependencies

absl
gin-config
TensorFlow==1.15
TensorFlow probability==0.7
gym
dm_control
MuJoCo

Disclaimer: This is not an official Google product.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 85

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗