All Projects → openai → Large Scale Curiosity

openai / Large Scale Curiosity

Code for the paper "Large-Scale Study of Curiosity-Driven Learning"

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Large Scale Curiosity

Paper
On self sovereign human identity.
Stars: ✭ 537 (-23.61%)
Mutual labels:  paper
Pl Compiler Resource
程序语言与编译技术相关资料(持续更新中)
Stars: ✭ 578 (-17.78%)
Mutual labels:  paper
Awesome Relation Extraction
📖 A curated list of awesome resources dedicated to Relation Extraction, one of the most important tasks in Natural Language Processing (NLP).
Stars: ✭ 656 (-6.69%)
Mutual labels:  paper
Hugo Paper
🥛 A simple, clean, flexible Hugo theme
Stars: ✭ 538 (-23.47%)
Mutual labels:  paper
Bert paper chinese translation
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 论文的中文翻译 Chinese Translation!
Stars: ✭ 564 (-19.77%)
Mutual labels:  paper
Recommendersystem Paper
This repository includes some papers that I have read or which I think may be very interesting.
Stars: ✭ 619 (-11.95%)
Mutual labels:  paper
Qlib
Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, you can easily try your ideas to create better Quant investment strategies. An increasing number of SOTA Quant research works/papers are released in Qlib.
Stars: ✭ 7,582 (+978.52%)
Mutual labels:  paper
Awesome Economics
A curated collection of links for economists
Stars: ✭ 688 (-2.13%)
Mutual labels:  paper
Deeptype
Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"
Stars: ✭ 571 (-18.78%)
Mutual labels:  paper
Minecraftdev
Plugin for IntelliJ IDEA that gives special support for Minecraft modding projects.
Stars: ✭ 645 (-8.25%)
Mutual labels:  paper
Mlsh
Code for the paper "Meta-Learning Shared Hierarchies"
Stars: ✭ 548 (-22.05%)
Mutual labels:  paper
Cv paperdaily
CV 论文笔记
Stars: ✭ 555 (-21.05%)
Mutual labels:  paper
Awesome Interaction Aware Trajectory Prediction
A selection of state-of-the-art research materials on trajectory prediction
Stars: ✭ 625 (-11.1%)
Mutual labels:  paper
Srflow
Official SRFlow training code: Super-Resolution using Normalizing Flow in PyTorch
Stars: ✭ 537 (-23.61%)
Mutual labels:  paper
Dl Nlp Readings
My Reading Lists of Deep Learning and Natural Language Processing
Stars: ✭ 656 (-6.69%)
Mutual labels:  paper
Cvpr 2019 Paper Statistics
Statistics and Visualization of acceptance rate, main keyword of CVPR 2019 accepted papers for the main Computer Vision conference (CVPR)
Stars: ✭ 527 (-25.04%)
Mutual labels:  paper
Dnc Tensorflow
A TensorFlow implementation of DeepMind's Differential Neural Computers (DNC)
Stars: ✭ 587 (-16.5%)
Mutual labels:  paper
Densenet
DenseNet implementation in Keras
Stars: ✭ 693 (-1.42%)
Mutual labels:  paper
Multiagent Competition
Code for the paper "Emergent Complexity via Multi-agent Competition"
Stars: ✭ 663 (-5.69%)
Mutual labels:  paper
All About The Gan
All About the GANs(Generative Adversarial Networks) - Summarized lists for GAN
Stars: ✭ 630 (-10.38%)
Mutual labels:  paper

Status: Archive (code is provided as-is, no updates expected)

Large-Scale Study of Curiosity-Driven Learning

[Project Website] [Demo Video]

Yuri Burda*, Harri Edwards*, Deepak Pathak*,
Amos Storkey, Trevor Darrell, Alexei A. Efros
(* alphabetical ordering, equal contribution)

University of California, Berkeley
OpenAI
University of Edinburgh

This is a TensorFlow based implementation for our paper on large-scale study of curiosity-driven learning across 54 environments. Curiosity is a type of intrinsic reward function which uses prediction error as reward signal. In this paper, We perform the first large-scale study of purely curiosity-driven learning, i.e. without any extrinsic rewards, across 54 standard benchmark environments. We further investigate the effect of using different feature spaces for computing prediction error and show that random features are sufficient for many popular RL game benchmarks, but learned features appear to generalize better (e.g. to novel game levels in Super Mario Bros.). If you find this work useful in your research, please cite:

@inproceedings{largeScaleCuriosity2018,
    Author = {Burda, Yuri and Edwards, Harri and
              Pathak, Deepak and Storkey, Amos and
              Darrell, Trevor and Efros, Alexei A.},
    Title = {Large-Scale Study of Curiosity-Driven Learning},
    Booktitle = {arXiv:1808.04355},
    Year = {2018}
}

Installation and Usage

The following command should train a pure exploration agent on Breakout with default experiment parameters.

python run.py

To use more than one gpu/machine, use MPI (e.g. mpiexec -n 8 python run.py should use 1024 parallel environments to collect experience instead of the default 128 on an 8 gpu machine).

Data for plots in paper

Data for Figure-2: contains raw game score data along with the plotting code to generate Figure-2 in the paper.

Other helpful pointers

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].