Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → brain-research → Guided Evolutionary Strategies

brain-research / Guided Evolutionary Strategies

Licence: apache-2.0

Guided Evolutionary Strategies

Labels

jupyter-notebook

Projects that are alternatives of or similar to Guided Evolutionary Strategies

A machine learning project. Turn on your webcam. Mona Lisa's eyes will follow you around.

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

Deeplearningcoursecodes

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

2016 01 Tennis Betting Analysis

Methodology and code supporting the BuzzFeed News/BBC article, "The Tennis Racket," published Jan. 17, 2016.

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

🌮 Trash Annotations in Context Dataset Toolkit

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

AIND Term 2 -- Lesson on Convolutional Neural Networks

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

Kernel Density Estimation in Python

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

TensorFlow JS models for MIRNet for low-light image enhancement.

Stars: ✭ 145 (-40.82%)

Mutual labels: jupyter-notebook

PyTorch Implementation of "Large-Scale Image Retrieval with Attentive Deep Local Features"

Stars: ✭ 245 (+0%)

Mutual labels: jupyter-notebook

Pytorch Vgg Cifar10

This is the PyTorch implementation of VGG network trained on CIFAR10 dataset

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals

Stars: ✭ 242 (-1.22%)

Mutual labels: jupyter-notebook

Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)

Stars: ✭ 3,443 (+1305.31%)

Mutual labels: jupyter-notebook

机器学习技术研究室——by阿布量化小组

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

a generalist algorithm for cellular segmentation

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Fouriertalkoscon

Presentation Materials for my "Sound Analysis with the Fourier Transform and Python" OSCON Talk.

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

An optical music recognition (OMR) system. Converts sheet music to a machine-readable version.

Stars: ✭ 241 (-1.63%)

Mutual labels: jupyter-notebook

Hackergame2018 Writeups

Write-ups for hackergame 2018

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

A library of metrics for evaluating recommender systems

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Human body prior

VPoser: Variational Human Pose Prior

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Data Cleaning 101

Data Cleaning Libraries with Python

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

Guided Evolutionary Strategies

Link to demo notebooks:

Link to paper: arXiv/1806.10230

Overview

Many applications in machine learning require optimizing a function whose true gradient is unknown, but where surrogate gradient information (directions that may be correlated with, but not necessarily identical to, the true gradient) is available instead. This arises when an approximate gradient is easier to compute than the full gradient (e.g. in meta-learning or unrolled optimization), or when a true gradient is intractable and is replaced with a surrogate (e.g. in certain reinforcement learning applications, or when using synthetic gradients).

Here, we propose Guided Evolutionary Strategies (Guided ES), a method for optimally using surrogate gradient directions along with random search. We define a search distribution for evolutionary strategies that is elongated along a guiding subspace spanned by the surrogate gradients. This allows us to estimate a descent direction which can then be passed to a first-order optimizer.

This repository contains a colaboratory (colab) notebook with a demo of the method on a toy problem (described below).

Introduction

Imagine you have a function you would like to optimize, but you only have access to approximate gradients of the function. There are two approaches to optimization. On one hand, you could ignore the surrogate gradient information entirely and perform zeroth-order optimization, using methods such as evolutionary strategies to estimate a descent direction. These methods exhibit poor convergence properties when the parameter dimension is large. On the other hand, you could directly feed the surrogate gradients to a first-order optimization algorithm. However, bias in the surrogate gradients will interfere with optimizing the target problem. Ideally, we would like a method that combines the complementary strengths of these two approaches: we would like to combine the unbiased descent direction estimated with evolutionary strategies with the low-variance estimate given by the surrogate gradient. We propose a method for doing this called guided evolutionary strategies (Guided ES).

Method

Our idea is to keep track of a low-dimensional subspace defined by the recent history of surrogate gradients during optimization (inspired by quasi-Newton methods) which we call the guiding subspace.

We then perform a finite difference random search (as in evolutionary strategies) preferentially within this subspace. By concentrating our search samples in a low-dimensional subspace where the true gradient has non-negligible support, we can dramatically reduce the variance of our search direction.

The figure panel (a) below depicts the geometry underlying our method. Instead of the true gradient (blue arrow), we are given a surrogate gradient (white arrow) which is correlated with the true gradient. We use this to form a guiding distribution (denoted with white contours) and use this to draw samples (white dots) which we use as part of a random search procedure.

In panel (b), we demonstrate the performance of the method on a toy problem. The problem consists of a random quadratic function, where we add an explicit bias and random noise to the gradient. Following the gradient directly with SGD (orange curve) starts fast but starts to diverge due to the bias in the gradient. Performing evolutionary strategies (or an adaptive variant, CMA-ES) succeed in minimizing the true function but proceed slowly and ignore the gradient information.

Guided ES, on the other hand, combines the strengths of these two approaches.

Citation

If you use this code, please consider citing our paper:

@article{
   maheswaranathan2018guided,
   title = {Guided evolutionary strategies: escaping the curse of dimensionality in random search},
   author = {Niru Maheswaranathan and Luke Metz and Dami Choi and George Tucker and Jascha Sohl-Dickstein},
   year = {2018},
   eprint = {arXiv:1806.10230},
   url = {https://arxiv.org/abs/1806.10230},
}

Contact

Authors:

Niru Maheswaranathan ([email protected])
Luke Metz ([email protected])
Dami Choi ([email protected])
George Tucker ([email protected])
Jascha Sohl-Dickstein ([email protected])

This is not an officially supported Google product.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 245

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗