All Projects → wangronin → MIP-EGO

wangronin / MIP-EGO

Licence: MIT license
Mixed-Integer Parallel Efficient Global Optimization

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to MIP-EGO

Hypertunity
A toolset for black-box hyperparameter optimisation.
Stars: ✭ 119 (+357.69%)
Mutual labels:  bayesian-optimization
Lstm anomaly thesis
Anomaly detection for temporal data using LSTMs
Stars: ✭ 178 (+584.62%)
Mutual labels:  bayesian-optimization
approxposterior
A Python package for approximate Bayesian inference and optimization using Gaussian processes
Stars: ✭ 36 (+38.46%)
Mutual labels:  bayesian-optimization
Pysot
Surrogate Optimization Toolbox for Python
Stars: ✭ 136 (+423.08%)
Mutual labels:  bayesian-optimization
Mlrmbo
Toolbox for Bayesian Optimization and Model-Based Optimization in R
Stars: ✭ 173 (+565.38%)
Mutual labels:  bayesian-optimization
Cornell Moe
A Python library for the state-of-the-art Bayesian optimization algorithms, with the core implemented in C++.
Stars: ✭ 198 (+661.54%)
Mutual labels:  bayesian-optimization
Chocolate
A fully decentralized hyperparameter optimization framework
Stars: ✭ 112 (+330.77%)
Mutual labels:  bayesian-optimization
mango
Parallel Hyperparameter Tuning in Python
Stars: ✭ 241 (+826.92%)
Mutual labels:  bayesian-optimization
Scikit Optimize
Sequential model-based optimization with a `scipy.optimize` interface
Stars: ✭ 2,258 (+8584.62%)
Mutual labels:  bayesian-optimization
pyrff
pyrff: Python implementation of random fourier feature approximations for gaussian processes
Stars: ✭ 24 (-7.69%)
Mutual labels:  bayesian-optimization
Limbo
A lightweight framework for Gaussian processes and Bayesian optimization of black-box functions (C++-11)
Stars: ✭ 157 (+503.85%)
Mutual labels:  bayesian-optimization
Goptuna
Decentralized hyperparameter optimization framework, inspired by Optuna.
Stars: ✭ 169 (+550%)
Mutual labels:  bayesian-optimization
Gpflowopt
Bayesian Optimization using GPflow
Stars: ✭ 229 (+780.77%)
Mutual labels:  bayesian-optimization
Nasbot
Neural Architecture Search with Bayesian Optimisation and Optimal Transport
Stars: ✭ 120 (+361.54%)
Mutual labels:  bayesian-optimization
open-box
Generalized and Efficient Blackbox Optimization System.
Stars: ✭ 64 (+146.15%)
Mutual labels:  bayesian-optimization
Bopp
BOPP: Bayesian Optimization for Probabilistic Programs
Stars: ✭ 112 (+330.77%)
Mutual labels:  bayesian-optimization
Hyperactive
A hyperparameter optimization and data collection toolbox for convenient and fast prototyping of machine-learning models.
Stars: ✭ 182 (+600%)
Mutual labels:  bayesian-optimization
AutoPrognosis
Codebase for "AutoPrognosis: Automated Clinical Prognostic Modeling via Bayesian Optimization", ICML 2018.
Stars: ✭ 47 (+80.77%)
Mutual labels:  bayesian-optimization
chemprop
Fast and scalable uncertainty quantification for neural molecular property prediction, accelerated optimization, and guided virtual screening.
Stars: ✭ 75 (+188.46%)
Mutual labels:  bayesian-optimization
Tune Sklearn
A drop-in replacement for Scikit-Learn’s GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.
Stars: ✭ 241 (+826.92%)
Mutual labels:  bayesian-optimization

Mixed-Integer Parallel Efficient Global Optimization

A Python implementation of the Efficient Global Optimization (EGO) / Bayesian Optimization (BO) algorithm for decision spaces composed of either real, integer, catergorical variables, or a mixture thereof.

Underpinned by surrogate models, this algorithm iteratively proposes candidate solutions using the so-called acquisition function which balances exploration with exploitation, and updates the surrogate model with newly observed objective values.

The project is structured as follows:

  • MIPEGO/base.py: the base class of Bayesian Optimization.
  • MIPEGO/BayesOpt.py contains several BO variants:
    • BO: noiseless + seqential
    • ParallelBO: noiseless + parallel (a.k.a. batch-sequential)
    • AnnealingBO: noiseless + parallel + annealling [WEB18]
    • SelfAdaptiveBO: noiseless + parallel + self-adaptive [WEB19]
    • NoisyBO: noisy + parallel
    • PCABO: noiseless + parallel + PCA-assisted dimensionality reduction [RaponiWBBD20] [Under Construction]
  • MIPEGO/InfillCriteria.py: the implemetation of acquisition functions (see below for the list of implemented ones).
  • MIPEGO/Surrogate.py: the implementation/wrapper of sklearn's random forests model.
  • MIPEGO/SearchSpace.py: implementation of the search/decision space.

Features

This implementation differs from alternative packages/libraries in the following features:

  • Parallelization, also known as batch-sequential optimization, for which several different approaches are implemented here.
  • Moment-Generating Function of the improvment (MGFI) [WvSEB17a] is a recently proposed acquistion function, which implictly controls the exploration-exploitation trade-off.
  • Mixed-Integer Evolution Strategy for optimizing the acqusition function, which is enabled when the search space is a mixture of real, integer, and categorical variables.

Acqusition Functions

The following infill-criteria are implemented in the library:

  • Expected Improvement (EI)
  • Probability of Improvement (PI) / Probability of Improvement
  • Upper Confidence Bound (UCB)
  • Moment-Generating Function of Improvement (MGFI)
  • Generalized Expected Improvement (GEI) [Under Construction]

For sequential working mode, Expected Improvement is used by default. For parallelization mode, MGFI is enabled by default.

Surrogate Model

The meta (surrogate)-model used in Bayesian optimization. The basic requirement for such a model is to provide the uncertainty quantification (either empirical or theorerical) for the prediction. To easily handle the categorical data, random forest model is used by default. The implementation here is based the one in scikit-learn, with modifications on uncertainty quantification.

Installation

pip install mipego

Exemplary Use Case

To use the optimizer you need to define an objective function, the search space and configure the optimizer. Below are two examples that describe most of the functionality.

Optimizing A Black-Box Function

In this example we optimize a mixed integer black box problem.

import numpy as np
from MIPEGO import ParallelBO, ContinuousSpace, OrdinalSpace, NominalSpace, RandomForest

seed = 666
np.random.seed(seed)
dim_r = 2  # dimension of the real values

def obj_fun(x):
    x_r = np.array([x['continuous_%d'%i] for i in range(dim_r)])
    x_i = x['ordinal']
    x_d = x['nominal']
    _ = 0 if x_d == 'OK' else 1
    return np.sum(x_r ** 2) + abs(x_i - 10) / 123. + _ * 2

# Continuous variables can be specified as follows:
# a 2-D variable in [-5, 5]^2
# for 2 variables, the naming scheme is continuous0, continuous1
C = ContinuousSpace([-5, 5], var_name='continuous') * dim_r

# Integer (ordinal) variables can be specified as follows:
# The domain of integer variables can be given as with continuous ones
# var_name is optional
I = OrdinalSpace([5, 15], var_name='ordinal')

# Discrete (nominal) variables can be specified as follows:
# No lb, ub... a list of categories instead
N = NominalSpace(['OK', 'A', 'B', 'C', 'D', 'E', 'F', 'G'], var_name='nominal')

# The whole search space can be constructed:
search_space = C + I + N

# Bayesian optimization also uses a Surrogate model
# For mixed variable type, the random forest is typically used
model = RandomForest(levels=search_space.levels)

opt = ParallelBO(
    search_space=search_space,
    obj_fun=obj_fun,
    model=model,
    max_FEs=50,
    DoE_size=3,    # the initial DoE size
    eval_type='dict',
    acquisition_fun='MGFI',
    acquisition_par={'t' : 2},
    n_job=3,       # number of processes
    n_point=3,     # number of the candidate solution proposed in each iteration
    verbose=True   # turn this off, if you prefer no output
)
xopt, fopt, stop_dict = opt.run()

print('xopt: {}'.format(xopt))
print('fopt: {}'.format(fopt))
print('stop criteria: {}'.format(stop_dict))

Neural Architecture Search

In this example we optimize a neural network architecture on the well-known MNIST dataset. The objective function is computed in this file. In the objective file the neural network architecture is defined and evaluated on the MNIST dataset. In this example, the optimizer proposes 4 candidates architectures which are trained on 4 GPUs simultaneously.

A brief Introduction to Bayesian Optimization

Bayesian Optimization [Moc74, JSW98] (BO) is a sequential optimization strategy originally proposed to solve the single-objective black-box optimiza-tion problem that is costly to evaluate. Here, we shall restrict our discussion to the single-objective case. BO typically starts with sampling an initial design of experiment (DoE) of size, X={x1,x2,...,xn}, which is usually generated by simple random sampling, Latin Hypercube Sampling [SWN03], or the more sophisticated low-discrepancy sequence [Nie88] (e.g., Sobol sequences). Taking the initial DoE X and its corresponding objective value, Y={f(x1), f(x2),..., f(xn)} ⊆ ℝ, we proceed to construct a statistical model M describing the probability distribution of the objective function conditioned onthe initial evidence, namely Pr(f|X,Y). In most application scenarios of BO, there is a lack of a priori knowledge about f and therefore nonparametric models (e.g., Gaussian process regression or random forest) are commonly chosen for M, which gives rise to a predictor f'(x) for all x ∈ X and an uncertainty quantification s'(x) that estimates, for instance, the mean squared error of the predic-tion E(f'(x)−f(x))2. Based on f' and s', promising points can be identified via the so-called acquisition function which balances exploitation with exploration of the optimization process.

Contributing

Please take a look at our contributing guidelines if you're interested in helping!

Cite Us

You can find our paper on IEEE Explore and on Arxiv.
When using MiP-EGO for your research, please cite us as follows:

@inproceedings{van2019automatic,
    title={Automatic Configuration of Deep Neural Networks with Parallel Efficient Global Optimization},
    author={van Stein, Bas and Wang, Hao and B{\"a}ck, Thomas},
    booktitle={2019 International Joint Conference on Neural Networks (IJCNN)},
    pages={1--7},
    year={2019},
    organization={IEEE}
}

Reference

  • [Moc74] Jonas Mockus. "On bayesian methods for seeking the extremum". In Guri I. Marchuk, editor, Optimization Techniques, IFIP Technical Conference, Novosibirsk, USSR, July 1-7, 1974, volume 27 of Lecture Notes in Computer Science, pages 400–404. Springer, 1974.
  • [JSW98] Donald R. Jones, Matthias Schonlau, and William J. Welch. "Efficient global optimization of expensive black-box functions". J. Glob. Optim., 13(4):455–492, 1998.
  • [SWN03] Thomas J. Santner, Brian J. Williams, and William I. Notz. "The Design and Analysis of Computer Experiments". Springer series in statistics. Springer, 2003.
  • [Nie88] Harald Niederreiter. "Low-discrepancy and low-dispersion sequences". Journal of number theory, 30(1):51–70, 1988.
  • [WvSEB17a] Hao Wang, Bas van Stein, Michael Emmerich, and Thomas Bäck. "A New Acquisition Function for Bayesian Optimization Based on the Moment-Generating Function". In Systems, Man, and Cybernetics (SMC), 2017 IEEE International Conference on, pages 507–512. IEEE, 2017.
  • [WEB18] Hao Wang, Michael Emmerich, and Thomas Bäck. "Cooling Strategies for the Moment-Generating Function in Bayesian Global Optimization". In 2018 IEEE Congress on Evolutionary Computation, CEC 2018, Rio de Janeiro, Brazil, July 8-13, 2018, pages 1–8. IEEE, 2018.
  • [WEB19] Hao, Wang, Michael Emmerich, and Thomas Bäck. "Towards self-adaptive efficient global optimization". In AIP Conference Proceedings, vol. 2070, no. 1, p. 020056. AIP Publishing LLC, 2019.
  • [RaponiWBBD20] Elena Raponi, Hao Wang, Mariusz Bujny, Simonetta Boria, and Carola Doerr: "High Dimensional Bayesian Optimization Assisted by Principal Component Analysis". In International Conference on Parallel Problem Solving from Nature, pp. 169-183. Springer, Cham, 2020.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].