Differentiating Policies for Non-Myopic Bayesian Optimization

Abstract

Bayesian optimization (BO) methods choose sample points by optimizing an acquisition function derived from a statistical model of the objective. These acquisition functions are chosen to balance sampling regions with predicted good objective values against exploring regions where the objective is uncertain. Standard acquisition functions are myopic, considering only the impact of the next sample, but non-myopic acquisition functions may be more effective. In principle, one could model the sampling by a Markov decision process, and optimally choose the next sample by maximizing an expected reward computed by dynamic programming; however, this is infeasibly expensive. More practical approaches, such as rollout, consider a parametric family of sampling policies. In this paper, we show how to efficiently estimate rollout acquisition functions and their gradients, enabling stochastic gradient-based optimization of sampling policies.

Software Design

In order to enable our computations of interest, we expressed each core computational concept as its own abstract type and built out our implementations from there. We first begin by enumerating the fundamental abstract types that appear throughout our framework. The depiction below follows julia's type system, that is, the leaf node represents a concrete type--all other nodes are abstract:

AbstractKernel
- StationaryKernel
  - RadialBasisFunction
AbstractCostFunction
- KnownCostFunction
  - UniformCost
  - NonUniformCost
- UnknownCostFunction (can be unknown and deterministic; fix)
  - GaussianProcessCost
AbstractSurrogate
- Surrogate
- AbstractFantasySurrogate
  - FantasySurrogate
  - AbstractPerturbationSurrogate
    - SpatialPerturbationSurrogate
    - DataPerturbationSurrogate
AbstractObservable
- DeterministicObservable
- StochasticObservable
AbstractTrajectory (fix typing on concrete types)
- ForwardTrajectory
- AdjointTrajectory
AbstractDecisionRule
- DecisionRule

Surrogate depends on RadialBasisFunction. FantasySurrogate depends on a base surrogate, which is of type Surrogate. SpatialPerteurbationSurrogate depends on FantasySurrogate. DataPerteurbationSurrogate depends on FantasySurrogate. Surrogate also depends on DecisionRule. AdjointTrajectory depends on DeterministicObservable or StochasticObservable.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
docs		docs
experiments		experiments
notebooks		notebooks
paper		paper
.gitignore		.gitignore
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md
cost_functions.jl		cost_functions.jl
decision_rules.jl		decision_rules.jl
lazy_struct.jl		lazy_struct.jl
low_discrepancy.jl		low_discrepancy.jl
observables.jl		observables.jl
optim.jl		optim.jl
optimizers.jl		optimizers.jl
radial_basis_functions.jl		radial_basis_functions.jl
radial_basis_surrogates.jl		radial_basis_surrogates.jl
rbf_optim.jl		rbf_optim.jl
rollout.jl		rollout.jl
rollout_bayesian_optimization.jl		rollout_bayesian_optimization.jl
runtests.jl		runtests.jl
testfns.jl		testfns.jl
trajectory.jl		trajectory.jl
utils.jl		utils.jl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Differentiating Policies for Non-Myopic Bayesian Optimization

Abstract

Software Design

About

Releases

Packages

Languages

DarianNwankwo/Rollout-Bayesian-Optimization

Folders and files

Latest commit

History

Repository files navigation

Differentiating Policies for Non-Myopic Bayesian Optimization

Abstract

Software Design

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages