Zi Wang* Beomjoon Kim* Leslie Pack Kaelbling
Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior
Dec 5 @ NeurIPS 18
Regret bounds for meta Bayesian optimization with an unknown Gaussian - - PowerPoint PPT Presentation
Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior Zi Wang* Beomjoon Kim* Leslie Pack Kaelbling Dec 5 @ NeurIPS 18 Poster #22 Bayesian optimization x * = argmax Goal: f ( x ) x
Zi Wang* Beomjoon Kim* Leslie Pack Kaelbling
Dec 5 @ NeurIPS 18
x∈𝔜
x∈𝔜
Assume a GP prior f ∼ GP(μ, k) LOOP
1 2 3
1 2
f(x) x
x∈𝔜
Assume a GP prior f ∼ GP(μ, k) LOOP
1 2 3
1 2
f(x) x
x∈𝔜
Assume a GP prior f ∼ GP(μ, k) LOOP
1 2 3
1 2
f(x) x
e.g. by maximizing marginal data likelihood every few iterations
x∈𝔜
Assume a GP prior f ∼ GP(μ, k)
LOOP
e.g. by maximizing marginal data likelihood every few iterations
x∈𝔜
Assume a GP prior f ∼ GP(μ, k)
LOOP
e.g. by maximizing marginal data likelihood every few iterations
x
x
x
x x
ˆ µ(x) ˆ µ(x) ± 3 q ˆ k(x)
Estimated prior
Estimate the GP prior from offline data sampled from the same prior
x
ˆ µ0(x) ˆ µ0(x) ± ζ1 q ˆ k0(x)
x
ˆ µ(x) ˆ µ(x) ± 3 q ˆ k(x)
Estimated prior
Estimate the GP prior from offline data sampled from the same prior Construct unbiased estimators of the posterior and use a variant of GP-UCB
x
ˆ µ1(x) ˆ µ1(x) ± ζ2 q ˆ k1(x)
x
ˆ µ(x) ˆ µ(x) ± 3 q ˆ k(x)
Estimated prior
Estimate the GP prior from offline data sampled from the same prior Construct unbiased estimators of the posterior and use a variant of GP-UCB
x
ˆ µ2(x) ˆ µ2(x) ± ζ3 q ˆ k2(x)
x
ˆ µ(x) ˆ µ(x) ± 3 q ˆ k(x)
Estimated prior
Estimate the GP prior from offline data sampled from the same prior Construct unbiased estimators of the posterior and use a variant of GP-UCB
x
ˆ µ3(x) ˆ µ3(x) ± ζ4 q ˆ k3(x)
x
ˆ µ(x) ˆ µ(x) ± 3 q ˆ k(x)
Estimated prior
Estimate the GP prior from offline data sampled from the same prior Construct unbiased estimators of the posterior and use a variant of GP-UCB
x
ˆ µ4(x) ˆ µ4(x) ± ζ5 q ˆ k4(x)
x
ˆ µ(x) ˆ µ(x) ± 3 q ˆ k(x)
Estimated prior
Estimate the GP prior from offline data sampled from the same prior Construct unbiased estimators of the posterior and use a variant of GP-UCB
ˆ µt(x) ˆ µt(x) ± ζt+1 q ˆ kt(x)
ˆ µt(x) ˆ µt(x) ± ζt+1 q ˆ kt(x)
x x
Important assumptions:
constant
Given , with high probability, simple regret RT ≲
linear kernel
≈ 10
Results for continuous input space @ poster #22
#evaluations of test function
Max observed value
—Our method —UCB —TransLearn —Rand
5 10 15 20 25 30 6 5 4 3 2
proportion of meta-training data
—Our method —UCB
0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 6 5 4 3 2