Bayesian excursion set estimation with GPs Dario Azzimonti 1 Joint - - PowerPoint PPT Presentation

bayesian excursion set estimation with gps
SMART_READER_LITE
LIVE PREVIEW

Bayesian excursion set estimation with GPs Dario Azzimonti 1 Joint - - PowerPoint PPT Presentation

Bayesian excursion set estimation with GPs Dario Azzimonti 1 Joint works with: David Ginsbourger, Cl ement Chevalier, Julien Bect, Yann Richet 1 Institute of Mathematical Statistics and Actuarial Science University of Bern, Switzerland (Visiting


slide-1
SLIDE 1

Bayesian excursion set estimation with GPs

Dario Azzimonti 1

Joint works with: David Ginsbourger, Cl´ ement Chevalier, Julien Bect, Yann Richet

1Institute of Mathematical Statistics and Actuarial Science

University of Bern, Switzerland (Visiting UQOD group, Idiap Research Institute)

Uncertainty Quantification Workshop Sheffield, UK September 15, 2016

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 1 / 43

slide-2
SLIDE 2

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

Outline

Introduction Expectations of random closed sets Vorob’ev expectation Distance average approach Quasi-realizations for excursion sets estimation Approximate field Optimal design Implementation Assessing uncertainties with the distance transform Conservative estimates Definition Computational issues GanMC method Test case

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 2 / 43

slide-3
SLIDE 3

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

Outline

Introduction Expectations of random closed sets Vorob’ev expectation Distance average approach Quasi-realizations for excursion sets estimation Approximate field Optimal design Implementation Assessing uncertainties with the distance transform Conservative estimates Definition Computational issues GanMC method Test case

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 3 / 43

slide-4
SLIDE 4

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

The framework

In this talk we focus on the problem of determining the set Γ ⋆ = {x ∈ D : f (x) ∈ T} = f −1(T) where D ⊂ Rd is compact, f : D − → Rk is measurable, T ⊂ Rk. Here: k = 1, f is continuous, and T = (−∞, t] for a fixed t ∈ R. Γ ⋆ = {x ∈ D : f (x) ≤ t} is denoted the excursion set of f below t. Objective Estimate Γ ⋆ and quantify uncertainty on it when f is evaluated only at a few points Xn = {x1, . . . , xn} ⊂ D.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 4 / 43

slide-5
SLIDE 5

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

The framework: IRSN test case

1 2 3 4 5 1 2 3 4 5

Moret test case (k_eff)

PuO2 density (cm3) Water thickness (cm3)

Test case:

◮ keff function of PuO2 density and

H2O thickness, D = [0.2, 5.2] × [0, 5];

◮ continuous function, expensive to

evaluate;

◮ n = 20 observations (black triangles);

Objective: estimate Γ ⋆ = {x ∈ D : f (x) ≤ t} and evaluate the uncertainty of the estimate.

Acknowledgements: Yann Richet, Institut de Radioprotection et de Sˆ uret´ e Nucleaire.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 5 / 43

slide-6
SLIDE 6

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

The framework: an example

Gaussian random field realization

Function f : D ⊂ Rd → R

◮ expensive to evaluate; ◮ continuous.

Evaluated at Xn = (x1, . . . , xn) (black triangles) with values fn = (f (x1), . . . , f (xn)). Objective: estimate Γ ⋆ = {x ∈ D : f (x) ≤ t} and evaluate the uncertainty of the estimate.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 6 / 43

slide-7
SLIDE 7

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

Bayesian approach

Bayesian framework: f is seen as one realization of a (GRF) (Zx)x∈D with prior mean m and covariance kernel k. Given the function evaluations fn the posterior field has a Gaussian distribution Z | (Z(Xn) = fn) with mean and covariance kernel mn(x) = m(x) + k(x, Xn)k(Xn, Xn)−1(fn − m(Xn)) kn(x, y) = k(x, y) − k(x, Xn)k(Xn, Xn)−1k(Xn, y) Γ ⋆ is a realization of Γ = {x ∈ D : Zx ≤ t} = Z −1((−∞, t])

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 7 / 43

slide-8
SLIDE 8

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

A prior on the space of functions

Assume: f realization of (Zx)x∈D, Gaussian Random Field (GRF) Prior: (Zx)x∈D with

◮ a.s. continuous paths; ◮ Mat´

ern covariance kernel k (ν = 3/2);

◮ constant mean function m.

Given n = 15 evaluations fn at Xn Posterior field: Z | ZXn = fn with mean mn and covariance kn.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 8 / 43

Posterior GRF realization

slide-9
SLIDE 9

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

Distribution of excursion sets

The posterior field defines posterior distribution on excursion sets. Γ = {x ∈ D : Zx ≤ t}

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 9 / 43

Excursion set realization

slide-10
SLIDE 10

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

How to summarize the distribution on sets?

The posterior excursion set is a random closed set. Here we focus on Expectations of random closed sets1

◮ Vorob’ev expectation ◮ distance average expectation

Conservative estimates, based on Vorob’ev quantiles.

  • 1. for more definitions of expectation see Molchanov, I. (2005). Theory of Random Sets. Springer.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 10 / 43

slide-11
SLIDE 11

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates

Main references:

  • E. Vazquez and M. P. Martinez. (2006). Estimation of the volume of an excursion set
  • f a Gaussian process using intrinsic kriging. Tech Report. arXiv:math/0611273.

Ranjan, P., Bingham, D., and Michailidis, G. (2008). Sequential experiment design for contour estimation from complex computer codes. Technometrics, 50(4):527541. Bect, J., Ginsbourger, D., Li, L., Picheny, V., and Vazquez, E. (2012). Sequential design of computer experiments for the estimation of a probability of failure. Stat. Comput., 22 (3):773793. Chevalier, C., Bect, J., Ginsbourger, D., Vazquez, E., Picheny, V., and Richet, Y. (2014). Fast kriging-based stepwise uncertainty reduction with application to the identification of an excursion set. Technometrics. Chevalier, C., Ginsbourger, D., Bect, J., and Molchanov, I. (2013). Estimating and quantifying uncertainties on level sets using the Vorobev expectation and deviation with Gaussian process models. mODa 10. Bolin, D. and Lindgren, F. (2015), French, J. P. and Sain, S. R. (2013) and references therein...

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 11 / 43

slide-12
SLIDE 12

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Outline

Introduction Expectations of random closed sets Vorob’ev expectation Distance average approach Quasi-realizations for excursion sets estimation Approximate field Optimal design Implementation Assessing uncertainties with the distance transform Conservative estimates Definition Computational issues GanMC method Test case

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 12 / 43

slide-13
SLIDE 13

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Vorob’ev quantiles

The function pn : x ∈ D → pn(x) = Pn(x ∈ Γ) ∈ [0, 1] is the coverage function of Γ, where Pn(·) = P(· | ZXn = fn).

Coverage probability function

In the Gaussian case

◮ fast to compute

pn(x) = Φ

  • mn(x)−t

kn(x,x)

  • ◮ marginal statement

◮ creates a family of set estimates

Qρ = {x ∈ D : pn(x) ≥ ρ}

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 13 / 43

slide-14
SLIDE 14

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Vorob’ev expectation

Consider a Borel measure µ on D. From the family of “quantiles” Qρ we can choose Q

ρ such that µ(Q ρ) = E[µ(Γ)].

Vorob'ev expectation

Properties:

◮ based on the measure µ; ◮ for some choices of µ, fast to

compute;

◮ no confidence statements on the

set.

Chevalier, C., Ginsbourger, D., Bect, J., and Molchanov, I. (2013). Estimating and quantifying uncertainties on level sets using the Vorobev expectation and deviation with Gaussian process models. mODa 10

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 14 / 43

slide-15
SLIDE 15

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Distance average approach

Consider the distance function d : (x, Γ) → d(x, Γ). Γ is random therefore d(x, Γ) is a random variable for each x ∈ D.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 15 / 43 Posterior GRF realization Distance transform of set realization

slide-16
SLIDE 16

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Distance average expectation

Given the distance function d(x, Γ), the expected distance function d(x) = E[d(x, Γ)] The distance average expectation of Γ is the set EDF[Γ] = {x ∈ D : d(x) ≤ ε} where ε is chosen in order to obtain a distance function for the set EDF[Γ] as “close” as possible to d in a L2 sense. An uncertainty assessment for the estimate is DFVΓ = Ed(·) − d(·, Γ)2

2

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 16 / 43

slide-17
SLIDE 17

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Distance average expectation

Given the distance function d(x, Γ), the expected distance function d(x) = E[d(x, Γ)] The distance average expectation of Γ is the set EDF[Γ] = {x ∈ D : d(x) ≤ ε} where ε is chosen in order to obtain a distance function for the set EDF[Γ] as “close” as possible to d in a L2 sense. An uncertainty assessment for the estimate is DFVΓ = Ed(·) − d(·, Γ)2

2

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 16 / 43

slide-18
SLIDE 18

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Distance average expectation

Given the distance function d(x, Γ), the expected distance function d(x) = E[d(x, Γ)] The distance average expectation of Γ is the set EDF[Γ] = {x ∈ D : d(x) ≤ ε} where ε is chosen in order to obtain a distance function for the set EDF[Γ] as “close” as possible to d in a L2 sense. An uncertainty assessment for the estimate is DFVΓ = Ed(·) − d(·, Γ)2

2

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 16 / 43

slide-19
SLIDE 19

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Distance average approach

Consider the distance function d : (x, Γ) → d(x, Γ). For each realization of Γ (expensive!) we compute the distance function and then consider an average over the functions.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 17 / 43 Posterior GRF realization Distance transform of set realization

slide-20
SLIDE 20

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Vorob’ev expectation Distance average approach

Distance average expectation

Distance average expectation

EDF[Γ] = {x ∈ D : d(x) ≤ ε}

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 18 / 43

slide-21
SLIDE 21

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Outline

Introduction Expectations of random closed sets Vorob’ev expectation Distance average approach Quasi-realizations for excursion sets estimation Approximate field Optimal design Implementation Assessing uncertainties with the distance transform Conservative estimates Definition Computational issues GanMC method Test case

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 19 / 43

slide-22
SLIDE 22

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

An approximate Gaussian random field

Assumption: the GRF Z has been evaluated at x1, . . . , xn ∈ D. We denote by ZE = (Ze1, . . . , Zem)′ the random vector of values of Z at E = {e1, . . . , em} ⊂ D. Here we focus on affine predictors of Z of the form

  • Zx = a(x) + bT(x)ZE

(x ∈ D), where a : D − → R is a trend function and b : D − → Rm is a vector-valued function of deterministic weights. Similarly, we approximate Γ by the excursion set of Z:

  • Γ = {x ∈ D :

Zx ≤ t}

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 20 / 43

slide-23
SLIDE 23

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Towards an optimal design of simulation points

The simulation points E could be chosen with a LHS design (m = 30) However, we do not control on how close is Γ to Γ

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 21 / 43 GRF quasi-realizations Distance transform

slide-24
SLIDE 24

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Towards an optimal design of simulation points

The simulation points E could be chosen with a LHS design (m = 30) However, we do not control on how close is Γ to Γ

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 21 / 43 GRF quasi-realizations Distance transform

slide-25
SLIDE 25

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

What distance between Γ to Γ?

Definition: the function (Γ1, Γ2) ∈ D × D − → dµ,n(Γ1, Γ2) = E[µ(Γ1∆Γ2) | ZXn = fn] is called expected distance in measure between Γ1, Γ2. Proposition: distance in measure between Γ and Γ a) If Z and Z are random fields such that Γ and Γ are random closed sets, D ⊂ Rd is compact and µ is a finite Borel measure on D, we have dµ,n(Γ, Γ) =

  • ρn,m(x)µ(dx) where

ρn,m(x) = Pn(x ∈ Γ∆ Γ) = Pn(Zx ≥ t, Zx < t) + Pn(Zx < t, Zx ≥ t)

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 22 / 43

slide-26
SLIDE 26

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Towards an optimal design of simulation points

Proposition: distance in measure between Γ and Γ b) If Z is Gaussian conditionally on ZXn with conditional mean mn and conditional covariance kernel kn, we get Pn(Zx ≥ t, Zx < t) = Φ2 (cn(x, E), Σn(x, E)), with cn(x, E) = mn(x) − t t − a(x) − b(x)Tmn(E)

  • and Σn(x, E) =
  • kn(x, x)

−b(x)Tkn(E, x) −b(x)Tkn(E, x) b(x)Tkn(E, E)b(x)

  • dario.azzimonti@stat.unibe.ch

Bayesian set estimation with GPs 23 / 43

slide-27
SLIDE 27

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Towards an optimal design of simulation points

c) Particular case: If Z is chosen as best linear unbiased predictor of Z given Z(Xn), then b(x) = kn(E, E)−1kn(E, x) so that Σn(x, E) = kn(x, x) −γn(x, E) −γn(x, E) γn(x, E)

  • where γn(x, E) = Varn[

Zx] = kn(E, x)Tkn(E, E)−1kn(E, x). Optimal design(s) of simulation points can be obtained by minimizing dµ,n(Γ, Γ(E)) =

  • Φ2 (cn(x, E), Σn(x, E)) + Φ2 (−cn(x, E), Σn(x, E)) µ(dx)
  • ver (e1, . . . , em) ∈ Dm.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 24 / 43

slide-28
SLIDE 28

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Towards an optimal design of simulation points

c) Particular case: If Z is chosen as best linear unbiased predictor of Z given Z(Xn), then b(x) = kn(E, E)−1kn(E, x) so that Σn(x, E) = kn(x, x) −γn(x, E) −γn(x, E) γn(x, E)

  • where γn(x, E) = Varn[

Zx] = kn(E, x)Tkn(E, E)−1kn(E, x). Optimal design(s) of simulation points can be obtained by minimizing dµ,n(Γ, Γ(E)) =

  • Φ2 (cn(x, E), Σn(x, E)) + Φ2 (−cn(x, E), Σn(x, E)) µ(dx)
  • ver (e1, . . . , em) ∈ Dm.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 24 / 43

slide-29
SLIDE 29

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Procedure overview

Approximate Z at each point with

  • Zx = a(x) + bT(x)ZE with E = {e1, . . . , em}

The points in E are chosen with one of the following algorithms: Algorithm A (Full criterion): sequential minimization of

dµ,n(Γ, Γ(E ∗

i )) =

  • Φ2 (cn(x, E ∗

i ), Σn(x, E ∗ i ))+Φ2 (−cn(x, E ∗ i ), Σn(x, E ∗ i )) µ(dx)

with respect to ei where E ∗

i = {e∗ 1, . . . , e∗ i−1} ∪ {ei};

Algorithm B (Fast heuristic): sequential maximization of ρn,E(x) = Φ2 (cn(x, E), Σn(x, E)) + Φ2 (−cn(x, E), Σn(x, E)) with respect to x;

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 25 / 43

slide-30
SLIDE 30

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Procedure overview

Approximate Z at each point with

  • Zx = a(x) + bT(x)ZE with E = {e1, . . . , em}

The points in E are chosen with one of the following algorithms: Algorithm A (Full criterion): sequential minimization of

dµ,n(Γ, Γ(E ∗

i )) =

  • Φ2 (cn(x, E ∗

i ), Σn(x, E ∗ i ))+Φ2 (−cn(x, E ∗ i ), Σn(x, E ∗ i )) µ(dx)

with respect to ei where E ∗

i = {e∗ 1, . . . , e∗ i−1} ∪ {ei};

Algorithm B (Fast heuristic): sequential maximization of ρn,E(x) = Φ2 (cn(x, E), Σn(x, E)) + Φ2 (−cn(x, E), Σn(x, E)) with respect to x;

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 25 / 43

slide-31
SLIDE 31

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Quasi realizations for distance average variability

A.D. and Bect, J. and Chevalier, C. and Ginsbourger, D. (2016) Quantifying uncertainties on excursion sets under a Gaussian random field prior. SIAM/ASA J.Uncertainty Quantification, 4(1):850–874. hal-01103644v2.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 26 / 43

GRF quasi-realizations Distance transform

slide-32
SLIDE 32

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Test case: negative Branin-Hoo function

Quantity of interest: DTVΓ,N = 1

N

N

i=1d∗ N(·) − d(·, Γi)2 2

Experimental set-up:

◮ 20 observation points; ◮ N = 10000 conditional simulations on a 50 × 50 grid; ◮ K = 100 replications of each experiment.

Methods:

  • 1. Full Monte Carlo simulations on the grid,
  • 2. Simulations at optimized points (A,B) and interpolation on the

same grid.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 27 / 43

slide-33
SLIDE 33

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Test case: negative Branin-Hoo function

Method 1: Full grid simulations

◮ Variability:

  • E [DTVΓ,10000] = 7.2 × 10−4(±4.71 × 10−8);
  • Var [DTVΓ,10000] = 2.22 × 10−11(±3.14 × 10−13);

◮ total computational cost: 10498 seconds.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 28 / 43

slide-34
SLIDE 34

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Approximate field Optimal design Implementation Assessing uncertainties with the distance transform

Test case: negative Branin-Hoo function

Method 2: quasi-realizations on 50 × 50 grid.

3e-04 4e-04 5e-04 6e-04 7e-04 10 20 50 75 100 120 150 175 2500

Number of simulation points (m) DTV( , m) type Algorithm A Algorithm B Benchmark Maximin LHS

Distributions of simulated distance transform variability

Total computing cost

◮ Algorithm A, m = 150:

11201 sec (10566 for simulation point

  • ptimization)

◮ Algorithm B, m = 150:

812 sec (250 for simulation point optimization)

◮ LHS, m = 150: 691 sec

(Benchmark: 10498 sec)

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 29 / 43

slide-35
SLIDE 35

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Outline

Introduction Expectations of random closed sets Vorob’ev expectation Distance average approach Quasi-realizations for excursion sets estimation Approximate field Optimal design Implementation Assessing uncertainties with the distance transform Conservative estimates Definition Computational issues GanMC method Test case

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 30 / 43

slide-36
SLIDE 36

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Vorob’ev quantiles

The coverage function pn : x → pn(x) = Pn(x ∈ Γ) defines the family of set estimates Qρ = {x ∈ D : pn(x) ≥ ρ}

Coverage probability function

If ρ = ρ we have Vorob’ev expectation High values of ρ gives us sets with high marginal probability of observing the set.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 31 / 43

slide-37
SLIDE 37

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

A conservative estimate of Γ ∗ is CΓ,n = Qρ∗ where ρ∗ ∈ arg max

ρ∈[0,1]

{µ(Qρ) : Pn(Qρ ⊂ {Zx ≤ t}) ≥ α}

Conservative estimate at 95%

Conservative estimate (95%) 0.95-level set True excursion

◮ joint confidence statement on the set

estimate;

◮ method introduced for Gauss Markov

random fields;

◮ expensive to compute otherwise. Bolin, D. and Lindgren, F. (2015). Excursion and contour uncertainty regions for latent Gaussian models. JRSS: B, 77(1):85-106.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 32 / 43

slide-38
SLIDE 38

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

The computation of conservative estimates

The family of sets Qρ is nested, therefore we can obtain CΓ,n with a dichotomy on the level ρ. At each iteration of the dichotomy we need to compute Pn(Qρ ⊂ {Zx ≤ t}) = Pn(Ze1 ≤ t, . . . , Zek ≤ t), where E = {e1, . . . , ek} is a the discretization of Qρ.

◮ randomized quasi Monte Carlo integration by Genz et al. :

(Fast, reliable, dimension dependent, available only k < 1000)

◮ standard Monte Carlo.

(dimension independent, many samples for low variance)

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 33 / 43

slide-39
SLIDE 39

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

A quasi Monte Carlo algorithm for orthant probabilities

Candidate ρ = 0.95

Pn(Qρ ⊂ {Zx ≤ t}) = Pn(Ze1 ≤ t, . . . , Zek ≤ t) = 1 − Pn(maxx∈E Zx > t) = 1 − p Main idea: p = Pn(maxE Zx > t) = pq + (1 − pq)Rq, where pq = Pn(max

Eq Zx > t) Genz algorithm (QRSVN)

, Rq = Pn(max

E\Eq Zx > t | max Eq Zx ≤ t) Monte Carlo methods

.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 34 / 43

slide-40
SLIDE 40

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

A quasi Monte Carlo algorithm for orthant probabilities

Candidate ρ = 0.95

Pn(Qρ ⊂ {Zx ≤ t}) = Pn(Ze1 ≤ t, . . . , Zek ≤ t) = 1 − Pn(maxx∈E Zx > t) = 1 − p Main idea: p = Pn(maxE Zx > t) = pq + (1 − pq)Rq, where pq = Pn(max

Eq Zx > t) Genz algorithm (QRSVN)

, Rq = Pn(max

E\Eq Zx > t | max Eq Zx ≤ t) Monte Carlo methods

.

  • pq = 0.47
  • Rq = 0.42

⇒ p = 0.69

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 34 / 43

slide-41
SLIDE 41

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Computation of the remainder

Rq = Pn(maxE\Eq Zx > t | maxEq Zx ≤ t) Standard Monte Carlo:

  • 1. draw realizations zq

1 , . . . , zq s from ZEq | maxEq Zx ≤ t;

  • 2. for each zq

i , draw a realization from ZE\Eq | ZEq = zq i ;

  • 3. Estimate Rq with RMC

q

= 1

s

s

i=1 1max(ZE\Eq (ωi)|ZEq =zq

i )>t

The cost of step 1 is higher than the cost of step 2. At fixed computational budget we reduce the variance of RMC

q

exploiting this difference with asymmetric nested Monte Carlo.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 35 / 43

slide-42
SLIDE 42

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Computation of the remainder

At fixed computational budget we reduce the variance of RMC

q

drawing many realizations of ZE\Eq | ZEq = zq

i for each zi.

  • 1

1 2

ZEq ZE−Eq

1 2 3 4 5

Standard marginal/conditional scheme

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 36 / 43

slide-43
SLIDE 43

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Computation of the remainder

At fixed computational budget we reduce the variance of RMC

q

drawing many realizations of ZE\Eq | ZEq = zq

i for each zi.

  • 1

1 2

ZEq ZE−Eq

1 2 3 4 5

Standard marginal/conditional scheme

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 37 / 43

slide-44
SLIDE 44

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Computation of the remainder

At fixed computational budget we reduce the variance of RMC

q

drawing many realizations of ZE\Eq | ZEq = zq

i for each zi.

  • 1

1 2

ZEq ZE−Eq

1 2 3 4 5

Asymmetric sampling scheme

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 37 / 43

slide-45
SLIDE 45

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Computation of the remainder: asymmetric nested MC

Rq = Pn(maxE\Eq Zx > t | maxEq Zx ≤ t)

  • 1. draw realizations zq

1 , . . . , zq s from ZEq | maxEq Zx ≤ t;

  • 2. for each zq

i , draw m∗ > 1 samples from ZE\Eq | ZEq = zq i ;

  • 3. RanMC

q

= 1

s 1 m∗

s

i=1

m∗

j=1 1max(ZE\Eq (ωi,j)|ZEq =zq

i )>t

var(RanMC

q

) is optimally reduced if: m∗ =

  • (α+c)B

β(A−B),

where A = var(1max(ZE\Eq |ZEq )>t), B = E

  • var(1max(ZE\Eq |ZEq )>t | maxEq Zx ≤ t)
  • and

α, β, c system dependent constants. A D. and Ginsbourger D. (2016). Estimating orthant probabilities of high dimensional Gaussian vectors with an application to set estimation. Submitted, hal-01289126

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 38 / 43

slide-46
SLIDE 46

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Comparison with standard Monte Carlo

Conservative estimate at 95%

Conservative estimate (95%) 0.95-level set True excursion

Full discretization: grid 100 × 100 Time for equivalent estimates:

◮ Full MC: 1520 seconds; ◮ GMC: 200 seconds; ◮ GanMC: 136 seconds.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 39 / 43

slide-47
SLIDE 47

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

The IRSN test case

1 2 3 4 5 1 2 3 4 5

Moret test case (k_eff)

PuO2 density (cm3) Water thickness (cm3)

Test case:

◮ keff function of PuO2 density and

H2O thickness, D = [0.2, 5.2] × [0, 5];

◮ n = 20 observations (LHS design); ◮ Γ ∗ = {x ∈ D : keff(x) ≤ t}, t = 0.92

GRF model

◮ constant prior mean,

Mat´ ern (ν = 5/2) covariance;

◮ MLE for parameters. Acknowledgements: Yann Richet, Institut de Radioprotection et de Sˆ uret´ e Nucleaire.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 40 / 43

slide-48
SLIDE 48

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

1 2 3 4 5 1 2 3 4 5

Initial design, conservative Estimate PuO2 density (cm3) Water thickness (cm3)

Conservative Estimate True excursion coverage function

Discretization on grid 50 × 50. Conservative estimate at 95%; Candidate sets dimension between 1659 and 2084; Volume of conservative estimate: 17.36 (true volume 22.0).

Sequential dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 41 / 43

slide-49
SLIDE 49

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Conclusion

◮ GP can be used for uncertainty quantification on sets; ◮ different types of estimates, depending on the final objective; ◮ Optimal quasi-realizations for excursion sets lower the

computational cost of quantities based on set realizations;

◮ Conservative estimates:

◮ sequential strategies to reduce uncertainty; ◮ GanMC: benchmark study with other algorithms; ◮ Currently developing R package ConservativeEstimates.

Thanks for your attention!

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 42 / 43

slide-50
SLIDE 50

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

Conclusion

◮ GP can be used for uncertainty quantification on sets; ◮ different types of estimates, depending on the final objective; ◮ Optimal quasi-realizations for excursion sets lower the

computational cost of quantities based on set realizations;

◮ Conservative estimates:

◮ sequential strategies to reduce uncertainty; ◮ GanMC: benchmark study with other algorithms; ◮ Currently developing R package ConservativeEstimates.

Thanks for your attention!

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 42 / 43

slide-51
SLIDE 51

Introduction Expectations of random closed sets Quasi-realizations for excursion sets estimation Conservative estimates Definition Computational issues GanMC method Test case

References

A D. and Ginsbourger D. (2016). Estimating orthant probabilities of high dimensional Gaussian vectors with an application to set estimation. Submitted, hal-01289126 A D. and Bect, J. and Chevalier, C. and Ginsbourger, D. (2015). Quantifying uncertainties on excursion sets under a Gaussian random field prior. Accepted, JUQ hal-01103644v2 Bolin, D. and Lindgren, F. (2015). Excursion and contour uncertainty regions for latent Gaussian models. JRSS: B, 77(1):85-106. Chevalier, C., Ginsbourger, D., Bect, J., and Molchanov, I. (2013). Estimating and quantifying uncertainties on level sets using the Vorobev expectation and deviation with Gaussian process models. mODa 10.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 43 / 43

slide-52
SLIDE 52

How to reduce the uncertainty on the estimate?

Stepwise uncertainty reduction: find a sequence of evaluation points X1, X2, . . . that optimally reduces the expected uncertainty on the future estimate, i.e. given an initial design Xn, select Xn+1 ∈ arg min

xn+1∈D En[Hn+1 | Xn+1 = xn+1]

Uncertainty function(s): many possible definitions, here Hsymm

n+1

= En+1[µ(Γ∆Qρn+1)], Γ∆Qρn+1 = Γ \ Qρn+1 ∪ Qρn+1 \ Γ see [Bect et al. (2012), Chevalier et al. (2014)] and references therein for different definitions of Hn+1.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 43 / 43

slide-53
SLIDE 53

How to reduce the uncertainty on the estimate?

Criterion: Jsymm

n+q (xq) = En[En+q[µ(Γ \ CΓ,n)] | Xn+q = xq], dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 43 / 43

1 2 3 4 5 1 2 3 4 5

Initial design, conservative Estimate PuO2 density (cm3) Water thickness (cm3)

Conservative Estimate True excursion coverage function

slide-54
SLIDE 54

Sequential strategies

1 2 3 4 5 1 2 3 4 5

Iteration 20, conservative Estimate PuO2 density (cm3) Water thickness (cm3)

Conservative Estimate True excursion coverage function

n = 75 new evaluations; next evaluation chosen in order to minimize the future expected volume of the set difference Γ \ Qρ∗; Volume of updated CE: 20.72 (true excursion: 22.0,

  • ld estimate: 17.36)

Back

Joint work with: David Ginsbourger, Cl´ ement Chevalier, Julien Bect, Yann Richet.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 43 / 43

slide-55
SLIDE 55

More comparisons anMC/MC

0.04 0.08 0.12 0.16 10 20 30 40 50 1000 stdMC

q Probability Method pGanMC Monte Carlo pGMC pq

G

Estimate p with pGanMC

3.5 4.0 4.5 5.0 5.5 10 20 30 40 50 stdMC

q log10 (efficiency) Method pGanMC Monte Carlo pGMC

Efficiency of pGMC and pGanMC

Benchmark: 6d GRF, discretization: 1000 Sobol’ points, k Mat´ ern (ν = 5/2) with θ = [0.5, 0.5, 1, 1, 0.5, 0.5]T and σ2 = 8, m constant.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 43 / 43

slide-56
SLIDE 56

More comparisons anMC/MC

104 105 106 107 108 109 1010 100 300 500 1000 2000 3000 4000 5000 6000 7000

Dimension efficiency (log10) Algorithm GanMC GHK GMC MET QRSVN

Efficiency

100 101 102 103 104 100 300 500 1000 2000 3000 4000 5000 6000 7000

Dimension sec (log10) Algorithm GanMC GHK GMC MET QRSVN

Time

Benchmark: 6d GRF, discretization: 1000 Sobol’ points, k Mat´ ern (ν = 5/2) with θ = [0.5, 0.5, 1, 1, 0.5, 0.5]T and σ2 = 8, m constant, t = 5.

dario.azzimonti@stat.unibe.ch Bayesian set estimation with GPs 43 / 43