and its application in UQ Wenyu Li Arun Hegde Jim Oreluk Andrew - - PowerPoint PPT Presentation

and its application in uq
SMART_READER_LITE
LIVE PREVIEW

and its application in UQ Wenyu Li Arun Hegde Jim Oreluk Andrew - - PowerPoint PPT Presentation

Uniform sampling of a feasible set and its application in UQ Wenyu Li Arun Hegde Jim Oreluk Andrew Packard Michael Frenklach SIAM NC17 SPRING 2017 Bound-to-Bound Data Collaboration (B2BDC) Model: Prior Uncertainty Data n Data n Data n


slide-1
SLIDE 1

SPRING 2017 SIAM NC17

Uniform sampling of a feasible set and its application in UQ

Wenyu Li Arun Hegde Jim Oreluk Andrew Packard Michael Frenklach

slide-2
SLIDE 2

SPRING 2017 SIAM NC17

Bound-to-Bound Data Collaboration (B2BDC)

Prior Uncertainty

Feasible set Model:

Data 1 Data 2 Data 3 Data n Data n Data n

slide-3
SLIDE 3

SPRING 2017 SIAM NC17

Uniform sampling

Goal: uniform sampling of feasible set

  • Sampling is useful in providing information about
  • B2BDC makes NO distribution assumptions, but as far as taking

samples, uniform distribution of is reasonable

  • Applying Bayesian analysis with specific prior assumptions also

leads to uniform distribution of as posterior (shown in next slide)

slide-4
SLIDE 4

SPRING 2017 SIAM NC17

What Bayesian analysis leads to

Prior distribution Measurement distribution

Posterior distribution

Bayesian analysis Deterministic model:

slide-5
SLIDE 5

SPRING 2017 SIAM NC17

B2BDC and Bayesian Calibration and Prediction (BCP)

[1] Frenklach, M., Packard, A., Garcia-Donato, G., Paulo, R. and

Sacks, J., 2016. Comparison of Statistical and Deterministic Frameworks of Uncertainty Quantification. SIAM/ASA Journal on Uncertainty Quantification, 4(1), pp.875-901.

Reference Nomenclature

  • sampling efficiency acceptance rate
  • feasible set
slide-6
SLIDE 6

SPRING 2017 SIAM NC17

“B2B” Box

Rejection sampling with box

Procedure: Pros & Cons

  • find a bounding box
  • available from B2BDC
  • generate uniformly distributed samples in

the box as candidates

  • reject the points outside of feasible set

Circumscribed box Feasible set

  • provably uniform in the feasible set
  • practical in low dimensions
  • impractical in higher dimensions
slide-7
SLIDE 7

SPRING 2017 SIAM NC17

Random walk (RW)

Procedure:

Feasible set

Moving direction Extreme point Extreme point Starting point New moving direction Next point

  • start from a feasible point
  • available from B2BDC
  • select a random direction, calculate extreme

points and choose the next point uniformly

  • repeat the process

Pros & Cons

  • NOT limited by problem dimensions
  • NOT necessarily uniform in the feasible set
slide-8
SLIDE 8

SPRING 2017 SIAM NC17

Rejection sampling with polytope

Procedure:

Feasible set

  • find a bounding polytope
  • generate candidate points by random walk
  • reject the points outside of feasible set

Pros & Cons

  • provably uniform in the feasible set
  • increased efficiency with more polytope facets

Circumscribed polytope 6 facets 8 facets

slide-9
SLIDE 9

SPRING 2017 SIAM NC17

Rejection sampling with polytope

Procedure:

Feasible set

  • find a bounding polytope
  • generate candidate points by random walk
  • reject the points outside of feasible set

Pros & Cons

  • provably uniform in the feasible set
  • increased efficiency with more polytope facets
  • practical in low to medium dimensions

Circumscribed polytope 6 facets 10 facets

  • limited by computational resource
slide-10
SLIDE 10

SPRING 2017 SIAM NC17

Approximation strategy

Procedure:

  • relax the requirement that the polytope needs

to contain the feasible set completely

  • generate candidate points by random walk
  • reject the points outside of feasible set

Feasible set Approximate polytope

Pros & Cons

  • practical in medium to high dimensions
  • samples don’t cover the whole feasible set
slide-11
SLIDE 11

SPRING 2017 SIAM NC17

Define the polytope: one facet

Inner and Outer bounds from B2B prediction Sample bound from random walk

  • Outer bound from optimization (NO

approximation, provably uniform)

  • Inner bound from optimization (less

aggressive approximation, very close to circumscribed bound)

  • Sample bound (more aggressive

approximation, performance depends

  • n problem)
slide-12
SLIDE 12

SPRING 2017 SIAM NC17

Effect on sampling efficiency

Condition for improved efficiency Efficiency density function

Projected area

slide-13
SLIDE 13

SPRING 2017 SIAM NC17

Effect on sampling efficiency

Posterior check Assumption in the polytope case Special case with bounding box

slide-14
SLIDE 14

SPRING 2017 SIAM NC17

Effect on sampled distribution

Approximated distribution Target distribution Difference of mean for a function

slide-15
SLIDE 15

SPRING 2017 SIAM NC17

Toy example

Polytope bound Efficiency (%) Outer bound 0.095 Inner bound 20.8 Sample bound 27.7 Posterior check

Outer -> Inner : 1.33 > 0.68 Inner -> Sample : 1.40 > 1.33

Test condition:

  • 5 parameters, 30 constraints
  • 1000 facets for each polytope
  • Optimization and sample bounds
  • 1000 sample points
slide-16
SLIDE 16

SPRING 2017 SIAM NC17

Toy example

Polytope bound Efficiency (%) Outer bound 0.095 Inner bound 20.8 Sample bound 27.7

Test condition:

  • 5 parameters, 30 constraints
  • 1000 facets for each polytope
  • Optimization and sample bounds
  • 1000 sample points

Passed the Kolmogorov-Smirnov test with 0.05 significance level

slide-17
SLIDE 17

SPRING 2017 SIAM NC17

Principal component analysis (PCA)

Procedure:

Feasible set Lower-dimensional subspace

  • collect RW samples from the feasible set
  • conduct PCA on RW samples
  • find a subspace based on PCA result
  • generate uniform samples in the subspace

Pros & Cons

  • reduced problem dimension
  • works only if feasible set approximates

lower-dimensional manifold/subspace

slide-18
SLIDE 18

SPRING 2017 SIAM NC17

GRI-Mech

Test condition:

  • 102 parameters
  • 76 experimental data
  • 107 RW samples for PCA
  • 10-65 subspace dimension
  • 104 facets for each polytope
  • 107 candidate points for sampling

Test methods:

  • polytope and box
  • inner and sample bounds

Subspace dimension

Sampling Efficiency

slide-19
SLIDE 19

SPRING 2017 SIAM NC17

GRI-Mech: 1-D posterior marginal uncertainty

Inner bound Outer bound Uniform histogram

Test condition:

  • 45 subspace dimension
  • Polytope with sample bound
  • 104 facets for the polytope
  • 1000 sample points
  • [-1, 1] are prior uncertainties
slide-20
SLIDE 20

SPRING 2017 SIAM NC17

GRI-Mech: 2-D posterior joint uncertainty

Plots:

  • 2-D projection
  • [-1 1] are prior uncertainties
  • Correlations observed
slide-21
SLIDE 21

SPRING 2017 SIAM NC17

Summary

  • We developed methods to generate uniformly distributed

samples of a feasible set

  • Approximation strategy and PCA further improves the

practicality of rejection sampling method

  • Hybrid statistical-deterministic uncertainty quantification

process combining B2BDC prediction and uniform sampling

slide-22
SLIDE 22

SPRING 2017 SIAM NC17

Acknowledgements

This work is supported as a part of the CCMSC at the University

  • f Utah, funded through PSAAP by the National Nuclear Security

Administration, under Award Number DE-NA0002375.

slide-23
SLIDE 23

SPRING 2017 SIAM NC17

Thank you Questions?

slide-24
SLIDE 24

SPRING 2017 SIAM NC17

slide-25
SLIDE 25

SPRING 2017 SIAM NC17

GRI-Mech: 1-D posterior marginal uncertainty

Inner bound Outer bound Uniform sampling, B2BDC Gaussian prior, MCMC Bayes

Test condition:

  • 45 subspace dimension
  • Polytope with sample bound
  • 104 facets for the polytope
  • 1000 sample points
  • [-1, 1] are prior uncertainties
slide-26
SLIDE 26

SPRING 2017 SIAM NC17

GRI-Mech: 1-D posterior marginal uncertainty

Inner bound Outer bound Sample histogram

Test condition:

  • 45 subspace dimension
  • Polytope with sample bound
  • 104 facets for the polytope
  • 1000 sample points
  • [-1, 1] are prior uncertainties

True bounds

slide-27
SLIDE 27

SPRING 2017 SIAM NC17

GRI-Mech: 1-D posterior marginal uncertainty

Inner bound Outer bound Uniform histogram Gaussian histogram

Test condition:

  • 45 subspace dimension
  • Polytope with sample bound
  • 104 facets for the polytope
  • 1000 sample points
  • [-1, 1] are prior uncertainties
slide-28
SLIDE 28

SPRING 2017 SIAM NC17

“B2B” Box

Rejection sampling with box

Procedure:

  • generate uniformly distributed samples in

the box as candidates

  • reject the points outside of feasible set

Bounding Box Feasible set “B2B” box with increased problem dimension

  • find a bounding box
  • available from B2B

Pros & Cons

  • provably uniform in the feasible set
  • practical in low dimensions
slide-29
SLIDE 29

SPRING 2017 SIAM NC17

Rejection sampling with polytope

Procedure:

Feasible set

  • find a bounding polytope
  • generate candidate points by random walk
  • reject the points outside of feasible set

Pros & Cons

  • provably uniform in the feasible set
  • increased efficiency with more polytope facets
  • practical in low to medium dimensions

Circumscribed polytope 6 facets 10 facets

Possible Convergence to the convex hull

slide-30
SLIDE 30

SPRING 2017 SIAM NC17

Conclusion

  • Polytope method is in general more practical than box

method

  • Approximation method further improves the practicality
  • PCA and dimension reduction increases efficiency

significantly when applicable

  • Samples of the feasible set provide extra information on

posterior uncertainty

Polytope method Box method

Subspace dimension

Sampling Efficiency

slide-31
SLIDE 31

SPRING 2017 SIAM NC17

Heuristic approximation strategy (continued…)

𝒖 = 𝒃𝑼𝒚

Remaining region Truncated region

  • Consider the statistical quality of samples returned

with heuristic approximation by estimating the difference in its statistical inference of a function Q(x). Denote the truncated and remaining area as and , then

  • Hypothesis. If the target distribution has a small

integrated probability in the truncated region, the inference difference of the returned samples are likely to be small compared to the target distribution

  • Hypothesis. If the target distribution has a small

integrated probability in the truncated region, the inferring difference of the returned samples are likely to be small compared to the target distribution

slide-32
SLIDE 32

SPRING 2017 SIAM NC17

Rejection sampling with polytope (continued…)

  • scales the parameters so the polytope with the scaled parameters is more isotropic
  • a 2-D example is given in the following figure for illustration
  • RW performs better (converges faster) with a more isotropic polytope[1]

Parameter scaling

[1] Lovász, L., 1999. Hit-and-run mixes fast. Mathematical Programming, 86(3), pp.443-461. Bounding polytope with

  • riginal parameter

l1 ≤ a1

Tx ≤ u1

l2 ≤ a2

Tx ≤ u2

Bounding polytope with scaled parameter

l1 ≤ b1

Ty ≤ u1

l2 ≤ b2

Ty ≤ u2

slide-33
SLIDE 33

SPRING 2017 SIAM NC17

Acknowledgement

We gratefully acknowledge the support by U.S. Department

  • f Energy, National Nuclear Security Administration, under

Award Number DE-NA0002375.

slide-34
SLIDE 34

SPRING 2017 SIAM NC17

Heuristic approximation strategy (continued…)

P(t) F(t)

𝑭(𝒖) = 𝑮(𝒖) 𝑸(𝒖)

Polytope Feasible set

𝒖 = 𝒃𝑼𝒚

  • A sufficient condition that the sampling efficiency will

increase with the heuristic approximation is derived:

  • Hypothesis. Parameterize the direction as

and specify the efficiency density function as . Denote the truncated region as and the remaining region as . If the sampling efficiency will increase with the approximation

  • Conjecture. If the target distribution approximately

satisfies the condition along the directions selected for heuristic approximation, then the efficiency is likely to increase.

  • A sufficient condition that the sampling efficiency will

increase with the heuristic approximation is derived:

  • Conjecture. If the target distribution approximates a

high-weight center, low-weight tail shape along the directions selected for heuristic approximation, then the efficiency is likely to increase.

slide-35
SLIDE 35

SPRING 2017 SIAM NC17

Motivation of uniform sampling of the feasible set

  • We don’t know the distribution of returned points if the feasible set is

not convex (and in general it isn’t).

  • Only qualitative conclusions can be made.
  • To make the analysis quantitatively valid, we assume the uniform

distribution of the feasible set.

  • This is also the posterior distribution from Bayesian method if we

assume uniform prior distributions

  • n

both parameter and measurement uncertainties

slide-36
SLIDE 36

SPRING 2017 SIAM NC17

University of California, Berkeley

Generate uniform samples of a feasible set and its application in uncertainty quantification

slide-37
SLIDE 37

SPRING 2017 SIAM NC17

Random walk application in DLR dataset

Test condition:

  • 55 parameters
  • 244 constraints
  • 106 samples
  • 2-D projection
  • Bounds are prior