Simulator Analysis and Validation Engine 2 Fei Liu Duke University - - PowerPoint PPT Presentation

simulator analysis and validation engine 2
SMART_READER_LITE
LIVE PREVIEW

Simulator Analysis and Validation Engine 2 Fei Liu Duke University - - PowerPoint PPT Presentation

Simulator Analysis and Validation Engine 2 Fei Liu Duke University Summer School on the Design and Analysis of Computer Experiments Simon Fraser University August 11-16, 2006 Outline Objective The SAVE2 methodology The SAVE2 software package


slide-1
SLIDE 1

Simulator Analysis and Validation Engine 2

Fei Liu

Duke University Summer School on the Design and Analysis of Computer Experiments Simon Fraser University August 11-16, 2006

slide-2
SLIDE 2

Outline

Objective The SAVE2 methodology The SAVE2 software package The Example Resources

slide-3
SLIDE 3

Outline

Objective The SAVE2 methodology The SAVE2 software package The Example Resources

slide-4
SLIDE 4

Objective

◮ Computer models may have functional outputs:

yM(x, u, t)

◮ Inputs to the computer models include:

◮ x: characteristics associated with the field experiments. ◮ u: calibration parameters associated with the computer

model.

◮ t: time when the measurements are taken. And the

computer model outputs are functions of t.

slide-5
SLIDE 5

Collected data consist of:

◮ Field: Time-histories of field experiments, denoted by

yF(x, t).

◮ Computer model: Time histories obtained by running the

computer model at different design points, denoted by yM(x, u, t).

slide-6
SLIDE 6

10 20 30 40 50 60 5 10 d y

Figure: A example data: field experiments (red); computer model runs (black).

slide-7
SLIDE 7

Questions

SAVE-2 provides answers to questions such as,

◮ Does the computer model adaquetely represent the reality

in terms of validation perspectives / intended uses, e.g. peaks / valleys?

◮ How can we provide optimal prediction for the reality? ◮ How can we learn the associated (unknown) characteristic

parameters for the field experiments?

◮ How can we predict for a new system?

slide-8
SLIDE 8

Outline

Objective The SAVE2 methodology The SAVE2 software package The Example Resources

slide-9
SLIDE 9

Wavelet representation

Represent the history data as: yM(zj; t) =

W

  • i=1

wM

i (zj)ψi(t),

j = 1, . . . , m; yF

r (x∗; t) = W

  • i=1

wF

ir (x∗)ψi(t),

j = 1, . . . , f. Inputs: x = (x1, . . . , xp1): key (unknown) characteristics. u = (u1, . . . , up2): calibration parameters for the computer models. z = (x, u).

slide-10
SLIDE 10

GaSP approximation

For each i (index of wavelet coefficient), model the coefficient as, wM

i (·) ∼ GP

  • µi, 1

λM

i

Corri(·, ·)

  • ◮ Inputs to GaSP: Design matrix, and wM

i

from model runs.

◮ Output from GaSP:

  • α(i), β(i), µi, λM

i

  • , i = 1, . . . , W.

◮ Response Surface:

  • wM

i (z) | wM i

  • ∼ N
  • ˆ

µi(z), ˆ σ2

i (z)

  • .
slide-11
SLIDE 11

Bayesian analysis

◮ Simulator Analysis and Validation Engine for each

coefficient: wR

i (x∗) = wM i (x∗, u∗) + bi(x∗),

i = 1, . . . , W wF

ir (x∗) = wR i (x∗) + ǫir,

r = 1, . . . , f

◮ bi(x∗) ∼ N

  • 0, τ 2

j

  • , ǫir ∼ N
  • 0, σ2

i

  • . j is the level for index i.

◮ Sufficient statistics: ¯

wF

i , S2 i = f r=1(wF ir − ¯

wF

i )2. ◮ Data:

D = ( ¯ wF

i , S2 i , ˆ

µi(·), ˆ σ2

i (·)),

i = 1, . . . , W

slide-12
SLIDE 12

Outline

Objective The SAVE2 methodology The SAVE2 software package The Example Resources

slide-13
SLIDE 13

Steps of Analysis

  • 1. Input data;
  • 2. Data registration and wavelet representation;
  • 3. Gaussian stochastic response surface approximations;
  • 4. Bayesian analysis
  • x∗, u∗, {bi}, {τ 2

j }, {σ2 i }, {wR i (x∗)}

  • ;
  • 5. Wavelet reconstruction;
  • 6. Output and plots;
slide-14
SLIDE 14

SAVE-2 software

Data Registration Computer model Field runs Wavelet Decomposition GaSP Posterior (MCMC) Prior Information Inferences

Wavelet

Reconstruction Posterior reports Posterior Curves histograms

R Language

C Language

slide-15
SLIDE 15

Use of SAVE2

◮ To install save2,

[chuck:save2]$ make Installation

◮ To load data,

[chuck:save2]$ make LoadData

◮ To remove the failed model runs,

[chuck:save2]$ make PreprocessData

slide-16
SLIDE 16

Use of SAVE2

◮ To apply the data registration procesure and wavelet

representation, [chuck:save2]$ make RegisWave

◮ To apply the GaSP approximation to the coefficients of the

model runs, [chuck:save2]$ make GASP

◮ To run the Bayesian analysis,

[chuck:save2]$ make MCMC-LoadData [chuck:save2]$ make MCMC-Run

slide-17
SLIDE 17

Use of SAVE2

◮ To summarize the posterior samples, reconstruct functions,

and create an R object to store the result, [chuck:save2]$ make Reconstruction-all

◮ To visualize the results, and get confidence intervals for the

peaks / valleys, [chuck:save2]$ make PLOT-all

◮ To remove old analysis for new data,

[chuck:save2]$ make removeOldAnalysis

slide-18
SLIDE 18

Input files

Three types of input files are required by the software:

◮ The design matrix for the computer model runs. ◮ The time histories for the computer model runs. ◮ The time histories for the field experiments.

slide-19
SLIDE 19

Example Inputs

◮ The design matrix:

u01 u02 x01 x02 x03 0.625000 0.546875 0.406250 0.593750 0.406250 1.000000 0.343750 0.718750 0.921875 0.203125 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

◮ The history data:

Time CH 1 CH 2 CH 3 Velocity .000000 3.83 0.29 0.092 25.049 .002441 3.84 0.13

  • 0.099

25.048 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

slide-20
SLIDE 20

Data registration

Peaks / Valleys of field and model-run should occur at the same “location”. Steps:

  • 1. Convert time-histories into distance-histories.
  • 2. Reference curve: Average of the model curves.
  • 3. Align curves to match the peaks of both major peaks.
slide-21
SLIDE 21

7 8 9 10 5 10 Time Load 37 38 39 40 41 5 10 Time Load 7 8 9 10 5 10 Time Load 37 38 39 40 41 5 10 Time Load

slide-22
SLIDE 22

Output Files

Output Files consist of,

◮ Posterior distribution of x∗, u∗. ◮ Posterior distribution of the bias function. ◮ Posterior distribution of the reality. ◮ Reality for a new field run. ◮ Extrapolate to new system.

slide-23
SLIDE 23

Outline

Objective The SAVE2 methodology The SAVE2 software package The Example Resources

slide-24
SLIDE 24

The Data

◮ The time histories are collected at 90843 time points. ◮ The computer model has two calibration parameters

(u1, u2), and 7 characteristics are measured for the field experiments, (x1, x2, . . . , x7).

◮ The computer model has been exercised at 60 design

points.

◮ The field data consists of 7 replicates associated with the

same characteristic parameters.

slide-25
SLIDE 25

π(x∗, u∗)

Parameter Type Variable Type Uncertainty Damping1 Calibration Calibration 15% Damping2 Calibration Calibration 15% x1 Manufacturing Uncertain 10% x2 Manufacturing Uncertain 10% x3 Manufacturing Uncertain 7% x4 Manufacturing Uncertain 8% x5 Manufacturing Uncertain 5% x6 Manufacturing Uncertain 12% x7 Manufacturing Uncertain 8%

Table: I/U Map. Uncertainty ranges for calibration (first two) and manufacturing (last seven) parameters.

slide-26
SLIDE 26

The Input/Uncertainty Map

u− 1

u− 1 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0

u− 2

u− 2 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0

x− 1

x− 1 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0

x− 2

x− 2 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0

x− 3

x− 3 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 1 2 3 4

x− 4

x− 4 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 1 2 3 4

x− 5

x− 5 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 5 10 15 20

x− 6

x− 6 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 2 4 6 8

x− 7

x− 7 Freq. 0.0 0.2 0.4 0.6 0.8 1.0 1 2 3 4

slide-27
SLIDE 27

The prior distributions

π(u1) = π(u2) = Uniform on [0.125, 0.875] π(x1) = π(x2) ∼ N(0, 0.11112) truncated to the interval[0.1667, 0.8333] π(x3) ∼ N(0, 0.097232) truncated to the interval [0.2083, 0.7917] π(x4) = π(x7) ∼ N(0, 0.10262) truncated to the interval[0.1923, 0.8077] π(x5) ∼ N(0, 0.49032) truncated to the interval [0.3529, 0.6471] π(x6) ∼ N(0, 0.11762) truncated to the interval [0.1471, 0.8529]

slide-28
SLIDE 28

Posterior distribution of the bias function

7 8 9 10 −2 −1 1 2 Time Load

Bias function MCMC (90% Tolerance bounds)

slide-29
SLIDE 29

Posterior distribution of the reality

(yR)h(x∗, u∗, ·) =

W

  • i=1

((wM

i (x∗, u∗))h + bh i )ψi(·), h = 1, . . . , N

7 8 9 10 5 10 Time Load

Bias Corrected Prediction Field Data Pure Model Prediction 90% Tolerance bands from Reality

slide-30
SLIDE 30

Reality for a new field run

(yR)h(x, u∗, ·) =

W

  • i=1

((wM

i (x, u∗))h+bh i )ψi(·), h = 1, . . . , N; x ∼ π(x)

7 8 9 10 5 10 Time Load

Bias Corrected Prediction Field Data Pure Model Prediction 90% Tolerance bands from Field

slide-31
SLIDE 31

Extrapolate to new system

(yRB)h(x, u∗, ·) =

W

  • i=1
  • (wMB

i

(x, u∗))h · (bi)h ψi(·), x ∼ π(x)

7 8 9 10 5 10 15 20 25 30 Time Load

Field Data Bias Corrected Prediction 90% Tolerance bounds

slide-32
SLIDE 32

Outline

Objective The SAVE2 methodology The SAVE2 software package The Example Resources

slide-33
SLIDE 33

Resources

◮ System requirements:

◮ The R statistical software package (version 1.8.1 or higher). ◮ The wavethresh package (version 2.2-8 or higher) for R. ◮ The GaSP software. ◮ The ATLAS library. ◮ The GNU Scientific library.

◮ Software package and Manual may be available at NISS

webpage ...

slide-34
SLIDE 34

Thank you!