Weighted space-filling designs Dave Woods D.Woods@southampton.ac.uk - - PowerPoint PPT Presentation

weighted space filling designs
SMART_READER_LITE
LIVE PREVIEW

Weighted space-filling designs Dave Woods D.Woods@southampton.ac.uk - - PowerPoint PPT Presentation

Weighted space-filling designs Dave Woods D.Woods@southampton.ac.uk www.southampton.ac.uk/ davew Joint work with Veronica Bowman (Dstl) Supported by the Defence Threat Reduction Agency 1 / 34 Outline Motivation & background


slide-1
SLIDE 1

Weighted space-filling designs

Dave Woods

D.Woods@southampton.ac.uk www.southampton.ac.uk/∼davew

Joint work with Veronica Bowman (Dstl) Supported by the Defence Threat Reduction Agency

1 / 34

slide-2
SLIDE 2

Outline

  • Motivation & background
  • Designs incorporating prior information
  • Illustrative examples
  • Application to dispersion modelling
  • Graphical method of design evaluation
  • Work in progress . . .

2 / 34

slide-3
SLIDE 3

Dispersion modelling

Aim: Prediction of the downwind hazard generated by a chemical or biological (or other) release

  • Accident response; military

planning; volcanic ash; . . .

  • Variety of models
  • Gaussian plume (Clarke, 1979)
  • Gaussian puff (Sykes et al., 1998)
  • NAME (Jones et al., 2007)
  • Sensor placement

20 40 60 80 100 120 20 40 60 80 100 120 x.cord y.cord

3 / 34

slide-4
SLIDE 4

Dispersion modelling

  • Ex. 3 results

4 / 34

slide-5
SLIDE 5

Dispersion modelling

  • 1. The input variables are usually of two types (meteorological and

source), and can be quantitative or qualitative

  • 2. There is substantial prior information about the distribution of the

input variables from, for example, empirical observations (meteorological) or expert prior knowledge (source)

  • 3. These prior distributions are not usually independent, either within

type (for example, wind direction and speed is defined via a wind rose) or between type (wind direction and source location)

  • 4. The distributions define a joint probability density (or weight

function) on the design region, which is likely to have substantial areas of low weight

5 / 34

slide-6
SLIDE 6

Prior information

6 / 34

slide-7
SLIDE 7

Aim of this work

  • Reduce the number of model evaluations required through designed

experiments

  • Develop a class of criteria for constructing space-filling designs that

take account of prior information and, particularly, relationships between input variables

  • Evaluate designs from competing criteria in terms of (a) sampling

properties and (b) space-filling properties

7 / 34

slide-8
SLIDE 8

Weight function

For any point x ∈ X, define a weight function w(x)

  • w(x) ≥ 0 is a problem-specific weight function, e.g. defining the

probability of obtaining a useful response We can (almost) think about this in terms of fuzzy sets ˜ X = {(x, w(x)), x ∈ X} , where w(x) is the (unnormalised) membership function of the fuzzy set ˜ X [e.g. Dubois & Prade, 1980] Space-filling designs for fuzzy design spaces?

8 / 34

slide-9
SLIDE 9

Space-filling designs

  • Collections of combinations of values of the input variables that

(attempt to) provide information on every (relevant) region of the design space

  • Typically, they either
  • Spread the design points as far apart as possible
  • Cover the design region as evenly as possible
  • Only take one observation at each combination of input values (ideal

for deterministic models)

  • Allow a variety of statistical models to be fitted
  • Distance-based, Latin Hypercubes, Uniform designs [see, e.g.,

Santner et al., 2003]

9 / 34

slide-10
SLIDE 10

Extensions

To apply space-filling designs to dispersion modelling, we need to include

  • Qualitative variables
  • Weight function, including dependencies between variables

Consider

  • Quantitative variables x1, . . . , xk1
  • Qualitative variables xk1+1, . . . , xk1+k2, with xj having mj levels

denoted by Mj = {1, . . . , mj}

10 / 34

slide-11
SLIDE 11

Distance metrics

Define the distance between two points x, y ∈ X = R ×

j Mj, where

R ⊂ Rk1, as d(x, y) =

  • k1
  • i=1

(xi − yi)2 + α

k1+k2

  • j=k1+1

I[xj = yj)] , where I[r = s] is the indicator function that takes the value 1 if r = s and 0 otherwise In this talk, α = 1 Similar for ordered categorical variables [Qian et al. (2008)]

11 / 34

slide-12
SLIDE 12

Space-filling criteria

  • Coverage: minimise

φum(d) =

  • X
  • min

x∈d w(y)d(x, y)

m dy 1/m

  • Spread: minimise

φsp(d) = n

  • i=1
  • min

x∈d\{xi} w(x)w(xi)d(x, xi)

−p1/p

  • In this talk, m = p = 1

[See, e.g., Johnson et al. (1990) for unweighted versions]

12 / 34

slide-13
SLIDE 13

Space-filling criteria

  • For coverage designs, we want to “attract” the design to relevant

areas of the design space

  • Note that if w(y) = 0, minx∈d w(y)d(x, y) = 0 for all d
  • For spread designs, we want the design points to “repel” away from

each other

  • Note that as w(xi) → 0,
  • minx∈d\{xi } w(x)w(xi)d(x, xi)

−k → ∞

13 / 34

slide-14
SLIDE 14

Implementation

  • Computer search
  • Row exchange (e.g. Royle, 2002 and SAS)
  • Co-ordinate exchange
  • Efficient storage/updating of distances
  • e.g. cover.design in Fields
  • Quasi-random numbers (low-discrepancy sequence) used for the

candidate list and also to approximate X

14 / 34

slide-15
SLIDE 15

Alternative and connected approaches

  • Transform an iid (quasi-) random sample or space-filling design
  • Latin Hypercube Sample transformed to match marginal

distributions and pairwise correlations [McKay et al., 1979; Iman & Conover, 1982]

  • Design spaces with hard constraints on X
  • Constrained optimisation [e.g. Kleijnen et al., 2010]

15 / 34

slide-16
SLIDE 16

Graphical design assessment

  • 1. Fraction of Design Space (FDS) with respect to the distance

[Zahran et al., 2003] – assess the space-filling properties of a design

  • 2. Fraction of Design Points (FDP) with respect to the weight function

– assess the sampling properties of a design

16 / 34

slide-17
SLIDE 17

Weight functions

Assume some prior distribution, p(x), on X can be elicited from subject experts and/or derived from historical data We consider two weight functions (1) w(x) = p(x) w(x) ≥ 0 (2) w(x) = (1 − αp(x))−γ w(x) ≥ 1 [Joseph et al., 2010] with α < 1/ max p(x) and γ > 0

17 / 34

slide-18
SLIDE 18

Examples

  • Two quantitative variables (x1, x2); p(x) is a logistic function
  • Two quantitative variables (x1, x2) & one qualitative (x3);

conditional on x3, p(x) is a bivariate normal pdf

  • Seven quantitative variables (x1 − x7); application to the dispersion

example

  • Ex. 1
  • Ex. 2
  • Ex. 3

18 / 34

slide-19
SLIDE 19

Example 1 - coverage

(1)

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

(2) α = 1, γ = 1

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

(2) α = 1, γ = 0.75

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

(2) α = 0, γ = 0

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

19 / 34

slide-20
SLIDE 20

Example 1 - coverage

20 40 60 80 100 0.0 0.2 0.4 0.6 0.8 1.0 % design points p(x) 20 40 60 80 100 1 2 3 4 5 6 % design space Dist (1) (2) α = 1, γ = 1 (2) α = 1, γ = 0.75 (2) α = 0, γ = 0

20 / 34

slide-21
SLIDE 21

Example 1 - spread

(1)

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

(2) α = 1, γ = 1

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

(2) α = 0.75, γ = 0.5

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

(2) α = 0, γ = 0

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

21 / 34

slide-22
SLIDE 22

Example 1 - spread

20 40 60 80 100 0.0 0.2 0.4 0.6 0.8 1.0 % design points p(x) 20 40 60 80 100 2 4 6 % design space Dist (1) (2) α = 1, γ = 1 (2) α = 1, γ = 0.75 (2) α = 0, γ = 0

22 / 34

slide-23
SLIDE 23

Example 1 - comparison

20 40 60 80 100 0.0 0.2 0.4 0.6 0.8 1.0 % design points p(x) 20 40 60 80 100 1 2 3 4 % design space Dist (1) Coverage (1) Spread (2) Coverage α = 0, γ = 0 (2) Spread α = 0, γ = 0

  • Ex. 3

23 / 34

slide-24
SLIDE 24

Example 2 - coverage (1)

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3 x1 x2

x3 = 0

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

x3 = 1

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

x3 = 2

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

24 / 34

slide-25
SLIDE 25

Example 2 - coverage (2)

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3 x1 x2

x3 = 0

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

x3 = 1

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

x3 = 2

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

25 / 34

slide-26
SLIDE 26

Example 2 - spread (1)

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3 x1 x2

x3 = 0

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

x3 = 1

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

x3 = 2

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

26 / 34

slide-27
SLIDE 27

Example 2 - spread (2)

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3 x1 x2

x3 = 0

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

x3 = 1

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

x3 = 2

x1 x2

  • 3
  • 2
  • 1

1 2 3

  • 3
  • 2
  • 1

1 2 3

27 / 34

slide-28
SLIDE 28

Example 2 - comparison

20 40 60 80 100 0.0 0.5 1.0 1.5 2.0 2.5 3.0 % design points p(x) 20 40 60 80 100 1 2 3 4 % design space Dist (1) Coverage (2) Coverage α = 1 2.71, γ = 0.75 (1) Spread (2) Spread α = 1 2.71, γ = 0.3 Coverage 28 / 34

slide-29
SLIDE 29

Example 3

  • Seven quantitative variables x1 − x7
  • Wind speed and direction
  • Cloud cover
  • x − y location, mass and time of release
  • Prior information . . .
  • from past data (meteorological) and subject experts (source)
  • defines a release scenario
  • Compare:
  • Weighted space-filling designs with weight function (1) w(x) = p(x)
  • Latin hypercube samples transformed to match marginal

distributions and pairwise correlations [McKay et al., 1979; Iman & Conover, 1982]

29 / 34

slide-30
SLIDE 30

Example 3 - comparison

20 40 60 80 100 0.0 0.1 0.2 0.3 0.4 0.5 % design points p(x) 20 40 60 80 100 0.0 0.2 0.4 0.6 0.8 % design space Dist (1) Coverage (2) Coverage α = 0, γ = 0 (1) Spread (2) Spread α = 0, γ = 0 LHS

30 / 34

slide-31
SLIDE 31

Example 3 - coverage

0.00 0.05 0.10 0.15 0.00 0.05 0.10 0.15

Wind speed

0.0 0.2 0.4 0.6 0.8 1.0

  • 0.0

0.2 0.4 0.6 0.8 1.0

  • 0.0

0.2 0.4 0.6 0.8 1.0 0.00 0.05 0.10 0.15

  • Wind direc
  • 0.0

0.2 0.4 0.6 0.8 1.0

  • x−loc

0.0 0.2 0.4 0.6 0.8 1.0

  • 0.0

0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

y−loc

31 / 34

slide-32
SLIDE 32

Example 3 - preliminary results

Squared Error Monte Carlo LHS Coverage Mean 7 × 10−3 6 × 10−4 4 × 10−4 Max. 4 × 10−2 5 × 10−3 5 × 10−3

c.f. larger Monte Carlo study 32 / 34

slide-33
SLIDE 33

Summary & future work

  • Flexible space-filling design method incorporating prior information
  • Selection of weight function allows a trade-off between space-filling

and sampling high density areas

  • Other applications include physical spatial experiments with

covariates, and selection of subsets of meteorological ensembles

  • Space-filling in many dimensions?
  • Strong prior information can reduce the effective dimension

33 / 34

slide-34
SLIDE 34

Selected references

  • Iman, R.L. and Conover, W.J. (1982). Comm. Statist. Simul.

Comput., 11, 311-334.

  • Johnson, M.E. et al. (1990). JSPI, 26, 131-148.
  • Joseph, V.R. et al. (2010). Tech. report.
  • Lam, R.L.H. et al. (2002). Technometrics, 44, 99-109.
  • McKay, M.D. et al. (1979). Technometrics, 21, 239-245.
  • Qian, P.Z.G. et al. (2008). Technometrics, 50, 383-396.
  • Royle, J.A. (2002). JSPI, 100, 121-134.
  • Santner, T.J. (2003). The Design and Analysis of Computer
  • Experiments. Springer.
  • Zahran, A. et al. (2003). JQT, 35, 377-386.

34 / 34

slide-35
SLIDE 35

35 / 34

slide-36
SLIDE 36

References

  • Bedrick, E.J. et al. (2000). Biometrics, 56, 394-401.
  • Clarke, R.H. (1979). National Radiological Protection Board report NRPB-R91.
  • Dubois, D. & Prade, H. (1980). Fuzzy Sets and Systems. Academic Press.
  • Iman, R.L. and Conover, W.J. (1982). Comm. Statist.
  • Simul. Comput., 11,

311-334.

  • Johnson, M.E. et al. (1990). JSPI, 26, 131-148.
  • Jones, A.R. et al. (2007). Proceedings of the 27th NATO/CCMS International

Technical Meeting on Air Pollution Modelling and its Application.

  • Joseph, V.R. et al. (2010). Tech. report.
  • Kleijnen, J.P.C. et al. (2010). EJOR, 202, 164-174.
  • Lam, R.L.H. et al. (2002). Technometrics, 44, 99-109.
  • McKay, M.D. et al. (1979). Technometrics, 21, 239-245.
  • Qian, P.Z.G. et al. (2008). Technometrics, 50, 383-396.
  • Royle, J.A. (2002). JSPI, 100, 121-134.
  • Santner, T.J. (2003). The Design and Analysis of Computer Experiments.

Springer.

  • Sykes, R.I. et al. (1998). ARAP Report No. 718.
  • Zahran, A. et al. (2003). JQT, 35, 377-386.

36 / 34

slide-37
SLIDE 37

Graphical design assessment (1)

(i) Fraction of Design Space (FDS) with respect to the weighted

  • distance. For each point ˜

x ∈ X, we calculate φ(˜ x|d) = min

x∈d d(x, ˜

x) , and plot the inverse of the empirical distribution function Φ1(ν|d) = 1 D

  • A1

d˜ x ,

  • A1 = {˜

x ∈ X|φ(˜ x|d) ≤ ν}, D =

  • X dx and 0 ≤ Φ1(ν|d) ≤ 1 for all

ν ≥ 0

  • Design d1 dominates design d2 if and only if Φ1(ν, d1) ≥ Φ1(ν, d2)

for all ν, with Φ1(ν, d1) > Φ1(ν, d2) for at least one value of ν

  • Approximate A1 for any given ν using a quasi-random sample
  • See Zahran et al. (2003) for response surface designs

37 / 34

slide-38
SLIDE 38

Graphical design assessment (2)

(ii) Fraction of Design Points (FDP) with respect to the sampling density p(x). For each point x in the design, we calculate p(x) and then plot the inverse of the empirical distribution function Φ2(ρ|d) = 1 n|A2|

  • A2 = {x ∈ d|p(x) ≤ ρ} and 0 ≤ Φ2(ρ|d) ≤ 1 for all ρ ≥ 0
  • Design d1 dominates design d2 if and only if Φ2(ρ|d1) ≤ Φ2(ρ|, d2)

for all ρ, with Φ2(ρ|d1) < Φ2(ρ|d2) for at least one value of ρ

38 / 34

slide-39
SLIDE 39

39 / 34

slide-40
SLIDE 40

Weight functions

  • Example 1:

ln w 1 − w = 1.2 + 0.7x1 − 1.8x2 − 1.9x1x2 − 0.8x2

1 + 3.0x2 2

  • Example 2:

g(x, Σ, p|x3) = p 2π | Σ |1/2 exp

  • −1

2(x − µ(x3))′Σ−1(x − µ(x3))

  • µ(x3) =

   (0, 0)′ if x3 = 0 (1, 1)′ if x3 = 1 (−1, −1)′ if x3 = 2

40 / 34