Using R for the design and analysis of computer experiments with - - PowerPoint PPT Presentation

using r for the design and analysis of computer
SMART_READER_LITE
LIVE PREVIEW

Using R for the design and analysis of computer experiments with - - PowerPoint PPT Presentation

Using R for the design and analysis of computer experiments with the Nimrod toolkit 1 Using R for the design and analysis of computer experiments with the Nimrod toolkit Neil Diamond 1 , David Abramson 2 , Tom Peachey 2 1. Department of


slide-1
SLIDE 1

Using R for the design and analysis of computer experiments with the Nimrod toolkit 1

Using R for the design and analysis of computer experiments with the Nimrod toolkit

Neil Diamond1, David Abramson2, Tom Peachey2

  • 1. Department of Econometrics and Business Statistics
  • 2. Caulfield School of Information Technology

1 / 22

slide-2
SLIDE 2

Using R for the design and analysis of computer experiments with the Nimrod toolkit 2

Computer Experiments

The design and analysis of computer experiments to explore the behavior of complex systems is becoming increasingly important in science and engineering.

2 / 22

slide-3
SLIDE 3

Using R for the design and analysis of computer experiments with the Nimrod toolkit 2

Computer Experiments

The design and analysis of computer experiments to explore the behavior of complex systems is becoming increasingly important in science and engineering. At least two books on the topic:

2 / 22

slide-4
SLIDE 4

Using R for the design and analysis of computer experiments with the Nimrod toolkit 2

Computer Experiments

The design and analysis of computer experiments to explore the behavior of complex systems is becoming increasingly important in science and engineering. At least two books on the topic:

The Design and Analysis of Computer

  • Experiments. T. J. Santner, B. J. Williamns, W.I
  • Notz. (2003), Springer: New York.

2 / 22

slide-5
SLIDE 5

Using R for the design and analysis of computer experiments with the Nimrod toolkit 2

Computer Experiments

The design and analysis of computer experiments to explore the behavior of complex systems is becoming increasingly important in science and engineering. At least two books on the topic:

The Design and Analysis of Computer

  • Experiments. T. J. Santner, B. J. Williamns, W.I
  • Notz. (2003), Springer: New York.

Design and Modeling for Computer Experiments. K-T. Fang, R. Li, A. Sudjianto. (2006), Chapman & Hall/CRC: London.

2 / 22

slide-6
SLIDE 6

Using R for the design and analysis of computer experiments with the Nimrod toolkit 2

Computer Experiments

The design and analysis of computer experiments to explore the behavior of complex systems is becoming increasingly important in science and engineering. At least two books on the topic:

The Design and Analysis of Computer

  • Experiments. T. J. Santner, B. J. Williamns, W.I
  • Notz. (2003), Springer: New York.

Design and Modeling for Computer Experiments. K-T. Fang, R. Li, A. Sudjianto. (2006), Chapman & Hall/CRC: London.

Some R packages-more on that later.

2 / 22

slide-7
SLIDE 7

Using R for the design and analysis of computer experiments with the Nimrod toolkit 3

Nimrod

Developed by Computer Scientists at Monash University’s eScience and Grid Engineering Laboratory.

3 / 22

slide-8
SLIDE 8

Using R for the design and analysis of computer experiments with the Nimrod toolkit 3

Nimrod

Developed by Computer Scientists at Monash University’s eScience and Grid Engineering Laboratory. Automates the formulation, running, and collation of the individual experiments.

3 / 22

slide-9
SLIDE 9

Using R for the design and analysis of computer experiments with the Nimrod toolkit 3

Nimrod

Developed by Computer Scientists at Monash University’s eScience and Grid Engineering Laboratory. Automates the formulation, running, and collation of the individual experiments. Includes a distributed scheduling component that can manage the scheduling of individual jobs.

3 / 22

slide-10
SLIDE 10

Using R for the design and analysis of computer experiments with the Nimrod toolkit 4

Nimrod Set of Tools

Nimrod contains tools to perform a complete parameter sweep across all possible combinations (Nimrod/G),

4 / 22

slide-11
SLIDE 11

Using R for the design and analysis of computer experiments with the Nimrod toolkit 4

Nimrod Set of Tools

Nimrod contains tools to perform a complete parameter sweep across all possible combinations (Nimrod/G), search using non-linear optimization algorithms (Nimrod/O),

4 / 22

slide-12
SLIDE 12

Using R for the design and analysis of computer experiments with the Nimrod toolkit 4

Nimrod Set of Tools

Nimrod contains tools to perform a complete parameter sweep across all possible combinations (Nimrod/G), search using non-linear optimization algorithms (Nimrod/O),

  • r use fractional factorial design techniques

(Nimrod/E).

4 / 22

slide-13
SLIDE 13

Using R for the design and analysis of computer experiments with the Nimrod toolkit 4

Nimrod Set of Tools

Nimrod contains tools to perform a complete parameter sweep across all possible combinations (Nimrod/G), search using non-linear optimization algorithms (Nimrod/O),

  • r use fractional factorial design techniques

(Nimrod/E).

4 / 22

slide-14
SLIDE 14

Using R for the design and analysis of computer experiments with the Nimrod toolkit 4

Nimrod Set of Tools

Nimrod contains tools to perform a complete parameter sweep across all possible combinations (Nimrod/G), search using non-linear optimization algorithms (Nimrod/O),

  • r use fractional factorial design techniques

(Nimrod/E). These can be run stand-alone or accessed via the Nimrod portal

4 / 22

slide-15
SLIDE 15

Using R for the design and analysis of computer experiments with the Nimrod toolkit 5

Nimrod Applications

Nimrod has been used in an extensive range of applications Air Pollution Studies

5 / 22

slide-16
SLIDE 16

Using R for the design and analysis of computer experiments with the Nimrod toolkit 5

Nimrod Applications

Nimrod has been used in an extensive range of applications Air Pollution Studies Laser Physics

5 / 22

slide-17
SLIDE 17

Using R for the design and analysis of computer experiments with the Nimrod toolkit 5

Nimrod Applications

Nimrod has been used in an extensive range of applications Air Pollution Studies Laser Physics Ecology

5 / 22

slide-18
SLIDE 18

Using R for the design and analysis of computer experiments with the Nimrod toolkit 5

Nimrod Applications

Nimrod has been used in an extensive range of applications Air Pollution Studies Laser Physics Ecology Quantum Chemistry

5 / 22

slide-19
SLIDE 19

Using R for the design and analysis of computer experiments with the Nimrod toolkit 5

Nimrod Applications

Nimrod has been used in an extensive range of applications Air Pollution Studies Laser Physics Ecology Quantum Chemistry CAD Digital Simulation

5 / 22

slide-20
SLIDE 20

Using R for the design and analysis of computer experiments with the Nimrod toolkit 5

Nimrod Applications

Nimrod has been used in an extensive range of applications Air Pollution Studies Laser Physics Ecology Quantum Chemistry CAD Digital Simulation Antenna Design

5 / 22

slide-21
SLIDE 21

Using R for the design and analysis of computer experiments with the Nimrod toolkit 5

Nimrod Applications

Nimrod has been used in an extensive range of applications Air Pollution Studies Laser Physics Ecology Quantum Chemistry CAD Digital Simulation Antenna Design Cardiac Modelling

5 / 22

slide-22
SLIDE 22

Using R for the design and analysis of computer experiments with the Nimrod toolkit 6

Workflow Engines

There are a number of workflow engines which provide scientists with an environment with which they can manage data, the workflows of the various analytical steps in their investigation, and summaries of findings.

6 / 22

slide-23
SLIDE 23

Using R for the design and analysis of computer experiments with the Nimrod toolkit 6

Workflow Engines

There are a number of workflow engines which provide scientists with an environment with which they can manage data, the workflows of the various analytical steps in their investigation, and summaries of findings. Although existing workflow systems can specify arbitrary parallel programs, they are typically not effective with large and variable parallelism.

6 / 22

slide-24
SLIDE 24

Using R for the design and analysis of computer experiments with the Nimrod toolkit 6

Workflow Engines

There are a number of workflow engines which provide scientists with an environment with which they can manage data, the workflows of the various analytical steps in their investigation, and summaries of findings. Although existing workflow systems can specify arbitrary parallel programs, they are typically not effective with large and variable parallelism. Similarly, Nimrod was not designed to execute arbitrary workflows.

6 / 22

slide-25
SLIDE 25

Using R for the design and analysis of computer experiments with the Nimrod toolkit 6

Workflow Engines

There are a number of workflow engines which provide scientists with an environment with which they can manage data, the workflows of the various analytical steps in their investigation, and summaries of findings. Although existing workflow systems can specify arbitrary parallel programs, they are typically not effective with large and variable parallelism. Similarly, Nimrod was not designed to execute arbitrary workflows. Thus, it is difficult to run sweeps over workflows, and workflows containing sweeps.

6 / 22

slide-26
SLIDE 26

Using R for the design and analysis of computer experiments with the Nimrod toolkit 7

Nimrod/K

To overcome these problems, a new tool (Nimrod/K) is being developed, based on the Kepler workflow engine (Kepler Core, 2009).

7 / 22

slide-27
SLIDE 27

Using R for the design and analysis of computer experiments with the Nimrod toolkit 7

Nimrod/K

To overcome these problems, a new tool (Nimrod/K) is being developed, based on the Kepler workflow engine (Kepler Core, 2009). It leverages a number of the techniques developed in the earlier Nimrod tools for distributing tasks to the Grid.

7 / 22

slide-28
SLIDE 28

Using R for the design and analysis of computer experiments with the Nimrod toolkit 7

Nimrod/K

To overcome these problems, a new tool (Nimrod/K) is being developed, based on the Kepler workflow engine (Kepler Core, 2009). It leverages a number of the techniques developed in the earlier Nimrod tools for distributing tasks to the Grid. Kepler allows the user to specify R expressions and access R objects as part of the scientific workflow.

7 / 22

slide-29
SLIDE 29

Using R for the design and analysis of computer experiments with the Nimrod toolkit 8

Example Workflow

8 / 22

slide-30
SLIDE 30

Using R for the design and analysis of computer experiments with the Nimrod toolkit 9

Statistical Approach to Computer Experiments

Unlike physical experiments, repeated experiments give the same results.

9 / 22

slide-31
SLIDE 31

Using R for the design and analysis of computer experiments with the Nimrod toolkit 9

Statistical Approach to Computer Experiments

Unlike physical experiments, repeated experiments give the same results. Model the output as the realisation of a stochastic process with a correlation structure that depends on a distance to other points in the experiment.

9 / 22

slide-32
SLIDE 32

Using R for the design and analysis of computer experiments with the Nimrod toolkit 9

Statistical Approach to Computer Experiments

Unlike physical experiments, repeated experiments give the same results. Model the output as the realisation of a stochastic process with a correlation structure that depends on a distance to other points in the experiment. Allows estimates of untried experiments.

9 / 22

slide-33
SLIDE 33

Using R for the design and analysis of computer experiments with the Nimrod toolkit 9

Statistical Approach to Computer Experiments

Unlike physical experiments, repeated experiments give the same results. Model the output as the realisation of a stochastic process with a correlation structure that depends on a distance to other points in the experiment. Allows estimates of untried experiments. Gives an estimate of the uncertainty.

9 / 22

slide-34
SLIDE 34

Using R for the design and analysis of computer experiments with the Nimrod toolkit 10

Computer Experiments-Designs

Simplest method-Latin Hypercubes

10 / 22

slide-35
SLIDE 35

Using R for the design and analysis of computer experiments with the Nimrod toolkit 10

Computer Experiments-Designs

Simplest method-Latin Hypercubes Other more sophisticated methods include Orthogonal Arrays and Scrambled Nets.

10 / 22

slide-36
SLIDE 36

Using R for the design and analysis of computer experiments with the Nimrod toolkit 10

Computer Experiments-Designs

Simplest method-Latin Hypercubes Other more sophisticated methods include Orthogonal Arrays and Scrambled Nets. Various space filling designs.

10 / 22

slide-37
SLIDE 37

Using R for the design and analysis of computer experiments with the Nimrod toolkit 11

Computer Experiments-Model

Response = Linear Model + Departure y(x) = β + z(x) E(z(x) = 0 Cov(z(t), z(u)) = σ2

z d

  • j=1

Rj(tj, uj) Rj(tj, uj) = exp [−θj(tj − uj)pj]

11 / 22

slide-38
SLIDE 38

Using R for the design and analysis of computer experiments with the Nimrod toolkit 12

MLE of θ, p, β, and σ2

Reduces to numerically optimising −1 2(n ln ˆ σ2 + ln detRD) RD = Matrix of correlations for design points ˆ β = (1TR−1

D 1T)−11TR−1 D y

ˆ σ2 = 1 n(y − 1ˆ β)TR−1

D (y − 1ˆ

β)

12 / 22

slide-39
SLIDE 39

Using R for the design and analysis of computer experiments with the Nimrod toolkit 13

Best Linear Unbiased Predictor for an untried x

ˆ yx = ˆ β + r T(x)R−1

D (y − 1ˆ

β) where r(x) = [R(x1, x), R(x2, x), . . ., R(xn, x)]T Design point : [x1, x2, . . . , xn] Untried Input : x Interpolates the data points.

13 / 22

slide-40
SLIDE 40

Using R for the design and analysis of computer experiments with the Nimrod toolkit 14

Implementations in R

BACCO

14 / 22

slide-41
SLIDE 41

Using R for the design and analysis of computer experiments with the Nimrod toolkit 14

Implementations in R

BACCO

Emulator

14 / 22

slide-42
SLIDE 42

Using R for the design and analysis of computer experiments with the Nimrod toolkit 14

Implementations in R

BACCO

Emulator Approximator

14 / 22

slide-43
SLIDE 43

Using R for the design and analysis of computer experiments with the Nimrod toolkit 14

Implementations in R

BACCO

Emulator Approximator Calibrator

14 / 22

slide-44
SLIDE 44

Using R for the design and analysis of computer experiments with the Nimrod toolkit 14

Implementations in R

BACCO

Emulator Approximator Calibrator

mlegp: an R package for Gaussian process modeling and sensitivity analysis

14 / 22

slide-45
SLIDE 45

Using R for the design and analysis of computer experiments with the Nimrod toolkit 14

Implementations in R

BACCO

Emulator Approximator Calibrator

mlegp: an R package for Gaussian process modeling and sensitivity analysis Certainly others . . .

14 / 22

slide-46
SLIDE 46

Using R for the design and analysis of computer experiments with the Nimrod toolkit 15

Example Workflow

15 / 22

slide-47
SLIDE 47

Using R for the design and analysis of computer experiments with the Nimrod toolkit 16

Latin Hypercube Actor

16 / 22

slide-48
SLIDE 48

Using R for the design and analysis of computer experiments with the Nimrod toolkit 17

Latin Hypercube Design

17 / 22

slide-49
SLIDE 49

Using R for the design and analysis of computer experiments with the Nimrod toolkit 18

Nimrod/K Actor

Nimrod takes the experimental design and controls the running of the experiments and collation of results.

18 / 22

slide-50
SLIDE 50

Using R for the design and analysis of computer experiments with the Nimrod toolkit 18

Nimrod/K Actor

Nimrod takes the experimental design and controls the running of the experiments and collation of results. Passes the results onto mlegp actor which fits the Gaussian model to the data.

18 / 22

slide-51
SLIDE 51

Using R for the design and analysis of computer experiments with the Nimrod toolkit 19

mlegp predictions Actor

Takes fitted model and predicts at a grid of untried inputs.

19 / 22

slide-52
SLIDE 52

Using R for the design and analysis of computer experiments with the Nimrod toolkit 19

mlegp predictions Actor

Takes fitted model and predicts at a grid of untried inputs. Inputs are the granularity

  • f the grid, and which are

the primary and conditioning inputs.

19 / 22

slide-53
SLIDE 53

Using R for the design and analysis of computer experiments with the Nimrod toolkit 19

mlegp predictions Actor

Takes fitted model and predicts at a grid of untried inputs. Inputs are the granularity

  • f the grid, and which are

the primary and conditioning inputs. Uses Lattice graphics to produce a visualisation of the surface.

19 / 22

slide-54
SLIDE 54

Using R for the design and analysis of computer experiments with the Nimrod toolkit 20

Visualisation

x1 x2 yhat : x3 x1 x2 yhat : x3 8.33 x1 x2 yhat : x3 16.67 x1 x2 yhat : x3 25

2 4 6 8 10

20 / 22

slide-55
SLIDE 55

Using R for the design and analysis of computer experiments with the Nimrod toolkit 21

Key Message

Computer Experiments are very important.

21 / 22

slide-56
SLIDE 56

Using R for the design and analysis of computer experiments with the Nimrod toolkit 21

Key Message

Computer Experiments are very important. Many tools in R both to design and analyse computer experiments.

21 / 22

slide-57
SLIDE 57

Using R for the design and analysis of computer experiments with the Nimrod toolkit 21

Key Message

Computer Experiments are very important. Many tools in R both to design and analyse computer experiments. Nimrod tools are convenient in managing the execution of the computer experiments.

21 / 22

slide-58
SLIDE 58

Using R for the design and analysis of computer experiments with the Nimrod toolkit 21

Key Message

Computer Experiments are very important. Many tools in R both to design and analyse computer experiments. Nimrod tools are convenient in managing the execution of the computer experiments. Using Nimrod/K takes advantage of the Kepler workflow engine.

21 / 22

slide-59
SLIDE 59

Using R for the design and analysis of computer experiments with the Nimrod toolkit 21

Key Message

Computer Experiments are very important. Many tools in R both to design and analyse computer experiments. Nimrod tools are convenient in managing the execution of the computer experiments. Using Nimrod/K takes advantage of the Kepler workflow engine. Kepler and R are integrated, making it easy to use existing packages in R for computer experiments, and extends their usefulness.

21 / 22

slide-60
SLIDE 60

Using R for the design and analysis of computer experiments with the Nimrod toolkit 22

MeSsAGE Lab

Monash eScience and Grid Engineering Laboratory http://messagelab.monash.edu.au/

22 / 22