Part2: Analysis Prepared by: Paul Funkenbusch, Department of - - PowerPoint PPT Presentation

part2 analysis
SMART_READER_LITE
LIVE PREVIEW

Part2: Analysis Prepared by: Paul Funkenbusch, Department of - - PowerPoint PPT Presentation

Part2: Analysis Prepared by: Paul Funkenbusch, Department of Mechanical Engineering, University of Rochester Review ANOM ANOVA Error estimate Replication vs. Pooling ANOVA table Judging statistical significance DOE


slide-1
SLIDE 1

Part2: Analysis

Prepared by: Paul Funkenbusch, Department of Mechanical Engineering, University of Rochester

slide-2
SLIDE 2

 Review  ANOM  ANOVA

  • Error estimate  Replication vs. Pooling
  • ANOVA table
  • Judging statistical significance

DOE mini-course, part 2, Paul Funkenbusch, 2015 2

slide-3
SLIDE 3

Terms

Example

(measure the volume of a balloon as a function of temperature and pressure)

 Factors  variables whose

influence you want to study.

 Levels specific values given

to a factor during experiments (initially limit ourselves to 2- levels)

 Treatment condition  one

running of the experiment

 Response  result measured

for a treatment condition

 Temperature,

Pressure

 50C, 100C

1Pa, 2Pa

 Set T = 50C, P = 1Pa

and measure volume

 measured volume

DOE mini-course, part 2, Paul Funkenbusch, 2015 3

slide-4
SLIDE 4

Leve vel Factor

  • 1

+1

  • X1. Temperature (C)

50 100

  • X2. Pressure (Pa)

1 2 TC TC X1 X1 X2 X2 y 1

  • 1
  • 1

y1 2 +1

  • 1

y2 3

  • 1

+1 y3 4 +1 +1 y4 etc.

DOE mini-course, part 2, Paul Funkenbusch, 2015

Use X1, X2, etc. to designate factors. Use -1, +1 to designate levels X1 at level -1 means T = 50 C Use a table to show factor levels and response (a) for each treatment condition. For example, during TC2, set T = 100C and P = 1Pa, measure the balloon volume = y2

y  response y = volume of balloon

4

slide-5
SLIDE 5

 Test all combinations  Responses  4 DOF

  • 4 measured (y) values

 Effects  4 DOF

  • 4 values calculated
  • m* = (y1+y2+y3+y4)/4
  • DX1 = (y3+y4)/2 - (y1+y2)/2
  • DX2 = (y2+y4)/2 - (y1+y3)/2
  • D12 = (y1+y4)/2 - (y2+y3)/2

 Model  4 DOF

  • 4 constants in model
  • ypred = ao+a1X1+a2X2+a12X1X2

DOE mini-course, part 2, Paul Funkenbusch, 2015

Leve vel Factor

  • 1

+1

  • X1. Temp (C)

50 100

  • X2. Pressure (Pa)

1 2 TC TC X1 X1 X2 X2 X1*X2 *X2 y 1

  • 1
  • 1

+1 y1 2

  • 1

+1

  • 1

y2 3 +1

  • 1
  • 1

y3 4 +1 +1 +1 y4

5

slide-6
SLIDE 6

 Most basic level of analysis  Which effects are largest (which

factors/interactions most important)?

 Which levels produce the best (highest or

lowest) responses?

 Just based on the D values

  • you’ve already done this

DOE mini-course, part 2, Paul Funkenbusch, 2015

   

1 1

m

  • m

1 level at response average

  • 1

level at response average

 

    D

6

slide-7
SLIDE 7

Sign and Magnitude of D Graphically

 Positive D

  • +1 level should increase

the response

  • -1 level should decrease

the response

 Negative D  reversed  Magnitude of D

  • Indicates relative

importance

DOE mini-course, part 2, Paul Funkenbusch, 2015

m* A-1 A+1 B-1 B+1

 Choose A-1 and B+1 to

increase response

 A is more important

7

slide-8
SLIDE 8

 Can treat interactions terms the same way  Larger D  more important interaction  If interactions are large  “best settings” to

increase (or decrease) the response will depend

  • n the combination of factor levels

 Use model to test different combinations

  • ypred = ao+a1X1+a2X2+a12X1X2

DOE mini-course, part 2, Paul Funkenbusch, 2015 8

slide-9
SLIDE 9

DOE mini-course, part 2, Paul Funkenbusch, 2015

Leve vel Factor

  • 1

+1

  • X1. applied load (kg)

2 3

  • X2. previous cuts

(new) 20

 From part I.  Removal rate of

  • steotomy drills vs.

applied load and number of cuts.

 Which effects are

most important?

 How can you

increase the removal rate?

TC TC X1 X1 X2 X2 X1*X2 *X2 Remov

  • val

al rate (mm3/s) /s) 1

  • 1
  • 1

+1 3 2

  • 1

+1

  • 1

2 3 +1

  • 1
  • 1

5 4 +1 +1 +1 2

9

slide-10
SLIDE 10

DOE mini-course, part 2, Paul Funkenbusch, 2015

 Effects

  • DX1 = +1
  • Dx2 = -2
  • D12 = -1

 Number of previous cuts is most important  new drill (level -1) will increase removal rate the most  Applied load and the interaction are comparable  higher load (level +1) will increase removal rate  but need to test combinations (since interaction is

important too).

Factor Leve vel

  • 1

+1

  • X1. applied load (kg)

2 3

  • X2. previous cuts

(new) 20

10

slide-11
SLIDE 11

DOE mini-course, part 2, Paul Funkenbusch, 2015

 ypred = 3.0 + (0.5)X1 - (1.0)X2 - (0.5)X1X2  Set X2 = -1 (Much larger than X1 and interaction)  What is best level for X1?

  • For X1 = -1, X2 = -1  ypred = 3.0
  • For X1 = +1, X2 = -1  ypred = 5.0

 Best settings X1 = +1 (3kg load), X2 = -1 (new drill)  Note; This is a synergistic interaction,

  • Best level for interaction (-1) corresponds to best factor levels

[X1X2 = (+1)(-1) = -1]

  • Interaction enhances effects of “best” factor level choices

 For an anti-synergistic interaction,

  • Conflict between best factor settings and best interaction level
  • Best overall settings then depend on relative strength of the

interaction vs. factor

11

slide-12
SLIDE 12

 Second level of analysis  Which of the observed effects are statistically

significant?

  • Based on comparing observed effects against an estimate
  • f error.
  • Compares D2 for factors and interactions with D2 for

error.

  • Actually compare “mean square” or “MS”  proportional

to D2

 How much does each factor/interaction contribute

to the total variance in system?

DOE mini-course, part 2, Paul Funkenbusch, 2015 12

slide-13
SLIDE 13

Replication Pooling of higher-order interactions

Repeat (replicate) each of the treatment conditions

Independent experimental runs (not multiple measurements from the same TC)

Differences in the responses measured for identical TCs run at different times provide error

“Pure error”  not dependent on modeling assumptions

Best way to estimate error, but greatly increases effort

DOE mini-course, part 2, Paul Funkenbusch, 2015

Assume that higher-order interactions are unimportant/zero

Must choose these interactions upfront (before examining results)  these form a “pool” for error

Effects measured for pooled interactions are used to estimate error

“Error”  includes modeling error (i.e. assumptions about interactions)

Requires less experimental effort, but error estimate is not as good

13

slide-14
SLIDE 14

TC TC X1 X1 X2 X2 y 1

  • 1
  • 1

y1 2

  • 1

+1 y2 3 +1

  • 1

y3 4 +1 +1 y4

DOE mini-course, part 2, Paul Funkenbusch, 2015

1b

  • 1
  • 1

y1b 2b

  • 1

+1 y2b 3b +1

  • 1

y3b 4b +1 +1 y4b TC TC X1 X1 X2 X2 y 1a

  • 1
  • 1

y1a 2a

  • 1

+1 y2a 3a +1

  • 1

y3a 4a +1 +1 y4a

 Original design two factors at 2-

levels

 4 DOF  m*, DX1, Dx2, D12  Replicated design (2x)  8 DOF total  4 DOF  m*, DX1, Dx2, D12  + 4 DOF for error  Contrast responses measured

under nominally identical TC

2X

14

slide-15
SLIDE 15

DOE mini-course, part 2, Paul Funkenbusch, 2015

 Test more factors. Increase size and DOF.  Example: three 2-level factors instead of two.  8 DOF total

1 DOF for m* 3 DOF for factors DX1, DX2, DX3 3 DOF for 2-factor interactions D12, D13, D23 1 DOF for 3-factor interaction D123

 Pool all interactions  decide before examining results  assess m*, DX1, DX2, DX3 (4 DOF)  use D12, D13, D23 ,D123 for error (4 DOF)

TC TC X1 X1 X2 X2 X3 X3 y 1

  • 1 -1 -1 y1

2

  • 1 -1 +1 y2

3

  • 1 +1 -1 y3

4

  • 1 +1 +1 y4

5 +1 -1 -1 y5 6 +1 -1 +1 y6 7 +1 +1 -1 y7 8 +1 +1 +1 y8

Note: alternative choice (pool only the highest order , 3- factor,interaction) is possible, but only leaves 1 DOF for the error estimate  not desirable. For larger experiments this is not as much of a constraint (more higher-order interactions that can be pooled).

15

slide-16
SLIDE 16

 24 = 16TC = 16 DOF

  • m*

 1 DOF

  • factors

 4 DOF

  • 2-factor inter.

 6 DOF

  • 3-factor inter.

 4 DOF

  • 4-factor inter.

 1 DOF

 Pool 3 and 4 factor int.

  • Assess factors and 2-factor

interactions

  • 5 DOF for error estimate

 25 = 32TC = 32 DOF

  • m*

 1 DOF

  • factors

 5 DOF

  • 2-factor inter.

 10 DOF

  • 3-factor inter.

 10 DOF

  • 4-factor inter.

 5 DOF

  • 5-factor inter.

 1 DOF

 Pool 4 and 5 factor int.

  • Assess factors and 2-factor

and 3-factor interactions

  • 6 DOF for error estimate

DOE mini-course, part 2, Paul Funkenbusch, 2015

Four 2-level factors Five 2-level factors

16

slide-17
SLIDE 17

 Best for:

  • small numbers of factors
  • systems with large

uncertainties

DOE mini-course, part 2, Paul Funkenbusch, 2015

“Pure error” (no modeling assumptions needed) Assess more factors for same effort (or same number of factors for less effort)

 Best for:

  • large numbers of factors
  • systems with strong

time/cost constraints on experimental size

17

slide-18
SLIDE 18

Source ce SS SS DOF MS MS F p % SS A 100 1 100 20 0.011

56

B 50 1 50 10 0.034

28

AxB 10 1 10 2 0.230

6

error 20 4 5

  • 11

Total 180 7

  • 100

 Typical way of presenting ANOVA results.  Explain each column so you can interpret these

results.

 Most software packages will output their analysis in

some variant of this format.

 Will also show how you can do the calculations.

DOE mini-course, part 2, Paul Funkenbusch, 2015 18

slide-19
SLIDE 19

Source ce SS SS DOF MS MS F p % SS X1 100 1 100 20

0.011 56

X2 50 1 50 10

0.034 28

X1X2 10 1 10 2

0.230 6

error 20 4 5

  • 11

Total 180 7

  • 100

 ANOVA works by determining how much of the variance in the

experiment can be attributed to each source.

 Sources are factors, interactions, and the error.

  • Interactions “pooled” to get error are included in the error row and do

not appear as a separate source (don’t double count them).

 The “Total” includes everything except terms which are

attributed to the overall average (i.e. m*).

DOE mini-course, part 2, Paul Funkenbusch, 2015 19

slide-20
SLIDE 20

Source ce SS SS DOF MS MS F p % SS X1 100 1 100 20 0.011

56

X2 50 1 50 10 0.034

28

X1X2 10 1 10 2 0.230

6

error 20 4 5

  • 11

Total 180 7

  • 100

 Total  variance of the responses about the overall average.

(summation over all treatment conditions)

 For 2-level (factor or interaction) in a factorial design:

SS = D2 х (# of TC)/4

 Can obtain the error term by subtraction.

DOE mini-course, part 2, Paul Funkenbusch, 2015

   

n 1 = i 2 i

* m = SS Total y

Measures the variance attributable to each effect (factor

  • r interaction)

20

slide-21
SLIDE 21

Source ce SS SS DOF MS MS F p % SS X1 100 1 100 20

0.011 56

X2 50 1 50 10

0.034 28

X1X2 10 1 10 2

0.230 6

error 20 4 5

  • 11

Total 180 7

  • 100

 Total  One less than the total number of treatment

conditions (i.e. one less than the # of responses).

  • Because 1 DOF is used in calculating m*

 For 2-level (factors or interactions)  1 DOF  Can obtain the error term by subtraction.

DOE mini-course, part 2, Paul Funkenbusch, 2015 21

slide-22
SLIDE 22

Source ce SS SS DOF MS MS F p % SS X1 100 1 100 20 0.011

56

X2 50 1 50 10 0.034

28

X1X2 10 1 10 2 0.230

6

error 20 4 5

  • 11

Total 180 7

  • 100

 SS “normalized” against the DOF  For 2-level (factors or interactions)  1 DOF  MS = SS  For error, averages the different measurements of the

error variance.

DOE mini-course, part 2, Paul Funkenbusch, 2015 22

slide-23
SLIDE 23

Source ce SS SS DOF MS MS F p % SS X1 100 1 100 20 0.011

56

X2 50 1 50 10 0.034

28

X1X2 10 1 10 2 0.230

6

error 20 4 5

  • 11

Total 180 7

  • 100

 Compares size of effect with error  The larger the F, the more likely an effect is “real”  But judging statistical significance also depends on the DOF for

the effect, the DOF for the error, and chosen significance level.

 Having at least 2 (and preferably 3 or 4) DOF for error greatly

improves chances of identifying statistically significant effects.

DOE mini-course, part 2, Paul Funkenbusch, 2015 23

slide-24
SLIDE 24

 Standard tables are available in most statistics textbooks to

determine the critical value of F based on the DOF for error, DOF for effect, and the chosen significance level a).

 a estimates the chance that an F value larger than the critical

value could occur randomly (i.e. even if the effect was not real)

 If F > Fcritical , the factor or interaction is judged statistically

significant.

DOE mini-course, part 2, Paul Funkenbusch, 2015

DOF (error) DOF (effect) 1 2 3 1 161.45 199.50 215.71 2 18.51 19.00 19.16 3 10.13 9.55 9.28 4 7.71 6.94 6.59 Critical F for a = 0.05 Portion of an a = 0.05 table

(Note the very high values when there is only 1 DOF for error.)

24

slide-25
SLIDE 25

 Use a  0.05  Fcritical = 7.71  F (X1) = 20 > Fcritical  X1 significant  F (X2) = 10 > Fcritical  X2 significant  F (X1X2) = 2 < Fcritical  X1X2 not significant

DOE mini-course, part 2, Paul Funkenbusch, 2015

Source ce SS SS DOF MS MS F X1 100 1 100 20 X2 50 1 50 10 X1X2 10 1 10 2 error 20 4 5

  • Total

180 7

  • 25
slide-26
SLIDE 26

Source ce SS SS DOF MS MS F p % SS X1 100 1 100 20

0.011 56

X2 50 1 50 10

0.034 28

X1X2 10 1 10 2

0.230 6

error 20 4 5

  • 11

Total 180 7

  • 100

 Difficult/cumbersome to determine p values “manually”. But,

commonly provided by statistical software packages

 Estimates how likely an F value as big as that observed is to have

  • ccurred randomly (i.e. if the effect was not real)

 p values below a chosen a value (typically = 0.05)  indicate

statistical significance

 p depends on the DOF (effect) and DOF (error) in addition to F

DOE mini-course, part 2, Paul Funkenbusch, 2015 26

slide-27
SLIDE 27

 Choose a significance level, a  0.05  p (X1) = 0.011 < 0.05  X1 significant  p (X2) = 0.034 < 0.05  X2 significant  p (X1X2) = 0.23 > 0.05  X1X2 not significant

DOE mini-course, part 2, Paul Funkenbusch, 2015

Source ce SS SS DOF MS MS F p % SS X1 100 1 100 20 0.011

56

X2 50 1 50 10 0.034

28

X1X2 10 1 10 2 0.230

6

error 20 4 5

  • 11

Total 180 7

  • 100

27

slide-28
SLIDE 28

Source ce SS SS DOF MS MS F p %SS X1 100 1 100 20 0.011 56 X2 50 1 50 10 0.034 28 X1X2 10 1 10 2 0.230 6 error 20 4 5

  • 11

Total 180 7

  • 100

 Sometimes used to measure the

“importance” of each factor/interaction

 How much of the total variance in the

experiment can be attributed to each of the factors/interactions?

DOE mini-course, part 2, Paul Funkenbusch, 2015

84% of the total variance can be attributed to the two factor effects.

28

slide-29
SLIDE 29

1b

  • 1
  • 1

+1 3.2 2b

  • 1

+1

  • 1

1.9 3b +1

  • 1
  • 1

5.3 4b +1 +1 +1 1.8

DOE mini-course, part 2, Paul Funkenbusch, 2015

 Data on the removal

rate of osteotomy drills is collected as a function of the load applied and the number of previous cuts made.

 Assume the data

was collected as part of an experiment with one replication

TC TC X1 X1 X2 X2 X1*X2 *X2 Remov

  • val

al rate (mm3/s) /s) 1a

  • 1
  • 1

+1 2.8 2a

  • 1

+1

  • 1

2.1 3a +1

  • 1
  • 1

4.7 4a +1 +1 +1 2.2

29

slide-30
SLIDE 30

DOE mini-course, part 2, Paul Funkenbusch, 2015

Effects

m* = +3 DX1 = +1 Dx2 = -2 Dx1x2 = -1

 X1

  • SS = D2*(# of TC)/4 = 12*(8)/4 = 2; DOF = 1

 X2

  • SS = D2*(# of TC)/4 = (-22)*(8)/4 = 8; DOF = 1

 X1X2

  • SS = D2*(# of TC)/4 = 12*(8)/4 = 2; DOF = 1

 Total

  • DOF = (# of TC) – 1 = 8 – 1 = 7

 Error (by subtraction)

  • SS = 12.36 – (2 + 8 + 2) = 0.36
  • DOF = 7 – (1 + 1 + 1) = 4

       

36 . 12 3 8 . 1 ... 3 1 . 2 3 8 . 2 * m = SS

8 1 2 2 2 n 1 = i 2 i

       

 

 i

y

30

slide-31
SLIDE 31

DOE mini-course, part 2, Paul Funkenbusch, 2015

 Judge X1 (load), X2 (previous # of cuts), and X1X2

(interaction between load and # of cuts) significant.

Source ce SS SS DOF MS MS F X1 2 1 2 22.2 X2 8 1 8 88.8 X1X2 2 1 2 22.2 error 0.36 4 0.09

  • Total

12.36 7

  • F critical = 7.71

for a  0.05 Critical F for a = 0.05

31

slide-32
SLIDE 32

 ANalysis OF Means (ANOM)

  • Which effects are largest (which factors/interactions most

important)?

  • Which levels produce the best (highest or lowest) responses?

 ANalysis Of VAriance (ANOVA)

  • Which of the observed effects are statistically significant?

 Error estimate

  • Replication  pure error, multiplies required effort
  • Pooling  requires upfront assumptions (sparsity of effects)

DOE mini-course, part 2, Paul Funkenbusch, 2015 32

slide-33
SLIDE 33

 This material is based on work supported by

the National Science Foundation under grant CMMI-1100632.

 The assistance of Prof. Amy Lerner and Mr.

Alex Kotelsky in preparation of this material is gratefully acknowledged.

 This material was originally presented as a

module in the course BME 283/483, Biosolid Mechanics.

33 DOE mini-course, part 2, Paul Funkenbusch, 2015