Principles of Experimental Design Applied Statistics and - - PowerPoint PPT Presentation

principles of experimental design
SMART_READER_LITE
LIVE PREVIEW

Principles of Experimental Design Applied Statistics and - - PowerPoint PPT Presentation

Induction Model of a process or system Experiments and observational studies Principles of Experimental Design Applied Statistics and Experimental Design Chapter 1 Peter Hoff Statistics, Biostatistics and the CSSS University of Washington


slide-1
SLIDE 1

Induction Model of a process or system Experiments and observational studies

Principles of Experimental Design

Applied Statistics and Experimental Design Chapter 1

Peter Hoff

Statistics, Biostatistics and the CSSS University of Washington

slide-2
SLIDE 2

Induction Model of a process or system Experiments and observational studies

Induction

Much of our scientific knowledge about processes and systems is based on induction: reasoning from the specific to the general. Example(survey): Do you favor increasing the gas tax for public transportation?

  • Specific cases:

200 people called for a telephone survey

  • Inferential goal:

get information on the opinion of the entire city. Example (Women’s Health Initiative): Does hormone replacement improve health status in post-menopausal women?

  • Specific cases:

Health status monitored in 16,608 women over a 5-year period. Some took hormones, others did not.

  • Inferential goal:

Determine if hormones improve the health of women not in the study.

slide-3
SLIDE 3

Induction Model of a process or system Experiments and observational studies

Induction

Much of our scientific knowledge about processes and systems is based on induction: reasoning from the specific to the general. Example(survey): Do you favor increasing the gas tax for public transportation?

  • Specific cases:

200 people called for a telephone survey

  • Inferential goal:

get information on the opinion of the entire city. Example (Women’s Health Initiative): Does hormone replacement improve health status in post-menopausal women?

  • Specific cases:

Health status monitored in 16,608 women over a 5-year period. Some took hormones, others did not.

  • Inferential goal:

Determine if hormones improve the health of women not in the study.

slide-4
SLIDE 4

Induction Model of a process or system Experiments and observational studies

Induction

Much of our scientific knowledge about processes and systems is based on induction: reasoning from the specific to the general. Example(survey): Do you favor increasing the gas tax for public transportation?

  • Specific cases:

200 people called for a telephone survey

  • Inferential goal:

get information on the opinion of the entire city. Example (Women’s Health Initiative): Does hormone replacement improve health status in post-menopausal women?

  • Specific cases:

Health status monitored in 16,608 women over a 5-year period. Some took hormones, others did not.

  • Inferential goal:

Determine if hormones improve the health of women not in the study.

slide-5
SLIDE 5

Induction Model of a process or system Experiments and observational studies

Model of a variable process

Process x1 x2 ǫ y How do the inputs of a process affect an output? Input variables consist of

  • controllable factors x1: measured and determined by scientist.
  • uncontrollable factors x2: measured but not determined by scientist.
  • noise factors ǫ: unmeasured, uncontrolled factors, often called

experimental variability or “error”.

slide-6
SLIDE 6

Induction Model of a process or system Experiments and observational studies

Model of a variable process

Process x1 x2 ǫ y For any interesting process, there are inputs such that: variability in input → variability in output If variability in x leads to variability y, we say x is a source of variation. Good design and analysis of experiments can identify sources of variation.

slide-7
SLIDE 7

Induction Model of a process or system Experiments and observational studies

Experiments and observational studies

Information on how inputs affect output can be gained from: Observational studies: Input and output variables are observed from a pre-existing population. It may be hard to say what is input and what is output. Controlled experiments: One or more input variables are controlled and manipulated by the experimenter to determine their effect on the output.

slide-8
SLIDE 8

Induction Model of a process or system Experiments and observational studies

Experiments and observational studies

Information on how inputs affect output can be gained from: Observational studies: Input and output variables are observed from a pre-existing population. It may be hard to say what is input and what is output. Controlled experiments: One or more input variables are controlled and manipulated by the experimenter to determine their effect on the output.

slide-9
SLIDE 9

Induction Model of a process or system Experiments and observational studies

Experiments and observational studies

Information on how inputs affect output can be gained from: Observational studies: Input and output variables are observed from a pre-existing population. It may be hard to say what is input and what is output. Controlled experiments: One or more input variables are controlled and manipulated by the experimenter to determine their effect on the output.

slide-10
SLIDE 10

Induction Model of a process or system Experiments and observational studies

Women’s health initiative (WHI)

Population: Healthy, post-menopausal women in the U.S. Input variables:

  • 1. estrogen treatment, yes/no
  • 2. demographic variables (age, race, diet, etc.)
  • 3. unmeasured variables (?)

Output variables:

  • 1. coronary heart disease (eg. MI)
  • 2. invasive breast cancer
  • 3. other health related outcomes

Scientific question: How does estrogen treatment affect health outcomes?

slide-11
SLIDE 11

Induction Model of a process or system Experiments and observational studies

Women’s health initiative (WHI)

Population: Healthy, post-menopausal women in the U.S. Input variables:

  • 1. estrogen treatment, yes/no
  • 2. demographic variables (age, race, diet, etc.)
  • 3. unmeasured variables (?)

Output variables:

  • 1. coronary heart disease (eg. MI)
  • 2. invasive breast cancer
  • 3. other health related outcomes

Scientific question: How does estrogen treatment affect health outcomes?

slide-12
SLIDE 12

Induction Model of a process or system Experiments and observational studies

Women’s health initiative (WHI)

Population: Healthy, post-menopausal women in the U.S. Input variables:

  • 1. estrogen treatment, yes/no
  • 2. demographic variables (age, race, diet, etc.)
  • 3. unmeasured variables (?)

Output variables:

  • 1. coronary heart disease (eg. MI)
  • 2. invasive breast cancer
  • 3. other health related outcomes

Scientific question: How does estrogen treatment affect health outcomes?

slide-13
SLIDE 13

Induction Model of a process or system Experiments and observational studies

Women’s health initiative (WHI)

Population: Healthy, post-menopausal women in the U.S. Input variables:

  • 1. estrogen treatment, yes/no
  • 2. demographic variables (age, race, diet, etc.)
  • 3. unmeasured variables (?)

Output variables:

  • 1. coronary heart disease (eg. MI)
  • 2. invasive breast cancer
  • 3. other health related outcomes

Scientific question: How does estrogen treatment affect health outcomes?

slide-14
SLIDE 14

Induction Model of a process or system Experiments and observational studies

WHI observational study

Observational population:

  • 93,676 women enlisted starting in 1991;
  • tracked over eight years on average;
  • data consists of
  • x= input variables
  • y=health outcomes,

gathered concurrently on existing populations. Results: Good health/low rates of CHD generally associated with estrogen treatment. Conclusion: Estrogen treatment positively associated with health, such as CHD.

slide-15
SLIDE 15

Induction Model of a process or system Experiments and observational studies

WHI observational study

Observational population:

  • 93,676 women enlisted starting in 1991;
  • tracked over eight years on average;
  • data consists of
  • x= input variables
  • y=health outcomes,

gathered concurrently on existing populations. Results: Good health/low rates of CHD generally associated with estrogen treatment. Conclusion: Estrogen treatment positively associated with health, such as CHD.

slide-16
SLIDE 16

Induction Model of a process or system Experiments and observational studies

WHI observational study

Observational population:

  • 93,676 women enlisted starting in 1991;
  • tracked over eight years on average;
  • data consists of
  • x= input variables
  • y=health outcomes,

gathered concurrently on existing populations. Results: Good health/low rates of CHD generally associated with estrogen treatment. Conclusion: Estrogen treatment positively associated with health, such as CHD.

slide-17
SLIDE 17

Induction Model of a process or system Experiments and observational studies

WHI observational study

Observational population:

  • 93,676 women enlisted starting in 1991;
  • tracked over eight years on average;
  • data consists of
  • x= input variables
  • y=health outcomes,

gathered concurrently on existing populations. Results: Good health/low rates of CHD generally associated with estrogen treatment. Conclusion: Estrogen treatment positively associated with health, such as CHD.

slide-18
SLIDE 18

Induction Model of a process or system Experiments and observational studies

WHI observational study

Observational population:

  • 93,676 women enlisted starting in 1991;
  • tracked over eight years on average;
  • data consists of
  • x= input variables
  • y=health outcomes,

gathered concurrently on existing populations. Results: Good health/low rates of CHD generally associated with estrogen treatment. Conclusion: Estrogen treatment positively associated with health, such as CHD.

slide-19
SLIDE 19

Induction Model of a process or system Experiments and observational studies

WHI observational study

Observational population:

  • 93,676 women enlisted starting in 1991;
  • tracked over eight years on average;
  • data consists of
  • x= input variables
  • y=health outcomes,

gathered concurrently on existing populations. Results: Good health/low rates of CHD generally associated with estrogen treatment. Conclusion: Estrogen treatment positively associated with health, such as CHD.

slide-20
SLIDE 20

Induction Model of a process or system Experiments and observational studies

WHI randomized controlled trial

Experimental population: 373,092 women determined to be eligible ֒ → 18,845 provided consent to be in experiment ֒ → 16,608 included in the experiment 16,608 women randomized to either  x = 1 (estrogen treatment) x = 0 (no estrogen treatment) Women were of different ages and were treated at different clinics. Women were blocked together by age and clinic then treatments were randomly assigned within each age×treatment block. This type of random allocation is called a randomized block design.

slide-21
SLIDE 21

Induction Model of a process or system Experiments and observational studies

WHI randomized controlled trial

Experimental population: 373,092 women determined to be eligible ֒ → 18,845 provided consent to be in experiment ֒ → 16,608 included in the experiment 16,608 women randomized to either  x = 1 (estrogen treatment) x = 0 (no estrogen treatment) Women were of different ages and were treated at different clinics. Women were blocked together by age and clinic then treatments were randomly assigned within each age×treatment block. This type of random allocation is called a randomized block design.

slide-22
SLIDE 22

Induction Model of a process or system Experiments and observational studies

WHI randomized controlled trial

Experimental population: 373,092 women determined to be eligible ֒ → 18,845 provided consent to be in experiment ֒ → 16,608 included in the experiment 16,608 women randomized to either  x = 1 (estrogen treatment) x = 0 (no estrogen treatment) Women were of different ages and were treated at different clinics. Women were blocked together by age and clinic then treatments were randomly assigned within each age×treatment block. This type of random allocation is called a randomized block design.

slide-23
SLIDE 23

Induction Model of a process or system Experiments and observational studies

WHI randomized controlled trial

Experimental population: 373,092 women determined to be eligible ֒ → 18,845 provided consent to be in experiment ֒ → 16,608 included in the experiment 16,608 women randomized to either  x = 1 (estrogen treatment) x = 0 (no estrogen treatment) Women were of different ages and were treated at different clinics. Women were blocked together by age and clinic then treatments were randomly assigned within each age×treatment block. This type of random allocation is called a randomized block design.

slide-24
SLIDE 24

Induction Model of a process or system Experiments and observational studies

WHI randomized controlled trial

Experimental population: 373,092 women determined to be eligible ֒ → 18,845 provided consent to be in experiment ֒ → 16,608 included in the experiment 16,608 women randomized to either  x = 1 (estrogen treatment) x = 0 (no estrogen treatment) Women were of different ages and were treated at different clinics. Women were blocked together by age and clinic then treatments were randomly assigned within each age×treatment block. This type of random allocation is called a randomized block design.

slide-25
SLIDE 25

Induction Model of a process or system Experiments and observational studies

WHI randomized controlled trial

Experimental population: 373,092 women determined to be eligible ֒ → 18,845 provided consent to be in experiment ֒ → 16,608 included in the experiment 16,608 women randomized to either  x = 1 (estrogen treatment) x = 0 (no estrogen treatment) Women were of different ages and were treated at different clinics. Women were blocked together by age and clinic then treatments were randomly assigned within each age×treatment block. This type of random allocation is called a randomized block design.

slide-26
SLIDE 26

Induction Model of a process or system Experiments and observational studies

WHI randomized controlled trial

Experimental population: 373,092 women determined to be eligible ֒ → 18,845 provided consent to be in experiment ֒ → 16,608 included in the experiment 16,608 women randomized to either  x = 1 (estrogen treatment) x = 0 (no estrogen treatment) Women were of different ages and were treated at different clinics. Women were blocked together by age and clinic then treatments were randomly assigned within each age×treatment block. This type of random allocation is called a randomized block design.

slide-27
SLIDE 27

Induction Model of a process or system Experiments and observational studies

WHI Randomization scheme

age group 1 (50-59) 2 (60-69) 3 (70-79) clinic 1 n11 n12 n13 2 n21 n22 n23 . . . . . . . . . . . . ni,j = # of women in study, in clinic i and in age group j = # of women in block i, j Randomization scheme: For each block,

  • 50% of the women randomly assigned to treatment (x = 1)
  • remaining women assigned to control (x = 0).

Question: Why did they randomize within a block?

slide-28
SLIDE 28

Induction Model of a process or system Experiments and observational studies

WHI Randomization scheme

age group 1 (50-59) 2 (60-69) 3 (70-79) clinic 1 n11 n12 n13 2 n21 n22 n23 . . . . . . . . . . . . ni,j = # of women in study, in clinic i and in age group j = # of women in block i, j Randomization scheme: For each block,

  • 50% of the women randomly assigned to treatment (x = 1)
  • remaining women assigned to control (x = 0).

Question: Why did they randomize within a block?

slide-29
SLIDE 29

Induction Model of a process or system Experiments and observational studies

WHI RCT Results

Women on treatment had lower incidence rates for

  • colorectal cancer
  • hip fracture

but higher incidence rates for

  • CHD
  • breast cancer
  • stroke
  • pulmonary embolism

Conclusion: Estrogen isn’t a viable preventative measure for CHD in the general population. That is, our inductive inference is (specific) higher CHD rate in treatment population than control population suggests (general) if the whole population were treated, CHD incidence would increase

slide-30
SLIDE 30

Induction Model of a process or system Experiments and observational studies

WHI RCT Results

Women on treatment had lower incidence rates for

  • colorectal cancer
  • hip fracture

but higher incidence rates for

  • CHD
  • breast cancer
  • stroke
  • pulmonary embolism

Conclusion: Estrogen isn’t a viable preventative measure for CHD in the general population. That is, our inductive inference is (specific) higher CHD rate in treatment population than control population suggests (general) if the whole population were treated, CHD incidence would increase

slide-31
SLIDE 31

Induction Model of a process or system Experiments and observational studies

WHI RCT Results

Women on treatment had lower incidence rates for

  • colorectal cancer
  • hip fracture

but higher incidence rates for

  • CHD
  • breast cancer
  • stroke
  • pulmonary embolism

Conclusion: Estrogen isn’t a viable preventative measure for CHD in the general population. That is, our inductive inference is (specific) higher CHD rate in treatment population than control population suggests (general) if the whole population were treated, CHD incidence would increase

slide-32
SLIDE 32

Induction Model of a process or system Experiments and observational studies

Correlation, causation, confounding

Question: Why the different conclusions between the two studies? Consider the following possible explanation: Let x = estrogen treatment ǫ = “health consciousness” (not directly measured) y = health outcomes x ǫ y correlation Association between x and y may be due to an unmeasured variable ǫ.

slide-33
SLIDE 33

Induction Model of a process or system Experiments and observational studies

Correlation, causation, confounding

Question: Why the different conclusions between the two studies? Consider the following possible explanation: Let x = estrogen treatment ǫ = “health consciousness” (not directly measured) y = health outcomes x ǫ y correlation Association between x and y may be due to an unmeasured variable ǫ.

slide-34
SLIDE 34

Induction Model of a process or system Experiments and observational studies

Correlation, causation, confounding

Randomization breaks the association between ǫ and x. randomization x ǫ y correlation= causation

slide-35
SLIDE 35

Induction Model of a process or system Experiments and observational studies

Randomized experiments versus observational studies

Observational studies can suggest good experiments to run, but can’t definitively show causation. Randomization can eliminate correlation between x and y due to a different cause ǫ, aka a confounder. “No causation without randomization”

slide-36
SLIDE 36

Induction Model of a process or system Experiments and observational studies

Randomized experiments versus observational studies

Observational studies can suggest good experiments to run, but can’t definitively show causation. Randomization can eliminate correlation between x and y due to a different cause ǫ, aka a confounder. “No causation without randomization”

slide-37
SLIDE 37

Induction Model of a process or system Experiments and observational studies

Ingredients of an experimental design

  • 1. Identify research hypotheses to be tested.
  • 2. Choose a set of experimental units, which are the units to which

treatments will be randomized.

  • 3. Choose a response/output variable.
  • 4. Determine potential sources of variation in response:

4.1 factors of interest 4.2 nuisance factors

  • 5. Decide which variables to measure and control:

5.1 treatment variables 5.2 potential large sources of variation in the units (blocking variables)

  • 6. Decide on the experimental procedure and how treatments are to be

randomly assigned. The order of these steps may vary due to constraints such as budgets, ethics, time, etc..

slide-38
SLIDE 38

Induction Model of a process or system Experiments and observational studies

Ingredients of an experimental design

  • 1. Identify research hypotheses to be tested.
  • 2. Choose a set of experimental units, which are the units to which

treatments will be randomized.

  • 3. Choose a response/output variable.
  • 4. Determine potential sources of variation in response:

4.1 factors of interest 4.2 nuisance factors

  • 5. Decide which variables to measure and control:

5.1 treatment variables 5.2 potential large sources of variation in the units (blocking variables)

  • 6. Decide on the experimental procedure and how treatments are to be

randomly assigned. The order of these steps may vary due to constraints such as budgets, ethics, time, etc..

slide-39
SLIDE 39

Induction Model of a process or system Experiments and observational studies

Ingredients of an experimental design

  • 1. Identify research hypotheses to be tested.
  • 2. Choose a set of experimental units, which are the units to which

treatments will be randomized.

  • 3. Choose a response/output variable.
  • 4. Determine potential sources of variation in response:

4.1 factors of interest 4.2 nuisance factors

  • 5. Decide which variables to measure and control:

5.1 treatment variables 5.2 potential large sources of variation in the units (blocking variables)

  • 6. Decide on the experimental procedure and how treatments are to be

randomly assigned. The order of these steps may vary due to constraints such as budgets, ethics, time, etc..

slide-40
SLIDE 40

Induction Model of a process or system Experiments and observational studies

Ingredients of an experimental design

  • 1. Identify research hypotheses to be tested.
  • 2. Choose a set of experimental units, which are the units to which

treatments will be randomized.

  • 3. Choose a response/output variable.
  • 4. Determine potential sources of variation in response:

4.1 factors of interest 4.2 nuisance factors

  • 5. Decide which variables to measure and control:

5.1 treatment variables 5.2 potential large sources of variation in the units (blocking variables)

  • 6. Decide on the experimental procedure and how treatments are to be

randomly assigned. The order of these steps may vary due to constraints such as budgets, ethics, time, etc..

slide-41
SLIDE 41

Induction Model of a process or system Experiments and observational studies

Ingredients of an experimental design

  • 1. Identify research hypotheses to be tested.
  • 2. Choose a set of experimental units, which are the units to which

treatments will be randomized.

  • 3. Choose a response/output variable.
  • 4. Determine potential sources of variation in response:

4.1 factors of interest 4.2 nuisance factors

  • 5. Decide which variables to measure and control:

5.1 treatment variables 5.2 potential large sources of variation in the units (blocking variables)

  • 6. Decide on the experimental procedure and how treatments are to be

randomly assigned. The order of these steps may vary due to constraints such as budgets, ethics, time, etc..

slide-42
SLIDE 42

Induction Model of a process or system Experiments and observational studies

Ingredients of an experimental design

  • 1. Identify research hypotheses to be tested.
  • 2. Choose a set of experimental units, which are the units to which

treatments will be randomized.

  • 3. Choose a response/output variable.
  • 4. Determine potential sources of variation in response:

4.1 factors of interest 4.2 nuisance factors

  • 5. Decide which variables to measure and control:

5.1 treatment variables 5.2 potential large sources of variation in the units (blocking variables)

  • 6. Decide on the experimental procedure and how treatments are to be

randomly assigned. The order of these steps may vary due to constraints such as budgets, ethics, time, etc..

slide-43
SLIDE 43

Induction Model of a process or system Experiments and observational studies

Ingredients of an experimental design

  • 1. Identify research hypotheses to be tested.
  • 2. Choose a set of experimental units, which are the units to which

treatments will be randomized.

  • 3. Choose a response/output variable.
  • 4. Determine potential sources of variation in response:

4.1 factors of interest 4.2 nuisance factors

  • 5. Decide which variables to measure and control:

5.1 treatment variables 5.2 potential large sources of variation in the units (blocking variables)

  • 6. Decide on the experimental procedure and how treatments are to be

randomly assigned. The order of these steps may vary due to constraints such as budgets, ethics, time, etc..

slide-44
SLIDE 44

Induction Model of a process or system Experiments and observational studies

Three principles of Experimental Design

Replication, Randomization, Blocking

  • 1. Replication: Repetition of an experiment.

Replicates are runs of an experiment or sets of experimental units that have the same values of the control variables. More replication → more precise inference yA,i = response of the ith unit assigned to treatment A, i = 1, . . . , nA yB,i = response of the ith unit assigned to treatment B , i = 1, . . . , nB. |¯ yA − ¯ yB| = δ > 0 provides evidence that treatment affects response the amount of evidence is increasing with n.

  • 2. Randomization: Random assignment of treatments to experimental units.

Removes potential for systematic bias on the part of the researcher, and removes any preexperimental source of bias. Makes confounding the effect

  • f treatment with unobserved variables unlikely (but not impossible).
  • 3. Blocking:

Randomization within blocks of homogeneous units. The goal is to evenly distribute treatments across large potential sources

  • f variation.
slide-45
SLIDE 45

Induction Model of a process or system Experiments and observational studies

Three principles of Experimental Design

Replication, Randomization, Blocking

  • 1. Replication: Repetition of an experiment.

Replicates are runs of an experiment or sets of experimental units that have the same values of the control variables. More replication → more precise inference yA,i = response of the ith unit assigned to treatment A, i = 1, . . . , nA yB,i = response of the ith unit assigned to treatment B , i = 1, . . . , nB. |¯ yA − ¯ yB| = δ > 0 provides evidence that treatment affects response the amount of evidence is increasing with n.

  • 2. Randomization: Random assignment of treatments to experimental units.

Removes potential for systematic bias on the part of the researcher, and removes any preexperimental source of bias. Makes confounding the effect

  • f treatment with unobserved variables unlikely (but not impossible).
  • 3. Blocking:

Randomization within blocks of homogeneous units. The goal is to evenly distribute treatments across large potential sources

  • f variation.
slide-46
SLIDE 46

Induction Model of a process or system Experiments and observational studies

Three principles of Experimental Design

Replication, Randomization, Blocking

  • 1. Replication: Repetition of an experiment.

Replicates are runs of an experiment or sets of experimental units that have the same values of the control variables. More replication → more precise inference yA,i = response of the ith unit assigned to treatment A, i = 1, . . . , nA yB,i = response of the ith unit assigned to treatment B , i = 1, . . . , nB. |¯ yA − ¯ yB| = δ > 0 provides evidence that treatment affects response the amount of evidence is increasing with n.

  • 2. Randomization: Random assignment of treatments to experimental units.

Removes potential for systematic bias on the part of the researcher, and removes any preexperimental source of bias. Makes confounding the effect

  • f treatment with unobserved variables unlikely (but not impossible).
  • 3. Blocking:

Randomization within blocks of homogeneous units. The goal is to evenly distribute treatments across large potential sources

  • f variation.
slide-47
SLIDE 47

Induction Model of a process or system Experiments and observational studies

Three principles of Experimental Design

Replication, Randomization, Blocking

  • 1. Replication: Repetition of an experiment.

Replicates are runs of an experiment or sets of experimental units that have the same values of the control variables. More replication → more precise inference yA,i = response of the ith unit assigned to treatment A, i = 1, . . . , nA yB,i = response of the ith unit assigned to treatment B , i = 1, . . . , nB. |¯ yA − ¯ yB| = δ > 0 provides evidence that treatment affects response the amount of evidence is increasing with n.

  • 2. Randomization: Random assignment of treatments to experimental units.

Removes potential for systematic bias on the part of the researcher, and removes any preexperimental source of bias. Makes confounding the effect

  • f treatment with unobserved variables unlikely (but not impossible).
  • 3. Blocking:

Randomization within blocks of homogeneous units. The goal is to evenly distribute treatments across large potential sources

  • f variation.
slide-48
SLIDE 48

Induction Model of a process or system Experiments and observational studies

Three principles of Experimental Design

Replication, Randomization, Blocking

  • 1. Replication: Repetition of an experiment.

Replicates are runs of an experiment or sets of experimental units that have the same values of the control variables. More replication → more precise inference yA,i = response of the ith unit assigned to treatment A, i = 1, . . . , nA yB,i = response of the ith unit assigned to treatment B , i = 1, . . . , nB. |¯ yA − ¯ yB| = δ > 0 provides evidence that treatment affects response the amount of evidence is increasing with n.

  • 2. Randomization: Random assignment of treatments to experimental units.

Removes potential for systematic bias on the part of the researcher, and removes any preexperimental source of bias. Makes confounding the effect

  • f treatment with unobserved variables unlikely (but not impossible).
  • 3. Blocking:

Randomization within blocks of homogeneous units. The goal is to evenly distribute treatments across large potential sources

  • f variation.
slide-49
SLIDE 49

Induction Model of a process or system Experiments and observational studies

Crop study example

Hypothesis: Tomato type (A versus B) affects tomato yield. Experimental units: three plots of land, each to be divided into a 2 × 2 grid. Outcome: Tomato yield. Factor of interest: Tomato type, A or B. Nuisance factor: Soil quality. bad soil good soil

slide-50
SLIDE 50

Induction Model of a process or system Experiments and observational studies

Crop study example

Hypothesis: Tomato type (A versus B) affects tomato yield. Experimental units: three plots of land, each to be divided into a 2 × 2 grid. Outcome: Tomato yield. Factor of interest: Tomato type, A or B. Nuisance factor: Soil quality. bad soil good soil

slide-51
SLIDE 51

Induction Model of a process or system Experiments and observational studies

Crop study example

Hypothesis: Tomato type (A versus B) affects tomato yield. Experimental units: three plots of land, each to be divided into a 2 × 2 grid. Outcome: Tomato yield. Factor of interest: Tomato type, A or B. Nuisance factor: Soil quality. bad soil good soil

slide-52
SLIDE 52

Induction Model of a process or system Experiments and observational studies

Crop study example

Hypothesis: Tomato type (A versus B) affects tomato yield. Experimental units: three plots of land, each to be divided into a 2 × 2 grid. Outcome: Tomato yield. Factor of interest: Tomato type, A or B. Nuisance factor: Soil quality. bad soil good soil

slide-53
SLIDE 53

Induction Model of a process or system Experiments and observational studies

Crop study example

Hypothesis: Tomato type (A versus B) affects tomato yield. Experimental units: three plots of land, each to be divided into a 2 × 2 grid. Outcome: Tomato yield. Factor of interest: Tomato type, A or B. Nuisance factor: Soil quality. bad soil good soil

slide-54
SLIDE 54

Induction Model of a process or system Experiments and observational studies

Questions for discussion

  • What are the benefits of this design?
  • What other designs might work?
  • What other designs wouldn’t work?
  • Should the plots be divided up further?

If so, how should treatments then be assigned?