Business Statistics CONTENTS Two types of error The power of a - - PowerPoint PPT Presentation

โ–ถ
business statistics
SMART_READER_LITE
LIVE PREVIEW

Business Statistics CONTENTS Two types of error The power of a - - PowerPoint PPT Presentation

POWER AND DESIGN Business Statistics CONTENTS Two types of error The power of a test Experimental design Choosing sample size Power curves Power and big data Old exam question Further study TWO TYPES OF ERROR What is the precise


slide-1
SLIDE 1

POWER AND DESIGN

Business Statistics

slide-2
SLIDE 2

Two types of error The power of a test Experimental design Choosing sample size Power curves Power and โ€œbig dataโ€ Old exam question Further study CONTENTS

slide-3
SLIDE 3

What is the precise meaning of the significance level ๐›ฝ? โ–ช ๐›ฝ is the maximum acceptable probability of rejecting the null hypothesis when it is in fact true โ–ช so ๐‘„ reject ๐ผ0 ๐ผ0 There are two possible decisions: โ–ช reject ๐ผ0 โ–ช do not reject ๐ผ0 And there are two possible realities: โ–ช ๐ผ0 is true โ–ช ๐ผ0 is not true TWO TYPES OF ERROR

To be read as: โ€œthe conditional probability

  • f rejecting ๐ผ0, given that

๐ผ0 is trueโ€

slide-4
SLIDE 4

Organize the situations in a 2 ร— 2-table: Correct decision โ–ช no further concern, because OK Wrong decision โ–ช type I error โ–ช type II error TWO TYPES OF ERROR

slide-5
SLIDE 5

Probability of a type I error: ๐‘„ reject ๐ผ0 ๐ผ0 โ‰ค ๐›ฝ

โ–ช conventionally ๐›ฝ = 5% โ–ช but you may choose another significance level if you think that is appropriate โ–ช e.g., aircraft safety: better use ๐›ฝ = 1% or even ๐›ฝ = 0.1% โ–ช e.g., choosing a colour of your shampoo flask: ๐›ฝ = 10% is OK

โ–ช Anyhow, you control the maximum type I error by choosing ๐›ฝ in advance

โ–ช and accept that you will once in a while reject ๐ผ0 while it is true

TWO TYPES OF ERROR

slide-6
SLIDE 6

Probability of a type II error: ๐‘„ do not reject ๐ผ0 specific ๐ผ1 = ๐›พ

โ–ช how to choose ๐›พ?

โ–ช You cannot simply control the maximum type II error ๐›พ

โ–ช because it depends on the true value of the unknown (!) parameter (that is to be tested itself) โ–ช as well as on ๐›ฝ โ–ช but it could be good to know it, at least โ–ช usually, this is done through the power concept

TWO TYPES OF ERROR

for instance, ๐ผ0: ๐œˆ = 10 vs. ๐ผ1: ๐œˆ = 12

slide-7
SLIDE 7

Controlling ๐›ฝ and ๐›พ is important โ–ช Example: airport security

โ–ช ๐ผ0: bag does not contain a weapon

โ–ช Type I error:

โ–ช the bag did not contain a weapon, but the machine said it did โ–ช loss of time in manual search, offended clients, delayed flights โ–ช management will try to minimize the probability of this error type

โ–ช Type II error:

โ–ช the bag did contain a weapon, but the machine did not detect it โ–ช hijacks, loss of crew and aircraft, liability claims, loss of credibility โ–ช management will try to minimize the probability of this error type

TWO TYPES OF ERROR

slide-8
SLIDE 8

There is a trade-off: โ–ช minimizing ๐›ฝ typically leads to increasing ๐›พ โ–ช minimizing ๐›พ typically leads to increasing ๐›ฝ ๐›ฝ and ๐›พ can only be decreased simultaneously by changing the set-up of the research โ–ช most importantly, by increasing sample size TWO TYPES OF ERROR

slide-9
SLIDE 9

What error do we make?

  • a. A population has ๐œˆ = 120. We reject ๐ผ0: ๐œˆ โ‰ค 125.
  • b. A population has ๐œŒ = 0.3. We do not reject ๐ผ0: ๐œŒ โ‰ค 0.4.
  • c. A population has ๐œ2 = 2.5. We do not reject ๐ผ0: ๐œ2 โ‰ค 2.

EXERCISE 1

slide-10
SLIDE 10

In business and politics, a lot depends on what clients, the market and the public require So you do experiments: โ–ช market surveys โ–ช questionnaires โ–ช customer cards โ–ช polls โ–ช website analysis (tracking cookies, etc) โ–ช etc. EXPERIMENTAL DESIGN

slide-11
SLIDE 11

How to set up such experiments โ–ช qualitative research methods (interview techniques, etc.) โ–ช quantitative research methods (choosing sample size, etc.) Here we will focus on sample size for ๐œˆ and ๐œŒ EXPERIMENTAL DESIGN

slide-12
SLIDE 12

Recall that the confidence interval of a mean ๐œˆ is าง ๐‘ฆ โˆ’ ๐‘จ๐›ฝ/2 ๐œ ๐‘œ , าง ๐‘ฆ + ๐‘จ๐›ฝ/2 ๐œ ๐‘œ โ–ช This means that the width of the confidence interval scales with a factor

1 ๐‘œ

If we need to estimate ๐œˆ with a (minimal) precision of ยฑ๐น (the allowable error), you would need a certain (minimal) sample size CHOOSING SAMPLE SIZE

slide-13
SLIDE 13

This gives ๐น = ๐‘จ๐›ฝ/2 ๐œ ๐‘œ

โ–ช so

๐‘œ = ๐‘จ๐›ฝ/2 ๐œ ๐น

2

Example โ–ช to realize a 95% confidence interval for the mean of a population with ๐œ = 3 with precision ๐น = 1, ๐‘œ = 34.5, so use ๐‘œ = 35 CHOOSING SAMPLE SIZE

always round to the higher value in determining sample size

slide-14
SLIDE 14

Observe that we need to know ๐œ โ–ช how to know it? Three suggestions: โ–ช take a small preliminary sample and use the sample ๐‘ก instead of ๐œ in the formula โ–ช estimate rough upper and lower limits ๐‘ and ๐‘ and set ๐œ =

๐‘โˆ’๐‘ 12

โ–ช estimate rough upper and lower limits ๐‘ and ๐‘ and set ๐œ =

๐‘โˆ’๐‘ 4

CHOOSING SAMPLE SIZE

based on the fact that most of the values od a normal distribution are between ๐œˆ โˆ’ 2๐œ and ๐œˆ + 2๐œ based on a uniform distribution

slide-15
SLIDE 15

Likewise, the confidence interval of a proportion ๐œŒ is ๐‘ž โˆ’ ๐‘จ๐›ฝ/2 ๐œŒ 1 โˆ’ ๐œŒ ๐‘œ , ๐‘ž + ๐‘จ๐›ฝ/2 ๐œŒ 1 โˆ’ ๐œŒ ๐‘œ This gives ๐น = ๐‘จ๐›ฝ/2 ๐œŒ 1 โˆ’ ๐œŒ ๐‘œ โ–ช so ๐‘œ = ๐œŒ 1 โˆ’ ๐œŒ ๐‘จ๐›ฝ/2 ๐น

2

CHOOSING SAMPLE SIZE

slide-16
SLIDE 16

Observe that we need to know ๐œŒ โ–ช so to determine the sample size to estimate ๐œŒ, you need ๐œŒ Four suggestions โ–ช take a small preliminary sample and use the sample ๐‘ž instead of ๐œŒ in the sample size formula โ–ช take a small preliminary sample, find a confidence interval and from this interval use the value closest to 0.5 instead

  • f ๐œŒ in the sample size formula

โ–ช use a prior sample or historical data โ–ช assume that ๐œŒ = 0.50 CHOOSING SAMPLE SIZE

this conservative method ensures the desired precision (๐œŒ 1 โˆ’ ๐œŒ has a maximum at ๐œŒ = 0.50)

slide-17
SLIDE 17

We ask a sample of persons if they are in favor or against

  • Brexit. We want to deduce the proportion in favor, with a

margin of no more than ยฑ2%. What sample size to use? EXERCISE 2

slide-18
SLIDE 18

โ–ช Suppose we test a one-sample mean

โ–ช the null hypothesis is ๐ผ0: ๐œˆ = ๐œˆโ„Ž๐‘ง๐‘ž = 3 โ–ช at a significance level ๐›ฝ = 0.05

โ–ช If the true parameter is ๐œˆ = ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ = 3

โ–ช there is a probability ๐›ฝ = 0.05 to reject ๐ผ0 โ–ช so this is the probability to make a type I error

POWER

slide-19
SLIDE 19

โ–ช But if the true parameter is ๐œˆ = ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ = 3.1 instead

โ–ช there is a larger probability to reject ๐ผ0 โ–ช which is the correct decision โ–ช how large depends on ๐œ and ๐‘œ (recall ๐‘จ๐›ฝ/2

๐œ ๐‘œ)

โ–ช And if the true parameter is ๐œˆ = ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ = 10 instead

โ–ช there is an even larger probability to reject ๐ผ0 โ–ช which is the correct decision

POWER

slide-20
SLIDE 20

So, the probability of a rejecting an incorrect ๐ผ0 on the mean depends on โ–ช the pre-defined probability ๐›ฝ of not rejecting a correct ๐ผ0 โ–ช the sample size ๐‘œ โ–ช the standard deviation of the popolution ๐œ โ–ช the difference between the hypothesized ๐œˆ (๐œˆโ„Ž๐‘ง๐‘ž) and the true ๐œˆ (๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“) POWER

slide-21
SLIDE 21

Power is defined as the probability of rejecting ๐ผ0 when it should indeed be rejected

โ–ช when a specific ๐ผ1 is true

So: power = ๐‘„ reject ๐ผ0 specific ๐ผ1 Therefore: power = 1 โˆ’ ๐‘„ do not reject ๐ผ0 specific ๐ผ1 = 1 โˆ’ ๐›พ POWER

slide-22
SLIDE 22

To calculate the power of a test for the mean, you need โ–ช the significance level ๐›ฝ (you choose it) โ–ช the sample size ๐‘œ (you choose it) โ–ช the standard deviation ๐œ โ–ช the hypothesized mean ๐œˆโ„Ž๐‘ง๐‘ž (you choose it) โ–ช the true mean ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ (you have no clue) Therefore, we typically do not calculate power โ–ช but rather calculate a power function or power curves for different values of ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ POWER

slide-23
SLIDE 23

For a fixed ๐ผ0: ๐œˆ = ๐œˆโ„Ž๐‘ง๐‘ž what happens for different values

  • f ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“

POWER CURVES

๏ญ1 ๏ญ0 ๏ญ ๏‚ฎ ๏ก 1 P (R e j e c t H o ) ๏‚ฎ

๏ข = P (t y p e I I e r r o r )

๐œˆ0 = ๐œˆโ„Ž๐‘ง๐‘ž

  • ne specific

๐œˆ1 = ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ ๐œˆ-axis: different

  • ptions for ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“
slide-24
SLIDE 24

Effect of different values of ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“: ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ = 6 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =6

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-25
SLIDE 25

Effect of different values of ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“: ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ = 5 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-26
SLIDE 26

Effect of different values of ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“: ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ = 4 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =4

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-27
SLIDE 27

Effect of different values of ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“: ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ = 3.4 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3๏ญ =3.4

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-28
SLIDE 28

Effect of different values of ๐›ฝ: ๐›ฝ = 0.05 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =5.5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-29
SLIDE 29

Effect of different values of ๐›ฝ: ๐›ฝ = 0.1 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =5.5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.1

slide-30
SLIDE 30

Effect of different values of ๐›ฝ: ๐›ฝ = 0.27 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =5.5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.27

slide-31
SLIDE 31

Effect of different values of ๐›ฝ: ๐›ฝ = 0.05 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =5.5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-32
SLIDE 32

Effect of different values of ๐‘œ: ๐‘œ = 100 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-33
SLIDE 33

Effect of different values of ๐‘œ: ๐‘œ = 200 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4

๏ญ =3

๏ญ =5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-34
SLIDE 34

Effect of different values of ๐‘œ: ๐‘œ = 300 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4 0.5

๏ญ =3

๏ญ =5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-35
SLIDE 35

Effect of different values of ๐‘œ: ๐‘œ = 400 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4 0.5

๏ญ =3

๏ญ =5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-36
SLIDE 36

Effect of different values of ๐‘œ: ๐‘œ = 500 POWER CURVES

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6

๏ญ =3

๏ญ =5

  • 1

1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G

๏ก =0.05

slide-37
SLIDE 37

We perform a ๐‘จ-test for ๐ผ0: ๐œˆ = 10 (๐›ฝ = 0.05, ๐‘œ = 100, ๐œ = 2).

  • a. What is the rejection region?
  • b. What is the power of this test when ๐œˆ๐‘ข๐‘ ๐‘ฃ๐‘“ = 10.1?

EXERCISE 3

slide-38
SLIDE 38

โ–ช If sample size ๐‘œ gets very large (โ€œbig dataโ€)

โ–ช the standard error of the estimate gets very small โ–ช confidence intervals get very narrow โ–ช an exact null hypothesis (e.g., ๐œˆ = 5 or ๐œŒ = 0.50) is easily rejected โ–ช an exact two-sample test (e.g., ๐œˆ1 = ๐œˆ2) is easily rejected โ–ช every regression model becomes easily significant, even with low ๐‘†2 โ–ช etc.

โ–ช So, โ€œstatistical significanceโ€ becomes almost meaningless

โ–ช once more, also consider effect size โ–ช which is more like practical signficance

POWER AND โ€œBIG DATAโ€

slide-39
SLIDE 39

26 March 2015, Q1n OLD EXAM QUESTION

slide-40
SLIDE 40

Doane & Seward 5/E 8.8-8.9, 9.2, 9.7 Tutorial exercises week 6 errors and power power of ๐‘จ test sample size FURTHER STUDY