Business Statistics CONTENTS Two types of error The power of a - - PowerPoint PPT Presentation
Business Statistics CONTENTS Two types of error The power of a - - PowerPoint PPT Presentation
POWER AND DESIGN Business Statistics CONTENTS Two types of error The power of a test Experimental design Choosing sample size Power curves Power and big data Old exam question Further study TWO TYPES OF ERROR What is the precise
Two types of error The power of a test Experimental design Choosing sample size Power curves Power and โbig dataโ Old exam question Further study CONTENTS
What is the precise meaning of the significance level ๐ฝ? โช ๐ฝ is the maximum acceptable probability of rejecting the null hypothesis when it is in fact true โช so ๐ reject ๐ผ0 ๐ผ0 There are two possible decisions: โช reject ๐ผ0 โช do not reject ๐ผ0 And there are two possible realities: โช ๐ผ0 is true โช ๐ผ0 is not true TWO TYPES OF ERROR
To be read as: โthe conditional probability
- f rejecting ๐ผ0, given that
๐ผ0 is trueโ
Organize the situations in a 2 ร 2-table: Correct decision โช no further concern, because OK Wrong decision โช type I error โช type II error TWO TYPES OF ERROR
Probability of a type I error: ๐ reject ๐ผ0 ๐ผ0 โค ๐ฝ
โช conventionally ๐ฝ = 5% โช but you may choose another significance level if you think that is appropriate โช e.g., aircraft safety: better use ๐ฝ = 1% or even ๐ฝ = 0.1% โช e.g., choosing a colour of your shampoo flask: ๐ฝ = 10% is OK
โช Anyhow, you control the maximum type I error by choosing ๐ฝ in advance
โช and accept that you will once in a while reject ๐ผ0 while it is true
TWO TYPES OF ERROR
Probability of a type II error: ๐ do not reject ๐ผ0 specific ๐ผ1 = ๐พ
โช how to choose ๐พ?
โช You cannot simply control the maximum type II error ๐พ
โช because it depends on the true value of the unknown (!) parameter (that is to be tested itself) โช as well as on ๐ฝ โช but it could be good to know it, at least โช usually, this is done through the power concept
TWO TYPES OF ERROR
for instance, ๐ผ0: ๐ = 10 vs. ๐ผ1: ๐ = 12
Controlling ๐ฝ and ๐พ is important โช Example: airport security
โช ๐ผ0: bag does not contain a weapon
โช Type I error:
โช the bag did not contain a weapon, but the machine said it did โช loss of time in manual search, offended clients, delayed flights โช management will try to minimize the probability of this error type
โช Type II error:
โช the bag did contain a weapon, but the machine did not detect it โช hijacks, loss of crew and aircraft, liability claims, loss of credibility โช management will try to minimize the probability of this error type
TWO TYPES OF ERROR
There is a trade-off: โช minimizing ๐ฝ typically leads to increasing ๐พ โช minimizing ๐พ typically leads to increasing ๐ฝ ๐ฝ and ๐พ can only be decreased simultaneously by changing the set-up of the research โช most importantly, by increasing sample size TWO TYPES OF ERROR
What error do we make?
- a. A population has ๐ = 120. We reject ๐ผ0: ๐ โค 125.
- b. A population has ๐ = 0.3. We do not reject ๐ผ0: ๐ โค 0.4.
- c. A population has ๐2 = 2.5. We do not reject ๐ผ0: ๐2 โค 2.
EXERCISE 1
In business and politics, a lot depends on what clients, the market and the public require So you do experiments: โช market surveys โช questionnaires โช customer cards โช polls โช website analysis (tracking cookies, etc) โช etc. EXPERIMENTAL DESIGN
How to set up such experiments โช qualitative research methods (interview techniques, etc.) โช quantitative research methods (choosing sample size, etc.) Here we will focus on sample size for ๐ and ๐ EXPERIMENTAL DESIGN
Recall that the confidence interval of a mean ๐ is าง ๐ฆ โ ๐จ๐ฝ/2 ๐ ๐ , าง ๐ฆ + ๐จ๐ฝ/2 ๐ ๐ โช This means that the width of the confidence interval scales with a factor
1 ๐
If we need to estimate ๐ with a (minimal) precision of ยฑ๐น (the allowable error), you would need a certain (minimal) sample size CHOOSING SAMPLE SIZE
This gives ๐น = ๐จ๐ฝ/2 ๐ ๐
โช so
๐ = ๐จ๐ฝ/2 ๐ ๐น
2
Example โช to realize a 95% confidence interval for the mean of a population with ๐ = 3 with precision ๐น = 1, ๐ = 34.5, so use ๐ = 35 CHOOSING SAMPLE SIZE
always round to the higher value in determining sample size
Observe that we need to know ๐ โช how to know it? Three suggestions: โช take a small preliminary sample and use the sample ๐ก instead of ๐ in the formula โช estimate rough upper and lower limits ๐ and ๐ and set ๐ =
๐โ๐ 12
โช estimate rough upper and lower limits ๐ and ๐ and set ๐ =
๐โ๐ 4
CHOOSING SAMPLE SIZE
based on the fact that most of the values od a normal distribution are between ๐ โ 2๐ and ๐ + 2๐ based on a uniform distribution
Likewise, the confidence interval of a proportion ๐ is ๐ โ ๐จ๐ฝ/2 ๐ 1 โ ๐ ๐ , ๐ + ๐จ๐ฝ/2 ๐ 1 โ ๐ ๐ This gives ๐น = ๐จ๐ฝ/2 ๐ 1 โ ๐ ๐ โช so ๐ = ๐ 1 โ ๐ ๐จ๐ฝ/2 ๐น
2
CHOOSING SAMPLE SIZE
Observe that we need to know ๐ โช so to determine the sample size to estimate ๐, you need ๐ Four suggestions โช take a small preliminary sample and use the sample ๐ instead of ๐ in the sample size formula โช take a small preliminary sample, find a confidence interval and from this interval use the value closest to 0.5 instead
- f ๐ in the sample size formula
โช use a prior sample or historical data โช assume that ๐ = 0.50 CHOOSING SAMPLE SIZE
this conservative method ensures the desired precision (๐ 1 โ ๐ has a maximum at ๐ = 0.50)
We ask a sample of persons if they are in favor or against
- Brexit. We want to deduce the proportion in favor, with a
margin of no more than ยฑ2%. What sample size to use? EXERCISE 2
โช Suppose we test a one-sample mean
โช the null hypothesis is ๐ผ0: ๐ = ๐โ๐ง๐ = 3 โช at a significance level ๐ฝ = 0.05
โช If the true parameter is ๐ = ๐๐ข๐ ๐ฃ๐ = 3
โช there is a probability ๐ฝ = 0.05 to reject ๐ผ0 โช so this is the probability to make a type I error
POWER
โช But if the true parameter is ๐ = ๐๐ข๐ ๐ฃ๐ = 3.1 instead
โช there is a larger probability to reject ๐ผ0 โช which is the correct decision โช how large depends on ๐ and ๐ (recall ๐จ๐ฝ/2
๐ ๐)
โช And if the true parameter is ๐ = ๐๐ข๐ ๐ฃ๐ = 10 instead
โช there is an even larger probability to reject ๐ผ0 โช which is the correct decision
POWER
So, the probability of a rejecting an incorrect ๐ผ0 on the mean depends on โช the pre-defined probability ๐ฝ of not rejecting a correct ๐ผ0 โช the sample size ๐ โช the standard deviation of the popolution ๐ โช the difference between the hypothesized ๐ (๐โ๐ง๐) and the true ๐ (๐๐ข๐ ๐ฃ๐) POWER
Power is defined as the probability of rejecting ๐ผ0 when it should indeed be rejected
โช when a specific ๐ผ1 is true
So: power = ๐ reject ๐ผ0 specific ๐ผ1 Therefore: power = 1 โ ๐ do not reject ๐ผ0 specific ๐ผ1 = 1 โ ๐พ POWER
To calculate the power of a test for the mean, you need โช the significance level ๐ฝ (you choose it) โช the sample size ๐ (you choose it) โช the standard deviation ๐ โช the hypothesized mean ๐โ๐ง๐ (you choose it) โช the true mean ๐๐ข๐ ๐ฃ๐ (you have no clue) Therefore, we typically do not calculate power โช but rather calculate a power function or power curves for different values of ๐๐ข๐ ๐ฃ๐ POWER
For a fixed ๐ผ0: ๐ = ๐โ๐ง๐ what happens for different values
- f ๐๐ข๐ ๐ฃ๐
POWER CURVES
๏ญ1 ๏ญ0 ๏ญ ๏ฎ ๏ก 1 P (R e j e c t H o ) ๏ฎ
๏ข = P (t y p e I I e r r o r )
๐0 = ๐โ๐ง๐
- ne specific
๐1 = ๐๐ข๐ ๐ฃ๐ ๐-axis: different
- ptions for ๐๐ข๐ ๐ฃ๐
Effect of different values of ๐๐ข๐ ๐ฃ๐: ๐๐ข๐ ๐ฃ๐ = 6 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =6
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐๐ข๐ ๐ฃ๐: ๐๐ข๐ ๐ฃ๐ = 5 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐๐ข๐ ๐ฃ๐: ๐๐ข๐ ๐ฃ๐ = 4 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =4
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐๐ข๐ ๐ฃ๐: ๐๐ข๐ ๐ฃ๐ = 3.4 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3๏ญ =3.4
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐ฝ: ๐ฝ = 0.05 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =5.5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐ฝ: ๐ฝ = 0.1 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =5.5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.1
Effect of different values of ๐ฝ: ๐ฝ = 0.27 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =5.5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.27
Effect of different values of ๐ฝ: ๐ฝ = 0.05 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =5.5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐: ๐ = 100 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐: ๐ = 200 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4
๏ญ =3
๏ญ =5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐: ๐ = 300 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4 0.5
๏ญ =3
๏ญ =5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐: ๐ = 400 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4 0.5
๏ญ =3
๏ญ =5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
Effect of different values of ๐: ๐ = 500 POWER CURVES
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6
๏ญ =3
๏ญ =5
- 1
1 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 p o w e r T W O - S I D E D T E S T I N G
๏ก =0.05
We perform a ๐จ-test for ๐ผ0: ๐ = 10 (๐ฝ = 0.05, ๐ = 100, ๐ = 2).
- a. What is the rejection region?
- b. What is the power of this test when ๐๐ข๐ ๐ฃ๐ = 10.1?