Business Statistics CONTENTS Comparing the variance of two - - PowerPoint PPT Presentation

β–Ά
business statistics
SMART_READER_LITE
LIVE PREVIEW

Business Statistics CONTENTS Comparing the variance of two - - PowerPoint PPT Presentation

TWO 2 S: COMPARISONS Business Statistics CONTENTS Comparing the variance of two populations The -distribution The -test Levenes test Old exam question Further study COMPARING THE VARIANCE OF TWO POPULATIONS So far, the


slide-1
SLIDE 1

TWO 𝜏2S: COMPARISONS

Business Statistics

slide-2
SLIDE 2

Comparing the variance of two populations The 𝐺-distribution The 𝐺-test Levene’s test Old exam question Further study CONTENTS

slide-3
SLIDE 3

β–ͺ So far, the emphasis was on differences in centrality β–ͺ There are also questions on differences in dispersion β–ͺ Context:

β–ͺ you can choose between two drilling machines β–ͺ both make holes of the specified size β–ͺ but the precision of the two may be different β–ͺ so you do an experiment (intended hole size: 3 mm)

COMPARING THE VARIANCE OF TWO POPULATIONS

slide-4
SLIDE 4

β–ͺ A second case β–ͺ Recall that we can compare two population means under the assumption of equal population variances

β–ͺ using the pooled variance

β–ͺ Thus, we may need to check if the populations variances are indeed equal COMPARING THE VARIANCE OF TWO POPULATIONS

slide-5
SLIDE 5

Test statistic to consider β–ͺ A combination of Sπ‘Œ

2 and S𝑍 2

β–ͺ but the null distribution of π‘‡π‘Œ

2 βˆ’ 𝑇𝑍 2 is problematic

β–ͺ Instead,

π‘‡π‘Œ

2

𝑇𝑍

2 is possible

β–ͺ or

𝑇𝑍

2

π‘‡π‘Œ

2 (sometimes easier)

β–ͺ What is the hypothesis?

β–ͺ 𝐼0: πœπ‘Œ

2 = πœπ‘ 2

β–ͺ 𝐼0: πœπ‘Œ

2 β‰₯ πœπ‘ 2

β–ͺ 𝐼0: πœπ‘Œ

2 ≀ πœπ‘ 2

β–ͺ but πœπ‘Œ

2 = πœπ‘ 2 + 3 etc. is not possible!

COMPARING THE VARIANCE OF TWO POPULATIONS

These hypotheses are equivalent to π‘‡π‘Œ

2

𝑇𝑍

2 = 1

(or β‰₯ 1 or ≀ 1)

slide-6
SLIDE 6

β–ͺ For normally distributed populations, it is known that:

β–ͺ under 𝐼0: πœπ‘Œ

2 = πœπ‘ 2:

π‘‡π‘Œ

2

𝑇𝑍

2 ~πΊπ‘œπ‘Œβˆ’1,π‘œπ‘βˆ’1

β–ͺ where 𝐺

df1,df2 is the F-distribution

β–ͺ with df1 and df2 degrees of freedom β–ͺ note: 𝐺 is a ratio of two variances, use df1 for the numerator and df2 for the denominator

THE 𝐺-DISTRIBUTION

slide-7
SLIDE 7

β–ͺ So, we compute the value of a test statistic 𝐺calc = π‘‘π‘Œ

2

𝑑𝑍

2

β–ͺ and expect it to be around 1 if 𝐼0 is true β–ͺ and reject 𝐼0 if 𝐺

calc is β€œtoo” small or β€œtoo” large

β–ͺ we need to look up the critical values of the 𝐺-distribution

THE 𝐺-DISTRIBUTION

𝐺crit,lower 𝐺𝑑𝑠𝑗𝑒,π‘£π‘žπ‘žπ‘“π‘  1 Of course (!) you expect 𝐺 = 1 when 𝐼0 is true

slide-8
SLIDE 8

β–ͺ 𝐺-distribution

β–ͺ like πœ“2 not symmetrical and strictly positive β–ͺ need to find 𝐺

crit,lower and 𝐺 crit,upper

THE 𝐺-DISTRIBUTION

slide-9
SLIDE 9

Looking up critical values for 𝐺 THE 𝐺-DISTRIBUTION

𝛽/2 df2 df1 Is this 𝐺crit,lower or 𝐺crit,upper?

slide-10
SLIDE 10

β–ͺ Finding 𝐺crit,lower when you know 𝐺crit,upper 𝐺crit,lower df1, df2 = 1 𝐺crit,upper df2, df1 β–ͺ so 𝐺crit,lower =

1 3.85 = 0.26

THE 𝐺-DISTRIBUTION

𝑇1

2

𝑇2

2 > 𝑏 ⇔ 𝑇2 2

𝑇1

2 < 1

𝑏

slide-11
SLIDE 11

Step 1: β–ͺ 𝐼0: 𝜏1

2 = 𝜏2 2; 𝐼1: 𝜏1 2 β‰  𝜏2 2; 𝛽 = 0.05

Step 2: β–ͺ sample statistic: 𝐺 =

𝑇1

2

𝑇2

2; reject for β€œtoo small” and β€œtoo large”

values Step 3: β–ͺ null distribution: 𝐺~𝐺df1,df2 β–ͺ both populations must be normally distributed (no CLT here!) Step 4: β–ͺ 𝐺calc =

𝑑1

2

𝑑2

2; 𝐺crit,lower = β‹―; 𝐺crit,upper = β‹― (use 𝛽/2)

Step 5: β–ͺ reject 𝐼0 if 𝐺calc < 𝐺crit,lower or 𝐺calc > 𝐺crit,upper

THE 𝐺-TEST

slide-12
SLIDE 12

Example: β–ͺ machine 1 gives 𝑑1

2 = 0.012 with π‘œ1 = 6

β–ͺ machine 2 gives 𝑑2

2 = 0.016 with π‘œ2 = 7

Computations: β–ͺ 𝐺calc =

0.012 0.016 = 0.75

β–ͺ swap the two machines

β–ͺ 𝐺

calc = 0.016 0.012 = 1.33

β–ͺ null distribution: 𝐺~𝐺6,5 β–ͺ 𝐺crit;upper = 𝐺6,5;0.025 = 6.98 β–ͺ 𝐺calc < 𝐺crit;upper, do not reject 𝐼0 THE 𝐺-TEST

upper critical values can be read in the 𝐺- table, so easier to look up 𝐺crit,upper

slide-13
SLIDE 13

Trick to avoid the use of 𝐺crit,lower in 𝐼0: 𝜏1

2 = 𝜏2 2

β–ͺ put largest sample variance in numerator β–ͺ so calculated value of test statistic is 𝐺calc =

𝑑1

2

𝑑2

2 or 𝐺calc =

𝑑2

2

𝑑1

2

β–ͺ with this, πΊπ‘‘π‘π‘šπ‘‘ > 1, so we need only consider 𝐺𝑑𝑠𝑗𝑒,π‘£π‘žπ‘žπ‘“π‘ 

β–ͺ because 𝐺

𝑑𝑠𝑗𝑒,π‘šπ‘π‘₯𝑓𝑠 is always < 1

β–ͺ formally reject for β€œtoo large” and β€œtoo small” β–ͺ so keep using 𝛽/2 and not 𝛽 (it is still a two-sided test) THE 𝐺-TEST

slide-14
SLIDE 14

One-sided tests: what is different compared to two-sided? Step 1: β–ͺ 𝐼0: 𝜏1

2 β‰₯ 𝜏2 2; 𝐼1: 𝜏1 2 < 𝜏2 2; 𝛽 = 0.05

β–ͺ reformulate as 𝐼0: 𝜏2

2 ≀ 𝜏1 2; 𝐼1: 𝜏2 2 > 𝜏1 2; 𝛽 = 0.05

Step 2: β–ͺ sample statistic: 𝐺 =

𝑇2

2

𝑇1

2; reject for β€œtoo large” values

Step 3: β–ͺ null distribution: 𝐺~𝐺

df2,df1

β–ͺ both populations must be normally distributed Step 4: β–ͺ 𝐺

calc = 𝑑2

2

𝑑1

2; 𝐺

crit,upper = β‹― (use 𝛽)

Step 5: β–ͺ reject 𝐼0 if 𝐺

π‘‘π‘π‘šπ‘‘ > 𝐺 crit,upper

THE 𝐺-TEST

slide-15
SLIDE 15

Given a sample 1 with π‘œ1 = 9 and 𝑑1 = 4 and a sample 2 with π‘œ2 =7 and 𝑑2 = 5. We want to test 𝐼0: 𝜏1

2 = 𝜏2 2.

  • a. What is step 2?
  • b. What is step 3?

EXERCISE 1

slide-16
SLIDE 16

Recall that SPSS also did a comparison of two variances when we asked for comparing two means β–ͺ Levene’s test LEVENE’S TEST

slide-17
SLIDE 17

The Levene test β–ͺ is based on a different principle β–ͺ requires no normal populations (!) β–ͺ also yields an 𝐺calc

β–ͺ but with different values of df

β–ͺ reject for large values only β–ͺ yields a π‘ž-value for 𝐼0: 𝜏1

2 = 𝜏2 2

LEVENE’S TEST

slide-18
SLIDE 18

β–ͺ When comparing two 𝜈s with the 𝑒-test, we needed to estimate 𝜏1

2 and 𝜏2 2

β–ͺ we could estimate 𝜏1

2 by 𝑑1 2 and 𝜏2 2 by 𝑑2 2

β–ͺ or assume 𝜏1

2 = 𝜏2 2 and use the pooled 𝑑P 2 to estimate both

β–ͺ The second is only allowed if the two sample variances 𝑑1

2

and 𝑑2

2 are not β€œtoo unequal”

β–ͺ Levene’s test tests this β–ͺ a low π‘ž-value means: unequal, so don’t do it β–ͺ do not necessarily use the same 𝛽 β–ͺ Why Levene and not the β€œnormal” 𝐺-test?

LEVENE’S TEST

Usually, a low π‘ž- value means bingo, but not so here ... Rule of thumb: use 𝛽 = 0.1 Levene is a non-parametric test ...

slide-19
SLIDE 19

26 March 2015, Q1c OLD EXAM QUESTION

slide-20
SLIDE 20

Doane & Seward 5/E 10.7 Tutorial exercises week 3 Fisher 𝐺-test FURTHER STUDY