Chapter 16 Nonparametric Statistics Introduction: Distribution-Free - - PowerPoint PPT Presentation

chapter 16
SMART_READER_LITE
LIVE PREVIEW

Chapter 16 Nonparametric Statistics Introduction: Distribution-Free - - PowerPoint PPT Presentation

Chapter 16 Nonparametric Statistics Introduction: Distribution-Free Tests Distribution-free tests statistical tests that dont rely on assumptions about the probability distribution of the sampled population Nonparametrics branch of


slide-1
SLIDE 1

Chapter 16

Nonparametric Statistics

slide-2
SLIDE 2

Introduction: Distribution-Free Tests

Distribution-free tests – statistical tests that don’t rely on assumptions about the probability distribution of the sampled population Nonparametrics – branch of inferential statistics devoted to distribution-free tests Rank statistics (Rank tests) – nonparametric statistics based on the ranks

  • f measurements
slide-3
SLIDE 3

Single Population Inferences: The Sign Test

The Sign test is used to make inferences about the central tendency of a single population Test is based on the median η Test involves hypothesizing a value for the population median, then testing to see if the distribution of sample values around the hypothesized median value reaches significance

slide-4
SLIDE 4

Single Population Inferences: The Sign Test

Sign Test for a Population Median η

One-Tailed Test Two-Tailed Test H0:η1 = η0 H0: η1 = η0 Ha :η1 < η0 {or Ha: η1> η0] Ha: η1  η0 Test Statistic S = Number of sample measurements greater than η0 [or S = number of measurements less than η0] S = Larger of S1 and S2, where S1 is the number of measurements less than η0 and S2 is the number

  • f measurements greater than η0

Observed Significance Level p-value = P(x ≥ S) p-value = 2P(x ≥ S) Where x has a binomial distribution with parameters n and p = .5 Rejection region: Reject H0 if p-value ≤ .05

Conditions required for sign test – sample must be randomly selected from a continuous probability distribution

slide-5
SLIDE 5

Single Population Inferences: The Sign Test

Large-Sample Sign Test for a Population Median η

Conditions required for sign test – sample must be randomly selected from a continuous probability distribution

One-Tailed Test Two-Tailed Test H0:η1 = η0 H0: η1 = η0 Ha :η1 < η0 {or Ha: η1> η0] Ha: η1  η0 Test Statistic

 

.5 .5 .5 S n z n   

Observed Significance Level p-value = P(x ≥ S) p-value = 2P(x ≥ S) Where x has a binomial distribution with parameters n and p = .5 Rejection region: z

z 

Rejection region:

/ 2

z z  

slide-6
SLIDE 6

Comparing Two Populations: The Wilcoxon Rank Sum Test for Independent Samples The Wilcoxon Rank Sum Test is used when two independent random samples are being used to compare two populations, and the t- test is not appropriate It tests the hypothesis that the probability distributions associated with the two populations are equivalent

slide-7
SLIDE 7

Comparing Two Populations: The Wilcoxon Rank Sum Test for Independent Samples

Rank Data from both samples from smallest to largest If populations are the same, ranks should be randomly mixed between the samples Test statistic is based on the rank sums – the totals of the ranks for each of the samples. T1 is the sum for sample 1, T2 is the sum for sample 2

Percentage Cost of Living Change, as Predicted by Government and University Economists

Government Economist (1) University Economist (2) Prediction Rank Prediction Rank 3.1 4 4.4 6 4.8 7 5.8 9 2.3 2 3.9 5 5.6 8 8.7 11 0.0 1 6.3 10 2.9 3 10.5 12 10.8 13

slide-8
SLIDE 8

Comparing Two Populations: The Wilcoxon Rank Sum Test for Independent Samples

Wilcoxon Rank Sum Test: Independent Samples

Required Conditions: random, independent samples Probability distributions samples drawn from are continuous

One-Tailed Test Two-Tailed Test H0:D1 and D2 are identical H0:D1 and D2 are identical Ha :D1 is shifted to the right of D2 {or Ha: D1 is shifted to the left of D2] Ha :D1 is shifted either to the left

  • r to the right of D2

Test Statistic T1, if n1<n2; T2, if n2 < n1 (Either rank sum can be used if n1 = n2) T1, if n1<n2; T2, if n2 < n1 (Either rank sum can be used if n1 = n2) We will denote this rank sum as T Rejection region: T1: T1 ≥ TU [or T1 ≤ TL] T1: T1 ≤ TL [or T1 ≥ TU] Rejection region: T ≤ TL or T ≥ TU Where TL and TU are obtained from table

slide-9
SLIDE 9

Comparing Two Populations: The Wilcoxon Rank Sum Test for Independent Samples

Wilcoxon Rank Sum Test for Large Samples(n1 and n2 ≥ 10)

One-Tailed Test Two-Tailed Test H0:D1 and D2 are identical H0:D1 and D2 are identical Ha :D1 is shifted to the right of D2 {or Ha: D1 is shifted to the left of D2] Ha :D1 is shifted either to the left

  • r to the right of D2

Test Statistic

1 1 2 1 1 2 1 2

( 1) 2 : ( 1) 1 2 n n n T T e s t s ta tis tic z n n n n      

Rejection region: z>z(or z<-z) Rejection region: |z|>z/2

slide-10
SLIDE 10

Comparing Two Populations: The Wilcoxon Signed Rank Test for the Paired Differences Experiment

An alternative test to the paired difference of means procedure Analysis is of the differences between ranks Any differences of 0 are eliminated, and n is reduced accordingly

Softness Ratings of Paper Product Difference Judge A B (A-B) Absolute Value of Difference Rank of Absolute Value 1 6 4 2 2 5 2 8 5 3 3 7.5 3 4 5

  • 1

1 2 4 9 8 1 1 2 5 4 1 3 3 7.5 6 7 9

  • 2

2 5 7 6 2 4 4 9 8 5 3 2 2 5 9 6 7

  • 1

1 2 10 8 2 6 6 10 T+ = Sum of positive ranks = 46 T- = Sum of negative ranks = 9

slide-11
SLIDE 11

Comparing Two Populations: The Wilcoxon Signed Rank Test for the Paired Differences Experiment

Wilcoxon Signed Rank Test for a Paired Difference Experiment

Let D1 and D2 represent the probability distributions for populations 1 and 2, respectively

One-Tailed Test Two-Tailed Test H0:D1 and D2 are identical H0:D1 and D2 are identical Ha :D1 is shifted to the right of D2 [or Ha: D1 is shifted to the left of D2] Ha :D1 is shifted either to the left

  • r to the right of D2

Test Statistic T-, the rank sum of the negative distances (or T+, the rank sum of the positive distances) T, the smaller of T+ or T- Rejection region: T-: ≤ T0 [or T+: ≤ T0] Rejection region: T ≤ T0 Where T0 is from table

Required Conditions Sample of differences is randomly selected Probability distribution from which sample is drawn is continuous

slide-12
SLIDE 12

The Kruskal-Wallis H-Test for a Completely Randomized Design

An alternative to the completely randomized ANOVA Based on comparison of rank sums

Number of Available Beds

Hospital 1 Hospital 2 Hospital 3 Beds Rank Beds Rank Beds Rank 6 5 34 25 13 9.5 38 27 28 19 35 26 3 2 42 30 19 15 17 13 13 9.5 4 3 11 8 40 29 29 20 30 21 31 22 1 15 11 9 7 7 6 16 12 32 23 33 24 25 17 39 28 18 14 5 4 27 18 24 16 R1 = 120 R2 = 210.5 R3 = 134.5

slide-13
SLIDE 13

The Kruskal-Wallis H-Test for a Completely Randomized Design

Kruskal-Wallis H-Test for Comparing p Probability Distributions

Required Conditions:

  • The p samples are random and independent
  • 5 or more measurements per sample
  • Probability distributions samples drawn from are continuous

H0: The p probability distributions are identical Ha: At least two of the p probability distributions differ in location Test statistic:

 

2

1 2 3( 1) 1

j j

R H n n n n    

Where Nj = Number of measurements in sample j Rj = Rank sum for sample j, where the rank of each measurement is computed according to its relative magnitude in the totality of data for the p samples n = Total Sample Size = n1 +n2 + ….+ np Rejection region:

2

H

 

with (p-1) degrees of freedom

slide-14
SLIDE 14

The Friedman Fr-Test for a Randomized Block Design

A nonparametric method for the randomized block design Based on comparison of rank sums

Reaction Time for Three Drugs

Subject Drug A Rank Drug B Rank Drug C Rank 1 1.21 1 1.48 2 1.56 3 2 1.63 1 1.85 2 2.01 3 3 1.42 1 2.06 3 1.70 2 4 2.43 2 1.98 1 2.64 3 5 1.16 1 1.27 2 1.48 3 6 1.94 1 2.44 2 2.81 3 R1 = 7 R2 = 12 R3 = 17

slide-15
SLIDE 15

The Friedman Fr-Test for a Randomized Block Design

Required Conditions:

  • Random assignment of treatments to units within blocks
  • Measurements can be ranked within blocks
  • Probability distributions samples within each block drawn from

are continuous

H0: The probability distributions for the p treatments are identical Ha: At least two of the p probability distributions differ in location Test statistic:

 

2

1 2 3 ( 1) 1

r j

F R b p b p p    

Where b = Number of blocks p = number of treatments Rj = Rank sum of the jth treatment; where the rank of each measurement is computed relative to its position within its own block Rejection region:

2 r

F

 

with (p-1) degrees of freedom

slide-16
SLIDE 16

Spearman’s Rank Correlation Coefficient

Provides a measure of correlation between ranks

Brake Rankings of New Car Models: Less than Perfect Agreement

Magazine Difference between Rank 1 and Rank 2 Car Model 1 2 D D2 1 4 5

  • 1

1 2 1 2

  • 1

1 3 9 10

  • 1

1 4 5 6

  • 1

1 5 2 1 1 1 6 10 9 1 1 7 7 7 8 3 3 9 6 4 2 4 10 8 8

2

1 0 d 

slide-17
SLIDE 17

Spearman’s Rank Correlation Coefficient

Conditions Required: Sample of experimental units is randomly selected Probability distributions of two variables are continuous

One-Tailed Test Two-Tailed Test H0:p = 0 H0: p = 0 Ha :p < 0 {or Ha: p> 0] Ha: p  0 Test Statistic

2 2

6 1 ( 1)

i s

d r n n   

Where di = ui –vi (difference in ranks of ith observations for samples 1 and 2 Rejection region:

, s s

r r

(or

, s s

r r

 

when Ha: p> 0) Rejection region:

, / 2 s s

r r