Order Statistics and Pitman Closeness Katherine F. Davies - - PowerPoint PPT Presentation

order statistics and pitman closeness
SMART_READER_LITE
LIVE PREVIEW

Order Statistics and Pitman Closeness Katherine F. Davies - - PowerPoint PPT Presentation

Order Statistics and Pitman Closeness Katherine F. Davies Department of Statistics University of Manitoba October 12, 2010 Outline 1 Introduction 2 Pitman Closeness of Order Statistics to Population Quantiles 3 SCP of Order Statistics to


slide-1
SLIDE 1

Order Statistics and Pitman Closeness

Katherine F. Davies

Department of Statistics University of Manitoba

October 12, 2010

slide-2
SLIDE 2

Outline

1 Introduction 2 Pitman Closeness of Order Statistics to Population Quantiles 3 SCP of Order Statistics to Population Quantiles 4 SCP Plotting Points 5 Conclusions

2 / 81

slide-3
SLIDE 3

Introduction Background

Framework

We recognize that much interest centers on comparisons of optimal estimators by different criteria. For example, one may want to compare an unbiased estimator to a minimum mean squared error estimator. If one uses unbiasedness or mean squared error, the outcome is

  • bvious.

In this regard, Rao recommended comparison by an alternative criterion, that of Pitman’s measure of closeness [15].

3 / 81

slide-4
SLIDE 4

Introduction Background

Pitman closeness

Pitman introduced his measure of closeness in 1937 as an alternative criterion in parameter estimation, for instance as an alternative to MSE. The measure is based on the probabilities of the relative closeness of competing estimators to an unknown parameter. There are two important notions that can be defined using Pitman’s measure of closeness. The first notion is that of the Pitman-closer estimator; the second is Pitman-closest estimator.

4 / 81

slide-5
SLIDE 5

Introduction Background

Two Important Definitions

Definition 1: When comparing two estimators ˆ θ1 and ˆ θ2 of θ, then ˆ θ1 is a Pitman-closer estimator than ˆ θ2 if Pr(|ˆ θ1 − θ| < |ˆ θ2 − θ|) ≥ 1/2 ∀θ ∈ Ω with strict inequality for at least one θ. [2] This formulation gives rise to the Pitman-closest estimator.

5 / 81

slide-6
SLIDE 6

Introduction Background

For the Pitman-closest estimator, we have the following definition: Definition 2: Let A be a nonempty class of estimators of a common parameter θ. Then, ˆ θ∗ is Pitman-closest among estimators in A provided for every ˆ θ ∈ A, such that ˆ θ = ˆ θ∗, i.e.: I P(θ; ˆ θ∗, ˆ θ) = Pr(|ˆ θ∗ − θ| < |ˆ θ − θ|) ≥ 1/2 ∀θ ∈ Ω with strict inequality for at least one θ, where I P(θ; ˆ θ∗, ˆ θ) is Pitman closeness or nearness of ˆ θ∗ to ˆ θ given θ. [14] For further details, we refer those interested to the monograph by Keating, Mason and Sen [14] for an elaborate discussion on this topic.

6 / 81

slide-7
SLIDE 7

Introduction Background

Recent Work

Recently, many problems involving ordered data and Pitman closeness have been investigated. Balakrishnan et al. [6] looked at comparing best linear unbiased and invariant estimators for the exponential mean parameter using Pitman closeness criterion, and similarly, a comparison of best linear unbiased and invariant predictors have been compared in Balakrishnan et al. [7]. Pitman closeness of records to population quantiles was explored in Ahmadi and Balakrishnan [1].

7 / 81

slide-8
SLIDE 8

Introduction Motivation

Motivating Work

In 2008, Balakrishnan et al. [5] performed a similar investigation to that of Ahmadi and Balakrishnan [1]; they investigated Pitman closeness of sample median to population median. Further interesting results regarding the sample median and Pitman closeness were provided in a followup paper by Iliopoulous and Balakrishnan [11]. The population median is a particular quantile and a natural followup problem is to look at population quantiles in general and that is our primary focus here.

8 / 81

slide-9
SLIDE 9

Introduction Settings and Notation

Common Conditions

Throughout this talk we shall assume the following common conditions:

1 Let Y1, · · · , Yn be a random sample of size n from a continuous

population with a cumulative distribution function (cdf) G(y) and density function (pdf) g(y) and Y1:n ≤ · · · ≤ Yn:n denote the corresponding order statistics.

2 Let ξ⋆

p be the pth quantile of G(y), i.e.,

Pr

  • Y ≤ ξ⋆

p

  • = p

for p ∈ (0, 1). Furthermore, we shall make an assumption about the distribution G(·) which will restrict our attention.

9 / 81

slide-10
SLIDE 10

Introduction Settings and Notation

An Assumption

Suppose G(·) belongs to the location-scale family of distributions, viz., G(y) = F y − µ σ

  • and

g(y) = 1 σf y − µ σ

  • ∀ y ∈ ℜ,

where µ ∈ ℜ is the location parameter and σ > 0 is the scale parameter, then we have the following additional conditions:

1 We have ξp = (ξ⋆

p − µ)/σ is the pth quantile of F(x).

2 We shall let X1, · · · , Xn be a random sample of size n from the

population with cdf F(x) and pdf f (x), and X1:n ≤ · · · ≤ Xn:n denote the corresponding order statistics.

10 / 81

slide-11
SLIDE 11

Pitman Closeness of Order Statistics to Population Quantiles Objectives

Aim

Our first primary objective is to extend the work on population median to the case of population quantiles. Goal Our goal is to consider the probability with which one order statistic will be Pitman-closer to a quantile (say, the pth quantile) than another order statistic from the same sample (i.e. pairwise comparisons). These closeness probabilities will lead to identification of that order statistic which is Pitman-closest.

11 / 81

slide-12
SLIDE 12

Pitman Closeness of Order Statistics to Population Quantiles General Results

A Definition

In the context of order statistics, we have an analogue to Definition 2 for comparing the closeness of order statistics to population quantiles. Definition 3: The ℓth order statistic (for some ℓ ∈ {1, · · · , n}) is the Pitman-closest order statistic to the population quantile ξp if: Pr (|Xℓ:n − ξp| < |Xi:n − ξp|) ≥ 1 2 ∀ i ∈ {1, · · · , n}\{ℓ}. Knowing that order statistics have the ordering preserving property under location-scale transforms, we have the following result.

12 / 81

slide-13
SLIDE 13

Pitman Closeness of Order Statistics to Population Quantiles General Results

General Results

Result 1: Suppose Xℓ:n (for some ℓ ∈ {1, · · · , n}) is the Pitman-closest

  • rder statistic to ξp, i.e.,

Pr (|Xℓ:n − ξp| < |Xi:n − ξp|) ≥ 1 2 ∀ i ∈ {1, · · · , n}\{ℓ}. Then, Yℓ:n is the Pitman-closest order statistic to ξ⋆

p.

Remark : Because of Result 1, it suffices to look for the Pitman-closest

  • rder statistic to a quantile ξp for the standard distribution F(·).

13 / 81

slide-14
SLIDE 14

Pitman Closeness of Order Statistics to Population Quantiles General Results

Another Important Result

In the case when the population distribution is symmetric about

  • rigin, we can easily establish the following symmetry property.

Result 2: Suppose Xℓ:n is the Pitman-closest order statistic to the pth quantile ξp of a distribution symmetric about origin, i.e., Pr (|Xℓ:n − ξp| < |Xi:n − ξp|) ≥ 1 2 for all i ∈ {1, · · · , n}\{ℓ}. Then, Xn−ℓ+1:n is the Pitman-closest

  • rder statistic to the (1 − p)th quantile ξ1−p, i.e.,

Pr (|Xn−ℓ+1:n − ξ1−p| < |Xi:n − ξ1−p|) ≥ 1 2 for all i ∈ {1, · · · , n}\{n − ℓ + 1}.

14 / 81

slide-15
SLIDE 15

Pitman Closeness of Order Statistics to Population Quantiles General Results

Some New Notation

For some p ∈ (0, 1), let us denote π(ℓ)i(p) = Pr (|Xℓ:n − ξp| < |Xi:n − ξp|) for i = {1, · · · , n} \ {ℓ} for the probability of Pitman closeness to ξp associated with any two

  • rder statistics.

Using what we know about order statistics, we have the following general expressions for π(ℓ)i(p).

15 / 81

slide-16
SLIDE 16

Pitman Closeness of Order Statistics to Population Quantiles General Results

Pitman Closeness Probabilities

Result 3: For i = ℓ + 1, ℓ + 2, · · · , n,

π(ℓ)i(p) = 1 − Ip (ℓ, n − ℓ + 1) + kℓ,i,n

i−ℓ−1

  • j=0

(−1)i−ℓ−1−j i − ℓ − 1 j

  • 1

n − ℓ − j × ξp

−∞

{F(x)}ℓ−1 {1 − F(x)}j {1 − F (−x + 2ξp)}n−ℓ−j f (x)dx, and for i = 1, 2, · · · , ℓ − 1, π(ℓ)i(p) = Ip (ℓ, n − ℓ + 1) + ki,ℓ,n

ℓ−i−1

  • j=0

(−1)ℓ−i−1−j ℓ − i − 1 j

  • 1

ℓ − j − 1 × ∞

ξp

{F(y)}j {1 − F(y)}n−ℓ {F (2ξp − y)}ℓ−j−1 f (y)dy,

16 / 81

slide-17
SLIDE 17

Pitman Closeness of Order Statistics to Population Quantiles General Results

where Ix(α, β) = 1 B(α, β) x uα−1(1 − u)β−1du, 0 < x < 1, is the incomplete beta ratio, B(α, β) = Γ(α)Γ(β)/Γ(α + β) is the complete beta function, and kℓ,i,n = n! (ℓ − 1)!(i − ℓ − 1)!(n − i)! for 1 ≤ ℓ < i ≤ n.

17 / 81

slide-18
SLIDE 18

Pitman Closeness of Order Statistics to Population Quantiles Applications

Three Cases of Interest

As can be seen, the expressions for the probability of Pitman closeness are distribution dependent. We consider three location-scale distributions: the Uniform, exponential and power function. For each distribution we can determine the probability of Pitman closeness to ξp associated with any two order statistics. With these probabilities, we can determine the Pitman-closest order statistic to a population quantile ξp.

18 / 81

slide-19
SLIDE 19

Pitman Closeness of Order Statistics to Population Quantiles Applications

Quantities of interest

Let us consider the U(−1, 1) distribution with pdf and cdf as f (x) = 1 2 and F(x) = 1 + x 2 for − 1 < x < 1, and its pth quantile as ξp = 2p − 1 for p ∈ (0, 1). With these quantities, we can use Result 3 to yield the following expressions for the Pitman closeness probability associated with any two order statistics from a uniform sample, and then identify the closest order statistic in the Pitman sense.

19 / 81

slide-20
SLIDE 20

Pitman Closeness of Order Statistics to Population Quantiles Applications

Results

Result 4: For i = ℓ + 1, · · · , n,

π(ℓ)i(p) = 1 − Ip (ℓ, n − ℓ + 1) + kℓ,i,n

i−ℓ−1

  • j=0

(−1)i−ℓ−1−j i − ℓ − 1 j

  • 1

n − ℓ − j × 1 2n

j

  • a=0

n−ℓ−j

  • b=0

j a n − ℓ − j b

  • (1 − ξp)j−a(−2ξp)n−ℓ−j−b

×(1 + ξp)ℓ+a+b B(ℓ + b, a + 1) for 0 < p < 1 2 = 1 − Ip (ℓ, n − ℓ + 1) + kℓ,i,n

i−ℓ−1

  • j=0

(−1)i−ℓ−1−j i − ℓ − 1 j

  • 1

n − ℓ − j × 1 2n

l−1

  • a=0

j

  • b=0

ℓ − 1 a j b

  • (2ξp)ℓ−1−a(1 − ξp)a+n−ℓ+1

×B(a + n − ℓ − j + 1, b + 1) for 1 2 ≤ p < 1.

20 / 81

slide-21
SLIDE 21

Pitman Closeness of Order Statistics to Population Quantiles Applications

Results cont’d

Similarly, for i = 1, · · · , ℓ − 1, π(ℓ)i(p) = Ip (ℓ, n − ℓ + 1) + ki,ℓ,n

ℓ−i−1

  • j=0

(−1)ℓ−i−1−j ℓ − i − 1 j

  • 1

ℓ − j − 1 × 1 2n

j

  • a=0

n−ℓ

  • b=0

j a n − ℓ b

  • (1 + ξp)ℓ+b(−2ξp)n−ℓ−b

×B(a + 1, ℓ − j + b) for 0 < p < 1 2 = Ip (ℓ, n − ℓ + 1) + ki,ℓ,n

ℓ−i−1

  • j=0

(−1)ℓ−i−1−j ℓ − i − 1 j

  • 1

ℓ − j − 1 × 1 2n

j

  • a=0

ℓ−j−1

  • b=0

j a ℓ − j − 1 b

  • (1 + ξp)j−a(2ξp)ℓ−j−1−b

×(1 − ξp)a+n−ℓ+b+1 B(a + 1, n − ℓ + b + 1) for 1 2 ≤ p < 1.

21 / 81

slide-22
SLIDE 22

Pitman Closeness of Order Statistics to Population Quantiles Applications ℓ i 1 2 3 4 5 6 7 8 9 10 1 p = 0.10 – 0.5900 0.8042 0.9305 0.9819 0.9966 0.9995 1.0000 1.0000 1.0000 0.25 – 0.1117 0.2126 0.3704 0.5690 0.7610 0.8990 0.9698 0.9944 0.9995 0.75 – 0.0000 0.0000 0.0000 0.0000 0.0000 0.0001 0.0001 0.0002 0.0005 0.90 – 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 2 p = 0.10 0.4100 – 0.8822 0.9603 0.9900 0.9982 0.9998 1.0000 1.0000 1.0000 0.25 0.8883 – 0.3764 0.5402 0.7101 0.8511 0.9412 0.9835 0.9971 0.9998 0.75 1.0000 – 0.0001 0.0001 0.0002 0.0004 0.0008 0.0015 0.0029 0.0056 0.90 1.0000 – 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 3 p = 0.10 0.1958 0.1178 – 0.9774 0.9945 0.9990 0.9999 1.0000 1.0000 1.0000 0.25 0.7874 0.6236 – 0.6748 0.8093 0.9086 0.9661 0.9910 0.9985 0.9999 0.75 1.0000 0.9999 – 0.0008 0.0014 0.0027 0.0049 0.0090 0.0165 0.0302 0.90 1.0000 1.0000 – 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 4 p = 0.10 0.0695 0.0397 0.0226 – 0.9970 0.9995 0.9999 1.0000 1.0000 1.0000 0.25 0.6296 0.4598 0.3252 – 0.8770 0.9446 0.9806 0.9951 0.9992 0.9999 0.75 1.0000 0.9999 0.9992 – 0.0062 0.0110 0.0194 0.0339 0.0588 0.1010 0.90 1.0000 1.0000 1.0000 – 0.0000 0.0000 0.0001 0.0001 0.0002 0.0005 5 p = 0.10 0.0181 0.0100 0.0055 0.0030 – 0.9997 1.0000 1.0000 1.0000 1.0000 0.25 0.4310 0.2899 0.1907 0.1230 – 0.9668 0.9890 0.9973 0.9996 1.0000 0.75 1.0000 0.9998 0.9986 0.9938 – 0.0332 0.0554 0.0914 0.1489 0.2390 0.90 1.0000 1.0000 1.0000 1.0000 – 0.0003 0.0005 0.0010 0.0018 0.0034 6 p = 0.10 0.0034 0.0018 0.0010 0.0005 0.0003 – 1.0000 1.0000 1.0000 1.0000 0.25 0.2390 0.1489 0.0914 0.0554 0.0332 – 0.9938 0.9986 0.9998 1.0000 0.75 1.0000 0.9996 0.9973 0.9890 0.9668 – 0.1230 0.1907 0.2899 0.4310 0.90 1.0000 1.0000 1.0000 1.0000 0.9997 – 0.0030 0.0055 0.0100 0.0181 7 p = 0.10 0.0005 0.0002 0.0001 0.0001 0.0000 0.0000 – 1.0000 1.0000 1.0000 0.25 0.1010 0.0588 0.0339 0.0194 0.0110 0.0062 – 0.9992 0.9999 1.0000 0.75 0.9999 0.9992 0.9951 0.9806 0.9446 0.8770 – 0.3252 0.4598 0.6296 0.90 1.0000 1.0000 1.0000 0.9999 0.9995 0.9970 – 0.0226 0.0397 0.0695 8 p = 0.10 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 – 1.0000 1.0000 0.25 0.0302 0.0165 0.0090 0.0049 0.0027 0.0014 0.0008 – 0.9999 1.0000 0.75 0.9999 0.9985 0.9910 0.9661 0.9086 0.8093 0.6748 – 0.6236 0.7874 0.90 1.0000 1.0000 1.0000 0.9999 0.9990 0.9945 0.9774 – 0.1178 0.1958 9 p = 0.10 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 – 1.0000 0.25 0.0056 0.0029 0.0015 0.0008 0.0004 0.0002 0.0001 0.0001 – 1.0000 0.75 0.9998 0.9971 0.9835 0.9412 0.8511 0.7101 0.5402 0.3764 – 0.8883 0.90 1.0000 1.0000 1.0000 0.9998 0.9982 0.9900 0.9603 0.8822 – 0.4100 10 p = 0.10 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 – 0.25 0.0005 0.0002 0.0001 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 – 0.75 0.9995 0.9944 0.9698 0.8990 0.7610 0.5690 0.3704 0.2126 0.1117 – 0.90 1.0000 1.0000 1.0000 0.9995 0.9966 0.9819 0.9305 0.8042 0.5900 –

Table 1: Closeness probabilities for Uniform for n=10

22 / 81

slide-23
SLIDE 23

Pitman Closeness of Order Statistics to Population Quantiles Applications

Figure 1: Closeness probabilities for uniform with n = 10, p = 0.10, 0.25, 0.75, 0.90

23 / 81

slide-24
SLIDE 24

Pitman Closeness of Order Statistics to Population Quantiles Applications

A Useful Table

n\p 0.10 0.25 0.75 0.90 5 1 2 4 5 10 1 3 8 10 15 2 4 12 14 20 2 5 16 19

Table 2: Pitman-closest order statistic to the pth quantile of U(-1,1) from a sample of size n

24 / 81

slide-25
SLIDE 25

Pitman Closeness of Order Statistics to Population Quantiles Applications

Quantities of interest

Let us consider the standard exponential distribution with pdf and cdf as f (x) = e−x and F(x) = 1 − e−x for x > 0, and its pth quantile as ξp = − ln (1 − p) for p ∈ (0, 1). Result 3 can again be used to derive the following expressions for the Pitman closeness probability associated with any two order statistics. We can then identify for different sample sizes and quantiles that

  • rder statistic which is Pitman-closest.

25 / 81

slide-26
SLIDE 26

Pitman Closeness of Order Statistics to Population Quantiles Applications

Results

Result 5: For i = ℓ + 1, · · · , n,

π(ℓ)i(p) = 1 − Ip (ℓ, n − ℓ + 1) + kℓ,i,n

i−ℓ−1

  • j=0

(−1)i−ℓ−1−j i − ℓ − 1 j

  • 1

n − ℓ − j ×(1 − p)2(n−ℓ−j)

ℓ−1

  • a=0

(−1)a ℓ − 1 a

  • Ha,j,ℓ,n(p),

where Ha,j,ℓ,n(p) =

  • 1

a+2j+ℓ−n+1

  • 1 − (1 − p)a+2j+ℓ−n+1

if a + 2j + ℓ − n + 1 = 0 ξp if a + 2j + ℓ − n + 1 = 0.

26 / 81

slide-27
SLIDE 27

Pitman Closeness of Order Statistics to Population Quantiles Applications

Similarly, for i = 1, · · · , ℓ − 1, π(ℓ)i(p) = Ip (ℓ, n − ℓ + 1) + ki,ℓ,n

ℓ−i−1

  • j=0

(−1)ℓ−i−1−j ℓ − i − 1 j

  • 1

ℓ − j − 1 ×

ℓ−j−1

  • a=0

j

  • b=0

(−1)a+b ℓ − j − 1 a j b

  • (1 − p)2aJa,b,ℓ,n(p),

where Ja,b,ℓ,n(p) =       

1 n−ℓ−a+b+1

  • (1 − p)n−ℓ−a+b+1 − (1 − p)2(n−ℓ−a+b+1)

if n − ℓ − a + b + 1 = 0 ξp if n − ℓ − a + b + 1 = 0.

27 / 81

slide-28
SLIDE 28

Pitman Closeness of Order Statistics to Population Quantiles Applications

A Remark

Suppose we have two independent standard exponential random variables, and look at the corresponding order statistics X1:2 and X2:2. Then, intuition suggests that X1:2 will be closer to ξp for p up to some value after which X2:2 will be closer to the quantile ξp. With n = 2, ℓ = 1 and i = 2, we find from our expression above that π(1)2(p) = Pr (|X1:2 − ξp| < |X2:2 − ξp|) = 1 − Ip(1, 2) + 2(1 − p)2{− ln(1 − p)} = 1 − {1 − (1 − p)2} − 2(1 − p)2 ln(1 − p) = (1 − p)2{1 − 2 ln(1 − p)}.

28 / 81

slide-29
SLIDE 29

Pitman Closeness of Order Statistics to Population Quantiles Applications

Remark Cont’d

Since ∂π(1)2(p) ∂p = 4(1 − p) ln(1 − p) < 0, we see that π(1)2(p) is a monotonic decreasing function in p. Moreover, since π(1)2(0) = 1, we can find a value of p, say p0, such that π(1)2(p) ≥ 1

2 for 0 < p ≤ p0 and π(1)2(p) < 1 2 for p0 < p < 1.

In fact, we determine that point to be p0 = 0.5675, i.e. we have X1:2 to be Pitman-closer (than X2:2) to ξp whenever p > 0.5675.

29 / 81

slide-30
SLIDE 30

Pitman Closeness of Order Statistics to Population Quantiles Applications ℓ i 1 2 3 4 5 6 7 8 9 10 1 p = 0.10 – 0.5969 0.8135 0.9365 0.9842 0.9972 0.9996 1.0000 1.0000 1.0000 0.25 – 0.1197 0.2439 0.4340 0.6506 0.8312 0.9395 0.9850 0.9977 0.9998 0.75 – 0.0000 0.0000 0.0000 0.0001 0.0007 0.0048 0.0313 0.1540 0.5122 0.90 – 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0002 0.0056 0.1067 2 p = 0.10 0.4031 – 0.8841 0.9620 0.9908 0.9984 0.9998 1.0000 1.0000 1.0000 0.25 0.8803 – 0.3862 0.5662 0.7461 0.8828 0.9596 0.9903 0.9986 0.9999 0.75 1.0000 – 0.0001 0.0002 0.0005 0.0021 0.0100 0.0474 0.1902 0.5531 0.90 1.0000 – 0.0000 0.0000 0.0000 0.0000 0.0000 0.0004 0.0076 0.1206 3 p = 0.10 0.1865 0.1159 – 0.9777 0.9948 0.9991 0.9999 1.0000 1.0000 1.0000 0.25 0.7561 0.6138 – 0.6815 0.8221 0.9212 0.9737 0.9939 0.9991 0.9999 0.75 1.0000 0.9999 – 0.0009 0.0021 0.0060 0.0203 0.0726 0.2364 0.5981 0.90 1.0000 1.0000 – 0.0000 0.0000 0.0000 0.0001 0.0007 0.0109 0.1383 4 p = 0.10 0.0635 0.0380 0.0223 – 0.9970 0.9995 0.9999 1.0000 1.0000 1.0000 0.25 0.5660 0.4338 0.3185 – 0.8799 0.9487 0.9834 0.9962 0.9995 1.0000 0.75 1.0000 0.9998 0.9991 – 0.0070 0.0157 0.0401 0.1108 0.2946 0.6472 0.90 1.0000 1.0000 1.0000 – 0.0000 0.0001 0.0002 0.0017 0.0164 0.1616 5 p = 0.10 0.0158 0.0092 0.0052 0.0030 – 0.9997 1.0000 1.0000 1.0000 1.0000 0.25 0.3494 0.2539 0.1779 0.1201 – 0.9676 0.9898 0.9977 0.9997 1.0000 0.75 0.9999 0.9995 0.9979 0.9930 – 0.0367 0.0755 0.1669 0.3663 0.6995 0.90 1.0000 1.0000 1.0000 1.0000 – 0.0003 0.0010 0.0041 0.0263 0.1929 6 p = 0.10 0.0028 0.0016 0.0009 0.0005 0.0003 – 1.0000 1.0000 1.0000 1.0000 0.25 0.1688 0.1172 0.0788 0.0513 0.0324 – 0.9939 0.9987 0.9998 1.0000 0.75 0.9993 0.9979 0.9940 0.9843 0.9633 – 0.1342 0.2453 0.4515 0.7538 0.90 1.0000 1.0000 1.0000 0.9999 0.9997 – 0.0036 0.0106 0.0448 0.2362 7 p = 0.10 0.0004 0.0002 0.0001 0.0001 0.0000 0.0000 – 1.0000 1.0000 1.0000 0.25 0.0605 0.0404 0.0263 0.0166 0.0102 0.0061 – 0.9992 0.9999 1.0000 0.75 0.9952 0.9900 0.9797 0.9599 0.9245 0.8658 – 0.3484 0.5484 0.8080 0.90 1.0000 1.0000 0.9999 0.9998 0.9990 0.9964 – 0.0277 0.0798 0.2968 8 p = 0.10 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 – 1.0000 1.0000 0.25 0.0150 0.0097 0.0061 0.0038 0.0023 0.0013 0.0008 – 0.9999 1.0000 0.75 0.9687 0.9526 0.9274 0.8892 0.8331 0.7547 0.6516 – 0.6524 0.8595 0.90 0.9998 0.9996 0.9993 0.9983 0.9959 0.9894 0.9723 – 0.1456 0.3822 9 p = 0.10 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 – 1.0000 0.25 0.0023 0.0014 0.0009 0.0005 0.0003 0.0002 0.0001 0.0001 – 1.0000 0.75 0.8460 0.8098 0.7636 0.7054 0.6337 0.5485 0.4516 0.3476 – 0.9057 0.90 0.9944 0.9924 0.9891 0.9836 0.9737 0.9552 0.9202 0.8544 – 0.4995 10 p = 0.10 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 – 0.25 0.0002 0.0001 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 – 0.75 0.4878 0.4469 0.4019 0.3528 0.3005 0.2462 0.1920 0.1405 0.0943 – 0.90 0.8933 0.8794 0.8617 0.8384 0.8071 0.7638 0.7032 0.6178 0.5005 –

Table 3: Closeness probabilities for the exponential for n=10

30 / 81

slide-31
SLIDE 31

Pitman Closeness of Order Statistics to Population Quantiles Applications

A Useful Table

n\p 0.10 0.25 0.75 0.90 5 1 2 4 5 10 1 3 8 10 15 2 4 12 14 20 2 5 16 19

Table 4: Pitman-closest order statistic to the pth quantile of the standard exponential from a sample of size n

31 / 81

slide-32
SLIDE 32

Pitman Closeness of Order Statistics to Population Quantiles Applications

Quantities of interest

Let us consider the power function distribution with pdf and cdf as f (x) = αxα−1 and F(x) = xα for x ∈ (0, 1), α > 0, and its pth quantile as ξp = p1/α for p ∈ (0, 1). Similar to the previous two examples, we can use Result 3 to derive the following expressions for the Pitman closeness probability associated with any two order statistics. Varying α and n, we can then determine the Pitman-closest order statistic to the quantiles of interest.

32 / 81

slide-33
SLIDE 33

Pitman Closeness of Order Statistics to Population Quantiles Applications

Results

Result 6: For i = ℓ + 1, · · · , n, π(ℓ)i(p) = 1 − Ip (ℓ, n − ℓ + 1) + kℓ,i,n

i−ℓ−1

  • j=0

(−1)i−ℓ−1−j i − ℓ − 1 j

  • 1

n − ℓ − j ×α

j

  • a=0

n−ℓ−j

  • b=0

(−1)a+b j a n − ℓ − j b

  • (2ξp)α(ℓ+a+b)B(αb + 1, α(a + ℓ))

×

  • 1 − I 1

2 (αb + 1, α(a + ℓ))

  • for 0 < p < 1

2α , = 1 − Ip (ℓ, n − ℓ + 1) + kℓ,i,n

i−ℓ−1

  • j=0

(−1)i−ℓ−1−j i − ℓ − 1 j

  • 1

n − ℓ − j ×α

j

  • a=0

n−ℓ−j

  • b=0

(−1)a+b j a n − ℓ − j b

  • (2ξp)α(a+ℓ+b)B(α(a + ℓ), αb + 1)

×

  • I 1

2 (α(a + ℓ), αb + 1) − I1− 1 2ξp (α(a + ℓ), αb + 1)

  • for

1 2α ≤ p < 1.

33 / 81

slide-34
SLIDE 34

Pitman Closeness of Order Statistics to Population Quantiles Applications

Similarly, for i = 1, · · · , ℓ − 1, π(ℓ)i(p) = Ip (ℓ, n − ℓ + 1) + ki,ℓ,n

ℓ−i−1

  • j=0

(−1)ℓ−i−1−j ℓ − i − 1 j

  • 1

ℓ − j − 1 ×α

n−ℓ

  • a=0

(−1)a n − ℓ a

  • (2ξp)α(a+ℓ)B(α(j + a + 1), α(ℓ − j − 1) + 1)

×

  • 1 − I 1

2 (α(j + a + 1), α(ℓ − j − 1) + 1)

  • for 0 < p < 1

2α , = Ip (ℓ, n − ℓ + 1) + ki,ℓ,n

ℓ−i−1

  • j=0

(−1)ℓ−i−1−j ℓ − i − 1 j

  • 1

ℓ − j − 1 ×α

n−ℓ

  • a=0

(−1)a n − ℓ a

  • (2ξp)α(a+ℓ)B(α(j + a + 1), α(ℓ − j − 1) + 1)

×

  • I

1 2ξp (α(j + a + 1), α(ℓ − j − 1) + 1)

−I 1

2 (α(j + a + 1), α(ℓ − j − 1) + 1)

  • for

1 2α ≤ p < 1.

34 / 81

slide-35
SLIDE 35

Pitman Closeness of Order Statistics to Population Quantiles Applications

A Useful Table

n p 0.10 0.25 0.75 0.90 5 α = 0.01 1 1 4 5 0.03 1 1 4 5 0.05 1 1 4 5 0.07 1 1 4 5 0.10 1 1 4 5 0.25 1 1 4 5 10 α = 0.01 1 2 8 9 0.03 1 2 8 9 0.05 1 2 8 9 0.07 1 3 8 9 0.10 1 3 8 9 0.25 1 3 8 10 15 α = 0.01 1 4 11 14 0.03 1 4 12 14 0.05 1 4 12 14 0.07 1 4 12 14 0.10 1 4 12 14 0.25 2 4 12 14 20 α = 0.01 2 5 15 18 0.03 2 5 15 18 0.05 2 5 15 18 0.07 2 5 15 19 0.10 2 5 15 19 0.25 2 5 16 19

Table 5: Pitman-closest order statistic to the pth quantile of the power function distribution from a sample of size n

35 / 81

slide-36
SLIDE 36

Pitman Closeness of Order Statistics to Population Quantiles Applications

Observations

The extension from population median to population quantiles was straightforward. Among all distributions, we observed the following:

1

We found the same order statistic to be closest to each quantile for the uniform distribution and exponential distribution; this reveals the natural robustness aspect of the Pitman closeness concept in this regard.

2

In the case of the power function distribution, we started to see changes, and yet they are quite consistent with the corresponding results for the uniform and exponential distributions.

3

The slight difference in the Pitman-closest order statistics in the case

  • f the power function distribution are seen to occur for extreme values
  • f α, viz., 0 < α < 0.25.

36 / 81

slide-37
SLIDE 37

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

Is there another way?

Notice the comparisons in our Pitman closeness probabilities involve two estimators at a time, and consequently one may be required to conduct as many as m(m − 1)/2 paired comparisons if m estimators are compared as a class. With so many paired comparisons, Blyth [9] advises us to reduce the number of comparisons by considering the joint distributions of their respective loss functions. In simultaneous comparisons, consider the following two simultaneous-closeness criteria given by Blyth [9].

37 / 81

slide-38
SLIDE 38

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

Blyth’s Criteria

Criterion 1 Choose ˆ θi from among estimators in the class C for which max

i∈K Pr[Li = min j∈K(Lj)],

(1) where K is an index set for the class C, and Li = |ˆ θi − θ|. This criterion chooses the estimator within C which is most-frequently closest to the value of an unknown parameter θ. This simultaneous criterion can be thought of as a max-min criterion in that we are maximizing the probability of ˆ θi having the smallest loss among all estimators in C.

38 / 81

slide-39
SLIDE 39

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

Blyth’s Criteria

Criterion 2 Choose ˆ θi from among estimators in the class C for which min

i∈K Pr[Li = max j∈K (Lj)].

(2) This criterion directs us to choose the estimator within a class C which is least-frequently farthest from θ This can therefore be thought of as a min-max criterion in that we are minimizing the probability that ˆ θi has the maximum loss among the estimators in C. Whenever the size of the index set, K, is two, these criteria are equivalent to the definition of a Pitman-closer estimator.

39 / 81

slide-40
SLIDE 40

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

Other criteria

Another closely related criterion has been suggested by Banks [8]: For a given ǫ > 0, he suggests that we prefer an estimator, δ1(x), where x is a vector of data obtained from a sample, over a competing estimator, δ2(x), if Pr[ δ1(x) − θ < ǫ] > Pr[ δ2(x) − θ < ǫ] (3) ∀ θ ∈ Θ, where x is a loss function. Banks’ criterion can be generalized to the simultaneous comparison of multiple estimators of θ as well. The list goes on but it is Blythe’s criteria which we will find most useful here.

40 / 81

slide-41
SLIDE 41

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

A Useful Reference

For the computations required in Blyth’s criteria, we turn to the work

  • f Fountain et al. [10].

While the geometry of their arguments is complex, we consider a special case in which estimators are ordered, as in the case of order statistics. Combining the concepts of Pitman closeness and simultaneous comparisons, we have a new goal and with it, introduce a new concept and some subsequent some new results.

41 / 81

slide-42
SLIDE 42

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

Aim and Method

Objective We would like to determine the probability with which each order statistic, Xi:n for each i ∈ {1, 2, · · · , n}, is simultaneously Pitman-closest to θ when compared with the remaining order statistics. Our procedure for calculating our probabilities of interest is based on partitioning the essential range of a random vector of observations into regions in which each order statistic is the “best”.

42 / 81

slide-43
SLIDE 43

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

An Important Definition

In accordance with Blyth’s first criterion, we have the following definition for the simultaneous-closeness probability. Definition The simultaneous-closeness probability (SCP) of Xi:n, i ∈ {1, · · · , n}, among the order statistics, X1:n, · · · , Xn:n, in the estimation of a population parameter θ is πi:n(θ) = Pr

  • |Xi:n − θ| < min

j,j=i |Xj:n − θ|

  • .

(4)

43 / 81

slide-44
SLIDE 44

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

A Useful Result

Using the geometric arguments of by Fountain et al. [10], this probability can be found by carrying out just two comparisons: πi:n(θ) = I P(Xi:n, Xi−1:n|θ, n) − I P(Xi+1:n, Xi:n|θ, n), (5) where I P(Xi:n, Xi−1:n|θ, n) = Pr(|Xi:n − θ| < |Xi−1:n − θ|) We can define the probability of interest in the case of both the unbounded and bounded support. Just as we looked at applications of Pitman closeness probabilities in the previous section, we can do the same here.

44 / 81

slide-45
SLIDE 45

SCP of Order Statistics to Population Quantiles Simultaneous Closeness Probabilities

Settings

The results to follow are so determined based on the support of the random variable X (i.e., whether the support is the entire real line or a bounded open interval). In particular, we are interested in parameters with a parameter space that coincides with the support of X. We first present general results for random variables with complete support (i.e., ℜ) and then present similar ones for random variables with bounded support.

45 / 81

slide-46
SLIDE 46

SCP of Order Statistics to Population Quantiles Some Results

Unbounded Case

Theorem Assume that the common conditions hold for X, with support ℜ. Then for i = 2, · · · , n − 1, the simultaneous-closeness probability, πi:n (θ), of Xi:n to θ is given by πi:n (θ) = n i − 1

  • [F(θ)]i−1 [1 − F(θ)]n−i+1

(6) + F(θ) n! (i − 1)!(n − i)! ¯ F

  • 2θ − F −1(u)

n−i ui−1du − F(θ) n! (i − 2)!(n − i + 1)! ¯ F

  • 2θ − F −1(u)

n−i+1 ui−2du.

46 / 81

slide-47
SLIDE 47

SCP of Order Statistics to Population Quantiles Some Results

Bounded Case

Theorem Assume the common conditions for X, where the support of X is bounded

  • n the interval (a, b). Then, the simultaneous-closeness probability πi:n(θ)

is given by πi:n(θ) = n n − 1 i − 2 b∗

a

f (x) [F(x)]i−2 (7)

  • [F(h2(x))]n−i+1 − [F(h1(x))]n−i+1

dx, where b∗ = min(b, 2θ − a), h1(x) = max(a, x) and h2(x) = min (b, 2θ − x).

47 / 81

slide-48
SLIDE 48

SCP of Order Statistics to Population Quantiles Some Results

Population Quantiles

By taking θ=ξp, we can derive explicit expression for the SCP probabilities as seen in Eqs.(6) and (7). First, for the unbounded case, assuming that the common conditions hold for X, with support ℜ. Then for i = 2, · · · , n − 1, πi:n(p) = n i − 1

  • pi−1(1 − p)n−i+1

+ n! (i − 1)!(n − i)! p

  • 1 − F
  • 2F −1(p) − F −1(u)

n−i ui−1du − n! (i − 2)!(n − i + 1)! p

  • 1 − F
  • 2F −1(p) − F −1(u)

n−i+1

48 / 81

slide-49
SLIDE 49

SCP of Order Statistics to Population Quantiles Some Results

Some Notes

Noting the previous expression is free of µ and σ, this suggests that the probabilities can be computed and cataloged for various choices of p and n for different families such as Normal, extreme-value, Cauchy, Laplace, logistic, and so on. These simultaneous-closeness probabilities may then be used in developing efficient goodness-of-fit methods and also as “good” plotting points in graphical model validity methods.

49 / 81

slide-50
SLIDE 50

SCP of Order Statistics to Population Quantiles Some Results

A Useful Observation

We have the following useful corollary which simplifies the computation in the case when the standard distribution is symmetric about the origin. Corollary If the standard pdf is symmetric about the origin (i.e., f (z) = f (−z), F(−z) = 1 − F(z) and F −1(p) = −F −1(q), where q = 1 − p), then we have πi:n(p) = πn−i+1:n(q) (9) for i = 1, · · · , n.

50 / 81

slide-51
SLIDE 51

SCP of Order Statistics to Population Quantiles Some Results

Bounded Case

Similarly, we have developed a general expression for πi:n(θ) when the support of X is bounded on the interval (a, b). If X is bounded on the interval (a, b), then Z = (X − µ)/σ is bounded on the interval (a′, b′), where b′ = (b − µ)/σ and a′ = (a − µ)/σ. Let zp = (ξp − µ)/σ. The simultaneous-closeness probability of Xi:n to ξp in this case is given by πi:n(p) = n n − 1 i − 2 b∗∗

a′

f (z) [F(z)]i−2 (10) ×

  • [F(h2(z))]n−i+1 − [F(h1(z))]n−i+1

dz, where b∗∗ = min(b′, 2zp − a′), h1(z) = max(a′, z) and h2(z) = min (b′, 2zp − z).

51 / 81

slide-52
SLIDE 52

SCP of Order Statistics to Population Quantiles Applications

Two Cases

Being the forementioned results are predicated on the support of the random variable and this is further separated into two categories (bounded and unbounded), we can look at applications in these two cases. Here we consider applying the unbounded results to the normal distribution and the bounded results to the exponential distribution.

52 / 81

slide-53
SLIDE 53

SCP of Order Statistics to Population Quantiles Applications

SCP for Normal

In the Normal distribution, the standard forms are f (z) = φ(z) = 1 √ 2π exp

  • −z2

2

  • , F(z) = Φ(z) and F −1(p) = Φ−1(p).

When we substitute these expressions into Eq.(8) and use the fact that Φ(−z) = 1 − Φ(z), we obtain

πi:n(p) = n i − 1

  • pi−1(1 − p)n−i+1

+ n! (i − 1)!(n − i)! p

  • Φ
  • Φ−1(u) − 2Φ−1(p)

n−i ui−1du − n! (i − 2)!(n − i + 1)! p

  • Φ
  • Φ−1(u) − 2Φ−1(p)

n−i+1 ui−2du.

53 / 81

slide-54
SLIDE 54

SCP of Order Statistics to Population Quantiles Applications

Computations

We have tabulated these probabilities for values of i = 1, . . . , 10, p = 0.05(0.05)0.50 and n = 10 in Table 6. Then, the values for p = 0.55(0.05)0.95 can be found readily by using the symmetry property in the corollary just mentioned.

54 / 81

slide-55
SLIDE 55

SCP of Order Statistics to Population Quantiles Applications

SCP for Normal

p\i 1 2 3 4 5 6 7 8 9 10 0.05 0.78921 0.18365 0.02478 0.00222 0.00014 0.00001 0.00000 0.00000 0.00000 0.00000 0.10 0.52231 0.34097 0.11082 0.02252 0.00308 0.00029 0.00002 0.00000 0.00000 0.00000 0.15 0.31927 0.37855 0.21151 0.07179 0.01613 0.00247 0.00026 0.00002 0.00000 0.00000 0.20 0.18403 0.33583 0.28126 0.14078 0.04616 0.01024 0.00153 0.00015 0.00001 0.00000 0.25 0.10058 0.25937 0.30255 0.20883 0.09375 0.02833 0.00576 0.00076 0.00006 0.00000 0.30 0.05210 0.18049 0.28047 0.25626 0.15155 0.06011 0.01598 0.00274 0.00028 0.00001 0.35 0.02548 0.11483 0.23100 0.27212 0.20682 0.10513 0.03573 0.00783 0.00100 0.00006 0.40 0.01169 0.06712 0.17152 0.25607 0.24613 0.15794 0.06765 0.01865 0.00300 0.00022 0.45 0.00499 0.03599 0.11543 0.21606 0.26008 0.20878 0.11177 0.03848 0.00773 0.00069 0.50 0.00195 0.01758 0.07031 0.16406 0.24609 0.24609 0.16406 0.07031 0.01758 0.00195

Table 6: SCP of order statistics for Normal distribution when n = 10 for various values of p and i

Table 6 presents a clear picture of the most-preferred order statistic for the pth quantile and we do observe from this table that the most-preferred order statistic, with index i∗, increases from 1 to 10 as p increases from 0.05 to 0.95.

55 / 81

slide-56
SLIDE 56

SCP of Order Statistics to Population Quantiles Applications

SCP for Exponential

In the case of the standard exponential distribution, we have a′ = 0 and b′ = ∞, and the standard forms, for z > 0, in this case are f (z) = e−z, F(z) = 1 − e−z and F −1(z) = − ln(1 − z). We can use expressions from Balakrishnan et al. [2] to express the simultaneous probabilities for ξp as πi:n(p) = I P(Xi:n, Xi−1:n|p, n) + I P(Xi:n, Xi+1:n|p, n) − 1 = π(i)i−1(p) + π(i)i+1(p) − 1, (11) where π(ℓ),i(p) is given in Result 7 (in Eq. (23) for i = ℓ + 1, · · · , n and Eq. (25) for i = 1, . . . , ℓ − 1) of Balakrishnan et al. [2].

56 / 81

slide-57
SLIDE 57

SCP of Order Statistics to Population Quantiles Applications

The combined expression for the simultaneous-closeness probabilities in the exponential distribution can be simplified using the following integral: Iℓ,m(p) = p uℓ−1 (1 − u)m du for ℓ, m = 1, 2, · · · . (12) With the integral in Eq.(12), πi:n(p) can be expressed in this case as πi:n(p) = n i − 1

  • pi−1(1 − p)n−i+1

(13) +ai:nq2(n−i)Ii,n−i(p) − bi:nq2(n−i+1)Ii−1,n−i+1(p), where ai:n =

n! (i−1)!(n−i)! and bi:n = n! (i−2)!(n−i+1)!.

We have tabulated these values for n = 10 and i = 1, · · · , 10 for incremental values of p = 0.05(0.05)0.95 in Table 7.

57 / 81

slide-58
SLIDE 58

SCP of Order Statistics to Population Quantiles Applications p\i 1 2 3 4 5 6 7 8 9 10 0.05 0.85064 0.12850 0.01894 0.00180 0.00012 0.00001 0.00000 0.00000 0.00000 0.00000 0.10 0.59691 0.28721 0.09362 0.01930 0.00268 0.00026 0.00002 0.00000 0.00000 0.00000 0.15 0.37591 0.35232 0.19064 0.06422 0.01444 0.00222 0.00023 0.00002 0.00000 0.00000 0.20 0.21907 0.33210 0.26540 0.13020 0.04232 0.00936 0.00140 0.00014 0.00001 0.00000 0.25 0.11966 0.26652 0.29531 0.19842 0.08774 0.02629 0.00532 0.00070 0.00005 0.00000 0.30 0.06152 0.19003 0.28078 0.24896 0.14444 0.05656 0.01490 0.00254 0.00025 0.00001 0.35 0.02975 0.12273 0.23570 0.26931 0.20031 0.10023 0.03367 0.00731 0.00093 0.00005 0.40 0.01348 0.07236 0.17753 0.25737 0.24188 0.15247 0.06438 0.01754 0.00280 0.00020 0.45 0.00567 0.03895 0.12074 0.21997 0.25894 0.20397 0.10742 0.03646 0.00723 0.00064 0.50 0.00219 0.01905 0.07410 0.16882 0.24792 0.24318 0.15927 0.06714 0.01653 0.00181 0.55 0.00077 0.00836 0.04075 0.11602 0.21257 0.25985 0.21190 0.11114 0.03402 0.00463 0.60 0.00024 0.00324 0.01981 0.07071 0.16233 0.24854 0.25378 0.16663 0.06384 0.01087 0.65 0.00006 0.00108 0.00832 0.03754 0.10896 0.21100 0.27263 0.22665 0.11001 0.02375 0.70 0.00001 0.00029 0.00292 0.01685 0.06275 0.15616 0.25967 0.27827 0.17438 0.04869 0.75 0.00000 0.00006 0.00080 0.00609 0.02976 0.09752 0.21416 0.30402 0.25325 0.09434 0.80 0.00000 0.00001 0.00016 0.00162 0.01081 0.04845 0.14623 0.28690 0.33239 0.17344 0.85 0.00000 0.00000 0.00002 0.00027 0.00261 0.01707 0.07561 0.21977 0.38173 0.30291 0.90 0.00000 0.00000 0.00000 0.00002 0.00031 0.00328 0.02407 0.11796 0.35384 0.50052 0.95 0.00000 0.00000 0.00000 0.00000 0.00001 0.00015 0.00241 0.02732 0.20140 0.76871

Table 7: SCP of order statistics for exponential distribution when n=10 for various values of p and i

58 / 81

slide-59
SLIDE 59

SCP of Order Statistics to Population Quantiles Applications

Observations

Quite interestingly, we find the simultaneous-closest order statistic to the pth quantile in this case to be quite close to that of the Normal case and the corresponding simultaneous-closeness probabilities to be quite close as well; this reveals that the concept discussed here is naturally quite robust. In Table 7, we observe the lack of symmetry in the simultaneous-closeness probabilities since the standard exponential is skewed to the right. In addition, while the probabilities πi:n(p) and πn−i+1:n(q) are not equal, the range of quantiles over which an order statistic is preferred varies, although slightly as mentioned. For example, X1:10 is most frequently closest to ξp, for p = 0 to p = 0.15+, whereas X10:10 is most frequently closest to ξp after p = 0.85.

59 / 81

slide-60
SLIDE 60

SCP Plotting Points Motivation

Idea

Plotting points of order statistics are often used in the determination

  • f goodness-of-fit of observed data to theoretical percentiles.

In the literature, there has been controversy on the choice of plotting points. Plotting points are often determined using nonparametric methods which produce, for example, the mean- and median-ranks commonly used in practice. To motivate the applicability of SCPs in the plotting points problem, consider the following diagram:

60 / 81

slide-61
SLIDE 61

SCP Plotting Points Motivation

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.1 0.2 0.3 0.4 0.5 p πi:n

Figure 2: SCP for Normal order statistics when n=10 and i=1, · · · , 10

61 / 81

slide-62
SLIDE 62

SCP Plotting Points Motivation

What does this picture tell us?

In Figure 2, for a given i, we can see the value of p for which the SCP is maximized. This leads to identification of optimal plotting points based on Pitman simultaneous-closeness probabilities. That is, we use a distribution-based approach which selects plotting points (quantiles) based on the simultaneous-closeness of order statistics to population quantiles.

62 / 81

slide-63
SLIDE 63

SCP Plotting Points Background

Other Plotting Points

The concept of mean-rank, denoted by ei:n, is based on the fact that E [F (Xi:n)] = i n + 1 = ei:n, since F(Xi:n) d = Ui:n, the i-th order statistic in a sample of size n from the uniform U(0,1) distribution. In an alternative approach, letting M (X) denote the median of X,

  • ne could use the median-rank, mi:n of the i-th order statistic given by

M [F (Xi:n)] = b0.5;i,n−i+1 = mi:n, where b0.5;α,β is the median of a B (α, β) due to (1) and one can establish that ei:n < mi:n ∀ i < n 2 and ei:n > mi:n ∀ i > n 2.

63 / 81

slide-64
SLIDE 64

SCP Plotting Points Determining SCP Plotting Points

A Reminder

Recall the result for the simultaneous-closeness probability:

Result 1 For i = 2, · · · , n − 1, πi:n(p) = n i − 1

  • pi−1(1 − p)n−i+1

+ n! (i − 1)!(n − i)! p

  • 1 − F
  • 2F −1(p) − F −1(u)

n−i ui−1du − n! (i − 2)!(n − i + 1)! p

  • 1 − F
  • 2F −1(p) − F −1(u)

n−i+1 ui−2du (14)

64 / 81

slide-65
SLIDE 65

SCP Plotting Points Determining SCP Plotting Points

Determining SCP Plotting Points

A natural question that arises here is as follows. For given values of i and n and the choice of the standard normal distribution, for example, for what value p is the SCP πi:n(p) in (14) maximized. Such a determination, as mentioned, would give an “optimal” plotting position corresponding to Xi:n while assessing the adequacy

  • f the fit of a normal distribution to the observed data.

65 / 81

slide-66
SLIDE 66

SCP Plotting Points Determining SCP Plotting Points

How?

Determining where the probability is maximized would require us to take the partial derivative of πi:n(p) in (14) with respect to p and equate it to zero. Now, setting ∂πi:n

∂p

equal to zero, the terms without integrals cancel and we obtain an objective equation.

66 / 81

slide-67
SLIDE 67

SCP Plotting Points Determining SCP Plotting Points

Objective Equation

The equation to be solved is:

(n − i) p

  • F
  • F −1(u) − 2F −1(p)

n−i−1 f

  • F −1(u) − 2F −1(p)
  • ui−1du

= (i − 1) p

  • F
  • F −1(u) − 2F −1(p)

n−i f

  • F −1(u) − 2F −1(p)
  • ui−2du,

which is equivalent to

p

  • F
  • F −1(u) − 2F −1(p)

n−i−1 f

  • F −1(u) − 2F −1(p)
  • ui−2

×

  • (n − i)u − (i − 1)F
  • F −1(u) − 2F −1(p)
  • du = 0.

(15)

We shall refer to the solution to this equation as SCP plotting point.

67 / 81

slide-68
SLIDE 68

SCP Plotting Points Applications

Many Cases

For a specific F(z) and f (z), such as the standard normal distribution, we can solve Eq. (15) numerically to determine the value

  • f p that maximizes the SCP πi:n(p).

Naturally, we would expect the plotting points derived by maximization of πi:n(p) with respect to p not to differ markedly from plotting points based on mean- or median-ranks. We show you the results for the normal case and several others.

68 / 81

slide-69
SLIDE 69

SCP Plotting Points Applications

Normal

In the case of normal distribution, we need to solve the equation p

  • Φ
  • Φ−1(u) − 2Φ−1(p)

n−i−1 φ

  • Φ−1(u) − 2Φ−1(p)
  • ui−2

×

  • (n − i)u − (i − 1)Φ
  • Φ−1(u) − 2Φ−1(p)
  • du = 0, (16)

where φ(z) and Φ(z) are the standard normal pdf and cdf, respectively.

69 / 81

slide-70
SLIDE 70

SCP Plotting Points Applications

Some Results

We determine the SCP for values of p ranging over 0.001(0.001)0.999 for all values of i corresponding to different choices of n. Denoting the solution of (16) by si:n for each i = 1, . . . , n, we first note that s1:n ≡ 0 and sn:n ≡ 1, as one would expect logically. In Table 8, we present, for n = 10 and i = 2(1)9, the values of si:n and the median-and mean-ranks.

70 / 81

slide-71
SLIDE 71

SCP Plotting Points Applications

i 2 3 4 5 6 7 8 9 si:n 0.1433 0.2472 0.3487 0.4496 0.5504 0.6513 0.7528 0.8567 mi:n 0.1623 0.2586 0.3551 0.4517 0.5483 0.6449 0.7414 0.8377 ei:n 0.1818 0.2727 0.3636 0.4545 0.5455 0.6364 0.7273 0.8182

Table 8: Plotting points for normal distribution when n=10

71 / 81

slide-72
SLIDE 72

SCP Plotting Points Applications

More Results

Similar results for the SCP plotting positions for the normal distribution when n = 15 are presented in Figure 3 and Table 9. i 2 3 4 5 6 7 8 si:n 0.0951 0.1644 0.2320 0.2992 0.3662 0.4331 0.5000 mi:n 0.1094 0.1743 0.2394 0.3045 0.3697 0.4348 0.5000 ei:n 0.1250 0.1875 0.2500 0.3125 0.3750 0.4375 0.5000

Table 9: Plotting points for normal distribution when n=15

72 / 81

slide-73
SLIDE 73

SCP Plotting Points Applications

A Picture

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.1 0.2 0.3 0.4 0.5 p πi:n

Figure 3: SCP for normal order statistics when n=15 and i=1, · · · , 15

73 / 81

slide-74
SLIDE 74

SCP Plotting Points Applications

Other distributions

As mentioned earlier, the SCP plotting points can also be determined for other distributions. For example, we can carry out the necessary computations for the logistic, Laplace and Cauchy distributions. The corresponding results for n = 10 and n = 15 are presented in Tables 10 and 11, respectively, for the aforementioned distributions, alongside the normal SCP plotting points and mean- and median-ranks. From these tables, we observe that the plotting points are nearly the same for the inner order statistics, while those for extreme order statistics vary.

74 / 81

slide-75
SLIDE 75

SCP Plotting Points Applications

Some More Results

i 2 3 4 5 6 7 8 9 Cauchy 0.1281 0.2375 0.3433 0.4479 0.5521 0.6567 0.7625 0.8819 Laplace 0.1381 0.2425 0.3443 0.4465 0.5535 0.6557 0.7575 0.8619 logistic 0.1405 0.2456 0.3479 0.4494 0.5506 0.6521 0.7544 0.8595 normal 0.1433 0.2472 0.3487 0.4496 0.5504 0.6513 0.7528 0.8567 uniform 0.1584 0.2542 0.3521 0.4506 0.5494 0.6479 0.7458 0.8416 m 0.1623 0.2586 0.3551 0.4517 0.5483 0.6449 0.7414 0.8377 e 0.1818 0.2727 0.3636 0.4545 0.5455 0.6364 0.7273 0.8182 Table 10: SCP plotting points for some symmetric families of distributions when n=10

75 / 81

slide-76
SLIDE 76

SCP Plotting Points Applications

Results Cont’d

i 2 3 4 5 6 7 8 Cauchy 0.0849 0.1572 0.2269 0.2957 0.3640 0.4320 0.5000 Laplace 0.0922 0.1620 0.2299 0.2971 0.3642 0.4313* 0.5000 logistic 0.0932 0.1632 0.2313 0.2987 0.3659 0.4330 0.5000 normal 0.0951 0.1644 0.2320 0.2992 0.3662 0.4331 0.5000 uniform 0.1058 0.1699 0.2353 0.3012 0.3674 0.4337 0.5000 m 0.1094 0.1743 0.2394 0.3045 0.3697 0.4348 0.5000 e 0.1250 0.1875 0.2500 0.3125 0.3750 0.4375 0.5000 Table 11: SCP plotting points for some symmetric families of distributions when n=15

76 / 81

slide-77
SLIDE 77

SCP Plotting Points Applications

Observations

Looking at the results for the various symmetric distributions just considered, we observe that the SCP plotting points for the inner

  • rder statistics were close and consistent across a spectrum of

distributions and that the greatest disparities occurred for the extreme

  • rder statistics at each end of the sample.

The natural question then is how robust is this method to a sequence

  • f distributions with tails that are progressively heavier?

77 / 81

slide-78
SLIDE 78

SCP Plotting Points Applications

What did we find?

To answer this, one may consider the family of t-distributions with f degrees of freedom. We observe that the values for interior order statistics are very nearly the same for all f , while those for the extreme order statistics vary a bit, thus displaying the inherent robustness of these plotting points. In addition, we also observe the behavior to be monotone with respect to f .

78 / 81

slide-79
SLIDE 79

Conclusions Recap

Summary

The concept of Pitman Closeness has much history and continues to prove to be a useful tool in many contexts. We have looked at its use in comparing order statistics as estimators and defined a new concept, that of simultaneous closeness. The concept of simultaneous-closeness builds on previous ideas and yet finds a new use in goodness-of-fit testing in the form of plotting points.

79 / 81

slide-80
SLIDE 80

Conclusions A Note

What followed?

Extending the work done on plotting points, a correlation-type test statistic was developed which uses SCP plotting points; work has been done for goodness-of-fit to Normal and Weibull. Current work is being carried out for Pitman closeness of order statistics to population quantiles based on progressively Type-II right censored samples.

80 / 81

slide-81
SLIDE 81

Conclusions A Note

I would just like to say thank you to...

  • DR. BALAKRISHNAN FOR “INVITING” ME

AND, WILLIAM VOLTERMAN FOR LISTENING TO MOST OF THIS MATERIAL FOR THE 3rd TIME AND, TO ALL OF YOU FOR YOUR ATTENTION!

81 / 81

slide-82
SLIDE 82

References

[1] Ahmadi, J., Balakrishnan, N., 2009. Pitman closeness of record values to population quantiles. Statistics and Probability Letters, 79, 2037–2044. [2] Balakrishnan, N., Davies, K., Keating J.P., 2009. Pitman closeness

  • f order statistics to population quantiles. Communications in

Statistics - Theory & Methods, 38, 802–820. [3] Balakrishnan, N., Davies, K., Keating, J.P. and R. L. Mason Optimal Plotting Points based on Simultaneous Closeness Probabilities, submitted to Computational Statistics & Data Analysis. [4] Balakrishnan, N., Davies, K.F., Keating, J.P. and R. L. Mason (2010) Simultaneous Closeness among Order Statistics to Population Quantiles, Journal of Statistical Planning and Inference, Vol.140, 9:2408–2415. [5] Balakrishnan, N., Iliopoulos, G., Keating, J.P. and Mason, R.L., Pitman closeness of sample median to population median, Statistics & Probability Letters, to appear (2009).

81 / 81

slide-83
SLIDE 83

References

[6] Balakrishnan, N., Davies, Katherine F., Keating, J.P. and R. L. Mason Pitman Closeness, Monotonicity and Consistency of Best Linear Unbiased and Invariant Estimators for Exponential Distribution under Type-II Censoring, Journal of Statistical Computation and Simulation, to appear. [7] Balakrishnan, N., Davies, K.F., Keating, J.P. and R. L. Mason Pitman Closeness of Best Linear Unbiased and Invariant Predictors for Exponential Distribution in One- and Two-Sample Situations, Communications in Statistics: Theory and Methods, to appear. [8] Banks, D. (1997). Some geometry for location estimators, Classification Society of North American Newsletter, Issue 49 (April 1997); available at http://www.public.iastate.edu/ larsen/csna/previous/csnanews49- 97apr.htm.

81 / 81

slide-84
SLIDE 84

References

[9] Blyth, C.R., 1972. Some probability paradoxes in choice from among random alternatives. Journal of the American Statistical Association, 67, 366–381. [10] Fountain, R.L., Keating, J.P., Maynard, H.B., 1996. The simulatneous comparison of estimators. Mathematical Methods of Statistics, 5, 187–198. [11] Iliopoulos, G., Balakrishnan, N., 2010. An odd property of sample median from odd sample sizes. Statistical Methodology, 7, 678–686. [12] Keating, J.P. and Gupta, R.C. (1984). Simultaneous comparison of scale estimators, Sankhy¯ a, B46, pp. 275-280. [13] Keating, J.P. and Mason, R.L. (1985). Pitman’s measure of closeness, Sankhy¯ a, B47, pp. 22-32. [14] Keating, J.P., Mason, R.L. and Sen, P.K., Pitman’s measure of closeness, Society for Industrial and Applied Mathematics, Philadelphia (1993).

81 / 81

slide-85
SLIDE 85

References

[15] Rao, C.R. (1981) Some comments on the minimum mean square error as a criterion of estimation, Statistics and Related Topics, North Holland, Amsterdam, 123-143.

81 / 81