Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample
Anthony Atkinson, LSE
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 1/29
Regression Diagnostics and the Forward Search 3. A Single - - PowerPoint PPT Presentation
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample Anthony Atkinson, LSE Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample p. 1/29 Multivariate Normality Much multivariate data is
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 1/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 2/29
i = {yi − ˆ
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 3/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 4/29
i (m) = {yi − ˆ
i (m∗)
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 5/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 6/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 7/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 8/29
y1
100 110 120 104 111 104 111 50 60 70 104 111 104 111 125 135 145 100 110 120 130 104 111 100 110 120 104 111
y2
104 111 104 111 104 111 104 111 104 111 104 111
y3
104 111 104 111 110 120 130 140 104 111 50 60 70 104 111 104 111 104 111
y4
104 111 104 111 104 111 104 111 104 111 104 111
y5
115 125 135 104 111 100 110 120 130 125 135 145 104 111 104 111 110 120 130 140 104 111 104 111 115 125 135 104 111
y6
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 9/29
y1
159 10 100 105 110 115 120 125
y2
147 110 115 120 125 130 135 140
y3
50 55 60 65 70 75
y4
104 111 115 120 125 130 135
y5
194 195 80 125 130 135 140 145 150
y6
57 160
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 10/29
i (n) is scaled Beta, approximated by a
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 11/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 12/29
Subset size m Mahalanobis distances 50 100 150 200 1 2 3 4 5 6
111 104
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 13/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 14/29
Subset size m Mahalanobis distances 50 100 150 200 1 2 3 4 5 6
111 104
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 15/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 16/29
Subset size m Mahalanobis distances 50 100 150 200 2 4 6 8 10 12
111 104
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 17/29
[m+1](m). If observation [m + 1] is an
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 18/29
Subset size m Minimum MD 50 100 150 200 3.0 3.5 4.0 4.5
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 19/29
Subset size m Minimum Mahalanobis distance 50 100 150 200 3.0 3.5 4.0 4.5 5.0 5.5
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 20/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 21/29
y1
100 110 120 50 60 70 125 135 145 100 110 120 130 100 110 120
y2 y3
110 120 130 140 50 60 70
y4 y5
115 125 135 100 110 120 130 125 135 145 110 120 130 140 115 125 135
y6
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 22/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 23/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 24/29
Subset size m Minimum Mahalanobis distance 20 40 60 80 100 3 4 5 6 7 Subset size m Minimum scaled Mahalanobis distance 20 40 60 80 100 2 3 4 5
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 25/29
Subset size m Minimum scaled Mahalanobis distance 20 40 60 80 100 2 3 4 5
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 26/29
Subset size m Mahalanobis distances 20 30 40 50 60 70 80 3 4 5 6 7 Try n=84 Subset size m Mahalanobis distances 20 30 40 50 60 70 80 3 4 5 6 7 Try n=85 Subset size m Mahalanobis distances 20 30 40 50 60 70 80 3 4 5 6 7 Try n=86 Subset size m Mahalanobis distances 20 30 40 50 60 70 80 3 4 5 6 7 Try n=87
Swiss Banknotes: forward plot of minimum Mahalanobis distance. When n = 84 and 85, the observed curve lies within the 99% envelope, but there is clear evidence of an outlier when n = 86. The evidence becomes even stronger when another observation is included.
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 27/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 28/29
Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample – p. 29/29
Box, G. E. P. and D. R. Cox (1964). An analysis of transformations (with discussion). Journal of the Royal Statistical Society, Series B 26, 211–246. Flury, B. and H. Riedwyl (1988). Multivariate Statis- tics: A Practical Approach. London: Chapman and Hall. 29-1