Robust Statistics using Stata
First Belgian Stata Users Meeting Vincenzo Verardi
Fnrs, UNamur, ULB
September 2016
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 1 / 77
Robust Statistics using Stata First Belgian Stata Users Meeting - - PowerPoint PPT Presentation
Robust Statistics using Stata First Belgian Stata Users Meeting Vincenzo Verardi Fnrs, UNamur, ULB September 2016 Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 1 / 77 Outliers do matter and are not always
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 1 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 2 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 3 / 77
3 4 5 6 7 Log of light intensity 3.5 4 4.5 Log of temperature Least Squares Robust Estimator
Source: P. J. Rousseeuw and A. M. Leroy (1987)
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 4 / 77
Water opossum Human Triceratops Dipliodocus Brachiosaurus LS: y=2.17+0.59 x Robust: y=1.98+0.75 x
5 10 Log of Brain Weight
5 10 15 Log of Body Weight
Source: Weisberg, S. (1985)
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 5 / 77
5 10 15 20 50 55 60 65 70 75 Year Least Squares Robust Estimator
Source: P. J. Rousseeuw and A. M. Leroy (1987)
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 6 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 7 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 8 / 77
5
5 Median Mean
X~N(0,1), N=20
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 9 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 10 / 77
5 y
5 x Median Mean
X~N(0,1)
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 11 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 12 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 13 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 14 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 15 / 77
First Belgian Stata Users Meeting 6/09/2016 16 / 77
1 2 IF
2 4 x µ Q0.5 HL µ
0.25
µ
0.05
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 17 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 18 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 19 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 20 / 77
1 2 3 4 IF
2 4 x σ IQR Qn MAD
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 21 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 22 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 23 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 24 / 77
1 2 IF
2 4 γ1 SK0.25 MC
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 25 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 26 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 27 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 28 / 77
P25-1.5IQR P75+1.5IQR P25 P75 IQR 0.35% 0.35%
median 1 sd 2 sd 3 sd 4 sd
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 29 / 77
P25-1.25IQR P75+1.5IQR P25 P75 IQR 2.75% 2.75%
median 1 sd 2 sd 3 sd 4 sd
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 30 / 77
max(P25-1.5IQR,0) P75+1.5IQR P25 P75 IQR 2.80% 1 sd median 2 sd 3 sd 4 sd 5 sd 6 sd 0%
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 31 / 77
Transformation Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 32 / 77
Ibrahimovic Messi Ronaldo Ronaldo
Generalized Standard 50000 100000 150000 200000 Daily earnings in British pounds Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 33 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 34 / 77
10 20 y
5 10 15 x True line Fitted line
t-stat (0.22)
t–stat (19.49) Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 35 / 77
Vertical points
10 20 y
5 10 15 x True line Fitted line
t-stat (0.22) (2.44)
t–stat (19.49) (1.88) Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 36 / 77
Bad leverage points
10 20 y
5 10 15 x True line Fitted line
t-stat (0.22) (2.44) (1.33)
t–stat (19.49) (1.88) (3.34) Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 37 / 77
G
l e v e r a g e p
n t s
10 20 y
5 10 15 x True line Fitted line
t-stat (0.22) (2.44) (1.33) (0.29)
t–stat (19.49) (1.88) (3.34) (47.92) Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 38 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 39 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 40 / 77
.1 .2 .3 .4 ρ(r/σ)
2 4 r/σ 1 2 3 ρ(r/σ)
2 4 r/σ
.2 .4 ψ(r/σ)
2 4 r/σ
.5 1 ψ(r/σ)
2 4 r/σ
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 41 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 42 / 77
i θ
i θ
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 43 / 77
i θ
i ˆ
i ˆ
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 44 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 45 / 77
c=1.56, eff=28%, BP=50% c=3.42, eff=85%, BP=20% c=4.68, eff=95%, BP=10%
2 4 6 ρ0(u)
2 4 6 u
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 46 / 77
1
2
3
i ˆ
i ˆ
i ˆ
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 47 / 77
i ˆ
i ˆ
i ˆ
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 48 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 49 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 50 / 77
_cons
percob .3003667 .0232706 12.91 0.000 .2546458 .3460876 percphys .1718203 .0190935 9.00 0.000 .1343063 .2093343 perdiabet
Total 3483.08608 499 6.98013242 Root MSE = 1.6246 Adj R-squared = 0.6219 Residual 1311.76184 497 2.63935983 R-squared = 0.6234 Model 2171.32424 2 1085.66212 Prob > F = 0.0000 F( 2, 497) = 411.34 Source SS df MS Number of obs = 500 . reg perdiabet percphys perco
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 51 / 77
Hausman test of S against LS: chi2(2) = 12.104611 Prob > chi2 = 0.0024 _cons
percob .1825398 .0320588 5.69 0.000 .1197057 .245374 percphys .2316495 .0239441 9.67 0.000 .1847199 .278579 perdiabet
Robust Scale estimate = 1.5505239 Bisquare k = 1.547645 Breakdown point = 50 Subsamples = 50 S-Regression (28.7% efficiency) Number of obs = 500 refining 2 best candidates ... done enumerating 50 candidates ... done . robreg s perdiabet percphys perco, hausman nodots
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 52 / 77
Hausman test of MM against S: chi2(2) = 8.4158528 Prob > chi2 = 0.0149 _cons
percob .2659837 .0318083 8.36 0.000 .2036406 .3283269 percphys .1808932 .0228649 7.91 0.000 .1360789 .2257075 perdiabet
Robust Robust R2 (rho) = .44064435 Robust R2 (w) = .68991266 Scale estimate = 1.5505239 S-estimate: k = 1.547645 M-estimate: k = 3.4436898 Breakdown point = 50 Subsamples = 50 MM-Regression (85% efficiency) Number of obs = 500
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 53 / 77
Hausman test of MM against S: chi2(2) = 4.5984588 Prob > chi2 = 0.1003 _cons
percob .2308929 .0350134 6.59 0.000 .1622679 .2995178 percphys .2026506 .0265639 7.63 0.000 .1505864 .2547149 perdiabet
Robust Robust R2 (rho) = .34941524 Robust R2 (w) = .76053069 Scale estimate = 1.5505239 S-estimate: k = 1.547645 M-estimate: k = 2.3666372 Breakdown point = 50 Subsamples = 50 MM-Regression (60% efficiency) Number of obs = 500
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 54 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 55 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 56 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 57 / 77
a
a
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 58 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 59 / 77
a
a
a
a
a
a
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 60 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 61 / 77
5 10 15 20 x1 5 10 15 20 x2 None SD AO & SD
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 62 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 63 / 77
South Dakota Pennsylvania Mississippi Georgia Mississippi Nebraska Nebraska Montana Kentucky South Carolina Mississippi South Dakota Alabama California Alabama Utah California Mississippi Washington Colorado Colorado Colorado Colorado New York Virginia
2 4 Standardized residuals 1 2 3 4 5 ASO
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 64 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 65 / 77
First Belgian Stata Users Meeting 6/09/2016 66 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 67 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 68 / 77
Allison, Master. Hudson Trevor Allison, Miss. Helen Loraine Allison, Mrs. Hudson J C (Bessie Waldo Daniels) Andrews, Miss. Kornelia Theodosia Artagaveytia, Mr. Ramon Barkworth, Mr. Algernon Henry Wilson Cardeza, Mr. Thomas Drake Martinez Cardeza, Mrs. James Warburton Martinez (Charlotte Wardle Drake) Carlsson, Mr. Frans Olof Carter, Mr. William Ernest Clark, Mrs. Walter Miller (Virginia McDowell) Cumings, Mrs. John Bradley (Florence Briggs Thayer) Earnshaw, Mrs. Boulton (Olive Potter) Endres, Miss. Caroline Louise Evans, Miss. Edith Corse Frauenthal, Dr. Henry William Greenfield, Mr. William Bertram Homer, Mr. Harry ("Mr E Haven") Isham, Miss. Ann Elizabeth Ismay, Mr. Joseph Bruce Julian, Mr. Henry Forbes Keeping, Mr. Edwin Kreuchen, Miss. Emilie Lurette, Miss. Elise Maioni, Miss. Roberta McCarthy, Mr. Timothy J Romaine, Mr. Charles Hallace ("Mr C Rolmane") Ryerson, Master. John Borie Ryerson, Miss. Emily Borie Ryerson, Miss. Susan Parker "Suzette" Seward, Mr. Frederic Kimber Smart, Mr. John Montgomery Straus, Mrs. Isidor (Rosalie Ida Blun) Taussig, Mrs. Emil (Tillie Mandelbaum) Van der hoef, Mr. Wyckoff Walker, Mr. William Anderson Widener, Mr. George Dunton Andrew, Mr. Edgardo Samuel Beane, Mr. Edward Beesley, Mr. Lawrence Buss, Miss. Kate Butler, Mr. Reginald Fenton Chapman, Mrs. John Henry (Sara Elizabeth Lawry) Corbett, Mrs. Walter H (Irene Colvin) Davies, Mrs. John Morgan (Elizabeth Agnes Mary White) de Brito, Mr. Jose Joaquim Drew, Master. Marshall Brines Faunthorpe, Mr. Harry Fynney, Mr. Joseph J Gale, Mr. Harry Hamalainen, Mrs. William (Anna) Harris, Mr. George Harris, Mr. Walter Herman, Mrs. Samuel (Jane Laver) Hickman, Mr. Lewis Hickman, Mr. Stanley George Hiltunen, Miss. Marta Hosono, Mr. Masabumi Hunt, Mr. George Henry Jacobsohn, Mr. Sidney Samuel Jefferys, Mr. Clifford Thomas Jerwan, Mrs. Amin S (Marie Marthe Thuillard) Kantor, Mr. Sinai Karnes, Mrs. J Frank (Claire Bennett) Lahtinen, Mrs. William (Anna Sylfven) Louch, Mr. Charles Alexander Matthews, Mr. William John Maybery, Mr. Frank Hubert Morley, Mr. Henry Samuel ("Mr Henry Marshall") Pallas y Castello, Mr. Emilio Phillips, Miss. Kate Florence ("Mrs Kate Louise Phillips Marshall") Phillips, Mr. Escott Robert Ponesell, Mr. Martin Portaluppi, Mr. Emilio Ilario Giuseppe Quick, Miss. Winifred Vera Sharp, Mr. Percival James R Stokes, Mr. Philip Joseph Turpin, Mrs. William John Robert (Dorothy Ann Wonnacott) Weisz, Mrs. Leopold (Mathilde Francoise Pede) Wilhelms, Mr. Charles Yrois, Miss. Henriette ("Mrs Harbeck") Abbott, Mr. Rossmore Edward Abbott, Mrs. Stanton (Rosa Hunt) Abelseth, Mr. Olaus Jorgensen Abrahamsson, Mr. Abraham August Johannes Aks, Master. Philip Frank Albimona, Mr. Nassef Cassem Andersson, Mr. August Edvard ("Wennerstrom") Asplund, Master. Edvin Rojj Felix Asplund, Mr. Johan Charles Barah, Mr. Hanna Assi Barbara, Mrs. (Catherine David) Betros, Mr. Tannous Bing, Mr. Lee Birkeland, Mr. Hans Martin Monsen Bradley, Miss. Bridget Delia Buckley, Mr. Daniel Cacic, Mr. Luka Calic, Mr. Jovo Chip, Mr. Chang Chronopoulos, Mr. Apostolos Cohen, Mr. Gurshon "Gus" Coutts, Master. Eden Leslie "Neville" Coutts, Master. William Loch "William" Coxon, Mr. Daniel Dahl, Mr. Karl Edwart Daly, Mr. Eugene Patrick Davies, Mr. Evan de Messemaeker, Mr. Guillaume Joseph de Mulder, Mr. Theodore Dean, Master. Bertram Vere Dorking, Mr. Edward Arthur Duquemin, Mr. Joseph Fischer, Mr. Eberhard Thelander Goldsmith, Master. Frank John William "Frankie" Goodwin, Master. Sidney Leonard Hedman, Mr. Oskar Arvid Ilmakangas, Miss. Pieta Sofia Jalsevac, Mr. Ivan Jansson, Mr. Carl Olof Johansson Palmquist, Mr. Oskar Leander Johnson, Master. Harold Theodor Jonsson, Mr. Carl Jussila, Mr. Eiriik Karlsson, Mr. Einar Gervasius Karlsson, Mr. Julius Konrad Eugen Karun, Miss. Manca Karun, Mr. Franz Kink-Heilmann, Mr. Anton Krekorian, Mr. Neshan Laitinen, Miss. Kristina Sofia Lang, Mr. Fang Leeni, Mr. Fahim ("Philip Zenni") Lindqvist, Mr. Eino William Ling, Mr. Lee Lulic, Mr. Nikola Lundstrom, Mr. Thure Edvin Madsen, Mr. Fridtjof Arne McNamee, Mr. Neal Midtsjo, Mr. Karl Albert Moor, Master. Meier Morley, Mr. William
2 4 d0 1 2 3 4 b0 Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 69 / 77
Allison, Mr. Hudson Joshua Creighton Allison, Mrs. Hudson J C (Bessie Waldo Daniels) Cleaver, Miss. Alice Daniels, Miss. Sarah Brown, Miss. Amelia "Mildred" Swane, Mr. George Allison, Master. Hudson Trevor Allison, Miss. Helen Loraine
d0 1 2 3 4 b0
Mr H.J.C. Allison (30)y Head Mrs H.J.C. Allison (25)y Spouse Miss H.L. Allison (2)y Daughter Master H.T. Allison (1) Son Miss A.C. Cleaver (22) Nurse Miss S. Daniels (33) Nurse Miss A.M. Brown (18) Cook
Chau¤eur Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 70 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 71 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 72 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 73 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 74 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 75 / 77
Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 76 / 77
1
2
3
4
5
6
j g)
j g)
P0.9(fw j g)P0.1(fw j g) P0.9(fw j g)+P0.1(fw j g)
0.9 7
Boxplot Vincenzo Verardi (Fnrs, UNamur, ULB) First Belgian Stata Users Meeting 6/09/2016 77 / 77