Inference with Arbitrary Clustering
Fabrizio Colella,∗ Rafael Lalive,∗ Seyhun O. Sakalli,∗ Mathias Thoenig∗
Swiss Stata Users Group Meeting, October 2018
∗University of Lausanne
Inference with Arbitrary Clustering Fabrizio Colella, Rafael Lalive, - - PowerPoint PPT Presentation
Inference with Arbitrary Clustering Fabrizio Colella, Rafael Lalive, Seyhun O. Sakalli, Mathias Thoenig Swiss Stata Users Group Meeting, October 2018 University of Lausanne Introduction Motivation A tremendous surge of
∗University of Lausanne
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
n
T
n
T
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Go
Go
Illustration
Go
Go
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Data generating process: Bartlett kernel Unit: U.S. towns U.S. counties Sample size: N=101 N=1001 N=3141 (1) (2) (3) Spatial correlation Correction Endogeneity Estimator Null-rejection rate Panel A: Cross section, t = 1 OLS 5.9% 5.0% 5.0%
5.6% 5.1% 5.2%
37.8% 50.2% 28.2%
33.4% 48.3% 26.5%
16.8% 7.2% 5.6%
16.7% 8.4% 5.5% Panel B: Panel, t = 5 OLS 5.8% 5.1% 5.3%
5.3% 5.0% 4.6%
39.1% 46.1% 17.9%
37.3% 44.3% 15.5%
19.4% 11.2% 10.1%
19.0% 11.1% 9.6% Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
.15 .3 .45 .6 .05 Null−rejection rate 2 4 6 8 10 12 14 16 18 20 Number of cities per state Not corrected Corrected
.15 .3 .45 .6 .05 Null−rejection rate 2 4 6 8 10 12 14 16 18 20 Number of cities per state Not corrected Corrected
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
.15 .3 .45 .6 .05 Null−rejection rate 2 4 6 8 10 12 14 16 18 20 Number of cities per state Not corrected Corrected
.15 .3 .45 .6 .05 Null−rejection rate 2 4 6 8 10 12 14 16 18 20 Number of cities per state Not corrected Corrected
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Data generating process: First-degree friends Unit: Top of the distribution Random sample Sample size: N=1000 N=2500 N=1000 N=2500 (1) (2) (3) (4) Network correlation Correction Endogeneity Estimator Null-rejection rate OLS 5.1% 4.7% 4.7% 5.1%
5.3% 4.9% 5.4% 4.7%
64.9% 59.0% 26.9% 36.2%
63.0% 58.2% 25.4% 35.4%
13.2% 9.2% 7.5% 8.1%
13.4% 9.7% 7.2% 8.4% Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
Colella, Lalive, Sakalli, and Thoenig Inference with Arbitrary Clustering
1Y and X1 can be any number. Given that Y and X1 are iid, statistical theory predicts that if we regress Y on X1,
the null hypothesis that the β coefficient is equal to 0 at a 5% level, will be rejected with 5% probability. Back
N
Ni
Back
Endogeneity Panel dimension Back
i + ςεYi)
Back
Legend Idiosyncratic component
0.01 - 1.5 1.51 - 3.0 3.01 - 5.0 5.01 - 10.19
Correlated Shocks Back
Legend Total common shocks
0.01 - 3.0 3.01 - 6.5 6.51 - 13.0 13.01 - 23.41
Back
We introduce endogeneity to the model by adding an endogenous variable, End, as a regressor: Yi = α1i + δ1X1i + δ2Endi + µi (2) We generate a random variable IV , which is independent and identically distributed (iid) to Y and X1: IV = IV + ǫIV , ǫIV ∼ N(0, σǫIV ); We define Endi as: Endi = F(X1i, IVi) + ǫYi We introduce correlation to the 2SLS model by adding the sum of common random shocks, ςεIVi , to the variable IV and computing End as a function of correlated variables and common shocks: ˆ IVi = IVi + ςεIVi ˆ Endi = F( ˆ X1i, ˆ IV i) + ǫYi + ςεYi
Back
Back