Understanding Persistence Morgan Kelly 22nd October 2020 Long Run - - PowerPoint PPT Presentation

understanding persistence
SMART_READER_LITE
LIVE PREVIEW

Understanding Persistence Morgan Kelly 22nd October 2020 Long Run - - PowerPoint PPT Presentation

Understanding Persistence Morgan Kelly 22nd October 2020 Long Run Impact of History European mortality determines quality of institutions. Spanish Mita still affects Peruvian living standards. Common law countries have better judicial


slide-1
SLIDE 1

Understanding Persistence

Morgan Kelly 22nd October 2020

slide-2
SLIDE 2

Long Run Impact of History

◮ European mortality determines quality of institutions. ◮ Spanish Mita still affects Peruvian living standards. ◮ Common law countries have better judicial systems. ◮ Towns with pogroms after the Black Death gave more support to the Nazis. ◮ Genetic diversity determines modern income. ◮ Plough adoption determines women’s rights. ◮ Potato determined city growth. ◮ Slave trade determines incomes and levels of mistrust in modern Africa.

slide-3
SLIDE 3

Objections

Simplistic monocausal explanations of complex phenomena. What is mechanism? p hacking and publication bias. Answers in search of questions. Reversals. All irrelevant because t statistics huge. Examine 25 studies here. 14 report a t above 3.3 (p = 10−3) and six above 5.1 (p = 10−7).

slide-4
SLIDE 4

Spatial Regressions.

However, maybe these results are just too good to be true. Persistence regressions are spatial regressions. Tobler’s First Law of Geography. “Everything is related to everything else. But near things are more related than distant things.” Spatial data highly autocorrelated and, moreover, show strong spatial trends.

slide-5
SLIDE 5

Dangers of spatial autocorrelation.

◮ Just like time series, it is easy to fit spurious trends. ◮ Because observations resemble not only immediate neighbours but distant ones as well, many observations contribute little to increasing the precision of coefficient estimates: standard errors can be much larger than you think.

slide-6
SLIDE 6

25 papers. American Economic Review (10), Quarterly Journal of Economics (8), and Econometrica (2), with one each taken from the American Economic Journal: Macroeconomics, Journal of Political Economy, Journal of Politics, Review of Economics and Statistics, and Science Chosen to include IV, diff-in-diff, regression discontinuity, and non-linear regressions. Mix “Attitudes and Institutions” with “Genetics and Geology”.

slide-7
SLIDE 7

What Paper is NOT About

◮ Not concerned with issues of data construction or estimation; plausibility of persistence story, alternative explanations, quality of historical scholarship (this is high in most cases). ◮ Not interested in individual papers except insofar as they illustrate the general contours of literature. ◮ Not concerned with potential statistical issues with

  • riginal studies: follow their specification completely.

◮ Not out to “disprove” anyone’s results or provide a two sentence critique. ◮ Not implying that because regression examined here may be problematic, others in paper are.

slide-8
SLIDE 8

Robustness checks against trend fitting.

Persistence regressions start with single explanatory variable: misspecified. Add extra control variables to proxy for omitted confounders: continent, latitude, distance from sea etc. Propose three simple robustness checks here.

  • 1. Add dummy for World Bank Regions for global studies.
  • 2. Add longitude-latitude for studies on a smaller scale.
  • 3. Remove regions, usually with extremely high or low

values.

slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11
slide-12
SLIDE 12

Fitting directional trend: Dell, Mita.

  • 5.0

5.5 6.0 6.5 7.0 14 15

Latitude Household Consumption

slide-13
SLIDE 13
  • 1.00

1.00 0.53 0.51 0.51 0.50 0.47 0.46 0.45 0.41 0.41 0.38 0.35 0.34 0.33 0.32 0.31 0.31 0.30 0.25 0.24 0.23 0.17 0.13 0.10

Caicedo, Mission. Alsan, Tsetse. Michalopoulos, Pre−Colonial. Michalopoulos, Scramble. Dell, Mita. Ambrus, Cholera. Becker, Anti−Semitism. Voigtlaender, Persecution. La Porta, Law Finance. Galor, Time Preference. Alesina, Plough. Acemoglu, Colonial Origins. Spolaore, Diffusion. Ashraf, Malthusian. Acemoglu, Reversal. Schulz, Kinship. Acharya, American Slavery. Becker, Weber. Ashraf, Out of Africa. Nunn, Slavery. Banerjee, Land Tenure. Nunn, Ruggedness. Nunn, Potato. Nunn, Mistrust. Comin, 1000 BC.

0.00 0.25 0.50 0.75 1.00

Coefficients after applying controls for WB Regions, directional trends or extreme areas, relative to original values.

Change in effect sizes after robustness checks.

slide-14
SLIDE 14
slide-15
SLIDE 15

Spatially correlated standard errors.

Usual talisman against spatial autocorrelation of residuals is to cluster at some arbitrary geographical level. Not a great idea. Hard to decide on clustering (Abadie et al) and different assumptions can lead to very different standard errors. In order for standard errors to be consistent, residuals cannot be correlated between clusters. (Very large clusters consistent but give wildly varying standard error estimates in practice.)

slide-16
SLIDE 16

HC Clustered

0.0 0.1 0.2 0.3 0.4 5 10 15 20

Proportion significant at 5 per cent

African Ethnic Groups.

HC Clustered

0.0 0.1 0.2 0.3 0.4 5 10 15 20

Correlation Range of Noise. Proportion significant at 5 per cent

US Cities.

Spatial correlation can cause marked inflation of t statistics even with clustered standard errors.

Spatial noise regressions.

slide-17
SLIDE 17

HAC Standard Errors

Var

  • ˆ

β

  • =
  • X

′X

−1 X

′ΩX

  • X

′X

−1 = 1

N X

′X

−1 Φ 1

N X

′X

−1 (1) Spectral approach, pioneered by Conley, is to estimate Φ as a weighted sum of cross products ˆ Φ = 1 N

  • si,sj

K (si, sj) xsi ˆ usix

sj ˆ

usj (2) where K (si, sj) is a weighting kernel that must be chosen. Currently no automatic, data-driven procedure for choosing elements of K. Conley: rectangular kernel. Widely differing standard errors as assumed cutoff varies.

slide-18
SLIDE 18

Proposed Approach.

Decompose the kernel into a systematic spatial component and idiosyncratic noise K (si, sj) = ρ C (si, sj) + (1 − ρ) 1ij (3) where the indicator 1ij = 1 when i = j and 0 otherwise, and 0 ≤ ρ ≤ 1. The structure parameter ρ reflects the ratio of spatial signal to noise in the residuals. ρ = 0 gives standard heteroskedasticity consistent standard errors.

slide-19
SLIDE 19

Need to choose kernel C. Workhorse of geostatistics is Matérn function. Correlation between sites si, sj at distance h apart is M (h; θ, κ) = 21−κ Γ (κ) h θ κ Bκ h θ

  • (κ > 0, θ > 0)

(4) θ is range parameter, κ is smoothness. κ = 0.5, exponential falloff of correlation, κ → ∞ Gaussian.

slide-20
SLIDE 20

Matern Function θ = 1.

0.00 0.25 0.50 0.75 1.00 2 4 6

Distance h.

Smoothness

κ = 0.5 κ = 1.5 κ = 4.0

slide-21
SLIDE 21

We have then a weighting kernel giving the correlation between the residuals at every location K (si, sj) = ρ M (h; θ, κ) + (1 − ρ) 1ij (5) whose three parameters θ, ρ and κ can be estimated by maximum likelihood from the estimated residuals. K is then substituted into (2) to estimate Φ. Potential problems. 1) Residuals do not obey a Matern function. 2) Economic locations, which is what matters, and not the same as geographical ones, which is what we get to observe. Simulations in Appendix show limited downward bias in both situations.

slide-22
SLIDE 22
  • 1.0

1.0 1.0 1.0 1.0 1.0 1.1 1.3 1.3 1.3 1.4 1.4 1.4 1.5 1.5 1.5 1.6 1.6 1.9 2.0 2.0 2.5 2.5 2.5 3.1

Low Spatial Correlation. Nunn, Slavery. La Porta, Law Finance. Michalopoulos, Scramble. Ambrus, Cholera. Nunn, Ruggedness. Galor, Time Preference. Acemoglu, Reversal. Michalopoulos, Pre−Colonial. Acemoglu, Colonial Origins. Nunn, Mistrust. Acharya, American Slavery. Becker, Anti−Semitism. Voigtlaender, Persecution. Alesina, Plough. Alsan, Tsetse. Dell, Mita. Banerjee, Land Tenure. Ashraf, Malthusian. Comin, 1000 BC. Schulz, Kinship. Ashraf, Out of Africa. Nunn, Potato. Becker, Weber. Spolaore, Diffusion. Caicedo, Mission.

1 2 3 4

SEs estimated with exponential kernel relative to original values. Before spatial robustness checks.

Change in standard errors after HAC adjustment.

slide-23
SLIDE 23

t Statistics

Coefficients lower and standard errors higher than suspected. Increasing scepticism of “significance” although this seems in large measure to drive this literature. A result that is interesting only because it is “significant” is not an interesting result to begin with.

slide-24
SLIDE 24

0.5 0.3 0.7 1.0 1.3 1.7 0.7 1.5 1.2 1.0 1.6 1.1 1.4 1.0 0.3 1.4 0.4 1.4 0.9 1.5 0.3 1.5 1.1 1.6 1.4 2.1 2.2 2.3 2.5 2.6 2.6 2.6 2.8 3.0 3.1 3.1 3.3 3.4 3.4 3.5 3.9 3.9 4.0 4.3 5.1 5.4 5.7 6.0 8.5 9.6

  • Nunn, Ruggedness.

Banerjee, Land Tenure. Caicedo, Mission. Voigtlaender, Persecution. Michalopoulos, Pre−Colonial. Alsan, Tsetse. Acharya, American Slavery. Michalopoulos, Scramble. Ambrus, Cholera. Becker, Anti−Semitism. La Porta, Law Finance. Acemoglu, Reversal. Galor, Time Preference. Dell, Mita. Comin, 1000 BC. Alesina, Plough. Nunn, Potato. Acemoglu, Colonial Origins. Schulz, Kinship. Ashraf, Out of Africa. Nunn, Mistrust. Nunn, Slavery. Ashraf, Malthusian. Spolaore, Diffusion. Becker, Weber.

2 4 6

  • Adjusted t

Reported t Values have been truncated at 6.

Change in t statistics after robustness checks and standard error adjustments.

slide-25
SLIDE 25

Detecting problems: Placebo test.

If your regression can explain everything, perhaps there is a problem. Generate artificial dependent variables with same spatial trend and correlation structure as original variable. ˆ y = f (Lon, Lat) + ˆ e Simulate noise variables ˜ e with same spatial correlation structure as ˆ e. Run regressions with synthetic dependent variables ˜ y = ˆ y + ˜ e

slide-26
SLIDE 26

0.00 0.01 0.01 0.03 0.03 0.03 0.04 0.06 0.12 0.14 0.10 0.09 0.12 0.24 0.20 0.20 0.33 0.51 0.48 0.56 0.47 0.83 0.88 0.05 0.09 0.11 0.12 0.14 0.16 0.22 0.22 0.30 0.33 0.34 0.37 0.45 0.46 0.49 0.50 0.52 0.67 0.70 0.74 0.75 0.91 0.94

  • LaPorta, Law Finance.

Voigtlaender, Persecution. Michalopoulos, Scramble. Ambrus, Cholera. Acharya, Slavery. Galor, Time Preference. Michalopoulos, Pre−Colonial. Alsan, Tsetse. Nunn, Ruggedness. Nunn, Mistrust. Banerjee, Land Tenure. Acemoglu, Reversal. Alesina, Plough. Acemoglu, Colonial Origins. Nunn, Slavery. Dell, Mita. Caicedo, Mission. Ashraf, Malthusian. Schulz, Kinship. Comin, 1000BC. Ashraf, Out of Africa. Becker, Weber. Spolaore, Diffusion.

0.0 0.5 1.0

  • t=2

t=3 Proportion of synthetic regressions returning t statistics of 2 or 3.

Ability of Persistence Variables to Explain Spatial Noise.

slide-27
SLIDE 27
  • 40

60 80 100 25 50 75 100

Protestant Literate Protestantism and Literacy

48 50 52 54 56 10 15 20

Literacy by location.

48 50 52 54 56 10 15 20

Protestantism by location.

slide-28
SLIDE 28

Conclusions

Strong claims require strong evidence and, for persistence regressions, large t values do not constitute strong evidence. Numerators of t statistics inflated by fitting spatial trends. Denominators underestimated by failing to account for spatial correlation in residuals. Ultimately, strong evidence comes down to historical scholarship.