Linear Panel Data Models Michael R. Roberts Department of Finance - PowerPoint PPT Presentation

First Difference Estimators Fixed Effects Estimation Random Effects (RE) Estimation Dynamic Linear Panel Data Models (DLPDM) Linear Panel Data Models Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania October 5, 2009 Michael R. Roberts Linear Panel Data Models 1/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Example Link between crime and unemployment. Data for 46 cities in 1982 and 1987. Consider CS regression using 1987 data � R 2 = 0 . 033 crimeRate = 128 . 38 − 4 . 16 unem , (20 . 76) (3 . 42) Higher unemployment decreases the crime rate (insignificantly)?!?!?! Problem = omitted variables Solution = add more variables (age distribution, gender distribution, education levels, law enforcement, etc.) Use like lagged crime rate to control for unobservables Michael R. Roberts Linear Panel Data Models 2/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Panel Data Approach Panel data approach to unobserved factors. 2 types: constant across time 1 vary across time 2 y it = β 0 + δ 0 d 2 t + β 1 x it + a i + u it , t = 1 , 2 where d 2 = 1 when t = 2 and 0 when t = 1 Intercept for period 1 is β 0 , for period 2 β 0 + δ 0 Allowing intercept to change over time is important to capture secular trends. a i captures all variables that are constant over time but different across cross-sectional units. (a.k.a. unobserved effect , unobserved heterogeneity ) u it is idiosyncratic error or time-varying error and represents unobserved factors that change over time and effect y it Michael R. Roberts Linear Panel Data Models 3/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Example (Cont) Panel approach to link between crime and unemployment. crimeRate it = β 0 + δ 0 d 78 t + β 1 unem it + a i + u it where d 87 = 1 if year is 1987, 0 otherwise, and a i is an unobserved city effect that doesn’t change over time or are roughly constant over the 5-year window. Examples: Geographical features of city 1 Demographics (race, age, education) 2 Crime reporting methods 3 Michael R. Roberts Linear Panel Data Models 4/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Pooled OLS Estimation How do we estimate β 1 on the variable of interest? Pooled OLS. Ignore a i . But we have to assume that a i is ⊥ to unem since it would fall in the error term. crimeRate it = β 0 + δ 0 d 78 t + β 1 unem it + v it where v it = a i + u it . SRF: � R 2 = 0 . 012 crimeRate = 93 . 42 + 7 . 94 d 87 + 0 . 427 unem , (12 . 74) (7 . 98) (1 . 188) Positive coef on unem but insignificant Michael R. Roberts Linear Panel Data Models 5/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel First Difference Estimation Difference the regression equation across time to get rid of fixed effect and estimate differenced equation via OLS. y i 2 = ( β 0 + δ 0 ) + β 1 x i 2 + a i + u i 2 , ( t = 2) y i 1 = β 0 + β 1 x i 1 + a i + u i 1 , ( t = 1) Differencing yields ∆ y i = δ 0 + β 1 ∆ x i + ∆ u i where ∆ denotes period 2 minus period 1. Key assumption: ∆ x i ⊥ ∆ u i , which holds if at each time t , u it ⊥ x it ∀ t . (i.e., strict exogeneity). This rules out lagged dependent variables. Key assumption: ∆ x i must vary across some i Michael R. Roberts Linear Panel Data Models 6/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel First Difference Example Reconsider crime example: � R 2 = 0 . 012 crimeRate = 15 . 40 + 2 . 22∆ unem , (4 . 70) (0 . 88) Now positive and significant effect of unemployment on crime Intercept = ⇒ crime expected to increase even if unemployment doesn’t change! This reflects secular increase in crime rate from 1982 to 1987 Michael R. Roberts Linear Panel Data Models 7/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Practical Issues Differencing can really reduce variation in x x may vary greatly in cross-section but ∆ x may not Less variation in explanatory variable means larger standard errors on corresponding coefficient Can combat by either Increasing size of cross-section (if possible) 1 Taking longer differences (over several periods as opposed to adjacent 2 periods) Michael R. Roberts Linear Panel Data Models 8/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Example Michigan job training program on worker productivity of manufacturing firms in 1987 and 1988 scrap i t = β 0 + δ 0 y 88 t + β 1 grant i t + a i + u i t where i , t index firm-year, scrap = scrap rate = # of items per 100 that must be tossed due to defects, grant = 1 if firm i in year t received job training grant. a i is firm fixed effect and captures average employee ability, capital, and managerial skill...things constant over 2-year period. Difference to zap a i and run 1st difference (FD) regression N = 54 , R 2 = 0 . 022 ∆ � scrap = − 0 . 564 − 0 . 739∆ grant , (0 . 405) (0 . 683) Job training grant lowered scrap rate but insignificantly Michael R. Roberts Linear Panel Data Models 9/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Example (Cont) Is level-level model correct? � N = 54 , R 2 = 0 . 067 ∆ log ( scrap ) = − 0 . 57 − 0 . 317∆ grant , (0 . 097) (0 . 164) Job training grant lowered scrap rate by 31.7% (or 27.2% = exp(-0.317) - 1). Pooled OLS estimate implies insignificant 5.7% reduction Large difference between pooled OLS and first difference suggests that firms with lower-ability workers (low a i ) are more likely to receive a grant. I.e., Cov ( a i , grant it ) < 0. Pooled OLS ignores a i and we get a downward omitted variables bias Michael R. Roberts Linear Panel Data Models 10/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Program Evaluation Problem Let y = outcome variable, prog = program participation dummy. y it = β 0 + δ 0 d 2 t + β 1 prog it + a i + u it Difference regression ∆ y it = δ 0 + β 1 ∆ prog it + ∆ u it If program participation only occurs in the 2nd period then OLS estimator of β 1 in the differenced equation is just: ˆ β 1 = ∆ y treat − ∆ y control (1) Intuition: ∆ prog it = prog i 2 since participation in 2nd period only. (i.e., ∆ prog it 1 is just an indicator identify the treatment group) Omitted group is non-participants. 2 So β 1 measures the average outcome for the participants relative to 3 the average outcome of the nonparticipants Michael R. Roberts Linear Panel Data Models 11/56

First Difference Estimators Two Period Model Fixed Effects Estimation Policy Analysis Random Effects (RE) Estimation Three Period Panel Dynamic Linear Panel Data Models (DLPDM) General Period Panel Program Evaluation Problem (Cont) Note: This is just a difference-in-differences (dif-in-dif) estimator “Equivalent” model: = β 0 + δ 0 d 2 t + β 1 prog it + β 2 d 2 t × prog it + a i + u it y it where β 2 has same interpretation as β 1 from above. If program participation can take place in both periods, we can’t write the estimator as in (1) but it has the same interpretation: change in average value of y due to program participation Adding additional time-varying controls poses no problem. Just difference them as well. This allows us to control for variables that might be correlated with program designation. y it = β 0 + δ 0 d 2 t + β 1 prog it + γ ′ X it + a i + u it Michael R. Roberts Linear Panel Data Models 12/56

Linear Panel Data Models Michael R. Roberts Department of Finance - PowerPoint PPT Presentation

First Difference Estimators Fixed Effects Estimation Random Effects (RE) Estimation Dynamic Linear Panel Data Models (DLPDM) Linear Panel Data Models Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania

Introduction to Data Science: Logistic 0 1 1 according to a data fit criterion. account

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

PS4000 Assembly Guide Part List: A. 1 x Left Panel B. 1 x Right Panel C. 1 x Bottom Panel

Limitations of linear models Richard Erickson Instructor DataCamp Generalized Linear Models in

Workshop 3 Building from Linear Models to Generalised Linear Models Part 2: GLMs 2 2 What are

Functional Linear Models 1 66 / 181 Functional Linear Models Statistical Models So far we have

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

Workshop 2 Building from Linear Models to Generalised Linear Models Part 1: understanding LMs 2

Overview of logistic regression Richard Erickson Instructor DataCamp Generalized Linear Models

Linear Classifiers: Expressiveness Machine Learning 1 Lecture outline Linear models:

Introduction to the R Statistical Computing Environment Linear and Generalized Linear Models in R

SEPG 2007 SEPG 2007 SPIN Panel SPIN Panel SEPG2007 - SPIN Panel Session SEPG2007 - SPIN Panel

FEC403EN Extinguishing Control Panel FEC403EN Extinguishing panel Table of contents Panel

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

ECON 950 Winter 2020 Prof. James MacKinnon 9. Going Beyond Linear Models Linear regression,

Applications of Random Forest Algorithm Rosie Zou 1 Matthias Schonlau, Ph.D. 2 1 Department of

Type safety and exception handling Gabriele Keller Ron Vanderfeesten Overview higher &

Fast and accurate Bessel function computation John Harrison, Intel Corporation ARITH-19 Portland,

Sensors Approximate computing approximate edge detection Machine learning hidden

JUST THE MATHS SLIDES NUMBER 14.3 PARTIAL DIFFERENTIATION 3 (Small increments and small

sr t rrr s

A functor is a generic module A new form of parametric polymorphism: lambda and type-lambda

Troubleshooting Career Technical Education (CT (CTE) Reports Overv ervie iew CTE reports are

Sambuz

Useful Links

Newsletter

Mail Us

Linear Panel Data Models Michael R. Roberts Department of Finance - PowerPoint PPT Presentation

First Difference Estimators Fixed Effects Estimation Random Effects (RE) Estimation Dynamic Linear Panel Data Models (DLPDM) Linear Panel Data Models Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania

Introduction to Data Science: Logistic 0 1 1 according to a data fit criterion. account

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

PS4000 Assembly Guide Part List: A. 1 x Left Panel B. 1 x Right Panel C. 1 x Bottom Panel

Limitations of linear models Richard Erickson Instructor DataCamp Generalized Linear Models in

Workshop 3 Building from Linear Models to Generalised Linear Models Part 2: GLMs 2 2 What are

Functional Linear Models 1 66 / 181 Functional Linear Models Statistical Models So far we have

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

Workshop 2 Building from Linear Models to Generalised Linear Models Part 1: understanding LMs 2

Overview of logistic regression Richard Erickson Instructor DataCamp Generalized Linear Models

Linear Classifiers: Expressiveness Machine Learning 1 Lecture outline Linear models:

Introduction to the R Statistical Computing Environment Linear and Generalized Linear Models in R

SEPG 2007 SEPG 2007 SPIN Panel SPIN Panel SEPG2007 - SPIN Panel Session SEPG2007 - SPIN Panel

FEC403EN Extinguishing Control Panel FEC403EN Extinguishing panel Table of contents Panel

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

ECON 950 Winter 2020 Prof. James MacKinnon 9. Going Beyond Linear Models Linear regression,

Applications of Random Forest Algorithm Rosie Zou 1 Matthias Schonlau, Ph.D. 2 1 Department of

Type safety and exception handling Gabriele Keller Ron Vanderfeesten Overview higher &amp;

Fast and accurate Bessel function computation John Harrison, Intel Corporation ARITH-19 Portland,

Sensors Approximate computing approximate edge detection Machine learning hidden

JUST THE MATHS SLIDES NUMBER 14.3 PARTIAL DIFFERENTIATION 3 (Small increments and small

sr t rrr s

A functor is a generic module A new form of parametric polymorphism: lambda and type-lambda

Troubleshooting Career Technical Education (CT (CTE) Reports Overv ervie iew CTE reports are

Sambuz

Useful Links

Newsletter

Mail Us

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Type safety and exception handling Gabriele Keller Ron Vanderfeesten Overview higher &