Contents 1 Causal Inference and Predictive Comparison 2 1.1 How - PDF document

Causal Inference in Observational Studies Contents 1 Causal Inference and Predictive Comparison 2 1.1 How Predictive Comparison Can Mislead . . . . . . . . . . . . . 2 1.2 Adding Predictors as a Solution . . . . . . . . . . . . . . . . . . . 2 1.3 Omitted Variable Bias . . . . . . . . . . . . . . . . . . . . . . . . 3 2 Causal Inference – Problems and Solutions 3 2.1 The Fundamental Problem . . . . . . . . . . . . . . . . . . . . . 3 2.2 Ways of Getting Around the Problem . . . . . . . . . . . . . . . 4 2.3 Randomization . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 2.4 Controlling for a Pre-Treatment Predictor . . . . . . . . . . . . . 5 2.5 The assumption of no interference between units . . . . . . . . . 8 3 Treatment interactions and post-stratification 9 3.1 Post-stratification . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 4 Observational Studies 12 4.1 Electric Company example . . . . . . . . . . . . . . . . . . . . . 12 4.2 Assumption of ignorable treatment assignment . . . . . . . . . . 13 4.3 Judging the reasonableness of regression as a modeling approach, assuming ignorability . . . . . . . . . . . . . . . . . . . . . . . . . 14

1 Causal Inference and Predictive Comparison Causal Inference and Predictive Comparison • We have been using regression in the predictive sense, to determine what values of Y tend to be associated with particular values of X in a given hypothetical “superpopulation” modeled with random variables and prob- ability distributions. • In causal inference, we attempt to answer a fundamentally different ques- tion, namely, what would happen if different treatments had been applied to the same units . 1.1 How Predictive Comparison Can Mislead Examples Example 1 (Example 1) . • Suppose a medical treatment is of no value. It has no effect on any individual. • However, in our society, healthier people are more likely to receive the treatment. • What would/could happen? (C.P.) Example 2 (Example 2) . • Suppose a medical treatment has positive value. It increases IQ on any individual. • However, in our society, lower IQ people are more likely to receive the treatment. • What would/could happen? (C.P.) 1.2 Adding Predictors as a Solution Adding Predictors as a Solution • In the preceding two examples, there was a solution, i.e., to compare treatments and controls conditional on previous health status. Intuitively, we compare current health status across treatment and control groups only within each previous health strategy. • Another alternative is to include treatment status and previous health status as predictors in a regression equation. 2

• Gelman and Hill assert that “in general, causal effects can be estimated using regression if the model includes all confounding covariates and if the model is correct.” 1.3 Omitted Variable Bias Omitted Variable Bias Suppose the “correct” specification for confounding covariate x i is y i = β 0 + β 1 T i + β 2 x i + ǫ i (1) Moreover, suppose that the regression for predicting x i from the treatment is x i = γ 0 + γ 1 T i + ν i Omitted Variable Bias – 2 Substituting, we get = β 0 + β 1 T i + β 2 ( γ 0 + γ 1 T i + ν i ) + ǫ i y i = β 0 + β 2 γ 0 + β 1 T i + β 2 γ 1 T i + ( ǫ i + β 2 ν i ) = ( β 0 + β 2 γ 0 ) + ( β 1 + β 2 γ 1 ) T i + ( β 2 γ 1 T i ) (2) Note that this can be written as y i = β ∗ 0 + β ∗ 1 T i + ǫ ∗ i where β ∗ 1 = β 1 + β 2 γ 1 2 Causal Inference – Problems and Solutions 2.1 The Fundamental Problem The Fundamental Problem 3

• The potential outcomes of y 1 i and y 0 i under T are the values that the i th unit would have demonstrated had level 1 or level 0 of the treatment actually been received by that unit. • In general, of course, the i th unit (or, for simplicity, individual i ) will not receive both treatments so either y 1 i or y 2 i is a counterfactual and will not be observed. We can think of the counterfactuals as “missing data.” 2.2 Ways of Getting Around the Problem Possible Solutions We can think of causal inference as a prediction of what would happen to unit i if T i = 1 or T i = 0. There are 3 basic strategies: 1. Obtain close substitutes for the potential outcomes. Examples: (a) T=1 one day, T=0 another (b) Break plastic into two pieces and test simultaneously (c) Measure new diet using previous weight as proxy for y 0 i . 2. Randomize. Since we cannot compare on identical units, compare on similar units. In the long run, randomization confers similarity. 3. Do a statistical adjustment. Predict with a more complex model, or block to achieve similarity. 2.3 Randomization Randomization In a completely randomized experiment, we can estimate the average treatment effect easily as average treatment effect = avg ( y 1 i − y 0 i ) The standard test on means can be applied. Of course, issues of external validity apply too. The results are relevant only for the population from which the sample was taken. 4

An Electric Example Example 3 (Electric Company Study) . • 4 grades, 2 cities • For each city and grade, approximately 10-20 schools were chosen • 2 weakest classes randomly assigned to either treatment or control • T = 1 classes given opportunity to watch The Electric Company, and educational show • At the end of the year, students in all classes were given a reading test Post-Test Results Test scores in control classes Test scores in treated classes mean = 69 mean = 77 Grade 1 sd = 13 sd = 16 0 50 100 0 50 100 mean = 93 mean = 102 Grade 2 sd = 12 sd = 10 0 50 100 0 50 100 mean = 106 mean = 107 Grade 3 sd = 7 sd = 8 0 50 100 0 50 100 mean = 110 mean = 114 Grade 4 sd = 7 sd = 4 0 50 100 0 50 100 2.4 Controlling for a Pre-Treatment Predictor Controlling for Pre-Treatment Score • The preceding results are suggestive. 5

• However, in this study, a pre-test was also given. In this case, the treatment effect can also be estimated using a regression model: y i = α + θT i + βx i + error i . • First, we fit a model where post-test score is predicted from pre-test score, with constant slopes treatment and control groups. • Treatment group is represented by a solid regression line and circles, control by dotted regression line and filled dots. • In this case, – The treatment effect is estimated as a constant across individuals within treatment group – The regression lines are parallel, and – The treatment effect is the difference between the lines. Results with No Interaction grade 1 grade 2 grade 3 grade 4 post−test, y i post−test, y i post−test, y i post−test, y i ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 60 ● ● ● ● ● ● ● 60 ● 60 60 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 0 0 0 0 60 0 60 0 60 0 60 pre−test, x i pre−test, x i pre−test, x i pre−test, x i Including an Interaction Term 6

• The preceding model failed to take into account the fact that the relation- ship between pre-test and post-test scores might differ between treatment and control groups. • We can add an interaction term to the model, thus allowing treatment and control groups to have regression lines with differing slopes. • In this model, y i = α + θT i + β 1 x i + β 2 T i x i + error i . • Note that in this mode, the treatment effect can be written as θ + β 2 x i . In other words, the treatment effect changes as a function of pre-test status . Interaction Model Results grade 1 grade 2 grade 3 grade 4 post−test, y i post−test, y i post−test, y i post−test, y i ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 60 ● ● ● ● ● ● ● ● 60 ● 60 60 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 0 0 0 0 60 0 60 0 60 0 60 pre−test, x i pre−test, x i pre−test, x i pre−test, x i A Combined Picture This next plot shows T regression coefficient estimates, 50% and 90% confi- dence intervals by grade. You can see clearly how “controlling” for pre-test score reduces variability in the estimators and smooths them out. 7

Contents 1 Causal Inference and Predictive Comparison 2 1.1 How - PDF document

Causal Inference in Observational Studies Contents 1 Causal Inference and Predictive Comparison 2 1.1 How Predictive Comparison Can Mislead . . . . . . . . . . . . . 2 1.2 Adding Predictors as a Solution . . . . . . . . . . . . . . . . . .

Level 1, V2.0 Level 1, V2.0 1 Course Contents Course Contents Course Contents Course

Oasys Post Processing New Features in Version 16.0 www.arup.com/dyna Back to Contents Back to

Contents averages averages Contents Contents Harmonic mean (average) Harmonic mean (average)

Sage as a Calculator By Samaneh shafi naderi By Samaneh shafi naderi Sage as a Calculator

Contents Contents Fluid

Contents Contents.....2 Butter

PRODUCT LAW WORLDVIEW PRODUCT LAW WORLDVIEW TABLE OF CONTENTS TABLE OF CONTENTS INTRODUCTION

The Waterbase Limited Investor Presentation June - 2016 Contents Contents 2 Safe Harbour

17 www.scad.ae Table of Contents Table of Contents

Scytls voter-verifiability solutions Pnyx.DRE and Pnyx.VVPAT Contents Contents

Cencosud April 2016 Corporate Presentation | Contents | 2 Contents Investment Highlights

3 August 2006 Hong Kong www.solomon-systech.com Table of contents Table of contents

CONTENTS CONTENTS A. Company Profile 03 B. Products 06 Appendix 29 2/30 A. Company Profile

INVESTOR PRESENTATION February 2020 CONTENTS TABLE OF CONTENTS Majid Al Futtaim 2019

Marine Biodiversity Yoshihisa Shirayama Contents Contents Characteristics of Marine

Taeil Enterprise the antimicrobial material technology Table of Contents Table of Contents

WorldWide Telescope in the Classroom Patricia Udomprasert, Alyssa Goodman ,

CS 171: Introduction to Computer Science II Department of Mathematics and Computer Science Li

The impact of a Hausman pretest on the size of a hypothesis test Patrik Guggenberger Department

The role of Gold in Indian Economy Presented By Group: 05 Sameera Gorle Krishna Chaitanya

Utility value of management tools Advanced Herd Management Anders Ringgaard Kristensen What kind

Prototyping Preventive Maintenance Tools with R Erich Neuwirth, Julia Theresa Csar The R User

Software Evolution Dr. James A. Bednar jbednar@inf.ed.ac.uk

Increasing L Lane C Capacity w with thin Ex Existing ROW US US-23 F 3 Flex x Rout ute