(Mis)use of matching techniques Pawe Strawiski University of Warsaw - - PowerPoint PPT Presentation

mis use of matching techniques
SMART_READER_LITE
LIVE PREVIEW

(Mis)use of matching techniques Pawe Strawiski University of Warsaw - - PowerPoint PPT Presentation

Introduction and motivation Matching Numerical examples Final (Mis)use of matching techniques Pawe Strawiski University of Warsaw 5th Polish Stata Users Meeting, Warsaw, 27th November 2017 Research financed under National Science


slide-1
SLIDE 1

Introduction and motivation Matching Numerical examples Final

(Mis)use of matching techniques

Paweł Strawiński

University of Warsaw

5th Polish Stata Users Meeting, Warsaw, 27th November 2017

Research financed under National Science Center, Poland grant 2015/19/B/HS4/03231

Paweł Strawiński (Mis)use of matching techniques

slide-2
SLIDE 2

Introduction and motivation Matching Numerical examples Final

Outline

1

Introduction and motivation

2

Matching Matching algorithms Determining closest match

3

Numerical examples Example 1 Small grumble Example 2

4

Final

Paweł Strawiński (Mis)use of matching techniques

slide-3
SLIDE 3

Introduction and motivation Matching Numerical examples Final

Motivation

Matching techniques became very popular among researchers Search on ideas.repec.org for documents created since 2007 gives result: ”propensity: 11794, score : 14188, matching : 23245” However, matching is often overlooked as a ”magic bullet” that solves all statistical problems The problem is that practitioners either are not aware or ignore shortcomings of matching procedures

Paweł Strawiński (Mis)use of matching techniques

slide-4
SLIDE 4

Introduction and motivation Matching Numerical examples Final Matching algorithms Determining closest match

The idea of matching

Lets assume we observe two groups. One is exposed to experimental treatment and the other to control treatment We would like to compare the values of the outcome variable in two groups If a study is non-randomized simple difference in outcome variable may be biased estimator of the effect of treatment Matching is statistical technique that allows to transform non-randomised data to randomised one.

Paweł Strawiński (Mis)use of matching techniques

slide-5
SLIDE 5

Introduction and motivation Matching Numerical examples Final Matching algorithms Determining closest match

How it works

Assume that each observation is described by pair of random variables (Y , X). We call Y outcome variable(s), and X

  • bject characteristic(s)

1

For each observation in the treatment group, find observation in the control group with the same (or at least very similar) X value(s).

2

The Y values of these matching observations from the control group are then used to compute the counterfactual outcome without treatment for the observation from treatment group.

3

An estimate for the average treatment effect can be obtained as the mean of the differences between the observed values from the treatment and the matched counterfactual values from the control group.

Paweł Strawiński (Mis)use of matching techniques

slide-6
SLIDE 6

Introduction and motivation Matching Numerical examples Final Matching algorithms Determining closest match

Outcome of interest

The Average Treatment on the Treated (ATT) ATT =

  • i

Y 1

i − E(Y 0 i |T = 1)

The Average Treatment Effect (ATE) ATE = #T = 1 N ˆ ATT + #T = 0 N

  • i

E(Y 1

i |T = 0) − Y 0 i

  • Those expected values are not directly observed. They are

retrieved from observed data by reweighting procedure. Different algorithms uses different reweighting.

Paweł Strawiński (Mis)use of matching techniques

slide-7
SLIDE 7

Introduction and motivation Matching Numerical examples Final Matching algorithms Determining closest match

Matching methods

Exact matching Distance matching Propensity Score Matching

Paweł Strawiński (Mis)use of matching techniques

slide-8
SLIDE 8

Introduction and motivation Matching Numerical examples Final Matching algorithms Determining closest match

Exact Matching

Exact matching is matching on discrete metric Observations are matched if and only if Xi = Xj The result is perfect matching on covariate values Is the problem is multivariate the result could be an empty set

Paweł Strawiński (Mis)use of matching techniques

slide-9
SLIDE 9

Introduction and motivation Matching Numerical examples Final Matching algorithms Determining closest match

Distance Matching

Matching on distance metric that measures proximity of

  • bservations

The idea then is to use close observations, but not necessarily ideally matched The most popular algorithm is Mahalanobis distance matching MD = (Xi − Xj)′Σ(Xi − Xj) where Σ is empirical covariance matrix of X Performs well when X are discrete When X are continuous can be computationally burdensome

Paweł Strawiński (Mis)use of matching techniques

slide-10
SLIDE 10

Introduction and motivation Matching Numerical examples Final Matching algorithms Determining closest match

Propensity Score Matching

Method proposed by Rosenbaum and Rubin (1983). Instead matching on multidimensional X matching is done on propensity score π(X) which is E(T = 1 | X) It requires estimation of the propensity score π(X) usually by logit or probit model. Then observations with closest values of π(X) are matched Performs well when X are continuous When X are discrete it often is difficult to choose best match

Paweł Strawiński (Mis)use of matching techniques

slide-11
SLIDE 11

Introduction and motivation Matching Numerical examples Final Matching algorithms Determining closest match

Pair matching (no replacement) Nearest neighbour match (with replacement) Nearest k-neighbour match (with replacement) Caliper matching Kernel matching

Paweł Strawiński (Mis)use of matching techniques

slide-12
SLIDE 12

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

The aim of examples is to show in which circumstances exact matching and matching on the propensity score leads to poor quality results

Paweł Strawiński (Mis)use of matching techniques

slide-13
SLIDE 13

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1

We analyse sample of 1000 observations There are 500 treated objects and 500 non treated object Objects are described by 5 dummy variables and 5 variables with continuous distribution In each model for propensity score 5 dummy variables are used and different number of continuously distributed variables from 0 to 5

Paweł Strawiński (Mis)use of matching techniques

slide-14
SLIDE 14

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. 5 dummy

20 40 60 80 .2 .4 .6 .8 .2 .4 .6 .8

1 Frequency Pr(T)

Graphs by T

Paweł Strawiński (Mis)use of matching techniques

slide-15
SLIDE 15

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. 5 dummy and 1 continuous covariate

.5 1 .2 .4 .6 .8 .2 .4 .6 .8

1 Frequency Pr(T)

Graphs by T

Paweł Strawiński (Mis)use of matching techniques

slide-16
SLIDE 16

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. 5 dummy and 2 continuous covariates

.5 1 .5 1 .5 1

1 Frequency Pr(T)

Graphs by T

Paweł Strawiński (Mis)use of matching techniques

slide-17
SLIDE 17

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. 5 dummy and 3 continuous covariates

.5 1 .5 1 .5 1

1 Frequency Pr(T)

Graphs by T

Paweł Strawiński (Mis)use of matching techniques

slide-18
SLIDE 18

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. 5 dummy and 4 continuous covariates

.5 1 .5 1 .5 1

1 Frequency Pr(T)

Graphs by T

Paweł Strawiński (Mis)use of matching techniques

slide-19
SLIDE 19

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. 5 dummy and 5 continuous covariates

.5 1 .5 1 .5 1

1 Frequency Pr(T)

Graphs by T

Paweł Strawiński (Mis)use of matching techniques

slide-20
SLIDE 20

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. Number of distinct pscore values

With 0 continuous variables 31 distinct values With at least 1 continuous variable 1000 distinct values Therefore, distinct values of the propensity score are important but not sufficient to find good matches Its distribution also matters

Paweł Strawiński (Mis)use of matching techniques

slide-21
SLIDE 21

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. What I do not like about teffects

. teffects psmatch (y) (T d1 d2 d3 d4 d5), atet Treatment-effects estimation Number of obs = 1,000 Estimator : propensity-score matching Matches: requested = 1 Outcome model : matching min = 1 Treatment model: logit max = 82

  • |

AI Robust y1 | Coef.

  • Std. Err.

z P>|z| [95% Conf. Interval]

  • ------------+----------------------------------------------------------------

ATET | T | (1 vs 0) | 46.52442 105.3741 0.44 0.659

  • 160.005

253.0538

  • Paweł Strawiński

(Mis)use of matching techniques

slide-22
SLIDE 22

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. What I do not like about teffects

. psmatch2 T d1 d2 d3 d4 d5, outcome(y) logit quietly There are observations with identical propensity score values. The sort order of the data could affect your results. Make sure that the sort order is random before calling psmatch2.

  • Variable

Sample | Treated Controls Difference S.E. T-stat

  • ---------------------------+-----------------------------------------------------------

y1 Unmatched | 2107.59819 2074.21094 33.3872521 71.9959108 0.46 ATT | 2107.59819 1879.20512 228.393068 330.29759 0.69

  • ---------------------------+-----------------------------------------------------------

Note: S.E. does not take into account that the propensity score is estimated. Paweł Strawiński (Mis)use of matching techniques

slide-23
SLIDE 23

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 1. What I do not like about teffects

. teffects psmatch (y) (T x1 d1 d2 d3 d4 d5), atet Treatment-effects estimation Number of obs = 1,000 Estimator : propensity-score matching Matches: requested = 1 Outcome model : matching min = 1 Treatment model: logit max = 1

  • |

AI Robust y1 | Coef.

  • Std. Err.

z P>|z| [95% Conf. Interval]

  • ------------+----------------------------------------------------------------

ATET | T | (1 vs 0) | 33.21557 145.23 0.23 0.819

  • 251.43

317.8611

  • . psmatch2 T x1 d1 d2 d3 d4 d5, outcome(y) logit qui
  • Variable

Sample | Treated Controls Difference S.E. T-stat

  • ---------------------------+-----------------------------------------------------------

y1 Unmatched | 2107.59819 2074.21094 33.3872521 71.9959108 0.46 ATT | 2107.59819 2074.38262 33.2155662 127.669286 0.26

  • ---------------------------+-----------------------------------------------------------

Note: S.E. does not take into account that the propensity score is estimated. Paweł Strawiński (Mis)use of matching techniques

slide-24
SLIDE 24

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 2.

We analyse National Longitudinal Surveys 1988, Women samples data

sysuse nlsw88.dta, clear (NLSW, 1988 extract) drop if race==3 (26 observations deleted) keep if hours==40 (1,142 observations deleted) gen black=(race==2)

We restrict sample to white and black woman And consider only those who work for 40 hours a week In model for propensity score we use 8 dummy variables are used and 3 continuously distributed variables

Paweł Strawiński (Mis)use of matching techniques

slide-25
SLIDE 25

Introduction and motivation Matching Numerical examples Final Example 1 Small grumble Example 2

Example 2. 8 dummy and 3 continuous covariates

.5 1 .5 1 .5 1

white black Frequency propensity score

Graphs by black

Paweł Strawiński (Mis)use of matching techniques

slide-26
SLIDE 26

Introduction and motivation Matching Numerical examples Final

Conclusions and Recomendations

The Propensity Score Matching and other matching techniques should be used with caution If You lacking continuously distributed covariates the PSM matching is bad idea. Distance matching is better choice but remember about problem of identical distances Three continuously distributed covariates are usually enough to receive close to continuous distribution for the estimated propensity score

Paweł Strawiński (Mis)use of matching techniques