Geographic Data Science - Lecture IX Causal Inference Dani - - PowerPoint PPT Presentation

geographic data science lecture ix
SMART_READER_LITE
LIVE PREVIEW

Geographic Data Science - Lecture IX Causal Inference Dani - - PowerPoint PPT Presentation

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs Causation Causal inference Why/when causality matters Hurdles to causal inference & strategies to overcome them Correlation Vs Causation


slide-1
SLIDE 1

Geographic Data Science - Lecture IX

Causal Inference

Dani Arribas-Bel

slide-2
SLIDE 2

Today

Correlation Vs Causation Causal inference Why/when causality matters Hurdles to causal inference & strategies to

  • vercome them
slide-3
SLIDE 3

Correlation Vs Causation

slide-4
SLIDE 4

"Association breeds similarity" (sometimes)

Nasir bin Olu Dara Jones (a.k.a. Nas)

slide-5
SLIDE 5

Correlation Vs Causation

Two fundamental ways to look at the relationship between two (or more) variables:

slide-6
SLIDE 6

Correlation Vs Causation

Two fundamental ways to look at the relationship between two (or more) variables: Correlation Two variables have co-movement. If we know the value of one, we know something about the value of the other one.

slide-7
SLIDE 7

Correlation Vs Causation

Two fundamental ways to look at the relationship between two (or more) variables: Correlation Two variables have co-movement. If we know the value of one, we know something about the value of the other one. Causation There is a "cause-effect" link between the two and, as a result, they display co-movement.

slide-8
SLIDE 8

Correlation Vs Causation

Both are useful, but for different purposes Causation implies correlation but not the other way around It is vital to keep this distinction in mind for meaningful and credible analysis

slide-9
SLIDE 9

Examples

Sign correlation? Causal link? Take a guess (2mins)... Temperature and ice-cream consumption Non-commercial space launches & Sociology PhDs awarded Crime & policing IMD Moran Plot in Liverpool

slide-10
SLIDE 10

Examples

Sign correlation? Causal link? Take a guess (2mins)... Temperature and ice-cream consumption →

  • Positive. Positive.

Non-commercial space launches & Sociology PhDs awarded Crime & policing IMD Moran Plot in Liverpool

slide-11
SLIDE 11

Worldwide non-commercial space launches

correlates with

Sociology doctorates awarded (US)

Sociology doctorates awarded (US) Worldwide non-commercial space launches

2000 2001 2002 2003 2004 2005 2006 2007 2000 2001 2002 2003 2004 2005 2006 2007

[ ] Source

slide-12
SLIDE 12

Examples

Positive or negative correlation? Causal link? Take a guess (2mins)... Temperature and ice-cream consumption →

  • Positive. Positive.

Non-commercial space launches & Sociology PhDs awarded → Positive. None. Crime & policing IMD Moran Plot in Liverpool

slide-13
SLIDE 13

Examples

Positive or negative correlation? Causal link? Take a guess (2mins)... Temperature and ice-cream consumption →

  • Positive. Positive.

Non-commercial space launches & Sociology PhDs awarded → Positive. None. Crime & policing → Positive. Negative. IMD Moran Plot in Liverpool

slide-14
SLIDE 14
slide-15
SLIDE 15

Examples

Positive or negative correlation? Causal link? Take a guess (2mins)... Temperature and ice-cream consumption →

  • Positive. Positive.

Non-commercial space launches & Sociology PhDs awarded → Positive. None. Crime & policing → Positive. Negative. IMD Moran Plot in Liverpool → Positive. ?

slide-16
SLIDE 16

Causal inference

slide-17
SLIDE 17

Playback isn't supported on this device.

Causal inference is hard (or how I learned to stop worrying a…

0:00 / 5:29

[ ] Source

slide-18
SLIDE 18

Why/When get causal?

slide-19
SLIDE 19

Why

Most often, we are interested in understanding the processes that generate the world, not only in

  • bserving its outcomes

Many of these processes are only indirectly

  • bservable through outcomes

The only way to link both is through causal channels

slide-20
SLIDE 20

When

Essentially when the core interest is to find out if something causes something else Policy interventions Medical trials Business decisions (product/feature development...) Empirical (Social) Sciences ...

slide-21
SLIDE 21

When not (necessarily)

slide-22
SLIDE 22

When not (necessarily)

Exploratory analysis When you are not sure what you are after, inferring causality might be too high of a price to pay to get a sense of the main relationships

slide-23
SLIDE 23

When not (necessarily)

Exploratory analysis When you are not sure what you are after, inferring causality might be too high of a price to pay to get a sense of the main relationships Predictive settings Interest not in understanding the underlying mechanisms but want to obtain best possible estimates of a variable you do not have by combining

  • thers you do have

E.g. Population density in a specific point using population density in all available nearby locations

slide-24
SLIDE 24

Hurdles to causal inference

slide-25
SLIDE 25

Hurdles to causal inference

Causation implies Correlation Correlation does not imply Causation Why?

slide-26
SLIDE 26

Hurdles to causal inference

Causation implies Correlation Correlation does not imply Causation Why?

slide-27
SLIDE 27

Hurdles to causal inference

Causation implies Correlation Correlation does not imply Causation Why? Reverse causality Confounding factors/endogeneity

slide-28
SLIDE 28

Reverse causality

There is a causal link between the two variables but it either runs the oposite direction as we think, or runs in both

slide-29
SLIDE 29

Reverse causality

There is a causal link between the two variables but it either runs the oposite direction as we think, or runs in both E.g. Education and income

slide-30
SLIDE 30

Confounding factors

Two variables are correlated because they are both determined by other, unobserved, variables (factors) that confound the effect

slide-31
SLIDE 31

Confounding factors

Two variables are correlated because they are both determined by other, unobserved, variables (factors) that confound the effect E.g. Ice cream and cold beverages consumption

slide-32
SLIDE 32

Strategies

slide-33
SLIDE 33

Is there any way to overcome reverse causality and confounding factors to recover causal effects?

slide-34
SLIDE 34

Is there any way to overcome reverse causality and confounding factors to recover causal effects? The key is to get an exogenous source of variation

slide-35
SLIDE 35

Strategies

slide-36
SLIDE 36

Strategies

Randomized Control Trials Treated and control groups Probability of treatment is independent of everything else

slide-37
SLIDE 37

Strategies

Randomized Control Trials Treated and control groups Probability of treatment is independent of everything else Quasi-natural experiments Like a RCT, but that just "happen to occur naturally" (natural dissasters, exogenous law changes...)

slide-38
SLIDE 38

Strategies

Randomized Control Trials Treated and control groups Probability of treatment is independent of everything else Quasi-natural experiments Like a RCT, but that just "happen to occur naturally" (natural dissasters, exogenous law changes...) Econometric techniques For the interested reader: space-time regression, instrumental variables, propensity score matching, differences-in-differences, regression discontinuity...

slide-39
SLIDE 39

So, why correlation at all?

slide-40
SLIDE 40

So, why correlation at all?

Establishing causality is much harder than identifying correlation, and sometimes it is just not possible with a given dataset (e.g. many observational surveys).

slide-41
SLIDE 41

So, why correlation at all?

Establishing causality is much harder than identifying correlation, and sometimes it is just not possible with a given dataset (e.g. many observational surveys). ... correlation most often precludes causation and, depending on the application/analysis, it is all that is needed.

slide-42
SLIDE 42

So, why correlation at all?

Establishing causality is much harder than identifying correlation, and sometimes it is just not possible with a given dataset (e.g. many observational surveys). ... correlation most often precludes causation and, depending on the application/analysis, it is all that is needed. It is important to always draw conclusions based on analysis, know what the data can and cannot tell, and stay honest.

slide-43
SLIDE 43

Recapitulation

Correlation does NOT imply causation Causality implies more than correlation, a direct effect channel that is harder to identify but might be worthwhile There are several techniques to identify causality, all usually based on obtaining exogenous sources

  • f variation

You don't always need causality

slide-44
SLIDE 44

[ ] Source

slide-45
SLIDE 45

Geographic Data Science'16 - Lecture 9 by is licensed under a . Dani Arribas-Bel Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License