Q u antif y ing the strength of bi v ariate relationships C OR R E - - PowerPoint PPT Presentation

q u antif y ing the strength of bi v ariate relationships
SMART_READER_LITE
LIVE PREVIEW

Q u antif y ing the strength of bi v ariate relationships C OR R E - - PowerPoint PPT Presentation

Q u antif y ing the strength of bi v ariate relationships C OR R E L ATION AN D R E G R E SSION IN R Ben Ba u mer Assistant Professor at Smith College Correlation Correlation coe cient bet w een -1 and 1 Sign > direction Magnit u


slide-1
SLIDE 1

Quantifying the strength of bivariate relationships

C OR R E L ATION AN D R E G R E SSION IN R

Ben Baumer

Assistant Professor at Smith College

slide-2
SLIDE 2

CORRELATION AND REGRESSION IN R

Correlation

Correlation coecient between -1 and 1 Sign —> direction Magnitude —> strength

slide-3
SLIDE 3

CORRELATION AND REGRESSION IN R

Near perfect correlation

slide-4
SLIDE 4

CORRELATION AND REGRESSION IN R

Strong

slide-5
SLIDE 5

CORRELATION AND REGRESSION IN R

Moderate

slide-6
SLIDE 6

CORRELATION AND REGRESSION IN R

Weak

slide-7
SLIDE 7

CORRELATION AND REGRESSION IN R

Zero

slide-8
SLIDE 8

CORRELATION AND REGRESSION IN R

Negative

slide-9
SLIDE 9

CORRELATION AND REGRESSION IN R

Non-linear

slide-10
SLIDE 10

CORRELATION AND REGRESSION IN R

Non-linear correlation

run10 %>% filter(divPlace <= 10) %>% ggplot(aes(x = age, y = pace, color = gender)) + geom_point()

slide-11
SLIDE 11

CORRELATION AND REGRESSION IN R

Pearson product-moment correlation

slide-12
SLIDE 12

CORRELATION AND REGRESSION IN R

Pearson product-moment correlation

r(x,y) = √ (x − ) ⋅ (y − ) ∑i=1

n i

x ¯ 2 ∑i=1

n i

y ¯ 2 x − y − ∑i=1

n

( i x ¯) ( i y ¯)

slide-13
SLIDE 13

Let's practice!

C OR R E L ATION AN D R E G R E SSION IN R

slide-14
SLIDE 14

The Anscombe dataset

C OR R E L ATION AN D R E G R E SSION IN R

Ben Baumer

Assistant Professor at Smith College

slide-15
SLIDE 15

CORRELATION AND REGRESSION IN R

Anscombe

ggplot(data = Anscombe, aes(x = x, y = y)) + geom_point() + facet_wrap(~ set)

slide-16
SLIDE 16

CORRELATION AND REGRESSION IN R

Anscombe 1

Anscombe %>% filter(set == 1) %>% ggplot(aes(x = x, y = y)) + geom_point()

slide-17
SLIDE 17

CORRELATION AND REGRESSION IN R

Anscombe 2

Anscombe %>% filter(set == 2) %>% ggplot(aes(x = x, y = y)) + geom_point()

slide-18
SLIDE 18

CORRELATION AND REGRESSION IN R

Anscombe 3

Anscombe %>% filter(set == 3) %>% ggplot(aes(x = x, y = y)) + geom_point()

slide-19
SLIDE 19

CORRELATION AND REGRESSION IN R

Anscombe 4

Anscombe %>% filter(set == 4) %>% ggplot(aes(x = x, y = y)) + geom_point()

slide-20
SLIDE 20

Let's practice!

C OR R E L ATION AN D R E G R E SSION IN R

slide-21
SLIDE 21

Interpretation of Correlation

C OR R E L ATION AN D R E G R E SSION IN R

Ben Baumer

Assistant Professor at Smith College

slide-22
SLIDE 22

CORRELATION AND REGRESSION IN R

Exercise and beer

Source: hp://well.blogs.nytimes.com/2015/12/02/the-close-ties-between- exercise-and-beer/

1

slide-23
SLIDE 23

CORRELATION AND REGRESSION IN R

Exercise and beer

Source: hp://well.blogs.nytimes.com/2015/12/02/the-close-ties-between- exercise-and-beer/

1

slide-24
SLIDE 24

CORRELATION AND REGRESSION IN R

Exercise and beer

Source: hp://well.blogs.nytimes.com/2015/12/02/the-close-ties-between- exercise-and-beer/

1

slide-25
SLIDE 25

CORRELATION AND REGRESSION IN R

Exercise and beer

Source: hp://well.blogs.nytimes.com/2015/12/02/the-close-ties-between- exercise-and-beer/

1

slide-26
SLIDE 26

CORRELATION AND REGRESSION IN R

Exercise and beer

Source: hp://well.blogs.nytimes.com/2015/12/02/the-close-ties-between- exercise-and-beer/

1

slide-27
SLIDE 27

CORRELATION AND REGRESSION IN R

Exercise and beer

Source: hp://well.blogs.nytimes.com/2015/12/02/the-close-ties-between- exercise-and-beer/

1

slide-28
SLIDE 28

CORRELATION AND REGRESSION IN R

Exercise and beer

slide-29
SLIDE 29

CORRELATION AND REGRESSION IN R

Exercise and beer

slide-30
SLIDE 30

CORRELATION AND REGRESSION IN R

Exercise and beer

slide-31
SLIDE 31

CORRELATION AND REGRESSION IN R

Exercise and beer

slide-32
SLIDE 32

CORRELATION AND REGRESSION IN R

NFL arrests

Source: hps://www.nytimes.com/2014/09/13/upshot/what-the-numbers- show-about-n-player-arrests.html

1

slide-33
SLIDE 33

CORRELATION AND REGRESSION IN R

NFL arrests

slide-34
SLIDE 34

CORRELATION AND REGRESSION IN R

NFL arrests

slide-35
SLIDE 35

CORRELATION AND REGRESSION IN R

NFL arrests

slide-36
SLIDE 36

CORRELATION AND REGRESSION IN R

Correlation vs. regression

Source: hp://www.nytimes.com/2012/11/02/business/questions-raised-on- withdrawal-of-congressional-research-services-report-on-tax-rates.html

1

slide-37
SLIDE 37

CORRELATION AND REGRESSION IN R

Correlation vs. regression

Source: hp://www.nytimes.com/2012/11/02/business/questions-raised-on- withdrawal-of-congressional-research-services-report-on-tax-rates.html

1

slide-38
SLIDE 38

CORRELATION AND REGRESSION IN R

Correlation vs. regression

Source: hp://www.nytimes.com/2012/11/02/business/questions-raised-on- withdrawal-of-congressional-research-services-report-on-tax-rates.html

1

slide-39
SLIDE 39

CORRELATION AND REGRESSION IN R

Can you plot a correlation?

Source: hp://heatst.com/world/no-correlation-between-voting-for-brexit- and-racism-study-nds/

1

slide-40
SLIDE 40

Let's practice!

C OR R E L ATION AN D R E G R E SSION IN R

slide-41
SLIDE 41

Spurious correlations

C OR R E L ATION AN D R E G R E SSION IN R

Ben Baumer

Assistant Professor at Smith College

slide-42
SLIDE 42

CORRELATION AND REGRESSION IN R

Spurious over time

slide-43
SLIDE 43

CORRELATION AND REGRESSION IN R

Spurious over time

slide-44
SLIDE 44

CORRELATION AND REGRESSION IN R

Spurious over space

slide-45
SLIDE 45

CORRELATION AND REGRESSION IN R

Spurious for whatever reason

slide-46
SLIDE 46

Let's practice!

C OR R E L ATION AN D R E G R E SSION IN R