H Y P OT H E S I S T E S T I N G MPA 630: Data Science for Public - - PowerPoint PPT Presentation

h y p ot h e s i s t e s t i n g
SMART_READER_LITE
LIVE PREVIEW

H Y P OT H E S I S T E S T I N G MPA 630: Data Science for Public - - PowerPoint PPT Presentation

H Y P OT H E S I S T E S T I N G MPA 630: Data Science for Public Management November 15, 2018 Fill out your reading report on Learning Suite P L A N F O R T O D A Y Randomness, repetition, and replicability Why are we even doing this?


slide-1
SLIDE 1

H Y P OT H E S I S T E S T I N G

MPA 630: Data Science for Public Management November 15, 2018

Fill out your reading report

  • n Learning Suite
slide-2
SLIDE 2

P L A N F O R T O D A Y Why are we even doing this?

(again!)

Randomness, repetition, and replicability Burdens of proof How to test any hypothesis

slide-3
SLIDE 3

R A N D O M N E S S , R E P E T I T I O N , & R E P L I C A B I L I T Y

slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6

P O W E R P O S I N G

Increases individual perception of power

She made a guess at a population parameter and published it This is the process

  • f science!

Increases testosterone and decreases cortisol

slide-7
SLIDE 7

R U H R O H

slide-8
SLIDE 8

B U T W A I T

Increases individual perception of power Increases testosterone and decreases cortisol

slide-9
SLIDE 9

M O R A L O F T H E S T O R Y

Randomness is weird Capturing true population parameters is hard Replication and repetition are needed to check the net

slide-10
SLIDE 10

W H Y A R E W E E V E N D O I N G T H I S ?

Round 2!

slide-11
SLIDE 11

P O P U L A T I O N P A R A M E T E R S

Key assumption in the flavor of statistics we’re doing:

There are true, fixed population parameters out in the world

slide-12
SLIDE 12

P O P U L A T I O N V S . S A M P L E

Proportion Mean Difference between proportions Difference between means Intercept ! " !# − !% "# − "% &' ̂ ! ̅ * ̅ *# − ̅ *% ̂ !# − ̂ !% + &'

,

slide-13
SLIDE 13

W H Y W E S I M U L A T E

Is ̅ " an accurate guess of #? Is ̅ "$ − ̅ "& or ̂ ($ − ̂ (& real? Does it matter? Is it accurate? Is it real? Is it substantive?

Width of confidence interval Important numbers included in confidence interval

slide-14
SLIDE 14

B U R D E N S O F P R O O F

slide-15
SLIDE 15

A M E R I C A N L E G A L S Y S T E M

Accused must be judged Accuser has burden of proving guilt Presumption of innocence

We never prove innocence; we try (and fail) to reject innocence

Judge/jury decide guilt based

  • n amount of evidence
slide-16
SLIDE 16

L E G A L E V I D E N T I A R Y S T A N D A R D S Preponderance of evidence Beyond reasonable doubt Clear and convincing evidence

Why do we have these different levels? We’re afraid of locking up an innocent person

slide-17
SLIDE 17

Actual truth Guilty Not guilty Jury decision Not guilty Guilty

Yay!

True positive

Yay!

True negative

Oh no!

False positive (I)

Oh no!

False negative (II)

slide-18
SLIDE 18
slide-19
SLIDE 19

S T A T I S T I C A L “ L E G A L” S Y S T E M

You have burden of proving effect Presumption of no effect (null)

We never prove that the null is true; we try (and fail) to reject the null

You decide “guiltiness” of effect based on amount of evidence Sample statistic (δ) must be judged

slide-20
SLIDE 20

Actual truth Yes effect No effect Result of hypothesis test No effect Yes effect

Yay!

True positive

Yay!

True negative

Oh no!

False positive (I)

Oh no!

False negative (II)

!

0.10 0.05 0.01

slide-21
SLIDE 21

S T A T I S T I C A L S I G N I F I C A N C E

There’s enough evidence to safely reject the null hypothesis

slide-22
SLIDE 22

P - V A L U E S

The probability of observing an effect at least that large when no effect exists

slide-23
SLIDE 23

N O B O D Y U N D E R S T A N D S T H E S E

http://fivethirtyeight.com/pvalue

slide-24
SLIDE 24

H O W TO T E S T A N Y H Y P OT H E S I S

slide-25
SLIDE 25
slide-26
SLIDE 26
slide-27
SLIDE 27
slide-28
SLIDE 28

S I M U L A T I O N S A N D H Y P O T H E S E S

slide-29
SLIDE 29

Find δ

The sample statistic: diff in means, mean, diff in props, etc.

Invent world where δ is null

Simulate what the world would look like if there was no effect.

Look at δ in the null world

Is it big and extraordinary, or is it a normal thing?

Calculate probability that δ could exist in the null world

This is your p-value!

Decide!