Dealing with Separation in Logistic Regression Models Carlisle - - PowerPoint PPT Presentation

dealing with separation in logistic regression models
SMART_READER_LITE
LIVE PREVIEW

Dealing with Separation in Logistic Regression Models Carlisle - - PowerPoint PPT Presentation

Dealing with Separation in Logistic Regression Models Carlisle Rainey Assistant Professor Texas A&M University crainey@tamu.edu paper, data, and code at crain.co/research The prior matters a lot, so choose a good one. The prior matters


slide-1
SLIDE 1

Dealing with Separation in Logistic Regression Models

Carlisle Rainey Assistant Professor Texas A&M University crainey@tamu.edu

paper, data, and code at crain.co/research

slide-2
SLIDE 2

The prior matters a lot, so choose a good one.

slide-3
SLIDE 3

The prior matters a lot, so choose a good one.

  • 1. in practice
  • 2. in theory
  • 3. concepts
  • 4. software
slide-4
SLIDE 4

The Prior Matters

in Practice

slide-5
SLIDE 5

politics need

slide-6
SLIDE 6

Variable Coefficient Confidence Interval Democratic Governor

  • 26.35

[-126,979.03; 126,926.33] % Uninsured (Std.) 0.92 [-3.46; 5.30] % Favorable to ACA 0.01 [-0.17; 0.18] GOP Legislature 2.43 [-0.47; 5.33] Fiscal Health 0.00 [-0.02; 0.02] Medicaid Multiplier

  • 0.32

[-2.45; 1.80] % Non-white 0.05 [-0.12; 0.21] % Metropolitan

  • 0.08

[-0.17; 0.02] Constant 2.58 [-7.02; 12.18]

slide-7
SLIDE 7

Variable Coefficient Confidence Interval Democratic Governor

  • 26.35

[-126,979.03; 126,926.33] % Uninsured (Std.) 0.92 [-3.46; 5.30] % Favorable to ACA 0.01 [-0.17; 0.18] GOP Legislature 2.43 [-0.47; 5.33] Fiscal Health 0.00 [-0.02; 0.02] Medicaid Multiplier

  • 0.32

[-2.45; 1.80] % Non-white 0.05 [-0.12; 0.21] % Metropolitan

  • 0.08

[-0.17; 0.02] Constant 2.58 [-7.02; 12.18]

slide-8
SLIDE 8

Variable Coefficient Confidence Interval Democratic Governor

  • 26.35

[-126,979.03; 126,926.33] % Uninsured (Std.) 0.92 [-3.46; 5.30] % Favorable to ACA 0.01 [-0.17; 0.18] GOP Legislature 2.43 [-0.47; 5.33] Fiscal Health 0.00 [-0.02; 0.02] Medicaid Multiplier

  • 0.32

[-2.45; 1.80] % Non-white 0.05 [-0.12; 0.21] % Metropolitan

  • 0.08

[-0.17; 0.02] Constant 2.58 [-7.02; 12.18]

This is a failure of maximum likelihood.

slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11
slide-12
SLIDE 12

Different default priors produce different results.

slide-13
SLIDE 13

The Prior Matters

in Theory

slide-14
SLIDE 14

For

  • 1. a monotonic likelihood p(y|β) decreasing in βs,
  • 2. a proper prior distribution p(β|σ), and
  • 3. a large, negative βs,

the posterior distribution of βs is proportional to the prior distribution for βs, so that p(βs|y) ∝ p(βs|σ).

slide-15
SLIDE 15

For

  • 1. a monotonic likelihood p(y|β) decreasing in βs,
  • 2. a proper prior distribution p(β|σ), and
  • 3. a large, negative βs,

the posterior distribution of βs is proportional to the prior distribution for βs, so that p(βs|y) ∝ p(βs|σ).

slide-16
SLIDE 16

The prior determines crucial parts of the posterior.

slide-17
SLIDE 17

Key Concepts

for Choosing a Good Prior

slide-18
SLIDE 18

Pr(yi) = Λ(βc + βssi + β1xi1 + ... + βkxik)

slide-19
SLIDE 19

Transforming the Prior Distribution

˜ β ∼ p(β) ˜ πnew = p(ynew|˜ β) ˜ qnew = q(˜ πnew)

slide-20
SLIDE 20

We Already Know Few Things

β1 ≈ ˆ βmle

1

β2 ≈ ˆ βmle

2

. . . βk ≈ ˆ βmle

k

βs < 0

slide-21
SLIDE 21

Partial Prior Distribution

p∗(β|βs < 0, β−s = ˆ βmle

−s ),

where ˆ βmle

s

= −∞

slide-22
SLIDE 22
slide-23
SLIDE 23

Software

for Choosing a Good Prior

slide-24
SLIDE 24

separation

(on GitHub)

slide-25
SLIDE 25

Stan Project

rstanarm

StataStan

slide-26
SLIDE 26

Conclusion

slide-27
SLIDE 27

The prior matters a lot, so choose a good one.

slide-28
SLIDE 28

What should you do?

  • 1. Notice the problem and do something.
  • 2. Recognize the the prior affects the inferences

and choose a good one.

  • 3. Assess the robustness of your conclusions to a

range of prior distributions.

slide-29
SLIDE 29