Bayesian leave-one-out cross-validation for large data Mns Magnusson - - PowerPoint PPT Presentation

bayesian leave one out cross validation for large data
SMART_READER_LITE
LIVE PREVIEW

Bayesian leave-one-out cross-validation for large data Mns Magnusson - - PowerPoint PPT Presentation

Bayesian leave-one-out cross-validation for large data Mns Magnusson (Aalto University) Michael Riis Andersen (Technical University of Denmark) Johan Jonasson (University of Gothenburg) Aki Vehtari (Aalto University) Motivation: Model


slide-1
SLIDE 1

Bayesian leave-one-out cross-validation for large data

Måns Magnusson (Aalto University) Michael Riis Andersen (Technical University of Denmark) Johan Jonasson (University of Gothenburg) Aki Vehtari (Aalto University)

slide-2
SLIDE 2

Motivation: Model selection for large data

  • Bigger data sets and more complex models
  • We still need to evaluate and compare models
  • elpdM quantifies how model M generalizes to unseen data ˜

yi elpdM =

  • log pM(˜

yi|y)pt(˜ yi)d˜ yi ,

Expected log predictive density Posterior predictive distribution True data generating process

2 DTU Compute Bayesian leave-one-out cross-validation for large data 7.6.2019

slide-3
SLIDE 3

Leave-one-out cross-validation

  • Basic idea: Hold out observation i and predict yi based on y−i
  • Estimate elpdM using leave-one-out cross-validation (loo)

elpdloo = 1 n

n

  • i=1

log pM(yi|y−i) = 1 n

n

  • i=1

log

  • pM(yi|θ)pM(θ|y−i)dθ
  • Desirable properties

+ almost unbiased for large n + straight-forward handling of hierarchical structures

  • Two major problems
  • Need to fit the model n times
  • Need to evaluate predictive densities n times

3 DTU Compute Bayesian leave-one-out cross-validation for large data 7.6.2019

slide-4
SLIDE 4

Our contributions: Method

elpdloo = 1 n

n

  • i=1

log pM(yi|y−i)

  • We propose a fast approximation for elpdloo

1 Approximate full data posterior qM(θ|y) using Variational Bayes/Laplace 2 Compute pM(yi|y−i) using importance sampling with qM as proposal 3 Subsample the sum over n using the Hansen-Hurwitz estimator

  • Solves both problems with leave-one-out CV

1 Only need to fit the model once on the full data set 2 Predictive distributions pM(yi|y−i) are only needed for a small subset

4 DTU Compute Bayesian leave-one-out cross-validation for large data 7.6.2019

slide-5
SLIDE 5

Our contributions: Results

  • Theoretical results (under regularity conditions)
  • elpdloo

p

→ elpdloo for n → ∞

  • Extensive empirical results

1 Variational Bayes, Laplace approx., MCMC 2 Bayesian linear regression 3 Hierarchical models

  • For more details, come see us at poster #231
  • Thank you for listening!

5 DTU Compute Bayesian leave-one-out cross-validation for large data 7.6.2019