Bayesian leave-one-out cross-validation for large data Mns Magnusson - PowerPoint PPT Presentation

Sep 15, 2023 •354 likes •421 views

Bayesian leave-one-out cross-validation for large data Mns Magnusson (Aalto University) Michael Riis Andersen (Technical University of Denmark) Johan Jonasson (University of Gothenburg) Aki Vehtari (Aalto University) Motivation: Model

Bayesian leave-one-out cross-validation for large data Måns Magnusson (Aalto University) Michael Riis Andersen (Technical University of Denmark) Johan Jonasson (University of Gothenburg) Aki Vehtari (Aalto University)
Motivation: Model selection for large data • Bigger data sets and more complex models • We still need to evaluate and compare models • elpd M quantifies how model M generalizes to unseen data ˜ y i � elpd M = log p M (˜ y i | y ) p t (˜ y i ) d ˜ y i , True data generating process Posterior predictive distribution Expected log predictive density 2 DTU Compute Bayesian leave-one-out cross-validation for large data 7.6.2019
Leave-one-out cross-validation • Basic idea: Hold out observation i and predict y i based on y − i • Estimate elpd M using leave-one-out cross-validation (loo) � � n � n elpd loo = 1 log p M ( y i | y − i ) = 1 log p M ( y i | θ ) p M ( θ | y − i ) dθ n n i =1 i =1 • Desirable properties + almost unbiased for large n + straight-forward handling of hierarchical structures • Two major problems - Need to fit the model n times - Need to evaluate predictive densities n times 3 DTU Compute Bayesian leave-one-out cross-validation for large data 7.6.2019
Our contributions: Method � n elpd loo = 1 log p M ( y i | y − i ) n i =1 • We propose a fast approximation for elpd loo 1 Approximate full data posterior q M ( θ | y ) using Variational Bayes/Laplace 2 Compute p M ( y i | y − i ) using importance sampling with q M as proposal 3 Subsample the sum over n using the Hansen-Hurwitz estimator • Solves both problems with leave-one-out CV 1 Only need to fit the model once on the full data set 2 Predictive distributions p M ( y i | y − i ) are only needed for a small subset 4 DTU Compute Bayesian leave-one-out cross-validation for large data 7.6.2019
Our contributions: Results • Theoretical results (under regularity conditions) � p elpd loo → elpd loo for n → ∞ • Extensive empirical results 1 Variational Bayes, Laplace approx., MCMC 2 Bayesian linear regression 3 Hierarchical models • For more details, come see us at poster #231 • Thank you for listening! 5 DTU Compute Bayesian leave-one-out cross-validation for large data 7.6.2019

Recommend

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

Classier evaluation Classier evaluation Leave-one-out Cross-Validation Leave-one-out Cross-Validation Leave-one-out Cross-Validation Classier evaluation Classier evaluation Leave-one-out Cross-Validation Resampled validation set

985 views • 51 slides

presentation kit leave the rest to us SM leave the rest to us SM centrally located leave the

presentation kit leave the rest to us SM leave the rest to us SM centrally located leave the rest to us SM easy access to any places leave the rest to us SM exterior leave the rest to us SM lobby leave the rest to us SM lobby lounge

522 views • 18 slides

Leave, Leave and More Leave: A Legal Update Regarding Employment Leave Law February 20, 2013

Leave, Leave and More Leave: A Legal Update Regarding Employment Leave Law February 20, 2013 Presented by: D. Lewis Clark Jr. lew.clark@squiresanders.com Tara A. Aschenbrand tara.aschenbrand@squiresanders.com 37 Offices in 18 Countries

438 views • 40 slides

U.S. Leave of Absence Program (Informational Guide) Program features Wescos Leave of

U.S. Leave of Absence Program (Informational Guide) Program features Wescos Leave of Absence Program Common Leave Requests Overview of Family Medical Leave Intermittent Leave Leave of Absence Management Process

476 views • 26 slides

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the bootstrap. 1 / 44 Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the

1.13k views • 66 slides

STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12

Outline Last Time Cross-Validation STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12 April 2016 Outline Last Time Cross-Validation Outline Last Time Cross-Validation Outline Last Time

381 views • 25 slides

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method Transfer, partial and cross validation Team members: In scope Life cycle of a method after first full validation or relation Team

252 views • 10 slides

Family Leave Trends & San Francisco and New York Paid Family Leave Anna Steffeney CEO

Family Leave Trends & San Francisco and New York Paid Family Leave Anna Steffeney CEO LeaveLogic Employee Self-Serve Leave Solution www.leavelogic.com Parental Leave: Is it a Trend or Going Mainstream? AGENDA A What is Paid Leave

454 views • 18 slides

Paid Leave Laws Overview of Paid Leave Law s in the Northeast Paid leave required ( state adm

October 24, 2019 Paid Leave Laws Overview of Paid Leave Law s in the Northeast Paid leave required ( state adm inistered) Connecticut (family and medical) Massachusetts (family and medical) New York (family) Paid leave

364 views • 33 slides

Leaves of Absence FMLA and Medical Leaves at MIT October 2017 FMLA and Medical Leave Categories

Leaves of Absence FMLA and Medical Leaves at MIT October 2017 FMLA and Medical Leave Categories MIT Sick Leave Policy MIT Extended Sick Leave Policy Massachusetts Sick Leave Law Maternity Leave Massachusetts Parental Leave Law

362 views • 9 slides

leave Jo Broadbent 11 March 2015 Getting to grips with shared parental leave How will shared

Getting to grips with shared parental leave Jo Broadbent 11 March 2015 Getting to grips with shared parental leave How will shared parental leave operate? Who is entitled to shared parental leave? The process for requesting leave

1.04k views • 32 slides

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

302 views • 27 slides

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian Approach t o St ruct ure Discovery in Bayesian Net works Nir Friedman and Daphne Koller 04/ 21/ 2005 CS673 1 Roadmap Roadmap Bayesian lear

657 views • 21 slides

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian RL Prior knowledge, policy optimization, discussion, Bayesian approaches for other RL variants Model-free Bayesian RL Gaussian

1.23k views • 63 slides

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The Shadow of the Cross The Shadow of the Cross OT Glimpses of the Cross OT Glimpses of the Cross Heb 8:5 & 10:1 Heb 8:5 & 10:1 OT Glimpses

359 views • 35 slides

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far Various metrics (e.g., accuracy, F-measure, RMSE, ) Evaluation protocol setups Split Validation Cross Validation Special

1.04k views • 55 slides

Machine Learning July 20, 2016 Basic Concepts: Review Example machine learning problem: Decide

Machine Learning July 20, 2016 Basic Concepts: Review Example machine learning problem: Decide whether to play tennis at a given day. Basic Concepts: Review Example machine learning problem: Decide whether to play tennis at a given day. Input

537 views • 36 slides

Cross-Validation Machine Learning 1 Model selection Very broadly: Choosing the best model using

Cross-Validation Machine Learning 1 Model selection Very broadly: Choosing the best model using given data What makes a model Features Hyper-parameters that control the hypothesis space Example: depth of a decision tree, neural

328 views • 12 slides

Seismic landslide hazard zonation By: M.T.J. Terlien Department of Earth Resources Surveys,

CHAPTER 7 Seismic landslide hazard zonation By: M.T.J. Terlien Department of Earth Resources Surveys, International Institute for Aerospace Survey and Earth Sciences (ITC) Kanaalweg 3, 2628 EB Delft, The Netherlands Tel: +31 15 2748832, Fax:

621 views • 10 slides

Summary of Last Chapter Principles of Knowledge Discovery in Data What is the motivation for

Summary of Last Chapter Principles of Knowledge Discovery in Data What is the motivation for ad-hoc mining process? What defines a data mining task? Fall 2004 Chapter 5: Data Summarization Can we define an ad-hoc mining language?

506 views • 8 slides

CS 6316 Machine Learning Model Selection and Validation Yangfeng Ji Department of Computer

CS 6316 Machine Learning Model Selection and Validation Yangfeng Ji Department of Computer Science University of Virginia Overview Polynominals Polynomial regression (a) d 1 (b) d 3 (c) d 15 2 Boosting Adaboost combines T weak

839 views • 42 slides

Lecture 5: Regularization ML Methodology Aykut Erdem February 2016 Hacettepe University

Lecture 5: Regularization ML Methodology Aykut Erdem February 2016 Hacettepe University Recall from last time Linear Regression y ( x ) = w 0 + w 1 x w = ( w 0 , w 1 ) N i 2 h X t ( n ) ( w 0 + w 1 x ( n ) ) ` ( w ) = n =1

739 views • 48 slides

Time - dela y ed feat u res and a u to - regressi v e models MAC H IN E L E AR N IN G FOR TIME

Time - dela y ed feat u res and a u to - regressi v e models MAC H IN E L E AR N IN G FOR TIME SE R IE S DATA IN P YTH ON Chris Holdgraf Fello w, Berkele y Instit u te for Data Science The past is u sef u l Timeseries data almost al w a y s

624 views • 48 slides

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive mentioned over-fitting a

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive mentioned over-fitting a few times during the last few weeks Fitting to the noise as well as the signal Over-Fitting 25 25 20 20 15 15 10 10 5 5 0 0 0 5 10

796 views • 25 slides

Bayesian leave-one-out cross-validation for large data Mns Magnusson - PowerPoint PPT Presentation

Bayesian leave-one-out cross-validation for large data Mns Magnusson (Aalto University) Michael Riis Andersen (Technical University of Denmark) Johan Jonasson (University of Gothenburg) Aki Vehtari (Aalto University) Motivation: Model

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

presentation kit leave the rest to us SM leave the rest to us SM centrally located leave the

Leave, Leave and More Leave: A Legal Update Regarding Employment Leave Law February 20, 2013

U.S. Leave of Absence Program (Informational Guide) Program features Wescos Leave of

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Family Leave Trends & San Francisco and New York Paid Family Leave Anna Steffeney CEO

Paid Leave Laws Overview of Paid Leave Law s in the Northeast Paid leave required ( state adm

Leaves of Absence FMLA and Medical Leaves at MIT October 2017 FMLA and Medical Leave Categories

leave Jo Broadbent 11 March 2015 Getting to grips with shared parental leave How will shared

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far

Machine Learning July 20, 2016 Basic Concepts: Review Example machine learning problem: Decide

Cross-Validation Machine Learning 1 Model selection Very broadly: Choosing the best model using

Seismic landslide hazard zonation By: M.T.J. Terlien Department of Earth Resources Surveys,

Summary of Last Chapter Principles of Knowledge Discovery in Data What is the motivation for

CS 6316 Machine Learning Model Selection and Validation Yangfeng Ji Department of Computer

Lecture 5: Regularization ML Methodology Aykut Erdem February 2016 Hacettepe University

Time - dela y ed feat u res and a u to - regressi v e models MAC H IN E L E AR N IN G FOR TIME

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive mentioned over-fitting a

Sambuz

Useful Links

Newsletter

Mail Us

Bayesian leave-one-out cross-validation for large data Mns Magnusson - PowerPoint PPT Presentation

Bayesian leave-one-out cross-validation for large data Mns Magnusson (Aalto University) Michael Riis Andersen (Technical University of Denmark) Johan Jonasson (University of Gothenburg) Aki Vehtari (Aalto University) Motivation: Model

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

presentation kit leave the rest to us SM leave the rest to us SM centrally located leave the

Leave, Leave and More Leave: A Legal Update Regarding Employment Leave Law February 20, 2013

U.S. Leave of Absence Program (Informational Guide) Program features Wescos Leave of

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Family Leave Trends &amp; San Francisco and New York Paid Family Leave Anna Steffeney CEO

Paid Leave Laws Overview of Paid Leave Law s in the Northeast Paid leave required ( state adm

Leaves of Absence FMLA and Medical Leaves at MIT October 2017 FMLA and Medical Leave Categories

leave Jo Broadbent 11 March 2015 Getting to grips with shared parental leave How will shared

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far

Machine Learning July 20, 2016 Basic Concepts: Review Example machine learning problem: Decide

Cross-Validation Machine Learning 1 Model selection Very broadly: Choosing the best model using

Seismic landslide hazard zonation By: M.T.J. Terlien Department of Earth Resources Surveys,

Summary of Last Chapter Principles of Knowledge Discovery in Data What is the motivation for

CS 6316 Machine Learning Model Selection and Validation Yangfeng Ji Department of Computer

Lecture 5: Regularization ML Methodology Aykut Erdem February 2016 Hacettepe University

Time - dela y ed feat u res and a u to - regressi v e models MAC H IN E L E AR N IN G FOR TIME

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive mentioned over-fitting a

Sambuz

Useful Links

Newsletter

Mail Us

Family Leave Trends & San Francisco and New York Paid Family Leave Anna Steffeney CEO