Mismatched Models & Can GP Regression Be Made Robust Against - PowerPoint PPT Presentation

Sep 18, 2023 •150 likes •253 views

Gaussian Process Regression with Mismatched Models & Can GP Regression Be Made Robust Against Model Mismatch? Peter Sollich NEURIPS 2002 & International Workshop on Deterministic and Statistical Methods in Machine Learning (2004)

Gaussian Process Regression with Mismatched Models & Can GP Regression Be Made Robust Against Model Mismatch? Peter Sollich NEURIPS 2002 & International Workshop on Deterministic and Statistical Methods in Machine Learning (2004)
Learning curve Ideal learning curve: • Performance on true distribution • Average over multiple training datasets
What is GP regression? 𝑧 = 𝑔 𝑦 + 𝜗 , 𝜗~ 𝑂 0,𝜏 2 want to estimate 𝑔 Put a GP-prior on 𝑔 • 𝐷𝑝𝑤 𝑔 𝑦 𝑗 ,𝑔 𝑦 𝑘 = 𝐿(𝑦 𝑗 ,𝑦 𝑘 ) • 𝐹 𝑔 𝑦 = 0 Why GP-regression? • Posterior analytically (requires 𝑃(𝑜 3 ) ) • Error bars
Mismatched model? Input to GP: kernel 𝐿 , noise level 𝜏 • What if we use the wrong one? Setting: • Assume p(x) known: uniform on line or • Theory exact if 𝑒 = ∞ , otherwise all kinds hypercube • Assume K x, x ′ = g( 𝑦 − 𝑦 ′ ) of approximations
Weird learning curves • Plateaus or arbitrary # overfitting maxima Line 1D, noise level too low results in plateau Hypercube, d=10, noise level too small: 1e-4, 1e- 3, … true = 1
Asymptotic problems • No asymptotic decay 𝜗 = 𝑃( 𝑜 ) such as for parametric models, much 1 • If true kernel (OU, MB2) is less smooth than chosen kernel (RBF) slower (log. slow) • Prior cannot be overwhelmed by data (is too strong)
Fix? • But maybe we just chose very bad hyperparameters? • Maximize: 𝑄 𝐸 = ∫ 𝑄 𝐸 𝜄 𝑄 𝜄 𝑒𝜄 w.r.t. hyperpar. • A true Bayesian is too expensive… What if evidence maximization? • Setting: assume wrong kernel, but we tune 𝜏, 𝑏, 𝑚 using evidence • All kinds of approximations to make the analysis tractable…
Hypercube analysis • If we can tune par to get Bayes optimal performance, we will get it. • If we cannot find those par (for example, 𝑚 → ∞ ), convergence still very slow • No maxima’s • No experiments???
1D case • True kernel = MB2 • Used kernel = in plot • No maxima, plateaus • Optimal rate achieved

Recommend

ASL-English Semantically Mismatched Code Blends An Analysis of Motivations for Nonequivalent

ASL-English Semantically Mismatched Code Blends ASL-English Semantically Mismatched Code Blends An Analysis of Motivations for Nonequivalent Blending Erika Shane; Ling 4780 The Ohio State University 30th November 2017 ASL-English Semantically

554 views • 30 slides

The Homeland Security Safe-Harbor Procedure for Social Security No-Match Letters: A Mismatched

C ODER . PTD 2 1/5/2008 10:39:06 AM The Homeland Security Safe-Harbor Procedure for Social Security No-Match Letters: A Mismatched Immigration Enforcement Tool I NTRODUCTION

692 views • 33 slides

Sisterhood Polygamy, Blacklists, and Mismatched in the Gale-Shapley Matching Algorithm Quotas

Background Monogamous Case Sisterhood Polygamy, Blacklists, and Mismatched in the Gale-Shapley Matching Algorithm Quotas Yannai A. Gonczarowski Einstein Institute of Mathematics and Center for the Study of Rationality The Hebrew

709 views • 54 slides

Misinformed or mismatched? Decomposing the gap between expected and realized wages among

Misinformed or mismatched? Decomposing the gap between expected and realized wages among graduates in Mozambique Sam Jones, Ricardo Santos, Gimelgo Xirinda UNU-WIDER, Mozambique 11 September 2019 1 / 27 Agenda 1 Introduction 2 Framework

1.44k views • 79 slides

Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis

Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing Sanghamitra Dutta Hazar Yueksel Dennis Wei sanghamd@andrew.cmu.edu hazar.yueksel@ibm.com dwei@us.ibm.com Kush Varshney Pin-Yu Chen Sijia

320 views • 21 slides

Interactions between Students Learning Styles, Achievement and Behaviour in Mismatched Courses

Interactions between Students Learning Styles, Achievement and Behaviour in Mismatched Courses Sabine Graf Tzu-Chien Liu Kinshuk National Central University National Central University Athabasca University Taiwan Taiwan Canada

390 views • 22 slides

DSGE Models: A User Guide for Policymakers Lawrence J. Christiano Outline Why models? Why

DSGE Models: A User Guide for Policymakers Lawrence J. Christiano Outline Why models? Why models? Why did New Keynesian DSGE models become Wh did N K i DSGE d l b so popular in past decade? DSGE models after 2008. Why Models?

797 views • 69 slides

Seminar LIGHTING MODELS What is a light? Types of light Illumination models

Computer Graphics Seminar LIGHTING MODELS What is a light? Types of light Illumination models Shading models What is a light? Illumination models Shading models Lighting models What is a light? Delta Lights

1.64k views • 125 slides

From Conceptual Models From Conceptual Models to Simulation Models to Simulation Models Model

From Conceptual Models From Conceptual Models to Simulation Models to Simulation Models Model Driven Development of Agent- -Based Based Simulations Simulations Model Driven Development of Agent Takashi Iba* Yoshiaki Matsuzawa** Nozomu

235 views • 11 slides

Factor Models: A Review James J. Heckman The University of Chicago Econ 312, Winter 2019

Factor Models: A Review James J. Heckman The University of Chicago Econ 312, Winter 2019 Heckman Factor Models: A Review of General Models Factor Models: A Review of General Models Heckman Factor Models: A Review of General Models E ( ) =

434 views • 19 slides

Weak memory models INF4140 - Models of concurrency Weak memory models Fall 2016 30. 10. 2016

Weak memory models INF4140 - Models of concurrency Weak memory models Fall 2016 30. 10. 2016 Overview Weak memory models 1 Introduction 2 Hardware architectures Compiler optimizations Sequential consistency Weak memory models 3 TSO

1.3k views • 87 slides

4CSLL5 IBM Translation Models Martin Emms October 22, 2020 4CSLL5 IBM Translation Models IBM

4CSLL5 IBM Translation Models 4CSLL5 IBM Translation Models Martin Emms October 22, 2020 4CSLL5 IBM Translation Models IBM models Probabilities and Translation Alignments IBM Model 1 definitions 4CSLL5 IBM Translation Models IBM models

1.23k views • 103 slides

Outline Viscous Flow Turbulence Mixing Length Models One-Equation Models

ME 639-Turbulence G. Ahmadi ME 639-Turbulence G. Ahmadi Outline Viscous Flow Turbulence Mixing Length Models One-Equation Models Two-Equation Models Stress Transport Models Rate-Dependent Models ME 639-Turbulence G.

570 views • 12 slides

Regression 2: Mixed Models Marco Baroni Practical Statistics in R Outline Mixed models with

Regression 2: Mixed Models Marco Baroni Practical Statistics in R Outline Mixed models with subject and item effects Mixed models in R Outline Mixed models with subject and item effects Introduction Varying intercept mixed models

943 views • 55 slides

Sequence-to-sequence Models and Attention Graham Neubig Preliminaries: Language Models

Sequence-to-sequence Models and Attention Graham Neubig Preliminaries: Language Models Language Models Language models are generative models of text s ~ P(x) The Malfoys! said Hermione. Harry was watching him. He looked like

1.18k views • 37 slides

Functional Linear Models 1 66 / 181 Functional Linear Models Statistical Models So far we have

Functional Linear Models Functional Linear Models 1 66 / 181 Functional Linear Models Statistical Models So far we have focussed on exploratory data analysis Smoothing Functional covariance Functional PCA Now we wish to examine predictive

1.04k views • 43 slides

Identifying beneficial task relations for multi-task learning in deep neural networks Author:

Identifying beneficial task relations for multi-task learning in deep neural networks Author: Joachim Bingel, Anders Sogaard Presenter: Litian Ma Background Multi-task learning (MTL) in deep neural networks for

567 views • 15 slides

Learning From Data Lecture 7 Approximation Versus Generalization The VC Dimension Approximation

Learning From Data Lecture 7 Approximation Versus Generalization The VC Dimension Approximation Versus Generalization Bias and Variance The Learning Curve M. Magdon-Ismail CSCI 4100/6100 recap: The Vapnik-Chervonenkis Bound (VC Bound) P [ |

353 views • 22 slides

Week 6 Video 1 Visualization Learning Curves Visualization Displaying information in a

Week 6 Video 1 Visualization Learning Curves Visualization Displaying information in a meaningful fashion Visualization Should (Tufte, 1983) Show the data Induce the viewer to think about the substance Avoid distorting what the

519 views • 34 slides

Learning Curve Analysis for Programming: Which Concepts do Students Struggle With? Kelly Rivers,

Learning Curve Analysis for Programming: Which Concepts do Students Struggle With? Kelly Rivers, Erik Harpstead, and Ken Koedinger Educational Data Mining for Programming ITiCSE working group paper on EDM and Learning Analytics in

738 views • 38 slides

Model Evaluation Model Evaluation Metrics for Performance Evaluation How to evaluate the

Model Evaluation Model Evaluation Metrics for Performance Evaluation How to evaluate the performance of a model? Methods for Performance Evaluation How to obtain reliable estimates? Methods for Model Comparison How to

636 views • 36 slides

UMBC A B M A L T F O U M B C I M Y O R T 1 (June 14, 2000 4:18 pm) I E S R

Advanced Computer Architecture Chapter 1 (Part I) CMSC 611 Course Summary Improvements in CPU performance a result of: Technology enhancements (already discussed). Improvements in computer architecture: Innovations Improvements

415 views • 13 slides

React Angular or Jesse Sanders , CEO Thomas Burleson , Principal Architect React Learning Curve

Web Products Choosing your technology stack React Angular or Jesse Sanders , CEO Thomas Burleson , Principal Architect React Learning Curve wild west Productivity Angular Learning Curve Productivity Hiring Developers Hard Angular

1.07k views • 7 slides

Mastering Drupal: Getting Up the Drupal Learning Curve Matt Cheney January 23rd, 2010 Design

Mastering Drupal: Getting Up the Drupal Learning Curve Matt Cheney January 23rd, 2010 Design for Drupal at Stanford Off to be a Wizard Drupal has the power to solve your problems. So you heard... It often looks really easy. Wow. Bam.

793 views • 26 slides