Introduction to Nonparametric Bayesian Modeling and Gaussian Process - PowerPoint PPT Presentation

Introduction to Nonparametric Bayesian Modeling and Gaussian Process Regression Piyush Rai Dept. of CSE, IIT Kanpur (Mini-course: lecture 3) Nov 07, 2015 Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 1

Recap Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 2

Optimization vs Inference All ML problems require estimating parameters given data. Primarily two views: 1. Learning as Optimization Parameter θ is a fixed unknown Seeks a point estimate (single best answer) for θ ˆ θ = arg min θ Loss( D ; θ ) subject to constraints on θ Probabilistic methods such as MLE and MAP also fall in this category 2. Learning as (Bayesian) Inference Parameter θ is a random variable with a prior distribution P ( θ ) Seeks a posterior distribution over the parameters P ( θ | D ) = P ( D | θ ) P ( θ ) P ( D ) Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 3

Bayesian Learning Prior distribution specifies our prior belief/knowledge about parameters θ Bayesian inference updates the prior and gives the posterior Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 4

Why be Bayesian? Posterior P ( θ |D ) quantifies uncertainty in the parameters Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 5

Why be Bayesian? Posterior P ( θ |D ) quantifies uncertainty in the parameters More robust predictions by averaging over the posterior P ( θ |D ) � P ( d test | ˆ θ ) vs P ( d test |D ) = P ( d test | θ ) P ( θ |D ) d θ Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 5

Why be Bayesian? Posterior P ( θ |D ) quantifies uncertainty in the parameters More robust predictions by averaging over the posterior P ( θ |D ) � P ( d test | ˆ θ ) vs P ( d test |D ) = P ( d test | θ ) P ( θ |D ) d θ Allows inferring hyperparameters of the model and doing model comparison Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 5

Why be Bayesian? Posterior P ( θ |D ) quantifies uncertainty in the parameters More robust predictions by averaging over the posterior P ( θ |D ) � P ( d test | ˆ θ ) vs P ( d test |D ) = P ( d test | θ ) P ( θ |D ) d θ Allows inferring hyperparameters of the model and doing model comparison Offers a natural way for informed data acquisition (active learning) Can use the predictive posterior of unseen data points to guide data selection Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 5

Why be Bayesian? Posterior P ( θ |D ) quantifies uncertainty in the parameters More robust predictions by averaging over the posterior P ( θ |D ) � P ( d test | ˆ θ ) vs P ( d test |D ) = P ( d test | θ ) P ( θ |D ) d θ Allows inferring hyperparameters of the model and doing model comparison Offers a natural way for informed data acquisition (active learning) Can use the predictive posterior of unseen data points to guide data selection Can do nonparametric Bayesian modeling Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 5

Nonparametric Bayesian Learning How big/complex my model should be? How many parameters suffice? Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 6

Nonparametric Bayesian Learning How big/complex my model should be? How many parameters suffice? Model-selection or cross-validation, can often be expensive and impractical Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 6

Nonparametric Bayesian Learning How big/complex my model should be? How many parameters suffice? Model-selection or cross-validation, can often be expensive and impractical Nonparametric Bayesian Models: Allow unbounded number of parameters Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 6

Nonparametric Bayesian Learning How big/complex my model should be? How many parameters suffice? Model-selection or cross-validation, can often be expensive and impractical Nonparametric Bayesian Models: Allow unbounded number of parameters The model can grow/shrink adaptively as we observe more and more data Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 6

Nonparametric Bayesian Learning How big/complex my model should be? How many parameters suffice? Model-selection or cross-validation, can often be expensive and impractical Nonparametric Bayesian Models: Allow unbounded number of parameters The model can grow/shrink adaptively as we observe more and more data We “let the data speak” how complex the model needs to be Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 6

What’s a Nonparametric Bayesian Model? An NPBayes model is NOT a model with no parameters! It has potentially infinite many (unbounded number of) parameters It has the ability to “create” new parameters if data requires so.. Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 7

What’s a Nonparametric Bayesian Model? An NPBayes model is NOT a model with no parameters! It has potentially infinite many (unbounded number of) parameters It has the ability to “create” new parameters if data requires so.. Some non-Bayesian models are also nonparametric. For example: nearest neighbor regression/classification, kernel SVMs, kernel density estimation Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 7

What’s a Nonparametric Bayesian Model? An NPBayes model is NOT a model with no parameters! It has potentially infinite many (unbounded number of) parameters It has the ability to “create” new parameters if data requires so.. Some non-Bayesian models are also nonparametric. For example: nearest neighbor regression/classification, kernel SVMs, kernel density estimation NPBayes models offer the benefits of both Bayesian modeling and nonparametric modeling Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 7

Examples of NPBayes Models Some modeling problems and NPBayes models of choice 1 : 1 Table courtesy: Zoubin Ghahramani Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 8

① ① ① ① ① ① ① ① ① ① Gaussian Process A Gaussian Process (GP) is a distribution over functions f : f ∼ GP ( µ , Σ ) .. such that f ’s value at a finite set of points ① 1 , . . . , ① N is jointly Gaussian { f ( ① 1 ) , f ( ① 2 ) , . . . , f ( ① N ) } ∼ N ( µ , K ) Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 9

① ① ① ① ① ① ① ① ① ① Gaussian Process A Gaussian Process (GP) is a distribution over functions f : f ∼ GP ( µ , Σ ) .. such that f ’s value at a finite set of points ① 1 , . . . , ① N is jointly Gaussian { f ( ① 1 ) , f ( ① 2 ) , . . . , f ( ① N ) } ∼ N ( µ , K ) If µ = 0 , a GP is fully specified by its covariance (kernel) matrix K Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 9

Gaussian Process A Gaussian Process (GP) is a distribution over functions f : f ∼ GP ( µ , Σ ) .. such that f ’s value at a finite set of points ① 1 , . . . , ① N is jointly Gaussian { f ( ① 1 ) , f ( ① 2 ) , . . . , f ( ① N ) } ∼ N ( µ , K ) If µ = 0 , a GP is fully specified by its covariance (kernel) matrix K Covariance matrix defined by a kernel function k ( ① n , ① m ). Some examples: − || ① n − ① m || 2 � � k ( ① n , ① m ) = exp : Gaussian kernel 2 σ 2 � α � � � | ① n − ① m | k ( ① n , ① m ) = v 0 exp − + v 1 + v 2 δ nm r Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression 9

Introduction to Nonparametric Bayesian Modeling and Gaussian Process - PowerPoint PPT Presentation

Introduction to Nonparametric Bayesian Modeling and Gaussian Process Regression Piyush Rai Dept. of CSE, IIT Kanpur (Mini-course: lecture 3) Nov 07, 2015 Piyush Rai (IIT Kanpur) Nonparametric Bayesian Modeling and Gaussian Process Regression

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

A simple Bayesian regression model Alicia Johnson Associate Professor, Macalester College

Bayesian Nonparametric Models for Data Exploration Melanie F. Pradier Friday 15 th September,

Nonparametric analysis of CMB Nonparametric analysis of CMB power spectrum data and consistency

Nonparametric Regression Splines for Nonparametric Regression Splines for Regional Atmospheric

Nonparametric Sequential Change Detection for High-Dimensional Problems Yasin Ylmaz Electrical

The np package np : A Package for Nonparametric Kernel The np package implements a variety of

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Bayesian regression with a categorical predictor Alicia Johnson Associate Professor, Macalester

The prior model Alicia Johnson Associate Professor, Macalester College DataCamp Bayesian

Dirichlet Processes and Nonparametric Bayesian Modelling Volker Tresp 1 Motivation Infinite

Nonparametric Bayesian Models for Sparse Matrices and Covariances Zoubin Ghahramani Department

Bayesian nonparametric inference for diffusion models with discrete sampling Delft University of

Kriging a.k.a. Gaussian Process Regression(GPR) Yubo Paul Yang, Algorithm Interest Group,

15.1 Last Lecture Want to solve a regression problem. confidence band f = argmin f 2

Reconst nstruct ruct Radio o Map with Automatic atically ally Constru tructed cted Gaussia

I ntroduction to Mobile Robotics Gaussian Processes Wolfram Burgard Cyrill Stachniss Giorgio

GP-BayesFilters Bayes Filters CSE-571 u(k-1) u(k) u(k+1)

Gaussian Processes to Speed up Hamiltonian Monte Carlo Matthieu L Murray, Iain

DD2434 - Advanced Machine Learning Gaussian Processes Carl Henrik Ek { chek } @csc.kth.se Royal

Safe model-based learning for robot control Breaking your robot is only fun in simulation Felix