Reproducing Kernel Hilbert Spaces Lorenzo Rosasco 9.520 Class 03 - PowerPoint PPT Presentation

Reproducing Kernel Hilbert Spaces Lorenzo Rosasco 9.520 Class 03 L. Rosasco RKHS

About this class Goal To introduce a particularly useful family of hypothesis spaces called Reproducing Kernel Hilbert Spaces (RKHS) We will discuss several perspectives on RKHS. In particular in this class we investigate the fundamental definition of RKHS as Hilbert spaces with bounded, continuous evaluation functionals and the intimate connection with symmetric positive definite kernels. L. Rosasco RKHS

Plan Part I: RKHS are Hilbert spaces with bounded, continuous evaluation functionals. Part II: Reproducing Kernels Part III: Mercer Theorem Part IV: Feature Maps Part V: Representer Theorem L. Rosasco RKHS

Regularization The basic idea of regularization (originally introduced independently of the learning problem) is to restore well-posedness of ERM by constraining the hypothesis space H . Regularization A possible way to do this is considering regularized empirical risk minimization, that is we look for solutions minimizing a two term functional ERR ( f ) + λ R ( f ) � �� empirical error regularizer the regularization parameter λ trade-offs the two terms. L. Rosasco RKHS

Tikhonov Regularization Tikhonov regularization amounts to minimize n 1 � V ( f ( x i ) , y i ) + λ R ( f ) λ > 0 (1) n i = 1 V ( f ( x ) , y ) is the loss function, that is the price we pay when we predict f ( x ) in place of y R ( f ) is a regularizer– often R ( f ) = � · � H , the norm in the function space H The regularizer should encode some notion of smoothness of f . L. Rosasco RKHS

The "Ingredients" of Tikhonov Regularization The scheme we just described is very general and by choosing different loss functions V ( f ( x ) , y ) we can recover different algorithms The main point we want to discuss is how to choose a norm encoding some notion of smoothness/complexity of the solution Reproducing Kernel Hilbert Spaces allow us to do this in a very powerful way L. Rosasco RKHS

Different Views on RKHS L. Rosasco RKHS

Part I: Evaluation Functionals L. Rosasco RKHS

Some Functional Analysis A function space F is a space whose elements are functions f , for example f : R d → R . A norm is a nonnegative function � · � such that ∀ f , g ∈ F and α ∈ R � f � ≥ 0 and � f � = 0 iff f = 0; 1 � f + g � ≤ � f � + � g � ; 2 � α f � = | α | � f � . 3 � A norm can be defined via a inner product � f � = � f , f � . A Hilbert space is a complete inner product space. L. Rosasco RKHS

Examples Continuous functions C [ a , b ] : a norm can be established by defining � f � = max a ≤ x ≤ b | f ( x ) | (not a Hilbert space!) Square integrable functions L 2 [ a , b ] : it is a Hilbert space where the norm is induced by the dot product � b � f , g � = f ( x ) g ( x ) dx a L. Rosasco RKHS

Hypothesis Space: Desiderata Hilbert Space. Point-wise defined functions. L. Rosasco RKHS

RKHS An evaluation functional over the Hilbert space of functions H is a linear functional F t : H → R that evaluates each function in the space at the point t , or F t [ f ] = f ( t ) . Definition A Hilbert space H is a reproducing kernel Hilbert space (RKHS) if the evaluation functionals are bounded and continuous, i.e. if there exists a M s.t. |F t [ f ] | = | f ( t ) | ≤ M � f � H ∀ f ∈ H L. Rosasco RKHS

Evaluation functionals Evaluation functionals are not always bounded. Consider L 2 [ a , b ] : Each element of the space is an equivalence class of � | f ( x ) | 2 dx . functions with the same integral An integral remains the same if we change the function in a countable set of points. L. Rosasco RKHS

Norms in RKHS and Smoothness Choosing different kernels one can show that the norm in the corresponding RKHS encodes different notions of smoothness. Band limited functions. Consider the set of functions H := { f ∈ L 2 ( R ) | F ( ω ) ∈ [ − a , a ] , a < ∞} with the usual L 2 inner product. the function at every point is given by the convolution with a sinc function sin ( ax ) / ax . The norm � a � � f � 2 f ( x ) 2 dx = | F ( ω ) | 2 d ω H = a � ∞ −∞ f ( t ) e − i ω t dt is the Fourier Where F ( ω ) = F{ f } ( ω ) = tranform of f . L. Rosasco RKHS

Norms in RKHS and Smoothness Sobolev Space: consider f : [ 0 , 1 ] → R with f ( 0 ) = f ( 1 ) = 0. The norm � � � f � 2 ( f ′ ( x )) 2 dx = ω 2 | F ( ω ) | 2 d ω H = Gaussian Space: the norm can be written as � 1 σ 2 ω 2 � f � 2 | F ( ω ) | 2 exp 2 d ω H = 2 π d L. Rosasco RKHS

Linear RKHS Our function space is 1-dimensional lines f ( x ) = w x where the RKHS norm is simply � f � 2 H = � f , f � H = w 2 so that our measure of complexity is the slope of the line. We want to separate two classes using lines and see how the magnitude of the slope corresponds to a measure of complexity. We will look at three examples and see that each example requires more "complicated functions, functions with greater slopes, to separate the positive examples from negative examples. L. Rosasco RKHS

Linear case (cont.) here are three datasets: a linear function should be used to separate the classes. Notice that as the class distinction becomes finer, a larger slope is required to separate the classes. 2 2 2 1.5 1.5 1.5 1 1 1 0.5 0.5 0.5 f(x) f(X) f(x) 0 0 0 − 0.5 − 0.5 − 0.5 − 1 − 1 − 1 − 1.5 − 1.5 − 1.5 − 2 − 2 − 2 − 2 − 1.5 − 1 − 0.5 0 0.5 1 1.5 2 − 2 − 1.5 − 1 − 0.5 0 0.5 1 1.5 2 − 2 − 1.5 − 1 − 0.5 0 0.5 1 1.5 2 x x x L. Rosasco RKHS

Part II: Kernels L. Rosasco RKHS

Different Views on RKHS L. Rosasco RKHS

Reproducing Kernel Hilbert Spaces Lorenzo Rosasco 9.520 Class 03 - PowerPoint PPT Presentation

Reproducing Kernel Hilbert Spaces Lorenzo Rosasco 9.520 Class 03 L. Rosasco RKHS About this class Goal To introduce a particularly useful family of hypothesis spaces called Reproducing Kernel Hilbert Spaces (RKHS) We will discuss several

Reproducing Kernel Hilbert Spaces for Classification Katarina Domijan and Simon P. Wilson

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Functional Gradient Motion Planning in Reproducing Kernel Hilbert Spaces RSS Robotics Science and

Econ 2148, fall 2019 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Counterfactual Policy Evaluation in Reproducing Kernel Hilbert Spaces Krikamol Muandet Max

Composition operators on some analytic reproducing kernel Hilbert spaces Jan Stochel (Uniwersytet

Stochastic optimization in Hilbert spaces Aymeric Dieuleveut Aymeric Dieuleveut Stochastic

A new weak Hilbert space Jess Surez de la Fuente, UEx Workshop on Banach spaces and Banach

Lecture 1: Introduction to RKHS MLSS Cadiz, 2016 Gatsby Unit, CSML, UCL May 12, 2016 Lecture 1:

Lecture 1: Introduction to RKHS MLSS Tbingen, 2015 Gatsby Unit, CSML, UCL July 22, 2015

Positive kernels and reproducing kernel spaces: a rich tapestry of settings and applications

Scalable Learning in Reproducing Kernel Kre n Spaces Dino Oglic 1 Thomas Grtner 2 1

On Hilbert IVth Problem Marc Troyanov (EPFL) SJTU, June 21, 2019 On Hilbert IVth Abstract

Tyrol Hill Park Phase 4 Elementary Campbell Elementary Campbell Park Spaces Open Park

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Stochastic constrained optimization in Hilbert spaces with applications Georg Ch. Pflug/C.

Program Survey Findings, Lessons Learned, and Next Steps 8/3/17 Michael Garringer Director of

Learning in Intelligent Systems October 14, 2016 Janyl Jumadinova Overview of Learning 2/19

1 Some quick stats 58 70 35GB players days 2 Round Challenge Points Classic Profiling

Welcome New Students! Study and Work in Canada January 2015 Presented by International

Group Dynamics What do we want our groups to DO? Group Dynamics are the Key Only see them

Introduction to the sheaf-theoretic approach to contextuality Samson Abramsky Department of

Clinical Engagement in BI Using clinical engagement to drive analytics solutions into everyday

Introduction to Privacy and Course organization Nataliia Bielova