Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
Lecture 1: Introduction to RKHS
MLSS Cadiz, 2016
Gatsby Unit, CSML, UCL
May 12, 2016
Lecture 1: Introduction to RKHS
Lecture 1: Introduction to RKHS MLSS Cadiz, 2016 Gatsby Unit, CSML, - - PowerPoint PPT Presentation
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression Lecture 1: Introduction to RKHS MLSS Cadiz, 2016 Gatsby Unit, CSML, UCL May 12, 2016 Lecture 1: Introduction to RKHS Feature space Basics of reproducing
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
Gatsby Unit, CSML, UCL
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
−5 −4 −3 −2 −1 1 2 3 4 5 −5 −4 −3 −2 −1 1 2 3 4 5
x1 x2
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
−0.5 0.5 1 1.5 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 −0.5 0.5 1 1.5 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 −0.5 0.5 1 1.5 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
1 Hilbert space 2 Kernel (lots of examples: e.g. you can build kernels from
3 Reproducing property Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
1 Linear: α1f1 + α2f2, gH = α1 f1, gH + α2 f2, gH 2 Symmetric: f , gH = g, f H 3 f , f H ≥ 0 and f , f H = 0 if and only if f = 0. Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
1 Linear: α1f1 + α2f2, gH = α1 f1, gH + α2 f2, gH 2 Symmetric: f , gH = g, f H 3 f , f H ≥ 0 and f , f H = 0 if and only if f = 0.
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
1 Linear: α1f1 + α2f2, gH = α1 f1, gH + α2 f2, gH 2 Symmetric: f , gH = g, f H 3 f , f H ≥ 0 and f , f H = 0 if and only if f = 0.
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
1 (x)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
1 (x)
2 (x)φ2(x′)
1 (x′)
1 (x′)φ1(x)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
ℓ2 = ∞
i < ∞.
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
ℓ2 = ∞
i < ∞.
∞
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
∞
e1(x) e2(x) e3(x)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
1 Find a feature map? 1
2
2 A direct property of the function: positive definiteness. Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
n
n
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
n
n
n
n
H
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
n
n
n
n
n
n
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−5 −4 −3 −2 −1 1 2 3 4 5 −5 −4 −3 −2 −1 1 2 3 4 5
x1 x2
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
⊤
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
∞
∞
∞
i < ∞.
e1(x) e2(x) e3(x) Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
m
m
−6 −4 −2 2 4 6 8 −0.4 −0.2 0.2 0.4 0.6 0.8 1
x f(x)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
m
m
∞
−6 −4 −2 2 4 6 8 −0.4 −0.2 0.2 0.4 0.6 0.8 1
x f(x)
i=1 αiφℓ(xi)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
m
m
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
m
m
f (·)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
m
m
f (·)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
m
m
f (·)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
∞
e1(x) e2(x) e3(x) Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
m
∞
i=1 αi
−6 −4 −2 2 4 6 8 −0.4 −0.2 0.2 0.4 0.6 0.8 1
x f(x)
j < ∞.
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
∞
∞
∞
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
x f (x) Top hat
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 0.1 0.2 0.3 0.4 0.5
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
x f (x) Top hat
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 0.1 0.2 0.3 0.4 0.5
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
x f (x) Top hat
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 −0.2 0.2 0.4 0.6
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
x f (x) Top hat
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 −0.2 0.2 0.4 0.6
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
x f (x) Top hat
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 −0.2 0.2 0.4 0.6
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
x f (x) Top hat
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 −0.2 0.2 0.4 0.6
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
x f (x) Top hat
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 −0.2 0.2 0.4 0.6
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
∞
ϑ is the Jacobi theta function, close to Gaussian when σ2 sufficiently narrower than [−π, π].
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.1 0.1 0.2 0.3 0.4 0.5 0.6
x k(x) Jacobi Theta
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 0.05 0.1 0.15 0.2
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.1 0.1 0.2 0.3 0.4 0.5 0.6
x k(x) Jacobi Theta
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 0.05 0.1 0.15 0.2
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.1 0.1 0.2 0.3 0.4 0.5 0.6
x k(x) Jacobi Theta
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 0.05 0.1 0.15 0.2
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
−4 −2 2 4 −0.1 0.1 0.2 0.3 0.4 0.5 0.6
x k(x) Jacobi Theta
−4 −2 2 4 −1 −0.5 0.5 1
t cos(ℓ × x) Basis function
−10 −5 5 10 0.05 0.1 0.15 0.2
ℓ ˆ fℓ Fourier series coefficients Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
∞
∞
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
∞
∞
∞
∞
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
H = f , f H = ∞
H < ∞.
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
H = f , f H = ∞
H < ∞.
∞
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
H = f , f H = ∞
H < ∞.
∞
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
H f H
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression What is a kernel? Constructing new kernels Positive definite functions Reproducing kernel Hilbert space
f ∈H
H
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=0.1, σ=0.6
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=10, σ=0.6
−0.5 0.5 1 1.5 −1 −0.5 0.5 1 1.5
λ=1e−07, σ=0.6 Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
−0.5 0.5 1 1.5 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 −0.5 0.5 1 1.5 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 −0.5 0.5 1 1.5 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
f ∈H
H
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
n
n
−6 −4 −2 2 4 6 8 −0.4 −0.2 0.2 0.4 0.6 0.8 1
x f(x)
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
f ∈H J(f ),
H
i=1 Iyif (xi)≤0
i=1(yi − f (xi))2
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
f ∈H
H
n
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
i=1 αik(xi, ·).
H = fs2 H + f⊥2 H ≥ fs2 H ,
H
H
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
n
n
H
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
∞
∞
H = f , f H = ∞
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
∞
H = ∞
i .
e1(x) e2(x) e3(x) Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
f ∈H
H
Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=0.1, σ=0.6 Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=0.1, σ=0.6
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=10, σ=0.6
−0.5 0.5 1 1.5 −1 −0.5 0.5 1 1.5
λ=1e−07, σ=0.6 Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=0.1, σ=0.6 Lecture 1: Introduction to RKHS
Feature space Basics of reproducing kernel Hilbert spaces Kernel Ridge Regression
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=0.1, σ=0.6
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=0.1, σ=2
−0.5 0.5 1 1.5 −1 −0.5 0.5 1
λ=0.1, σ=0.1 Lecture 1: Introduction to RKHS