> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
More on kernels
Marcel Lüthi
Graphics and Vision Research Group Department of Mathematics and Computer Science University of Basel
More on kernels Marcel Lthi Graphics and Vision Research Group - - PowerPoint PPT Presentation
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE More on kernels Marcel Lthi Graphics and Vision Research Group Department of Mathematics and Computer Science University of Basel > DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
Marcel Lüthi
Graphics and Vision Research Group Department of Mathematics and Computer Science University of Basel
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
Integral and differential equations
society (1950): 337-404.
Numerical analysis, Approximation and Interpolation theory
methods." Acta Numerica 15 (2006): 543-639.
Media, 1999.
2
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
machine learning." The annals of statistics (2008): 1171-1220.
Quarterly of applied mathematics 56.4 (1998): 617-694.
3
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
to be able to:
such structure
mathematically “nice”.
4
ML Statistics Image analysis Numerics Differential equations
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
Vector-valued (this course)
fields: 𝑣: 𝒴 → ℝ𝑒
Scalar-valued (more common)
functions 𝑔 ∶ 𝒴 → ℝ
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
Vector-valued (this course)
𝑣 ∼ 𝐻𝑄 Ԧ 𝜈, 𝒍 Ԧ 𝜈: 𝒴 → ℝ𝑒 𝒍: 𝒴 × 𝒴 → ℝ𝑒×𝑒
Scalar-valued (more common)
𝑔 ∼ 𝐻𝑄 𝜈, 𝑙 𝜈: 𝒴 → ℝ 𝑙: 𝒴 × 𝒴 → ℝ
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
Matrix-valued kernels can be reinterpreted as scalar-valued kernels: Matrix valued kernel: 𝒍: 𝒴 × 𝒴 → ℝ𝒆×𝒆 Scalar valued kernel: 𝑙: 𝒴 × 1. . 𝑒 × 𝒴 × 1. . 𝑒 → ℝ Bijection: Define 𝑙( 𝑦, 𝑗 , 𝑦′, 𝑘 = 𝒍 𝑦′, 𝑦′ 𝑗,𝑘
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
8
𝑳 = 𝑙11 𝑦1, 𝑦1 𝑙12 𝑦1, 𝑦1 𝑙21 𝑦1, 𝑦1 𝑙22 𝑦1, 𝑦1 … 𝑙11 𝑦1, 𝑦𝑜 𝑙12 𝑦1, 𝑦𝑜 𝑙21 𝑦1, 𝑦𝑜 𝑙22 𝑦1, 𝑦𝑜 ⋮ ⋮ 𝑙11 𝑦𝑜, 𝑦1 𝑙12 𝑦𝑜, 𝑦1 𝑙21 𝑦𝑜, 𝑦1 𝑙22 𝑦𝑜, 𝑦1 … 𝑙11 𝑦𝑜, 𝑦𝑜 𝑙12 𝑦𝑜, 𝑦𝑜 𝑙21 𝑦𝑜, 𝑦𝑜 𝑙22 𝑦𝑜, 𝑦𝑜
𝐿 = 𝑙 (𝑦1, 1), (𝑦1, 1) 𝑙 (𝑦1, 1), (𝑦1, 2) 𝑙 𝑦1, 2 , (𝑦1, 1) 𝑙 𝑦1, 2 , (𝑦1, 2) … 𝑙 (𝑦1, 1), (𝑦𝑜, 1) 𝑙 (𝑦1, 1), (𝑦𝑜, 2) 𝑙 𝑦1, 2 , (𝑦𝑜, 1) 𝑙 𝑦1, 2 , (𝑦𝑜, 2) ⋮ ⋮ 𝑙 (𝑦𝑜, 1), (𝑦1, 1) 𝑙 (𝑦𝑜, 1), (𝑦1, 2) 𝑙 𝑦𝑜, 2 , (𝑦1, 1) 𝑙 𝑦𝑜, 2 , (𝑦1, 2) … 𝑙 (𝑦𝑜, 1), (𝑦𝑜, 1) 𝑙 (𝑦𝑜, 1), (𝑦𝑜, 2) 𝑙 𝑦𝑜, 2 , (𝑦𝑜, 1) 𝑙 𝑦𝑜, 2 , (𝑦𝑜, 2)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
Matrix-valued kernels can be reinterpreted as scalar-valued kernels: Matrix valued kernel: 𝒍: 𝒴 × 𝒴 → ℝ𝒆×𝒆 Scalar valued kernel: 𝑙: 𝒴 × 1. . 𝑒 × 𝒴 × 1. . 𝑒 → ℝ Bijection: Define 𝑙( 𝑦, 𝑗 , 𝑦′, 𝑘 = 𝒍 𝑦′, 𝑦′ 𝑗,𝑘
All the theory developed for the scalar-valued GPs holds also for vector-valued GPs!
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
Sampling from 𝐻𝑄 𝜈, 𝑙 is done using the corresponding normal distribution 𝑂( Ԧ 𝜈, K) Algorithm (slightly inefficient)
𝜈 + 𝑉𝐸𝛽
11
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
K𝑉𝐸−1 = 𝑉𝐸
𝑡 = Ԧ 𝜈 + 𝑉𝐸𝛽 = Ԧ 𝜈 + K𝑉𝐸−1𝛽 corresponds to linear combinations of the columns of K.
12
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
13
σ = 1
𝑙 𝑦, 𝑦′ = exp − 𝑦 − 𝑦′ 2 𝜏2
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
14
𝑙 𝑦, 𝑦′ = exp − 𝑦 − 𝑦′ 2 𝜏2
σ = 3
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
𝑦′ 1 2
+ 0.1 exp − 𝑦 −
𝑦′ 0.1 2
15
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
cos 𝑦 sin(𝑦)
‖𝑦 −𝑦′‖ 𝜏2
)
16
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
17
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
1 1+exp( −𝑦)
18
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
f x = x
19
𝑙 𝑦, 𝑦′ = 𝑔 𝑦 𝑔 𝑦′
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
20
𝑙 𝑦, 𝑦′ = 𝑔 𝑦 𝑔 𝑦′ f x = sin(x)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
{f1 x = x, f2 x = sin(x)}
21
𝑙 𝑦, 𝑦′ =
𝑗
𝑔
𝑗 𝑦 𝑔 𝑗(𝑦′)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
𝐼 = {𝑔|𝑔 𝑦 =
𝑗=1 𝑂
𝛽𝑗𝑙 𝑦, 𝑦𝑗 , 𝑜 ∈ ℕ, 𝑦𝑗 ∈ 𝑌, 𝛽𝑗 ∈ ℝ} For 𝑔 𝑦 = σ𝑗 𝛽𝑗𝑙 𝑦𝑗, 𝑦 and 𝑦 = σ𝑘 𝛽𝑘
′𝑙(𝑦𝑘, 𝑦) we define the
inner product 𝑔, 𝑙 =
𝑗,𝑘
𝛽𝑗𝛽𝑘
′𝑙(𝑦𝑗, 𝑦𝑘)
The space H called a Reproducing Kernel Hilbert Space (RKHS).
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
𝑙 𝑦, 𝑦′ = exp − 𝑦 − 𝑦′ 2 9
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
Observations: {(𝑦1, 𝑧1), … , 𝑦𝑜, 𝑧𝑜 }
compute p(𝑧∗|𝑦∗, 𝑦1, … , 𝑦𝑜, 𝑧1, … , 𝑧𝑜)
25
𝑦1 𝑦2 𝑦𝑜 𝑦∗ 𝑧∗
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
𝜈𝑞(𝑦∗) = 𝐿 𝑦∗, 𝑌 𝐿 𝑌, 𝑌 + 𝜏2𝐽 −1𝑧 𝑙𝑞 𝑦∗, 𝑦∗′ = 𝑙 𝑦∗, 𝑦∗′ − 𝐿 𝑦∗, 𝑌 𝐿 𝑌, 𝑌 + 𝜏2𝐽 −1𝐿 𝑌, 𝑦∗
′
26
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
27
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
28
Gaussian kernel (𝜏 = 1)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
29
Gaussian kernel (𝜏 = 5)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
30
Periodic kernel
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
31
Changepoint kernel
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
32
Symmetric kernel
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
33
Linear kernel
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
38
𝑙𝑞 𝑦∗, 𝑦∗′ = 𝑙 𝑦∗, 𝑦∗′ − 𝐿 𝑦∗, 𝑌 𝐿 𝑌, 𝑌 + 𝜏2𝐽 −1𝐿 𝑌, 𝑦∗
′
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
39
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
40