Gaussian Process Summer School
Kernel Design
Nicolas Durrande – PROWLER.io (nicolas@prowler.io)
Sheffield, September 2019
Kernel Design Nicolas Durrande PROWLER.io (nicolas@prowler.io) - - PowerPoint PPT Presentation
Gaussian Process Summer School Kernel Design Nicolas Durrande PROWLER.io (nicolas@prowler.io) Sheffield, September 2019 Second Introduction to GPs and GP Regression 2 / 77 The pdf of a Gaussian random variable is: 0.4 0.3
Sheffield, September 2019
2 / 77
2 4 0.0 0.1 0.2 0.3 0.4 x density
3 / 77
1 2 3
1 2 3 Y1 Y2
5
5 10 Y1 Y2
5
5 Y1 Y2 4 / 77
x
1
x2 density
5 / 77
x1 x2 fY
µc √Σc
6 / 77
3
7 / 77
22 (Y2 − µ2)
22 Σ21
8 / 77
1 1
x
4 4
Y(x)
9 / 77
n
n
10 / 77
squared exp. k(x, y) = σ2 exp
2θ2
k(x, y) = σ2
√ 5|x − y| θ + 5|x − y|2 3θ2
√ 5|x − y| θ
k(x, y) = σ2
√ 3|x − y| θ
√ 3|x − y| θ
k(x, y) = σ2 exp
θ
k(x, y) = σ2 min(x, y) white noise k(x, y) = σ2δx,y constant k(x, y) = σ2 linear k(x, y) = σ2xy
11 / 77
0.2 0.0 0.2 0.4 0.6 0.8 1.0
Matern12 k(x, 0.0) Matern32 k(x, 0.0) Matern52 k(x, 0.0) RBF k(x, 0.0)
0.2 0.0 0.2 0.4 0.6 0.8 1.0
RationalQuadratic k(x, 0.0) Constant k(x, 0.0) White k(x, 0.0) Cosine k(x, 0.0)
2 2 0.2 0.0 0.2 0.4 0.6 0.8 1.0
Periodic k(x, 0.0)
2 2
Linear k(x, 1.0)
2 2
Polynomial k(x, 1.0)
2 2
ArcCosine k(x, 0.0)
12 / 77
3 2 1 1 2 3
Matern12 Matern32 Matern52 RBF
3 2 1 1 2 3
RationalQuadratic Constant White Cosine
2 2 3 2 1 1 2 3
Periodic
2 2
Linear
2 2
Polynomial
2 2
ArcCosine
13 / 77
1 1
x
5 4
f
14 / 77
1 1
x
4 4
Y(x)
15 / 77
16 / 77
17 / 77
1 1
x
4 4
Y(x)|Y(X) = F
18 / 77
1 1
x
4 4
Y(x)|Y(X) = F
19 / 77
20 / 77
21 / 77
0.0 0.2 0.4 0.6 0.8 1.0 1 2 3 x Z(x)|Z(X) + N(X) = F 0.0 0.2 0.4 0.6 0.8 1.0 1 2 3 x Z(x)|Z(X) + N(X) = F 0.0 0.2 0.4 0.6 0.8 1.0
1 2 3 4 x Z(x)|Z(X) + N(X) = F
22 / 77
23 / 77
p
24 / 77
25 / 77
26 / 77
0.0 0.2 0.4 0.6 0.8 1.0
1 2 x Z(x)|Z(X) = F
27 / 77
28 / 77
standardised residuals Density
1 2 3 0.0 0.1 0.2 0.3 0.4 0.5 0.6
1 2
1 2 3
Normal Q-Q Plot
Theoretical Quantiles Sample Quantiles
29 / 77
30 / 77
0.0 0.2 0.4 0.6 0.8 1.0
1
31 / 77
0.0 0.2 0.4 0.6 0.8 1.0
1
32 / 77
0.0 0.2 0.4 0.6 0.8 1.0
1
33 / 77
0.0 0.2 0.4 0.6 0.8 1.0
1
34 / 77
standardised residuals Density
1 2 3 0.0 0.1 0.2 0.3 0.4
0.0 0.5 1.0 1.5
1 2
Normal Q-Q Plot
Theoretical Quantiles Sample Quantiles
35 / 77
36 / 77
37 / 77
38 / 77
39 / 77
n
n
40 / 77
41 / 77
0.0 0.0
42 / 77
1 (1+ω2)p
43 / 77
0.0
− →
0.0
44 / 77
1 2 3 4 5 6 4 2 2 4 6
45 / 77
46 / 77
◮ On the same space k(x, y) = k1(x, y) + k2(x, y) ◮ On the tensor space k(x, y) = k1(x1, y1) + k2(x2, y2)
◮ On the same space k(x, y) = k1(x, y) × k2(x, y) ◮ On the tensor space k(x, y) = k1(x1, y1) × k2(x2, y2)
◮ k(x, y) = k1(f (x), f (y))
47 / 77
3 2 1 1 2 3 0.04 0.02 0.00 0.02 0.04
Matern12 k(x, 0.03)
3 2 1 1 2 3 0.04 0.02 0.00 0.02 0.04
Linear k(x, 0.03)
3 2 1 1 2 3 0.04 0.02 0.00 0.02 0.04
Sum k(x, .03)
48 / 77
3 2 1 1 2 3 2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0
Z1(x)
3 2 1 1 2 3 2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0
Z2(x)
3 2 1 1 2 3 2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0
Z(x)
49 / 77
1960 1970 1980 1990 2000 2010 2020 2030 320 340 360 380 400 420 440
50 / 77
1960 1970 1980 1990 2000 2010 2020 2030 2040 600 400 200 200 400 600 1950 1960 1970 1980 1990 2000 2010 2020 2030 2040 300 320 340 360 380 400 420 440 460 480
51 / 77
1950 1960 1970 1980 1990 2000 2010 2020 2030 2040 300 320 340 360 380 400 420 440 460 480
52 / 77
1950 1960 1970 1980 1990 2000 2010 2020 2030 2040 300 320 340 360 380 400 420 440 460 480
52 / 77
0x2y2 + krbf 1(x, y) + krbf 2(x, y) + kper(x, y)
1950 1960 1970 1980 1990 2000 2010 2020 2030 2040 300 320 340 360 380 400 420 440 460
53 / 77
0x2y2 + krbf 1(x, y) + krbf 2(x, y) + kper(x, y)
1950 1960 1970 1980 1990 2000 2010 2020 2030 2040 300 320 340 360 380 400 420 440 460
53 / 77
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.5 1.0 1.5
54 / 77
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 2 1 1 2 3 4 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 3 2 1 1 2 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5
55 / 77
56 / 77
0.0 0.2 0.4 0.6 0.8 1.0
x1
0.0 0.2 0.4 0.6 0.8 1.0
x2
0.0 0.2 0.4 0.6 0.8 1.0
x1
0.0 0.2 0.4 0.6 0.8 1.0
x2
57 / 77
0.0 0.2 0.4 0.6 0.8 1.0
x1
0.0 0.2 0.4 0.6 0.8 1.0
x2
58 / 77
59 / 77
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
60 / 77
61 / 77
x and a Matérn 3/2 kernel
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
0.0 0.2 0.4 0.6 0.8 1.0 3 2 1 1 2 3
62 / 77
x and
0.0 0.2 0.4 0.6 0.8 1.0 10 20 30 40 50 60 70
0.0 0.2 0.4 0.6 0.8 1.0 40 20 20 40
63 / 77
64 / 77
65 / 77
66 / 77
0.0 0.2 0.4 0.6 0.8 1.0 −2 −1 1 2 3 x Y
0.0 0.2 0.4 0.6 0.8 1.0 −2 −1 1 2 3 x Y
67 / 77
H Hsym f L1f L2f
68 / 77
69 / 77
d
70 / 77
Z Z0
71 / 77
d
d
d
72 / 77
73 / 77
75 / 77
◮ What is the overall evaluation budget? ◮ What is my model for?
◮ Maximum likelihood ◮ Cross-validation ◮ Multi-start
◮ Test set ◮ Leave-one-out to check mean and confidence intervals ◮ Leave-k-out to check predicted covariances
76 / 77
77 / 77