Learning with Approximate Kernel Embeddings
Dino Sejdinovic
Department of Statistics University of Oxford
RegML Workshop, Simula, Oslo, 06/05/2017
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 1 / 18
Learning with Approximate Kernel Embeddings Dino Sejdinovic - - PowerPoint PPT Presentation
Learning with Approximate Kernel Embeddings Dino Sejdinovic Department of Statistics University of Oxford RegML Workshop, Simula, Oslo, 06/05/2017 D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 1 / 18
Department of Statistics University of Oxford
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 1 / 18
1
2
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 1 / 18
1
2
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 1 / 18
1
2
i=1 αik(x, xi) and their pointwise limits.
−6 −4 −2 2 4 6 8 −0.4 −0.2 0.2 0.4 0.6 0.8 1
x f(x)
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 2 / 18
[Cortes & Vapnik, 1995; Schölkopf & Smola, 2001]
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 3 / 18
[Cortes & Vapnik, 1995; Schölkopf & Smola, 2001]
[Smola et al, 2007; Sriperumbudur et al, 2010]
[Gretton et al, 2005; Gretton et al, 2006; Fukumizu et al, 2007; DS et al, 2013; Muandet et al, 2012; Szabo et al, 2015]
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 3 / 18
6 4 2 2 4 6 0.4 0.2 0.0 0.2 0.4 0.6 0.8 1.0
f∈Hk: fHk ≤1
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 4 / 18
6 4 2 2 4 6 0.4 0.2 0.0 0.2 0.4 0.6 0.8 1.0
f∈Hk: fHk ≤1
1 2σ2 x − x′2 2), Matérn family, inverse multiquadrics.
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 4 / 18
within-sample average similarity – between-sample average similarity
k(dogi, fishj) k(fishi, fishj) k(dogi, dogj) k(fishj, dogi)
Figure by Arthur Gretton
[Gretton et al, 2009, Gretton et al, 2012]
Ghahramani, 2015; Kim, Khanna & Koyejo, 2016]
2015+]
DS, 2015]
Wood, 2016]
Roy & Ghahramani, 2015; Sutherland et al, 2017]
k (P, Q) = EX,X′i.i.d. ∼ P k(X, X′) + EY ,Y ′i.i.d. ∼ Qk(Y , Y ′) − 2EX∼P ,Y ∼Qk(X, Y ).
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 5 / 18
1 0.7 0.3 0.1 0.3 0.8 1 1 1 1 1 1 1 0.3 0.1 0.1 0.3 0.2 0.2 0.4 0.2 0.2 0.5 0.3 0.2
cor vs. dcor
X Y Dependence witness and sample −1.5 −1 −0.5 0.5 1 1.5 −1.5 −1 −0.5 0.5 1 1.5 −0.04 −0.03 −0.02 −0.01 0.01 0.02 0.03 0.04 0.05
Figure by Arthur Gretton
Hκ
!" #" !"
#"
!" #" !" #"
!" #" !" #"
[Szekely et al, 2007]: HSIC with Brownian
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 6 / 18
x1
1
x2
1
x3
1
µ1 x1
3
x2
3
x3
3
x4
3
x5
3
µ3 µw
3
µm
3
women men µ2 x1
2
x2
2
x3
2 % vote for Obama feature space
region 1 region 2 region 3 both y1 y2 y3 ? ?
Figure from Flaxman et al, 2015 Figure from Mooij et al, 2014
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 7 / 18
1
2
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 7 / 18
j=1 – assumed to come from a
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 8 / 18
Rp exp
Rp
j=1 ∼ Λ and use a Monte Carlo estimator of
m
j x
j y
j x
j y
m
1 x
1 x
2015].
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 9 / 18
Hk =
Rd |ϕP (ω) − ϕQ (ω)|2 dΛ (ω) ,
j=1 ∼ Λ:
1 x
1 x
mx
mx
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 10 / 18
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 11 / 18
[Delaigle and Hall, 2016] construct density estimators for nonparametric deconvolution,
Rd |ρX (ω) − ρY (ω)|2 dΛ (ω)
j x
j x
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 12 / 18
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 13 / 18
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 13 / 18
i=1 that will give us a
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 14 / 18
i.i.d.
2016].
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 15 / 18
figure from Wang et al, 2012
various levels of noise, using the Fourier and phase neural network and GKKR averaged over 100 runs. Here noise-to-signal ratio σ is varied between 0 and 3.0, and the 5th and the 95th percentile is shown.
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 16 / 18
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 17 / 18
D.Sejdinovic (University of Oxford) Learning with Kernel Embeddings Oslo, 06/05/2017 18 / 18