The landscape of non-convex losses for statistical learning problems
Song Mei
Stanford University
October 19, 2017
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 1 / 32
The landscape of non-convex losses for statistical learning problems - - PowerPoint PPT Presentation
The landscape of non-convex losses for statistical learning problems Song Mei Stanford University October 19, 2017 Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 1 / 32 Deep learning Song Mei
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 1 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 2 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 2 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 3 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 4 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 5 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 6 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 6 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 7 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 8 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 8 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 8 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 9 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 10 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 10 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 10 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 10 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 11 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 11 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 11 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 12 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 12 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 12 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 12 / 32
Number of iterations
20 40 60 80 100 120 140 160 180 200
std
10-6 10-5 10-4 10-3 10-2 10-1
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 12 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 13 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 13 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 13 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 14 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 15 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 16 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 16 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 16 / 32
1 2 3
1 2 3
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 17 / 32
1 2 3
1 2 3
1 2 3
1 2 3
θ0 = [1, 0] ˆ θn = [0.816, −0.268]
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 17 / 32
1 2 3
1 2 3
1 2 3
1 2 3
θ0 = [1, 0] ˆ θn = [0.816, −0.268]
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 17 / 32
1 2 3
1 2 3
1 2 3
1 2 3
θ0 = [1, 0] ˆ θn = [0.816, −0.268]
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 17 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 18 / 32
Risk ¡ Empirical ¡risk Empirical ¡risk Empirical ¡risk Barriers ¡ Many ¡local ¡mins ¡ Risk ¡global ¡min Risk ¡ ERM ¡ Risk ¡global ¡min ERM ¡ Good ¡local ¡mins Smooth ¡far ¡from ¡mins ¡ Risk ¡global ¡min ERM ¡ Uniform ¡smooth ¡ surface ¡
Risk ¡
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 19 / 32
0.5 1 1.5 2 2.5 3 3.5
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 20 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 21 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 22 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 23 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 23 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 23 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 24 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 24 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 24 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 25 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 25 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 25 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 26 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 26 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 26 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 26 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 26 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 26 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 27 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 27 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 27 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 28 / 32
Gap =
1 k−1
SDP(A) + SDP(−A)
nε/2
a saddle point with ε curvature global optimizer a local optimizer
SDP(A) −SDP(−A)
Gap Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 29 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 30 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 30 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 31 / 32
Song Mei (Stanford University) The landscape of non-convex optimization October 19, 2017 32 / 32