SLIDE 43 Experiments: Logistic regression COVERTYPE dataset
100 101 102 103 104 105 106
iteration n
10−4 10−3 10−2 10−1
f(θn) − f(θ * ) Params: r = 1/2, q = 1.5, thresh = 0.4
averaged 1/R2√n averaged C/R2√n distance-based SGD restarts 100 101 102 103 104 105 106
iteration n
10−6 10−5 10−4 10−3 10−2 10−1 100
||θn − θrestart||2 Evolution of the distance-based statistic
restarts
100 101 102 103 104 105 106
iteration n
10−4 10−3 10−2 10−1
f(θn) − f(θ * ) Params: r = 1/4, q = 2, thresh = 0.9
averaged 1/R2√n averaged C/R2√n distance-based SGD restarts 100 101 102 103 104 105 106
iteration n
10−10 10−8 10−6 10−4 10−2 100
||θn − θrestart||2 Evolution of the distance-based statistic
restarts
39 / 47