Advance Stochastic Gradient with Variance Reduction
Jingchang Liu December 7, 2017
University of Science and Technology of China 1
Advance Stochastic Gradient with Variance Reduction Jingchang Liu - - PowerPoint PPT Presentation
Advance Stochastic Gradient with Variance Reduction Jingchang Liu December 7, 2017 University of Science and Technology of China 1 Table of Contents Introductions Control Variates Antithetic Sampling Stratified Sampling Important Sampling
University of Science and Technology of China 1
2
n
3
n
n
4
Var(Y )
5
k→0 E vk2 = 0
6
2(Xi + Xj)
2(Xi + Xj) to estimate µ
2(Xi + Xj)) = Var(µ) = 0 7
′ i w
i w yix ′
i
′ i w
i w yix ′
i ,
′ j w
j w yjx ′
j
′ i w
i w yix ′
i
′ j w
j w yjx ′
j
′
i yjx
′
j , equal hold. 8
n n
2 w2 equals to
n
n n
y,z L (y, z, α)
n
zi {fi (zi) − αizi} + inf y
n
n
i (−αi) − λ
n
9
n
i = − 1
n
i
l =
l
l
l
i − αt−1 i
l
l
l
10
11
bh b = nh n = Wh
WhSh
L
WhSh
NhSh
L
NhSh
12
n
n
∇fi(w) npt
i
i ,
i=1 pt i = 1
13
pt E
it
pt
n
i
i =
j=1 ∇fj(w t)
i =
j=1 Lj 14
15
16
17