SLIDE 1
Adaptive primal-dual stochastic gradient methods
Yangyang Xu Mathematical Sciences, Rensselaer Polytechnic Institute October 26, 2019
1 / 22
Adaptive primal-dual stochastic gradient methods Yangyang Xu - - PowerPoint PPT Presentation
Adaptive primal-dual stochastic gradient methods Yangyang Xu Mathematical Sciences, Rensselaer Polytechnic Institute October 26, 2019 1 / 22 Stochastic gradient method stochastic program: F ( x ; ) min x X f ( x ) = E N
1 / 22
2 / 22
3 / 22
AdaGrad Adam tuned SGD
4 / 22
5 / 22
6 / 22
7 / 22
8 / 22
9 / 22
10 / 22
11 / 22
12 / 22
13 / 22
14 / 22
15 / 22
16 / 22
17 / 22
18 / 22
10 20 30 40 50 number of epochs 10-6 10-4 10-2 100 102
PDSG-nonadp PDSG-adp CSA mirror-prox 10 20 30 40 50 number of epochs 10-6 10-4 10-2 100 102 average feasibility residual PDSG-nonadp PDSG-adp CSA mirror-prox
19 / 22
20 / 22
21 / 22
22 / 22