SLIDE 1
Aaron Mishkin
Research Goal: reliable and easy-to-use optimizers for ML.
1⁄10
Research Goal : reliable and easy-to-use optimizers for ML. 1 10 - - PowerPoint PPT Presentation
Aaron Mishkin Research Goal : reliable and easy-to-use optimizers for ML. 1 10 Challenges in Optimization for ML Stochastic gradient methods are the most popular algorithms for fitting ML models, w k +1 = w k k SGD: f ( w k ) .
Aaron Mishkin
1⁄10
2⁄10
n
i=1 fi(w). We say f satisfies
3⁄10
4⁄10
5⁄10
6⁄10
7⁄10
50 100 150 200 250 300 350
Iterations
10
10
10
4
Training Loss
Synthetic Matrix Fac.
Adam SGD + Armijo Nesterov + Armijo
8⁄10
9⁄10
10⁄10