SLIDE 4 Motivation
Deep Learning Gradient-based
create many difficulties
[Shalev-Shwartz et al. 2017]
Current ways of mitigating these issues Vanishing gradients Getting stuck in local minima Hyperparameter tuning
Image from: https://towardsdatascience.com/gradient-descent- algorithm-and-its-variants-10f652806a3
Noise
Image from [He et al. 2015]),
Architecture Design Regularization
Image from: Srivastava, Nitish, et al. 2014
[Sutskever 2013]
12/7/2019 Benchmarking the ATM Algorithm on the BBOB 2009 Noiseless Function Testbed 4 of 34
Do not always work