On the Impact of the Activation Function on Deep Neural Networks Training
Soufiane Hayou
University of Oxford soufiane.hayou@stats.ox.ac.uk
Soufiane Hayou (OxCSML) University of Oxford 1 / 16
On the Impact of the Activation Function on Deep Neural Networks - - PowerPoint PPT Presentation
On the Impact of the Activation Function on Deep Neural Networks Training Soufiane Hayou University of Oxford soufiane.hayou@stats.ox.ac.uk Soufiane Hayou (OxCSML) University of Oxford 1 / 16 Overview Neural Networks as Gaussian Processes
Soufiane Hayou (OxCSML) University of Oxford 1 / 16
Soufiane Hayou (OxCSML) University of Oxford 2 / 16
w
Soufiane Hayou (OxCSML) University of Oxford 3 / 16
1 When Nl−1 is large, yl
Soufiane Hayou (OxCSML) University of Oxford 4 / 16
1 When Nl−1 is large, yl
2 Stronger result : when Nl = +∞ for all l (recursively), yl
Soufiane Hayou (OxCSML) University of Oxford 4 / 16
1 Variance propagation : ql = F(ql−1)
2 Correlation propagation : cl+1 = fl(cl)
b+σ2 wE[φ(√
aZ1)φ(√
b(xZ1+
a
b Soufiane Hayou (OxCSML) University of Oxford 5 / 16
Soufiane Hayou (OxCSML) University of Oxford 6 / 16
Soufiane Hayou (OxCSML) University of Oxford 6 / 16
Soufiane Hayou (OxCSML) University of Oxford 6 / 16
Soufiane Hayou (OxCSML) University of Oxford 7 / 16
Soufiane Hayou (OxCSML) University of Oxford 8 / 16
Soufiane Hayou (OxCSML) University of Oxford 9 / 16
w
Soufiane Hayou (OxCSML) University of Oxford 10 / 16
Soufiane Hayou (OxCSML) University of Oxford 11 / 16
Soufiane Hayou (OxCSML) University of Oxford 12 / 16
Soufiane Hayou (OxCSML) University of Oxford 13 / 16
Soufiane Hayou (OxCSML) University of Oxford 14 / 16
Soufiane Hayou (OxCSML) University of Oxford 15 / 16
Soufiane Hayou (OxCSML) University of Oxford 16 / 16