using learning dynamics to understand neural network
play

Using Learning Dynamics to Understand Neural Network Generalisation - PowerPoint PPT Presentation

Using Learning Dynamics to Understand Neural Network Generalisation Arnu Pretorius Supervisor: Dr. Steve Kroon (Computer Science ) Co-supervisor: Dr. Herman Kamper (EE Engineering + ) 20 October 2017 Maties Machine Learning, Stellenbosch


  1. Using Learning Dynamics to Understand Neural Network Generalisation Arnu Pretorius ∗ Supervisor: Dr. Steve Kroon (Computer Science ∗ ) Co-supervisor: Dr. Herman Kamper (EE Engineering + ) 20 October 2017 Maties Machine Learning, Stellenbosch University

  2. The success of Deep Learning Scale of deep learning deployment at Google over time. Large-scale deep learning for intelligent computer systems , Dean. BayLearn keynote speech, 2015. 1

  3. The mysteries of Deep Learning • Generalisation • The role of explicit versus implicit regularisation • Non-convex optimisation • Adversarial examples • and more ... Understanding deep learning requires rethinking generalization , Zhang, Bengio, Hardt, Recht, Vinyals. ICLR, 2017. On large-batch training for deep learning: generalization gap and sharp minima , Keskar, Mudigere, Smelyanskiy, Tang. ICLR, 2017. Intriguing Properties of Neural Networks , Szegedy, Zaremba, Sutskever, Bruna, Erhan, Goodfellow, Fergus. ICLR, 2014. 2

  4. Approaches Towards Understanding Neural Networks Approaches towards understanding neural networks. [1] Understanding the difficulty of training deep feedforward neural networks , Glorot, Bengio. AISTATS, 2010. [2] Sharp minima can generalize for deep nets , Dinh, Pascanu, Bengio, Bengio. arXiv 2017. [3] Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , Saxe, McClelland, Ganguli. ICLR, 2014. [4] On the Expressive Power of Deep Neural Networks , Raghu, Poole, Kleinberg, Ganguli, Dickstein. ICML, 2017. 3

  5. Scalar linear neural network Let w 2 , w 1 , x ∈ R , y = w 2 w 1 x ˆ (1) Scalar linear neural network. 4

  6. Scalar learning dynamics Loss surface (1 − w 2 w 1 ) 2 , GD path (orange), learning dynamics (green). 5

  7. Exact solutions for scalar learning dynamics The learning dynamics for a scalar linear neural network starting from small initial values u 0 ≡ w 2(0) w 1(0) is given by yE u f ( t ) = x ( E − 1) + y / u 0 , (2) where E = e 2 yxt α , α is the learning rate and t is measured in epochs. Simulated versus theoretical learning dynamics. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , Saxe, McClelland, Ganguli. ICLR, 2014. 6

  8. Learning dynamics for deep linear networks Linear neural network learning dynamics. Learning hierarchical category structure in deep neural networks , Saxe, McClelland, Ganguli. CogSci, 2013. 7

  9. Our current focus 3.5 No regularisation Simulated 3.0 Theory 2.5 Mode strength 2.0 1.5 • The role of regularisation 1.0 0.5 – Weight decay 0.0 0 200 400 600 800 1000 Epoch (t) ( s − λ ∗ ) e 2( s − λ ∗ ) t /τ u ( t , s , u 0 , λ ∗ ) = e 2( s − λ ∗ ) t /τ − 1 + ( s − λ ∗ ) / u 0 – Dropout 3.5 No regularisation � s � τ Dropout τ � � 3.0 Ed ∼ Bern ( p )[ t ] ≤ ln = O ǫ sp sp 2.5 Mode strength 2.0 • Autoencoder networks 1.5 • Generalisation 1.0 0.5 0.0 0 200 400 600 800 1000 Epoch (t) 8

  10. Summary • Deep learning has been hugely successful in solving large and complex machine learning task, however many mysteries remain. • Better understanding deep neural networks might be achieved via several different routes. • Studying the learning dynamics of neural networks may help us understand how neural networks learn. • We hope to use this learning dynamics approach to study generalisation in deep neural networks. 9

  11. Thank you for listening! 9

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend