Learning Step Size Controllers for Robust Neural Network Training - PowerPoint PPT Presentation

Aug 22, 2022 •268 likes •438 views

Learning Step Size Controllers for Robust Neural Network Training Christian Daniel et al. Recent Trends in Automated Machine Learning Abeeha Shafiq 18.07.2019 Motivation Optimizers are sensitive to initial learning rate Good

Learning Step Size Controllers for Robust Neural Network Training Christian Daniel et al. Recent Trends in Automated Machine Learning Abeeha Shafiq 18.07.2019
Motivation • Optimizers are sensitive to initial learning rate • Good learning rate is problem specific • Manual search required Image taken from I2DL lecture slide Abeeha Shafiq | Recent Trends in Automated Machine Learning 2
Previous Work • Waterfall scheme • Exponential/power scheme • TONGA Abeeha Shafiq | Recent Trends in Automated Machine Learning 3
Goal Develop an adaptive controller for the learning rate used in training algorithms such as Stochastic Gradient Descent (SGD) with Reinforcement Learning Abeeha Shafiq | Recent Trends in Automated Machine Learning 4
Contributions • Identifying informative features for controller • Proposing a learning setup for a controller • Showing that the resulting controller generalizes across different tasks and architectures. Abeeha Shafiq | Recent Trends in Automated Machine Learning 5
Problem statement for controller • Find the minimizer • F ( · ) sums over the function values induced by the individual inputs T ( · ) is an optimization operator which yields a weight update vector to find ω ∗ • • SGD weight update Abeeha Shafiq | Recent Trends in Automated Machine Learning 6
Learning a Controller Relative Entropy Policy Search (REPS) Concept similar to Proximal Policy Optimization Abeeha Shafiq | Recent Trends in Automated Machine Learning 7
Features • Informative about current state • Generalize across different tasks and architectures • Constrained by computation and memory limits
Features • Predictive change in function value. • Disagreement of function values. Abeeha Shafiq | Recent Trends in Automated Machine Learning 9
Mini Batch Setting • Discounted Average. • Smooths outliers • Serve as memory • Uncertainty Estimate • Estimate of noise in the system Abeeha Shafiq | Recent Trends in Automated Machine Learning 10
Experimental Setup • Datasets: MNIST, CIFAR-10 • Learning Algorithms: SGD and RMSProp • Model: CNN • For Learning Controller parameters: • Subset of MNIST • Small CNN architecture • π ( θ ) to a Gaussian with isotropic covariance Abeeha Shafiq | Recent Trends in Automated Machine Learning 11
Results • overhead of 36% for controller training • Generalized to different variants of CNN • Did not generalize to different training methods Abeeha Shafiq | Recent Trends in Automated Machine Learning 12
Static RMSProp vs Controlled RMSProp Abeeha Shafiq | Recent Trends in Automated Machine Learning 13
Static SGD vs Controlled SGD Abeeha Shafiq | Recent Trends in Automated Machine Learning 14
Discussion • Strengths: • Features • Not sensitive to initial learning rate • Effort to generalize • Weakness: • Tested on only 2 dataset • CNN only • Lacks comparison with • learning rate decay techniques • Grid search for initial learning rate This is a prior technique to learning the complete optimizer Abeeha Shafiq | Recent Trends in Automated Machine Learning 15
Questions?

Recommend

Demo (Step 1, Selection) Demo (Step 1, Optimization) Demo (Step 2, Selection) Demo (Step 2,

Demo (Step 1, Selection) Demo (Step 1, Optimization) Demo (Step 2, Selection) Demo (Step 2, Optimization) Demo (Step 3, Selection) Demo (Step 3, Optimization) Demo (Step 4, Selection) Demo (Step 4, Optimization) Demo (Step 5, Selection)

530 views • 19 slides

Quick guide Step 1: Purchasing an RSEvents! membership Step 2: Downloading RSEvents! Step 3:

Quick guide Step 1: Purchasing an RSEvents! membership Step 2: Downloading RSEvents! Step 3: Installing RSEvents! Step 4: Configure RSEvents! Step 5: Add user permissions Step 6: Create event categories Step 7: Create event locations Step 8:

629 views • 35 slides

Step by step guide Step 1: Purchasing an RSBlog! membership Step 2: Downloading RSBlog! Step 3:

Step by step guide Step 1: Purchasing an RSBlog! membership Step 2: Downloading RSBlog! Step 3: Installing RSBlog! 3.1 Installing the component 3.2 Installing a new language file Step 4: Import plugins (optional) 4.1 Import Joomla! articles

891 views • 67 slides

Step by step guide Step 1: Purchasing an RSEvents! membership Step 2: Downloading RSEvents! Step

Step by step guide Step 1: Purchasing an RSEvents! membership Step 2: Downloading RSEvents! Step 3: Installing RSEvents! Step 4: Configure RSEvents! 4.1 General settings 4.2 Dashboard settings 4.3 Events 4.3.1 Event general settings 4.3.2

1.29k views • 80 slides

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

359 views • 14 slides

Step Size Matters in Deep Learning Kamil Nar Shankar Sastry Neural Information Processing

Step Size Matters in Deep Learning Kamil Nar Shankar Sastry Neural Information Processing Systems December 4, 2018 Nar & Sastry Step Size Matters 1 Gradient Descent: Effect of Step Size Example min x R ( x 2 + 1)( x 1) 2 ( x

755 views • 18 slides

Step by step guide Step 1: Accessing the account Step 2: Download RSFiles! 2.1 Download the

Step by step guide Step 1: Accessing the account Step 2: Download RSFiles! 2.1 Download the component 2.2 Download Language files Step 3: Installing RSFiles! 3.1 Installing the component 3.2 Installing the language files Step 4: Update

508 views • 47 slides

Step 1 Step 2 Step 3 Step 4 Step 5 Preparation of a sketch Submission of birth map of all

Step 1 Step 2 Step 3 Step 4 Step 5 Preparation of a sketch Submission of birth map of all customary information forms to Preparation and adoption of Convening and submission Submission of Application Ste land owned by ILG obtain

1.23k views • 4 slides

Quick guide Step 1: Purchasing RSMail! Step 2: Download RSMail! Step 3: Installing RSMail! Step

Quick guide Step 1: Purchasing RSMail! Step 2: Download RSMail! Step 3: Installing RSMail! Step 4: RSMail! settings Step 5: Add Subscribers 5.1. Create subscriber lists 5.2. Add subscribers 5.2.1 Manual add 5.2.2 Import from CSV Step 6

628 views • 14 slides

Credential Assessment Mapping Privilege Escalation at Scale Matt Weeks @scriptjunkie1 Adversary

Credential Assessment Mapping Privilege Escalation at Scale Matt Weeks @scriptjunkie1 Adversary access (# boxes owned) 10000 1000 100 10 1 Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8 Step 9 Step 10 Adversary access (#

734 views • 45 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

Lecture 16 Introduction to Controllers and PID Controllers Process Control Prof. Kannan M.

Lecture 16 Introduction to Controllers and PID Controllers Process Control Prof. Kannan M. Moudgalya IIT Bombay Tuesday, 27 August 2013 1/34 Process Control Introduction to controllers Outline 1. Recalling control loop components 2.

580 views • 34 slides

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

GESG seminar, 16 October 2015, UFM Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification identification identification of identification of of of switching regimes: switching regimes: switching regimes:

587 views • 27 slides

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

Robust Optimization & Machine Learning 6. Robust Optimization Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised Learning Motivations Examples Thresholding and robustness Boolean data

728 views • 48 slides

Step by step guide Step 1: Purchasing a RSMembership! membership Step 2: Download RSMembership!

Step by step guide Step 1: Purchasing a RSMembership! membership Step 2: Download RSMembership! 2.1. Download the component 2.2. Download RSMembership! language files Step 3: Installing RSMembership! 3.1: Installing the component 3.2:

798 views • 50 slides

Selection of Design Team Step 3 Design Step 4 June 2013 Project Management Concept

Selection of Design Team Step 3 Design Step 4 June 2013 Project Management Concept Project Management Concepts Step 1: Needs Development Step 2: Scope Development Step 3: Procurement of Design Team Step 4: Design Step 5:

835 views • 21 slides

Introduction to Machine Learning Linear Regression Prof. Andreas Krause Learning and Adaptive

Introduction to Machine Learning Linear Regression Prof. Andreas Krause Learning and Adaptive Systems (las.ethz.ch) Basic Supervised Learning Pipeline Training Data Test Data spam ? Learning Predic- ham Model method ?

569 views • 23 slides

Math 211 Math 211 Lecture #14 M ATLAB s ODE Solvers September 26, 2003 2 Matlab Solvers

1 Math 211 Math 211 Lecture #14 M ATLAB s ODE Solvers September 26, 2003 2 Matlab Solvers Matlab Solvers M ATLAB has several solvers. Return 2 Matlab Solvers Matlab Solvers M ATLAB has several solvers. ode45 Return 2 Matlab Solvers

855 views • 42 slides

On the steplength selection in Stochastic Gradient Methods Giorgia Franchini

Introduction Stochastic Gradient Methods and their properties A numerical experiment: the test problem Future developments On the steplength selection in Stochastic Gradient Methods Giorgia Franchini giorgia.franchini@unimore.it Universit

528 views • 19 slides

Generalized Approach for Analysing Quantum Key Distribution Experiments Arpita Maitra and Suvra

Generalized Approach for Analysing Quantum Key Distribution Experiments Arpita Maitra and Suvra Sekhar Das C R Rao AIMSCS & Indian Institute of Technology Kharagpur December 18, 2019 Generalized Approach for Analysing Quantum Key

501 views • 28 slides

CS257 Linear and Convex Optimization Lecture 10 Bo Jiang John Hopcroft Center for Computer

CS257 Linear and Convex Optimization Lecture 10 Bo Jiang John Hopcroft Center for Computer Science Shanghai Jiao Tong University November 9, 2020 Recap Strong convexity. f is m -strongly convex if 2 x 2 is convex f ( x ) m

540 views • 25 slides

CS 6316 Machine Learning Gradient Descent Yangfeng Ji Department of Computer Science University

CS 6316 Machine Learning Gradient Descent Yangfeng Ji Department of Computer Science University of Virginia Overview 1. Gradient Descent 2. Stochastic Gradient Descent 3. SGD with Momentum 4. Adaptive Learning Rates 1 Gradient Descent

772 views • 66 slides

Leamer Monoids and the Huneke-Wiegand Conjecture Roberto Carlos Pelayo Christopher ONeill

The Huneke-Wiegand Conjecture and Leamer Monoids Finding Irreducible Arithmetic Sequences Leamer Monoids and the Huneke-Wiegand Conjecture Roberto Carlos Pelayo Christopher ONeill Brian Wissman March 23, 2019 Roberto Carlos Pelayo

549 views • 39 slides

Machine Learning: Chenhao Tan University of Colorado Boulder LECTURE 5 Slides adapted from

Machine Learning: Chenhao Tan University of Colorado Boulder LECTURE 5 Slides adapted from Jordan Boyd-Graber, Tom Mitchell, Ziv Bar-Joseph Machine Learning: Chenhao Tan | Boulder | 1 of 27 Quiz question For a test instance ( x , y ) and a

1.73k views • 86 slides