Time-Bounded Sequential Parameter Optimization Frank Hutter, Holger - PowerPoint PPT Presentation

Time-Bounded Sequential Parameter Optimization Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown, Kevin P. Murphy Department of Computer Science University of British Columbia Canada { hutter, hoos, kevinlb, murphyk } @cs.ubc.ca

Automated Parameter Optimization Most algorithms have parameters ◮ Decisions that are left open during algorithm design ◮ Instantiate to optimize empirical performance 2

Automated Parameter Optimization Most algorithms have parameters ◮ Decisions that are left open during algorithm design ◮ Instantiate to optimize empirical performance ◮ E.g. local search – neighbourhoods, restarts, types of perturbations, tabu length (or range for it), etc 2

Automated Parameter Optimization Most algorithms have parameters ◮ Decisions that are left open during algorithm design ◮ Instantiate to optimize empirical performance ◮ E.g. local search – neighbourhoods, restarts, types of perturbations, tabu length (or range for it), etc ◮ E.g., tree search – Branching heuristics, no-good learning, restarts, pre-processing, etc 2

Automated Parameter Optimization Most algorithms have parameters ◮ Decisions that are left open during algorithm design ◮ Instantiate to optimize empirical performance ◮ E.g. local search – neighbourhoods, restarts, types of perturbations, tabu length (or range for it), etc ◮ E.g., tree search – Branching heuristics, no-good learning, restarts, pre-processing, etc Automatically find good instantiation of parameters ◮ Eliminate most tedious part of algorithm design and end use ◮ Save development time & improve performance 2

Parameter Optimization Methods ◮ Lots of work on numerical parameters, e.g. – CALIBRA [Adenso-Diaz & Laguna, ’06] – Population-based, e.g. CMA-ES [Hansen et al, ’95-present] 3

Parameter Optimization Methods ◮ Lots of work on numerical parameters, e.g. – CALIBRA [Adenso-Diaz & Laguna, ’06] – Population-based, e.g. CMA-ES [Hansen et al, ’95-present] ◮ Categorical parameters – Racing algorithms, F-Race [Birattari et al., ’02-present] – Iterated Local Search, ParamILS [Hutter et al., AAAI ’07 & JAIR’09] 3

Parameter Optimization Methods ◮ Lots of work on numerical parameters, e.g. – CALIBRA [Adenso-Diaz & Laguna, ’06] – Population-based, e.g. CMA-ES [Hansen et al, ’95-present] ◮ Categorical parameters – Racing algorithms, F-Race [Birattari et al., ’02-present] – Iterated Local Search, ParamILS [Hutter et al., AAAI ’07 & JAIR’09] ◮ Success of parameter optimization – Many parameters ( e.g. , CPLEX with 63 parameters) – Large speedups (sometimes orders of magnitude!) – For many problems: SAT, MIP, time-tabling, protein folding, ... 3

Limitations of Model-Free Parameter Optimization Model-free methods only return the best parameter setting ◮ Often that is all you need – E.g.: end user can customize algorithm 4

Limitations of Model-Free Parameter Optimization Model-free methods only return the best parameter setting ◮ Often that is all you need – E.g.: end user can customize algorithm ◮ But sometimes we would like to know more – How important is each of the parameters? – Which parameters interact? – For which types of instances is a parameter setting good? � Inform algorithm designer 4

Limitations of Model-Free Parameter Optimization Model-free methods only return the best parameter setting ◮ Often that is all you need – E.g.: end user can customize algorithm ◮ But sometimes we would like to know more – How important is each of the parameters? – Which parameters interact? – For which types of instances is a parameter setting good? � Inform algorithm designer Response surface models can help ◮ Predictive models of algorithm performance with given parameter settings 4

Sequential Parameter Optimization (SPO) ◮ Original SPO [Bartz-Beielstein et al., ’05-present] ◮ SPO toolbox ◮ Set of interactive tools for parameter optimization 5

Sequential Parameter Optimization (SPO) ◮ Original SPO [Bartz-Beielstein et al., ’05-present] ◮ SPO toolbox ◮ Set of interactive tools for parameter optimization ◮ Studied SPO components [Hutter et al, GECCO-09] ◮ Want completely automated tool � More robust version: SPO + 5

Sequential Parameter Optimization (SPO) ◮ Original SPO [Bartz-Beielstein et al., ’05-present] ◮ SPO toolbox ◮ Set of interactive tools for parameter optimization ◮ Studied SPO components [Hutter et al, GECCO-09] ◮ Want completely automated tool � More robust version: SPO + ◮ This work: TB-SPO, reduce computational overheads 5

Sequential Parameter Optimization (SPO) ◮ Original SPO [Bartz-Beielstein et al., ’05-present] ◮ SPO toolbox ◮ Set of interactive tools for parameter optimization ◮ Studied SPO components [Hutter et al, GECCO-09] ◮ Want completely automated tool � More robust version: SPO + ◮ This work: TB-SPO, reduce computational overheads ◮ Ongoing work: extend TB-SPO to handle – Categorical parameters – Multiple benchmark instances 5

Sequential Parameter Optimization (SPO) ◮ Original SPO [Bartz-Beielstein et al., ’05-present] ◮ SPO toolbox ◮ Set of interactive tools for parameter optimization ◮ Studied SPO components [Hutter et al, GECCO-09] ◮ Want completely automated tool � More robust version: SPO + ◮ This work: TB-SPO, reduce computational overheads ◮ Ongoing work: extend TB-SPO to handle – Categorical parameters – Multiple benchmark instances – Very promising results for both 5

Outline 1. Sequential Model-Based Optimization 2. Reducing the Computational Overhead Due To Models 3. Conclusions 6

Outline 1. Sequential Model-Based Optimization 2. Reducing the Computational Overhead Due To Models 3. Conclusions 7

Sequential Model-Based Optimization (SMBO) Blackbox function optimization; function = algo. performance 30 . . 25 True function . . 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 8

Sequential Model-Based Optimization (SMBO) Blackbox function optimization; function = algo. performance 0. Run algorithm with initial parameter settings 30 . 25 True function Function evaluations . 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 8

Sequential Model-Based Optimization (SMBO) Blackbox function optimization; function = algo. performance 0. Run algorithm with initial parameter settings 30 . . 25 . Function evaluations . 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 8

Sequential Model-Based Optimization (SMBO) Blackbox function optimization; function = algo. performance 0. Run algorithm with initial parameter settings 1. Fit a model to the data 30 DACE mean prediction DACE mean +/− 2*stddev 25 . Function evaluations . 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 8

Sequential Model-Based Optimization (SMBO) Blackbox function optimization; function = algo. performance 0. Run algorithm with initial parameter settings 1. Fit a model to the data 2. Use model to pick promising parameter setting 30 DACE mean prediction DACE mean +/− 2*stddev 25 . Function evaluations EI (scaled) 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 8

Sequential Model-Based Optimization (SMBO) Blackbox function optimization; function = algo. performance 0. Run algorithm with initial parameter settings 1. Fit a model to the data 2. Use model to pick promising parameter setting 3. Perform an algorithm run with that parameter setting 30 DACE mean prediction DACE mean +/− 2*stddev 25 True function Function evaluations EI (scaled) 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 8

Sequential Model-Based Optimization (SMBO) Blackbox function optimization; function = algo. performance 0. Run algorithm with initial parameter settings 1. Fit a model to the data 2. Use model to pick promising parameter setting 3. Perform an algorithm run with that parameter setting ◮ Repeat 1-3 until time is up 30 30 DACE mean prediction DACE mean prediction DACE mean +/− 2*stddev DACE mean +/− 2*stddev 25 True function 25 True function Function evaluations Function evaluations EI (scaled) EI (scaled) 20 20 response y response y 15 15 10 10 5 5 0 0 −5 −5 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 parameter x parameter x First step Second step 8

Computational Overhead due to Models: Example Example times 0. Run algorithm with initial parameter settings 1. Fit a model to the data 2. Use model to pick promising parameter setting 3. Perform an algorithm run with that parameter setting ◮ Repeat 1-3 until time is up 30 30 DACE mean prediction DACE mean prediction DACE mean +/− 2*stddev DACE mean +/− 2*stddev 25 True function 25 True function Function evaluations Function evaluations EI (scaled) EI (scaled) 20 20 response y response y 15 15 10 10 5 5 0 0 −5 −5 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 parameter x parameter x First step Second step 9

Time-Bounded Sequential Parameter Optimization Frank Hutter, Holger - PowerPoint PPT Presentation

Time-Bounded Sequential Parameter Optimization Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown, Kevin P. Murphy Department of Computer Science University of British Columbia Canada { hutter, hoos, kevinlb, murphyk } @cs.ubc.ca Automated

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Bounded Type Parameters 49 What is a bounded Type Parameter? Restrict the types that may

6.02 Fall 2012 Lecture #12 Bounded-input, bounded-output stability Frequency response 6.02

Random Sampling Florian Schoppmann August 24, 2010 Non-Sequential Sequential Sequential with

Hardware Design with VHDL Sequential Stmts ECE 443 Sequential Statements This slide set covers

Sequential Files : Outline ! Overview ! Ordered vs. Unordered ! Physical sequential Files !

6. Parameter Passing Parameter Passing CS 381 Spring 2016 Example (Formal) Parameter void

10/16/19 Parameter Control Genetic Algorithms Motivation Parameter setting Tuning

Satisfiability Bounds for -Regular Properties in Bounded-Parameter Markov Decision Processes M.

Intractable Problems Time-Bounded Turing Machines Classes P and NP Polynomial-Time Reductions 1

Real Time Market Real Time Market Parameter Settings: Parameter Settings: Analytic Results

Bounded Radius Routing Perform bounded PRIM algorithm Under = 0, = 0.5, and =

Bounded Degree Spanning Tree using Iterative Relaxation Barna Saha March 11, 2015 Bounded

Chapter 5 Synchronous Sequential Logic 5-1 Outline ! Sequential Circuits ! Latches ! Flip-Flops

Sequential Supervised Learning Sequential Supervised Learning Many Application Problems Require

Introduction to Synchronous Sequential Introduction to Synchronous Sequential Circuits Circuits

NEW DEVELOPMENTS IN RANKING AND SELECTION: An Empirical Comparison of Three Main Approaches urgen

Import Competition, Heterogeneous Preferences of Managers and Productivity Cheng Chen Claudia

Pr Private Stochastic Convex Optimization wi with Optimal Ra Rate Raef Bassily Vitaly

An Empirical Study on Selective Sampling in Active Learning for Splog Detection Taichi Katayama 1

14.581 International Trade Lecture 6: Ricardian Model (Empirics) 14.581

Who Gets Placed Where and Why? An Empirical Framework for Foster Care Placement Alejandro

Faster GPS via the Sparse Fourier Transform Haitham Hassanieh Fadel Adib Dina Katabi Piotr Indyk

Research Infrastructure for Empirical Science of F/OSS Les Gasser, Gabriel Ripoche , Robert J.