Tuning numerical parameters of algorithms: sampling and - PowerPoint PPT Presentation

Tuning numerical parameters of algorithms: sampling and stochasticity handling Z. Yuan, T. St¨ utzle, M. Birattari, M. Montes de Oca IRIDIA, CoDE, Universit´ e Libre de Bruxelles Brussels, Belgium zyuan@ulb.ac.be iridia.ulb.ac.be/~zyuan

Outline 1. The tuning problem 2. Tuning algorithm Sampling in parameter space Budget allocation for ranking and selection: F-Race Combine F-Race with Sampling method Iterated F-Race (Birattari et al. 2010) Post-selection mechanism 3. Experimental results

Configuration of parameterized algorithms Algorithm components ◮ categorical parameters ◮ choice of neighborhood in local search ◮ choice of crossover and mutation in EAs ◮ type of perturbation in iterated local search ◮ numerical parameters (real-valued or integer) ◮ crossover and mutation rates ◮ tabu list length ◮ perturbation strength

Importance of the tuning problem ◮ improvement over default settings, manual tuning ◮ reduction of development time and human intervention ◮ empirical studies, comparisons of algorithms ◮ support for end users of algorithms

Tuning problem: formal definition (Birattari et al. 2002) The tuning problem can be defined as a tuple � Θ , I , P I , P C , C�

Tuning problem: formal definition (Birattari et al. 2002) The tuning problem can be defined as a tuple � Θ , I , P I , P C , C� ◮ Θ: set of candidate configurations.

Tuning problem: formal definition (Birattari et al. 2002) The tuning problem can be defined as a tuple � Θ , I , P I , P C , C� ◮ Θ: set of candidate configurations. ◮ I : set of instances. P I : probability measure over I .

Tuning problem: formal definition (Birattari et al. 2002) The tuning problem can be defined as a tuple � Θ , I , P I , P C , C� ◮ Θ: set of candidate configurations. ◮ I : set of instances. P I : probability measure over I . ◮ c ( θ, i ): random variable representing the cost measure of a configuration θ ∈ Θ on instance i ∈ I

Tuning problem: formal definition (Birattari et al. 2002) The tuning problem can be defined as a tuple � Θ , I , P I , P C , C� ◮ Θ: set of candidate configurations. ◮ I : set of instances. P I : probability measure over I . ◮ c ( θ, i ): random variable representing the cost measure of a configuration θ ∈ Θ on instance i ∈ I ◮ C ⊂ ℜ : range of c . P C : probability measure over the set C .

Tuning problem: formal definition (Birattari et al. 2002) The tuning problem can be defined as a tuple � Θ , I , P I , P C , C� ◮ Θ: set of candidate configurations. ◮ I : set of instances. P I : probability measure over I . ◮ c ( θ, i ): random variable representing the cost measure of a configuration θ ∈ Θ on instance i ∈ I ◮ C ⊂ ℜ : range of c . P C : probability measure over the set C . ◮ C ( θ ) = C ( θ | Θ , I , P I , P C ): performance expectation: � C ( θ ) = E I , C [ c ] = c dP C ( c | θ, i ) dP I ( i ) , (1)

Tuning problem: formal definition (Birattari et al. 2002) The tuning problem can be defined as a tuple � Θ , I , P I , P C , C� ◮ Θ: set of candidate configurations. ◮ I : set of instances. P I : probability measure over I . ◮ c ( θ, i ): random variable representing the cost measure of a configuration θ ∈ Θ on instance i ∈ I ◮ C ⊂ ℜ : range of c . P C : probability measure over the set C . ◮ C ( θ ) = C ( θ | Θ , I , P I , P C ): performance expectation: � C ( θ ) = E I , C [ c ] = c dP C ( c | θ, i ) dP I ( i ) , (1) ◮ The objective is to find a performance optimizing configuration ¯ θ : ¯ θ = arg min θ ∈ Θ C ( θ ) (2)

Tuning problem: formal definition (Birattari et al. 2002) The tuning problem can be defined as a tuple � Θ , I , P I , P C , C� ◮ Θ: set of candidate configurations. ◮ I : set of instances. P I : probability measure over I . ◮ c ( θ, i ): random variable representing the cost measure of a configuration θ ∈ Θ on instance i ∈ I ◮ C ⊂ ℜ : range of c . P C : probability measure over the set C . ◮ C ( θ ) = C ( θ | Θ , I , P I , P C ): performance expectation: � C ( θ ) = E I , C [ c ] = c dP C ( c | θ, i ) dP I ( i ) , (1) ◮ The objective is to find a performance optimizing configuration ¯ θ : ¯ θ = arg min θ ∈ Θ C ( θ ) (2) ◮ Analytical solution not possible, hence estimate expected cost in a Monte Carlo fashion

Tuning problem is an optimization problem Variables: mixed discrete-continuous, conditional variables Objective: ◮ black-box ◮ stochastic ◮ due to stochasticity of the algorithm ◮ due to sampling of instances

Outline 1. The tuning problem 2. Tuning algorithm Sampling in parameter space Budget allocation for ranking and selection: F-Race Combine F-Race with Sampling method Iterated F-Race (Birattari et al. 2010) Post-selection mechanism 3. Experimental results

Solving tuning problem: Our approach ◮ sampling in parameter space ◮ budget allocation for ranking and selection under stochasticity: F-Race ◮ combine budget allocator with sampling methods Open question: trade-off in allocating budget to sampling new points or evaluation of sampled points.

Sampling in parameter space ◮ focus on numerical parameters ◮ usually low dimension, low budget ◮ sampling in established tuners: ad-hoc, factorial design, Kriging approximation ◮ our work studies state-of-the-art derivative-free optimizers: BOBYQA, CMA-ES, and MADS (Yuan et al. 2010, 2012a) Average rank of algorithms across numbers of parameters in MMAS Average rank of algorithms across numbers of parameters in cPSO 5 5 4 4 Average rank Average rank CMAES CMAES 3 3 MADS MADS IRS IRS URS URS BOBYQA BOBYQA 2 2 1 1 2 3 4 5 6 2.0 2.5 3.0 3.5 4.0 4.5 5.0 Number of parameters Number of parameters

F-Race (Birattari et al. 2002) Θ i

F-Race (Birattari et al. 2002) Θ ◮ start with a set of initial candidates i

F-Race (Birattari et al. 2002) Θ ◮ start with a set of initial candidates ◮ consider a stream of instances i

F-Race (Birattari et al. 2002) Θ ◮ start with a set of initial candidates ◮ consider a stream of instances ◮ sequentially evaluate candidates i

F-Race (Birattari et al. 2002) Θ ◮ start with a set of initial candidates ◮ consider a stream of instances ◮ sequentially evaluate candidates ◮ discard statistically worse candidates as detected by Friedman test i

Tuning numerical parameters of algorithms: sampling and - PowerPoint PPT Presentation

Tuning numerical parameters of algorithms: sampling and stochasticity handling Z. Yuan, T. St utzle, M. Birattari, M. Montes de Oca IRIDIA, CoDE, Universit e Libre de Bruxelles Brussels, Belgium zyuan@ulb.ac.be iridia.ulb.ac.be/~zyuan

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

10/16/19 Parameters and Parameter Tuning Genetic Algorithms History Taxonomy

Parameters vs hyperparameters Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning

02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 Sampling algorithms SMLP 1 /

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Camera Parameters INEL 6088 Computer Vision Camera Parameters Extrinsic parameters: define

Estimation of cosmological parameters using adaptive importance sampling Gersende FORT LTCI,

Estimation of cosmological parameters using adaptive importance sampling Gersende FORT LTCI,

Data Mining II Optimization & Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Data Mining II Optimization & Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Newfound Water Quality Sampling: In Lake Sampling 8 Historic Sampling locations

Use of FLOCK + Friedman-Rafsky (F- R) in Challenge 1 and 4 Mengya Liu, Southern Methodist

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

A Frequentist Semantics for a Generalized Jeffrey Conditionalization Dirk Draheim Tallinn

Welcome ! BAYE SIAN R E G R E SSION MOD E L IN G W ITH R STAN AR M Jake Thompson Ps y

Behavioral Programming: A Broader and More Detailed Take on Semantic GP Krzysztof Krawiec 1

Update on TopHat & measurement system interconnection Jordan Aug e , Timur Friedman, Thomas

CMPE 646: VLSI Design Verification and Test Course: CMPE 646: VLSI Design Verification and Test,

Lecture 2: Access Control Matrix January 6, 2011 Lecture 2, Slide 1 ECS 235B, Foundations of

Tuning numerical parameters of algorithms: sampling and - PowerPoint PPT Presentation

Tuning numerical parameters of algorithms: sampling and stochasticity handling Z. Yuan, T. St utzle, M. Birattari, M. Montes de Oca IRIDIA, CoDE, Universit e Libre de Bruxelles Brussels, Belgium zyuan@ulb.ac.be iridia.ulb.ac.be/~zyuan

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

10/16/19 Parameters and Parameter Tuning Genetic Algorithms History Taxonomy

Parameters vs hyperparameters Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning

02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 Sampling algorithms SMLP 1 /

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Camera Parameters INEL 6088 Computer Vision Camera Parameters Extrinsic parameters: define

Estimation of cosmological parameters using adaptive importance sampling Gersende FORT LTCI,

Estimation of cosmological parameters using adaptive importance sampling Gersende FORT LTCI,

Data Mining II Optimization &amp; Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Data Mining II Optimization &amp; Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Newfound Water Quality Sampling: In Lake Sampling 8 Historic Sampling locations

Use of FLOCK + Friedman-Rafsky (F- R) in Challenge 1 and 4 Mengya Liu, Southern Methodist

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

A Frequentist Semantics for a Generalized Jeffrey Conditionalization Dirk Draheim Tallinn

Welcome ! BAYE SIAN R E G R E SSION MOD E L IN G W ITH R STAN AR M Jake Thompson Ps y

Behavioral Programming: A Broader and More Detailed Take on Semantic GP Krzysztof Krawiec 1

Update on TopHat &amp; measurement system interconnection Jordan Aug e , Timur Friedman, Thomas

CMPE 646: VLSI Design Verification and Test Course: CMPE 646: VLSI Design Verification and Test,

Lecture 2: Access Control Matrix January 6, 2011 Lecture 2, Slide 1 ECS 235B, Foundations of

Data Mining II Optimization & Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Data Mining II Optimization & Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Update on TopHat & measurement system interconnection Jordan Aug e , Timur Friedman, Thomas