Introduction to Machine Learning Hyperparameter Tuning - Problem - PowerPoint PPT Presentation

Introduction to Machine Learning Hyperparameter Tuning - Problem Definition compstat-lmu.github.io/lecture_i2ml

TUNING Recall: Hyperparameters λ are parameters that are inputs to the training problem, in which a learner I minimizes the empirical risk on a training data set in order to find optimal model parameters θ which define the fitted model ˆ f . (Hyperparameter) Tuning is the process of finding good model hyperparameters λ . � c Introduction to Machine Learning – 1 / 6

TUNING: A BI-LEVEL OPTIMIZATION PROBLEM We face a bi-level optimization problem: The well-known risk minimization problem to find ˆ f is nested within the outer hyperparameter optimization (also called second-level problem): � c Introduction to Machine Learning – 2 / 6

TUNING: A BI-LEVEL OPTIMIZATION PROBLEM For a learning algorithm I (also inducer) with d hyperparameters, the hyperparameter configuration space is: Λ = Λ 1 × Λ 2 × . . . Λ d where Λ i is the domain of the i -th hyperparameter. The domains can be continuous, discrete or categorical. For practical reasons, the domain of a continuous or integer-valued hyperparameter is typically bounded. A vector in this configuration space is denoted as λ ∈ Λ . A learning algorithm I takes a (training) dataset D and a hyperparameter configuration λ ∈ Λ and returns a trained model (through risk minimization) I : ( X × Y ) n × Λ → H I ( D , λ ) = ˆ ( D , λ ) �→ f D , λ � c Introduction to Machine Learning – 3 / 6

TUNING: A BI-LEVEL OPTIMIZATION PROBLEM We formally state the nested hyperparameter tuning problem as: � min GE D test ( I ( D train , λ )) λ ∈ Λ The learner I ( D train , λ ) takes a training dataset as well as hyperparameter settings λ (e.g. the maximal depth of a classification tree) as an input. I ( D train , λ ) performs empirical risk minimization on the training data and returns the optimal model ˆ f for the given hyperparameters. Note that for the estimation of the generalization error, more sophisticated resampling strategies like cross-validation can be used. � c Introduction to Machine Learning – 4 / 6

TUNING: A BI-LEVEL OPTIMIZATION PROBLEM The components of a tuning problem are: The dataset The learner (possibly: several competing learners?) that is tuned The learner’s hyperparameters and their respective regions-of-interest over which we optimize The performance measure, as determined by the application. Not necessarily identical to the loss function that defines the risk minimization problem for the learner! A (resampling) procedure for estimating the predictive performance. � c Introduction to Machine Learning – 5 / 6

WHY IS TUNING SO HARD? Tuning is derivative-free (“black box problem”): It is usually impossible to compute derivatives of the objective (i.e., the resampled performance measure) that we optimize with regard to the HPs. All we can do is evaluate the performance for a given hyperparameter configuration. Every evaluation requires one or multiple train and predict steps of the learner. I.e., every evaluation is very expensive . Even worse: the answer we get from that evaluation is not exact, but stochastic in most settings, as we use resampling. Categorical and dependent hyperparameters aggravate our difficulties: the space of hyperparameters we optimize over has a non-metric, complicated structure. � c Introduction to Machine Learning – 6 / 6

Introduction to Machine Learning Hyperparameter Tuning - Problem - PowerPoint PPT Presentation

Introduction to Machine Learning Hyperparameter Tuning - Problem Definition compstat-lmu.github.io/lecture_i2ml TUNING Recall: Hyperparameters are parameters that are inputs to the training problem, in which a learner I minimizes the

Hyperparameter tuning in caret Dr. Shirin Glander Data Scientist DataCamp Hyperparameter

Parameters vs hyperparameters Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning

Machine learning with mlr Dr. Shirin Elsinghorst Data Scientist DataCamp Hyperparameter Tuning

Machine learning with H2O Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning in R

Introduction to Machine Learning Hyperparameter Tuning - Introduction

Deep Learning Hyperparameter Optimization with Competing Objectives GTC 2018 - S8136 Scott Clark

Introduction to Machine Learning Hyperparameter Tuning - Basic Techniques

Performance evaluation and hyperparameter tuning of statistical and machine-learning models using

Performance evaluation and hyperparameter tuning of statistical and machine-learning models using

Hyperparameter Search in Machine Learning Marc Claesen and Bart De Moor

Hyperparameter Tuning in Python Using Optunity http://www.optunity.net Marc Claesen Jaak Simm

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Asynchronous Hyperparameter Tuning and Ablation Studies with Apache Spark Sina Sheikholeslami

Introduction to hyperparameter tuning MODEL VALIDATION IN P YTH ON Kasey Jones Data Scientist

Optimizer Benchmarking Needs to Account for Hyperparameter Tuning Prabhu Teja S * 1, 2 Florian Mai

SELF TUNING MEMORY MANAGEMENT FOR DATA SERVERS By Sangeetha Sivaprakasam Introduction : 1)

Cryptanalysis of the Advanced Encryption Standard Vincent Rijmen Albena 2013 Content AES

Revisiting the Area under the ROC Berry de Bruijn Institute for Information Technology National

Agenda Course 02402 Introduction to Statistics 1 Stochastic Variables and Distributions The

1 -norm regularization Ji Zhu (Michigan), Saharon Rosset (IBM T. J. Watson), Trevor Hastie

MLbase: A System for Distributed Machine Learning Ameet Talwalkar

Model Selection Matt Gormley Lecture 4 January 29, 2018 1 Q&A Q: How do we deal with ties

CMSC5743 L09: Network Architecture Search Bei Yu (Latest update: September 13, 2020) Fall 2020

Scalable Bandit Methods for Hyper-parameter Tuning Kirthevasan Kandasamy Carnegie Mellon

Introduction to Machine Learning Hyperparameter Tuning - Problem - PowerPoint PPT Presentation

Introduction to Machine Learning Hyperparameter Tuning - Problem Definition compstat-lmu.github.io/lecture_i2ml TUNING Recall: Hyperparameters are parameters that are inputs to the training problem, in which a learner I minimizes the

Hyperparameter tuning in caret Dr. Shirin Glander Data Scientist DataCamp Hyperparameter

Parameters vs hyperparameters Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning

Machine learning with mlr Dr. Shirin Elsinghorst Data Scientist DataCamp Hyperparameter Tuning

Machine learning with H2O Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning in R

Introduction to Machine Learning Hyperparameter Tuning - Introduction

Deep Learning Hyperparameter Optimization with Competing Objectives GTC 2018 - S8136 Scott Clark

Introduction to Machine Learning Hyperparameter Tuning - Basic Techniques

Performance evaluation and hyperparameter tuning of statistical and machine-learning models using

Performance evaluation and hyperparameter tuning of statistical and machine-learning models using

Hyperparameter Search in Machine Learning Marc Claesen and Bart De Moor

Hyperparameter Tuning in Python Using Optunity http://www.optunity.net Marc Claesen Jaak Simm

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Asynchronous Hyperparameter Tuning and Ablation Studies with Apache Spark Sina Sheikholeslami

Introduction to hyperparameter tuning MODEL VALIDATION IN P YTH ON Kasey Jones Data Scientist

Optimizer Benchmarking Needs to Account for Hyperparameter Tuning Prabhu Teja S * 1, 2 Florian Mai

SELF TUNING MEMORY MANAGEMENT FOR DATA SERVERS By Sangeetha Sivaprakasam Introduction : 1)

Cryptanalysis of the Advanced Encryption Standard Vincent Rijmen Albena 2013 Content AES

Revisiting the Area under the ROC Berry de Bruijn Institute for Information Technology National

Agenda Course 02402 Introduction to Statistics 1 Stochastic Variables and Distributions The

1 -norm regularization Ji Zhu (Michigan), Saharon Rosset (IBM T. J. Watson), Trevor Hastie

MLbase: A System for Distributed Machine Learning Ameet Talwalkar

Model Selection Matt Gormley Lecture 4 January 29, 2018 1 Q&amp;A Q: How do we deal with ties

CMSC5743 L09: Network Architecture Search Bei Yu (Latest update: September 13, 2020) Fall 2020

Scalable Bandit Methods for Hyper-parameter Tuning Kirthevasan Kandasamy Carnegie Mellon

Model Selection Matt Gormley Lecture 4 January 29, 2018 1 Q&A Q: How do we deal with ties