Hyperparameter Search in Machine Learning Marc Claesen and Bart De - PowerPoint PPT Presentation

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Hyperparameter Search in Machine Learning Marc Claesen and Bart De Moor marc.claesen@esat.kuleuven.be ESAT-STADIUS, KU Leuven iMinds Medical IT Department STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Outline Introduction 1 Example: optimizing hyperparameters for an SVM classifier 2 Challenges in hyperparameter search 3 State-of-the-art 4 Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Machine learning Methods capable of learning patterns of interest from data. by formulating the learning task as an optimization problem Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Machine learning Methods capable of learning patterns of interest from data. by formulating the learning task as an optimization problem Machine learning is situated on the intersection of various fields: statistics, computer science, optimization, (biology), . . . Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Machine learning Methods capable of learning patterns of interest from data. by formulating the learning task as an optimization problem Machine learning is situated on the intersection of various fields: statistics, computer science, optimization, (biology), . . . The field encompasses learning methods with various origins, e.g.: biology, e.g. neural networks [1] convex optimization, e.g. support vector machines [2] statistics, e.g. hidden Markov models [3] tensor decompositions, e.g. recommender systems [4] Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Hyperparameter search Most machine learning methods are (hyper)parameterized. e.g. Occam’s razor: model complexity and overfitting Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Hyperparameter search Most machine learning methods are (hyper)parameterized. e.g. Occam’s razor: model complexity and overfitting Hyperparameters can significantly impact performance suitable hyperparameters must be determined for each task occurs in both supervised and unsupervised learning → need for disciplined, automated optimization methods Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Hyperparameter search Most machine learning methods are (hyper)parameterized. e.g. Occam’s razor: model complexity and overfitting Hyperparameters can significantly impact performance suitable hyperparameters must be determined for each task occurs in both supervised and unsupervised learning → need for disciplined, automated optimization methods Some examples: SVM: regularization and kernel hyperparameters ANN: regularization, network architecture, transfer functions Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Formalizing hyperparameter tuning In a general sense, tuning involves these components: a learning algorithm A , parameterized by hyperparameters λ Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Formalizing hyperparameter tuning In a general sense, tuning involves these components: a learning algorithm A , parameterized by hyperparameters λ training and test data X ( tr ) , X ( te ) Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Formalizing hyperparameter tuning In a general sense, tuning involves these components: a learning algorithm A , parameterized by hyperparameters λ training and test data X ( tr ) , X ( te ) a model M = A ( X ( tr ) | λ ) Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Formalizing hyperparameter tuning In a general sense, tuning involves these components: a learning algorithm A , parameterized by hyperparameters λ training and test data X ( tr ) , X ( te ) a model M = A ( X ( tr ) | λ ) loss function L to assess quality of M , typically using X ( te ) : L ( M | X ( te ) ) Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Formalizing hyperparameter tuning In a general sense, tuning involves these components: a learning algorithm A , parameterized by hyperparameters λ training and test data X ( tr ) , X ( te ) a model M = A ( X ( tr ) | λ ) loss function L to assess quality of M , typically using X ( te ) : L ( M | X ( te ) ) In optimization terms, we aim to find λ ∗ (assuming minimization): λ ∗ = arg min A ( X ( tr ) | λ ) | X ( te ) � � L λ Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Formalizing hyperparameter tuning In a general sense, tuning involves these components: a learning algorithm A , parameterized by hyperparameters λ training and test data X ( tr ) , X ( te ) a model M = A ( X ( tr ) | λ ) loss function L to assess quality of M , typically using X ( te ) : L ( M | X ( te ) ) In optimization terms, we aim to find λ ∗ (assuming minimization): λ ∗ = arg min A ( X ( tr ) | λ ) | X ( te ) � � F ( λ | A , X ( tr ) , X ( te ) , L ) L = arg min � �� λ λ objective function Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Tuning in practice Most often done using a combination of grid and manual search: grid search suffers from the curse of dimensionality manual tuning leads to poor reproducibility Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Tuning in practice Most often done using a combination of grid and manual search: grid search suffers from the curse of dimensionality manual tuning leads to poor reproducibility Better solutions exist but lack adoption because: potential performance improvements are underestimated lack of availability and/or ease of use Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Outline Introduction 1 Example: optimizing hyperparameters for an SVM classifier 2 Challenges in hyperparameter search 3 State-of-the-art 4 Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Support vector machine (SVM) classifiers Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Support vector machine (SVM) classifiers n 1 � � � min α i α j y i y j κ ( x i , x j ) + C ξ i , 2 α,ξ, b i =1 i ∈ SV j ∈ SV � � � α i α j y i y j κ ( x i , x j ) + b ≥ 1 − ξ i , ξ i ≥ 0 , ∀ i . subject to y i j ∈ SV Marc Claesen and Bart De Moor Hyperparameter Search in Machine Learning

Hyperparameter Search in Machine Learning Marc Claesen and Bart De - PowerPoint PPT Presentation

Introduction Example: optimizing hyperparameters for an SVM classifier Challenges in hyperparameter search State-of-the-art References Hyperparameter Search in Machine Learning Marc Claesen and Bart De Moor marc.claesen@esat.kuleuven.be

Hyperparameter tuning in caret Dr. Shirin Glander Data Scientist DataCamp Hyperparameter

Machine learning with mlr Dr. Shirin Elsinghorst Data Scientist DataCamp Hyperparameter Tuning

Parameters vs hyperparameters Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning

Machine learning with H2O Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning in R

CSC321 Lecture 21: Bayesian Hyperparameter Optimization Roger Grosse Roger Grosse CSC321

Hyperparameter Optimization with SHERPA Lars Hertel, Julian Collado, Peter Sadowski, Pierre Baldi

Improving Bug Prediction Accuracy by Regularization and Hyperparameter Optimization Haidar Osman

Hyperparameter optimization strategies git clone

Introduction to Machine Learning Hyperparameter Tuning - Basic Techniques

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

Performance evaluation and hyperparameter tuning of statistical and machine-learning models using

Brazil'and'Canada'+'Exchanging' Experiences'and'Ideas'to'Improve'

ECO 199 GAMES OF STRATEGY Spring Term 2004 April 20 EVOLUTIONARY GAMES 1. Get away from

CSCI 446: Artificial Intelligence Genetic Algorithms Genetic Algorithms Basic concept is

GENETIC ALGORITHMS By Joy Reistad Overview What are genetic algorithms? History

Th F t Th F t The Future Belongs to the Educated The Future Belongs to the Educated B l B l

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen Introduction to

Hereditary, additive and divisible classes in epireflective subcategories of Top Martin Sleziak

Maximization in Massive Datasets Alina Ene Joint work with Rafael Barbosa, Huy L. Nguyen, Justin