Using Meta Learning to Initialize Bayesian Optimization
Albert-Ludwigs-Universität Freiburg
Matthias Feurer1 Jost Tobias Springenberg2 Frank Hutter1
1Research Group on Learning, Optimization, and Automated Algorithm Design 2Machine Learning Lab
Using Meta Learning to Initialize Bayesian Optimization - - PowerPoint PPT Presentation
Using Meta Learning to Initialize Bayesian Optimization Albert-Ludwigs-Universitt Freiburg Matthias Feurer 1 Jost Tobias Springenberg 2 Frank Hutter 1 1 Research Group on Learning, Optimization, and Automated Algorithm Design 2 Machine Learning
Albert-Ludwigs-Universität Freiburg
1Research Group on Learning, Optimization, and Automated Algorithm Design 2Machine Learning Lab
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 2 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0.
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 2 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0.
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 2 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0.
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 2 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0.
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 2 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0.
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 3 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0; Bottom right: Iris Japonica is licensed by KENPEI under CC BY-SA 3.0
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 3 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0; Bottom right: Iris Japonica is licensed by KENPEI under CC BY-SA 3.0
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 3 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0; Bottom right: Iris Japonica is licensed by KENPEI under CC BY-SA 3.0
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 3 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0; Bottom right: Iris Japonica is licensed by KENPEI under CC BY-SA 3.0
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 4 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 4 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 4 / 18
Configuration Task ML Algorithm A Configuration Space Λ of A Dataset Dnew Fit regression model on pairs of (λ,Aλ(Dnew)) Select promising configuration λ ∈ Λ Evaluate Aλ(Dnew) Configuration λ ∗
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 5 / 18
Configuration Task ML Algorithm A Configuration Space Λ of A Dataset Dnew Fit regression model on pairs of (λ,Aλ(Dnew)) Select promising configuration λ ∈ Λ Evaluate Aλ(Dnew) Configuration λ ∗ Find Datasets Di similar to Dnew Initialize Search with λ ∗
Di
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 5 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 6 / 18 The iris pictures on this slide are from wikimedia commons and used under the following licenses: Top left: Iris Versicolor is public domain; Bottom left: Iris setosa is licensed by Radomil under CC BY-SA 3.0; Top right: Iris Virginica is licensed by C T Johansson under CC BY 3.0.
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 7 / 18
w a v e f
m
i r i s p e n d i g i t s y e a s t s a t i m a g e b a l a n c e
c a l e e c
i c m c p a g e
l
k s t a e g l a s s n u r s e r y s p a m b a s e v e h i c l e s e g m e n t c a r
t d i g i t s p r i m a r y
u m
m f e a t
r i e r m f e a t
a r h u n e n m f e a t
e r n i k e a b a l
e v
e l m f e a t
a c t
s d e r m a t
y l e t t e r m f e a t
p h
i c a l m f e a t
i x e l e u c a l y p t u s a r r h y t h m i a z
u d i
y a u t
s
b e a n c y l i n d e r
a n d s v
e i
p h e r e t i c
a c
b r a z i l t
r i s m h e p a t i t i s l a b
h e a r t
t a t l
b r e a s t
m u s h r
h e a r t
h e a r t
p
t
e r a t i v e
a t i e n t
a t a l y m p h b r e a s t
a n c e r d i a b e t e s l i v e r
i s
d e r s k r
s
p c r e d i t
a n n e a l . O R I G c r e d i t
s
a r h a b e r m a n
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 8 / 18
w a v e f
m
i r i s p e n d i g i t s y e a s t s a t i m a g e b a l a n c e
c a l e e c
i c m c p a g e
l
k s t a e g l a s s n u r s e r y s p a m b a s e v e h i c l e s e g m e n t c a r
t d i g i t s p r i m a r y
u m
m f e a t
r i e r m f e a t
a r h u n e n m f e a t
e r n i k e a b a l
e v
e l m f e a t
a c t
s d e r m a t
y l e t t e r m f e a t
p h
i c a l m f e a t
i x e l e u c a l y p t u s a r r h y t h m i a z
u d i
y a u t
s
b e a n c y l i n d e r
a n d s v
e i
p h e r e t i c
a c
b r a z i l t
r i s m h e p a t i t i s l a b
h e a r t
t a t l
b r e a s t
m u s h r
h e a r t
h e a r t
p
t
e r a t i v e
a t i e n t
a t a l y m p h b r e a s t
a n c e r d i a b e t e s l i v e r
i s
d e r s k r
s
p c r e d i t
a n n e a l . O R I G c r e d i t
s
a r h a b e r m a n
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 8 / 18
w a v e f
m
i r i s p e n d i g i t s y e a s t s a t i m a g e b a l a n c e
c a l e e c
i c m c p a g e
l
k s t a e g l a s s n u r s e r y s p a m b a s e v e h i c l e s e g m e n t c a r
t d i g i t s p r i m a r y
u m
m f e a t
r i e r m f e a t
a r h u n e n m f e a t
e r n i k e a b a l
e v
e l m f e a t
a c t
s d e r m a t
y l e t t e r m f e a t
p h
i c a l m f e a t
i x e l e u c a l y p t u s a r r h y t h m i a z
u d i
y a u t
s
b e a n c y l i n d e r
a n d s v
e i
p h e r e t i c
a c
b r a z i l t
r i s m h e p a t i t i s l a b
h e a r t
t a t l
b r e a s t
m u s h r
h e a r t
h e a r t
p
t
e r a t i v e
a t i e n t
a t a l y m p h b r e a s t
a n c e r d i a b e t e s l i v e r
i s
d e r s k r
s
p c r e d i t
a n n e a l . O R I G c r e d i t
s
a r h a b e r m a n
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 8 / 18
w a v e f
m
i r i s p e n d i g i t s y e a s t s a t i m a g e b a l a n c e
c a l e e c
i c m c p a g e
l
k s t a e g l a s s n u r s e r y s p a m b a s e v e h i c l e s e g m e n t c a r
t d i g i t s p r i m a r y
u m
m f e a t
r i e r m f e a t
a r h u n e n m f e a t
e r n i k e a b a l
e v
e l m f e a t
a c t
s d e r m a t
y l e t t e r m f e a t
p h
i c a l m f e a t
i x e l e u c a l y p t u s a r r h y t h m i a z
u d i
y a u t
s
b e a n c y l i n d e r
a n d s v
e i
p h e r e t i c
a c
b r a z i l t
r i s m h e p a t i t i s l a b
h e a r t
t a t l
b r e a s t
m u s h r
h e a r t
h e a r t
p
t
e r a t i v e
a t i e n t
a t a l y m p h b r e a s t
a n c e r d i a b e t e s l i v e r
i s
d e r s k r
s
p c r e d i t
a n n e a l . O R I G c r e d i t
s
a r h a b e r m a n
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 8 / 18
w a v e f
m
i r i s p e n d i g i t s y e a s t s a t i m a g e b a l a n c e
c a l e e c
i c m c p a g e
l
k s t a e g l a s s n u r s e r y s p a m b a s e v e h i c l e s e g m e n t c a r
t d i g i t s p r i m a r y
u m
m f e a t
r i e r m f e a t
a r h u n e n m f e a t
e r n i k e a b a l
e v
e l m f e a t
a c t
s d e r m a t
y l e t t e r m f e a t
p h
i c a l m f e a t
i x e l e u c a l y p t u s a r r h y t h m i a z
u d i
y a u t
s
b e a n c y l i n d e r
a n d s v
e i
p h e r e t i c
a c
b r a z i l t
r i s m h e p a t i t i s l a b
h e a r t
t a t l
b r e a s t
m u s h r
h e a r t
h e a r t
p
t
e r a t i v e
a t i e n t
a t a l y m p h b r e a s t
a n c e r d i a b e t e s l i v e r
i s
d e r s k r
s
p c r e d i t
a n n e a l . O R I G c r e d i t
s
a r h a b e r m a n
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 8 / 18
w a v e f
m
i r i s p e n d i g i t s y e a s t s a t i m a g e b a l a n c e
c a l e e c
i c m c p a g e
l
k s t a e g l a s s n u r s e r y s p a m b a s e v e h i c l e s e g m e n t c a r
t d i g i t s p r i m a r y
u m
m f e a t
r i e r m f e a t
a r h u n e n m f e a t
e r n i k e a b a l
e v
e l m f e a t
a c t
s d e r m a t
y l e t t e r m f e a t
p h
i c a l m f e a t
i x e l e u c a l y p t u s a r r h y t h m i a z
u d i
y a u t
s
b e a n c y l i n d e r
a n d s v
e i
p h e r e t i c
a c
b r a z i l t
r i s m h e p a t i t i s l a b
h e a r t
t a t l
b r e a s t
m u s h r
h e a r t
h e a r t
p
t
e r a t i v e
a t i e n t
a t a l y m p h b r e a s t
a n c e r d i a b e t e s l i v e r
i s
d e r s k r
s
p c r e d i t
a n n e a l . O R I G c r e d i t
s
a r h a b e r m a n
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 8 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 9 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 10 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 10 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 10 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 10 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 10 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 10 / 18
Max Features SVM gamma C(SVM) Classifier Random Forest LinearSVM Criterion Min Samples Split loss C(LinearSVM) MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 11 / 18
Max Features SVM gamma C(SVM) Classifier Random Forest LinearSVM Criterion Min Samples Split loss C(LinearSVM)
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 11 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 12 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 12 / 18
10 20 30 40 50 #Function evaluations 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 Difference to min function value
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 13 / 18
10 20 30 40 50 #Function evaluations 0.00 0.02 0.04 0.06 0.08 0.10 Difference to min function value
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 14 / 18
10 20 30 40 50 1.8 2.0 2.2 2.4 2.6 2.8 3.0 SMAC random TPE MI-SMAC(10,L1 ,landmarking) MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 15 / 18
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 10 20 30 40 50 #Function evaluations 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0
MI-SMAC(10,L1,landmarking) vs SMAC
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 15 / 18
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 10 20 30 40 50 #Function evaluations 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0
MI-SMAC(10,L1,landmarking) vs SMAC
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 15 / 18
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 10 20 30 40 50 #Function evaluations 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0
MI-SMAC(10,L1,landmarking) vs SMAC MI-SMAC(10,L1,landmarking) vs TPE
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 15 / 18
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 10 20 30 40 50 #Function evaluations 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0
MI-SMAC(10,L1,landmarking) vs SMAC MI-SMAC(10,L1,landmarking) vs TPE MI-SMAC(10,L1,landmarking) vs random
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 15 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 16 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 17 / 18
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 18 / 18
10 20 30 40 50 #Function evaluations 0.00 0.02 0.04 0.06 0.08 0.10 Min function value
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 18 / 18
0.0 0.1 0.2 0.3 0.4 0.5 0.6 10 20 30 40 50 Function evaluations −0.6 −0.5 −0.4 −0.3 −0.2 −0.1 0.0
MI-SMAC(10,L1,all) vs MI-SMAC(10,L1,landmarking) MI-SMAC(10,L1,all) vs SMAC MI-SMAC(10,L1,all) vs TPE MI-SMAC(10,L1,all) vs random
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 18 / 18
0.0 0.1 0.2 0.3 0.4 0.5 0.6 10 20 30 40 50 #Function evaluations 0.6 0.5 0.4 0.3 0.2 0.1 0.0
SMAC vs MI-SMAC(10,L1,landmarking) SMAC vs TPE SMAC vs random
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 18 / 18
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 10 20 30 40 50 Function evaluations −0.7 −0.6 −0.5 −0.4 −0.3 −0.2 −0.1 0.0
MI-SMAC(25,L1,landmarking) vs MI-SMAC(10,L1,landmarking) MI-SMAC(25,L1,landmarking) vs MI-SMAC(20,L1,landmarking) MI-SMAC(25,L1,landmarking) vs MI-SMAC(5,L1,landmarking) MI-SMAC(25,L1,landmarking) vs SMAC MI-SMAC(25,L1,landmarking) vs TPE MI-SMAC(25,L1,landmarking) vs random
MetaSel ’14 Feurer, Springenberg and Hutter – MI-SMBO 18 / 18