Optimization for Machine Learning
Lecture 3: Bundle Methods S.V . N. (vishy) Vishwanathan
Purdue University vishy@purdue.edu
July 11, 2012
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 1 / 30
Optimization for Machine Learning Lecture 3: Bundle Methods S.V . - - PowerPoint PPT Presentation
Optimization for Machine Learning Lecture 3: Bundle Methods S.V . N. (vishy) Vishwanathan Purdue University vishy@purdue.edu July 11, 2012 S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 1 / 30 Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 1 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 2 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 3 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 4 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 4 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 4 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 4 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 5 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 5 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 5 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 5 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 5 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 5 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 5 / 30
Motivation
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 6 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 7 / 30
Cutting Plane Methods
. N. Vishwanathan (Purdue University) Optimization for Machine Learning 8 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 9 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 10 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 10 / 30
Cutting Plane Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 10 / 30
Non Smooth Functions
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 11 / 30
Non Smooth Functions
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 12 / 30
Non Smooth Functions
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 13 / 30
Non Smooth Functions
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 13 / 30
Non Smooth Functions
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 13 / 30
Non Smooth Functions
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 14 / 30
Non Smooth Functions
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 14 / 30
Non Smooth Functions
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 15 / 30
Bundle Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 16 / 30
Bundle Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 17 / 30
Bundle Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 17 / 30
Bundle Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 17 / 30
Bundle Methods
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 17 / 30
BMRM
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 18 / 30
BMRM
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 19 / 30
BMRM
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 19 / 30
BMRM
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 19 / 30
Convergence Analysis
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 20 / 30
Convergence Analysis
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 21 / 30
Convergence Analysis
t
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 22 / 30
Convergence Analysis
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 22 / 30
Convergence Analysis
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 22 / 30
Convergence Analysis
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 23 / 30
Experiments
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 24 / 30
Experiments
100 101 102 103 104 Iteration t 10−4 10−3 10−2 10−1 100 ǫt
λ = 1e-3 λ = 1e-4 λ = 1e-5 λ = 1e-6 O(1/ǫ) O(log(1/ǫ)) S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 25 / 30
Experiments
100 101 102 103 104 Iteration t 10−4 10−3 10−2 10−1 100 ǫt
λ = 1e-3 λ = 1e-4 λ = 1e-5 λ = 1e-6 O(1/ǫ) O(log(1/ǫ)) S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 25 / 30
Experiments
100 101 102 103 104 Iteration t 10−4 10−3 10−2 10−1 100 ǫt
λ = 1e-3 λ = 1e-4 λ = 1e-5 λ = 1e-6 O(1/ǫ) O(log(1/ǫ)) S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 25 / 30
Lower Bounds
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 26 / 30
Lower Bounds
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 27 / 30
Lower Bounds
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 27 / 30
Lower Bounds
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 27 / 30
Lower Bounds
Input 1 2 3 4 Input 1 2 3 4 N u m b e r
i t e r a t i
s 1 2 3 4 5 6 7
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 28 / 30
References
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 29 / 30
References
S.V . N. Vishwanathan (Purdue University) Optimization for Machine Learning 30 / 30