Introduction to Convex Optimization for Machine Learning
John Duchi
University of California, Berkeley
Practical Machine Learning, Fall 2009
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 1 / 53
Introduction to Convex Optimization for Machine Learning John Duchi - - PowerPoint PPT Presentation
Introduction to Convex Optimization for Machine Learning John Duchi University of California, Berkeley Practical Machine Learning, Fall 2009 Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 1 / 53 Outline What is
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 1 / 53
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 2 / 53
What is Optimization
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 3 / 53
What is Optimization
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 4 / 53
What is Optimization
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 4 / 53
What is Optimization
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 5 / 53
What is Optimization
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 6 / 53
What is Optimization
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3 −50 50 100 150 200 250
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 7 / 53
What is Optimization
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3 −50 50 100 150 200 250
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 7 / 53
Convex Sets
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 8 / 53
Convex Sets
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 9 / 53
Convex Sets
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 10 / 53
Convex Sets
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 10 / 53
Convex Sets
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 10 / 53
Convex Sets
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 −0.4 −0.2 0.2 0.4 0.6 0.8 1
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 11 / 53
Convex Sets
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 12 / 53
Convex Sets
0.2 0.4 0.6 0.8 1 −1 −0.5 0.5 1 0.2 0.4 0.6 0.8 1
x y z
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 13 / 53
Convex Functions
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 14 / 53
Convex Functions
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 15 / 53
Convex Functions
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 16 / 53
Convex Functions
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 17 / 53
Convex Functions
−2 −1 1 2 −2 −1 1 2 2 4 6 8 10
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 18 / 53
Convex Functions
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 19 / 53
Convex Functions Examples
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 20 / 53
Convex Functions Examples
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 20 / 53
Convex Functions Examples
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 21 / 53
Convex Functions Examples
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 21 / 53
Convex Functions Examples
Convex Optimization for Machine Learning Fall 2009 21 / 53
Convex Functions Examples
3 3
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 22 / 53
Convex Optimization Problems
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 23 / 53
Convex Optimization Problems
0.5 1 1.5 2 2.5 3 3.5 0.5 1 1.5 2 2.5 3 3.5 4
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 24 / 53
Convex Optimization Problems
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 25 / 53
Convex Optimization Problems
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 26 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 27 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 28 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 29 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 30 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 31 / 53
Lagrange Duality
1There are conditions called constraint qualification for which this is true Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 32 / 53
Lagrange Duality
−2 −1.5 −1 −0.5 0.5 1 1.5 2 −2 2 4 6 8 10 12 14
0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 −0.4 −0.2 0.2 0.4 0.6
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 33 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 34 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 34 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 34 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 35 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 36 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 37 / 53
Lagrange Duality
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 37 / 53
Optimization Algorithms
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 38 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 39 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 40 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 41 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 42 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 42 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 43 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 44 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 45 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 46 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 47 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 47 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 48 / 53
Optimization Algorithms Gradient Methods
−3 1 −2 10
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 49 / 53
Optimization Algorithms Gradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 50 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 51 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 52 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 53 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 53 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 54 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 54 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 55 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 56 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 56 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 56 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 56 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 56 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 56 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 56 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 56 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 57 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 57 / 53
Optimization Algorithms Subgradient Methods
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 57 / 53
Take Home Messages
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 58 / 53
Take Home Messages
Duchi (UC Berkeley) Convex Optimization for Machine Learning Fall 2009 59 / 53