cs675 convex and combinatorial optimization fall 2014
play

CS675: Convex and Combinatorial Optimization Fall 2014 Convex - PowerPoint PPT Presentation

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin Dughmi Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3 Convex Functions A function f : R n


  1. CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin Dughmi

  2. Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3

  3. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Convex Functions 0/23

  4. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Inequality called Jensen’s inequality (basic form) Convex Functions 0/23

  5. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Inequality called Jensen’s inequality (basic form) f is convex iff its restriction to any line { x + tv : t ∈ R } is convex Convex Functions 0/23

  6. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Inequality called Jensen’s inequality (basic form) f is convex iff its restriction to any line { x + tv : t ∈ R } is convex f is strictly convex if inequality strict when x � = y . Convex Functions 0/23

  7. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Inequality called Jensen’s inequality (basic form) f is convex iff its restriction to any line { x + tv : t ∈ R } is convex f is strictly convex if inequality strict when x � = y . Analogous definition when the domain of f is a convex subset D of R n Convex Functions 0/23

  8. Concave and Affine Functions A function is f : R n → R is concave if − f is convex. Equivalently: Line segment between any points on the graph of f lies below f . If x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≥ θf ( x ) + (1 − θ ) f ( y ) Convex Functions 1/23

  9. Concave and Affine Functions A function is f : R n → R is concave if − f is convex. Equivalently: Line segment between any points on the graph of f lies below f . If x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≥ θf ( x ) + (1 − θ ) f ( y ) f : R n → R is affine if it is both concave and convex. Equivalently: Line segment between any points on the graph of f lies on the graph of f . f ( x ) = a ⊺ x + b for some a ∈ R n and b ∈ R . Convex Functions 1/23

  10. We will now look at some equivalent definitions of convex functions First Order Definition A differentiable f : R n → R is convex if and only if the first-order approximation centered at any point x underestimates f everywhere. f ( y ) ≥ f ( x ) + ( ▽ f ( x )) ⊺ ( y − x ) Convex Functions 2/23

  11. We will now look at some equivalent definitions of convex functions First Order Definition A differentiable f : R n → R is convex if and only if the first-order approximation centered at any point x underestimates f everywhere. f ( y ) ≥ f ( x ) + ( ▽ f ( x )) ⊺ ( y − x ) Local information → global information If ▽ f ( x ) = 0 then x is a global minimizer of f Convex Functions 2/23

  12. Second Order Definition A twice differentiable f : R n → R is convex if and only if its derivative is nondecreasing in all directions. Formally, ▽ 2 f ( x ) � 0 for all x . Convex Functions 3/23

  13. Second Order Definition A twice differentiable f : R n → R is convex if and only if its derivative is nondecreasing in all directions. Formally, ▽ 2 f ( x ) � 0 for all x . Intepretation Recall definition of PSD: z ⊺ ▽ 2 f ( x ) z > 0 for all z ∈ R n At x + δ� z , infitisimal change in gradient is in roughly in the same direction as � z Graph of f curves upwards along any line When n = 1 , this is f ′′ ( x ) ≥ 0 . Convex Functions 3/23

  14. Epigraph The epigraph of f is the set of points above the graph of f . Formally, epi ( f ) = { ( x, t ) : t ≥ f ( x ) } Convex Functions 4/23

  15. Epigraph The epigraph of f is the set of points above the graph of f . Formally, epi ( f ) = { ( x, t ) : t ≥ f ( x ) } Epigraph Definition f is a convex function if and only if its epigraph is a convex set. Convex Functions 4/23

  16. Jensen’s Inequality (General Form) f : R n → R is convex if and only if For every x 1 , . . . , x k in the domain of f , and θ 1 , . . . , θ k ≥ 0 such that � i θ i = 1 , we have � � f ( θ i x i ) ≤ θ i f ( x i ) i i Given a probability measure D on the domain of f , and x ∼ D , f ( E [ x ]) ≤ E [ f ( x )] Convex Functions 5/23

  17. Jensen’s Inequality (General Form) f : R n → R is convex if and only if For every x 1 , . . . , x k in the domain of f , and θ 1 , . . . , θ k ≥ 0 such that � i θ i = 1 , we have � � f ( θ i x i ) ≤ θ i f ( x i ) i i Given a probability measure D on the domain of f , and x ∼ D , f ( E [ x ]) ≤ E [ f ( x )] Adding noise to x can only increase f ( x ) in expectation. Convex Functions 5/23

  18. Local and Global Optimality Local minimum x is a local minimum of f if there is a an open ball B containing x where f ( y ) ≥ f ( x ) for all y ∈ B . Local and Global Optimality When f is convex, x is a local minimum of f if and only if it is a global minimum. Convex Functions 6/23

  19. Local and Global Optimality Local minimum x is a local minimum of f if there is a an open ball B containing x where f ( y ) ≥ f ( x ) for all y ∈ B . Local and Global Optimality When f is convex, x is a local minimum of f if and only if it is a global minimum. This fact underlies much of the tractability of convex optimization. Convex Functions 6/23

  20. Sub-level sets � x 2 + y 2 Level sets of f ( x, y ) = Sublevel set The α -sublevel set of f is { x ∈ domain ( f ) : f ( x ) ≤ α } . Convex Functions 7/23

  21. Sub-level sets � x 2 + y 2 Level sets of f ( x, y ) = Sublevel set The α -sublevel set of f is { x ∈ domain ( f ) : f ( x ) ≤ α } . Fact Every sub-level set of a convex function is a convex set. This fact also underlies tractability of convex optimization Convex Functions 7/23

  22. Sub-level sets � x 2 + y 2 Level sets of f ( x, y ) = Sublevel set The α -sublevel set of f is { x ∈ domain ( f ) : f ( x ) ≤ α } . Fact Every sub-level set of a convex function is a convex set. This fact also underlies tractability of convex optimization Note: converse false, but nevertheless useful check. Convex Functions 7/23

  23. Other Basic Properties Continuity Convex functions are continuous. Convex Functions 8/23

  24. Other Basic Properties Continuity Convex functions are continuous. Extended-value extension If a function f : D → R is convex on its domain, and D is convex, then it can be extended to a convex function on R n . by setting f ( x ) = ∞ whenever x / ∈ D . f : D → R � ∞ is “convex” This simplifies notation. Resulting function � with respect to the ordering on R � ∞ Convex Functions 8/23

  25. Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3

  26. Functions on the reals Affine: ax + b Exponential: e ax convex for any a ∈ R Powers: x a convex on R ++ when a ≥ 1 or a ≤ 0 , and concave for 0 ≤ a ≤ 1 Logarithm: log x concave on R ++ . Examples of Convex and Concave Functions 9/23

  27. Norms Norms are convex. || θx + (1 − θ ) y || ≤ || θx || + || (1 − θ ) y || = θ || x || + (1 − θ ) || y || Uses both norm axioms: triangle inequality, and homogeneity. Applies to matrix norms, such as the spectral norm (radius of induced ellipsoid) Examples of Convex and Concave Functions 10/23

  28. Norms Norms are convex. || θx + (1 − θ ) y || ≤ || θx || + || (1 − θ ) y || = θ || x || + (1 − θ ) || y || Uses both norm axioms: triangle inequality, and homogeneity. Applies to matrix norms, such as the spectral norm (radius of induced ellipsoid) Max max i x i is convex max i ( θx + (1 − θ ) y ) i = max i ( θx i + (1 − θ ) y i ) ≤ max θx i + max i (1 − θ ) y i i = θ max x i + (1 − θ ) max y i i i If i’m allowed to pick the maximum entry of θx and θy independently, I can do only better. Examples of Convex and Concave Functions 10/23

  29. Log-sum-exp: log( e x 1 + e x 2 + . . . + e x n ) is convex Geometric mean: ( � n 1 n is concave i =1 x i ) Log-determinant: log det X is concave Quadratic form: x ⊺ Ax is convex iff A � 0 Other examples in book f ( x, y ) = log( e x + e y ) Examples of Convex and Concave Functions 11/23

  30. Log-sum-exp: log( e x 1 + e x 2 + . . . + e x n ) is convex Geometric mean: ( � n 1 n is concave i =1 x i ) Log-determinant: log det X is concave Quadratic form: x ⊺ Ax is convex iff A � 0 Other examples in book f ( x, y ) = log( e x + e y ) Proving convexity often comes down to case-by-case reasoning, involving: Definition: restrict to line and check Jensen’s inequality Write down the Hessian and prove PSD Express as a combination of other convex functions through convexity-preserving operations (Next) Examples of Convex and Concave Functions 11/23

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend