math 4211 6211 optimization convex optimization problems
play

MATH 4211/6211 Optimization Convex Optimization Problems Xiaojing - PowerPoint PPT Presentation

MATH 4211/6211 Optimization Convex Optimization Problems Xiaojing Ye Department of Mathematics & Statistics Georgia State University Xiaojing Ye, Math & Stat, Georgia State University 0 A set R n is called convex if for any


  1. MATH 4211/6211 – Optimization Convex Optimization Problems Xiaojing Ye Department of Mathematics & Statistics Georgia State University Xiaojing Ye, Math & Stat, Georgia State University 0

  2. A set Ω ⊂ R n is called convex if for any x , y ∈ Ω , there is Definition. α x + (1 − α ) y ∈ Ω for all α ∈ [0 , 1] Definition. A function f : Ω → R , where Ω is a convex set, is called convex if for any x , y ∈ Ω and α ∈ [0 , 1] , there is f ( α x + (1 − α ) y ) ≤ αf ( x ) + (1 − α ) f ( y ) . Moreover, f is called strictly convex if for any distinct x , y ∈ Ω and α ∈ (0 , 1) , there is f ( α x + (1 − α ) y ) < αf ( x ) + (1 − α ) f ( y ) A function f is called (strictly) concave if − f is (strictly) convex. Xiaojing Ye, Math & Stat, Georgia State University 1

  3. There is an alternative definition based on the convexity of the epigraph of f . Definition. The graph of f : Ω → R is defined by { [ x ; f ( x )] ∈ R n +1 : x ∈ Ω } Definition. The epigraph of f : Ω → R is defined by epi( f ) := { [ x ; β ] ∈ R n +1 : x ∈ Ω , β ≥ f ( x ) } Definition. A function f : Ω → R , where Ω is a convex set, is called convex if epi( f ) is a convex set. Xiaojing Ye, Math & Stat, Georgia State University 2

  4. Example. Let f ( x ) = x 1 x 2 be defined on Ω := { x : x ≥ 0 } . Is f convex? Solution. f is not convex. The set Ω ⊂ R 2 is convex. But if we choose x = [1; 2] and y = [2; 1] , then α x + (1 − α ) y = [2 − α ; 1 + α ] . On the one hand f ( α x + (1 − α ) y ) = 2 + α − α 2 . On the other hand, αf ( x ) + (1 − α ) f ( y ) = 2 . Choosing α = 1 / 2 yields f ( α x + (1 − α ) y ) > αf ( x ) + (1 − α ) f ( y ) which means that f is not convex. Xiaojing Ye, Math & Stat, Georgia State University 3

  5. There are several sufficient and necessary conditions for the convexity of f . Theorem. If f : R n → R is C 1 and Ω is convex, then f is convex on Ω iff for all x , y ∈ Ω , f ( y ) ≥ f ( x ) + ∇ f ( x ) ⊤ ( y − x ) Proof. ( ⇒ ) Suppose f is convex, then for any x , y ∈ Ω and α ∈ (0 , 1] , f ((1 − α ) x + α y ) ≤ (1 − α ) f ( x ) + αf ( y ) Rearrange terms to obtain f ( x + α ( y − x )) − f ( x ) ≤ f ( y ) − f ( x ) α Taking the limit as α → 0 yields f ( y ) ≥ f ( x ) + ∇ f ( x ) ⊤ ( y − x ) Xiaojing Ye, Math & Stat, Georgia State University 4

  6. Proof (cont.) ( ⇐ ) For any x , y ∈ Ω and α ∈ [0 , 1] , define x α = α x + (1 − α ) y . Then f ( x ) ≥ f ( x α ) + ∇ f ( x α ) ⊤ ( x α − x ) f ( y ) ≥ f ( x α ) + ∇ f ( x α ) ⊤ ( x α − y ) Multiplying the two inequalities by α and 1 − α respectively, and adding to- gether yields f ( α x + (1 − α ) y ) ≤ αf ( x ) + (1 − α ) f ( y ) Xiaojing Ye, Math & Stat, Georgia State University 5

  7. Theorem. Let f : R n → R be C 2 and Ω be convex, then f is convex on Ω iff ∇ 2 f ( x ) � 0 for all x ∈ Ω . Proof. ( ⇒ ) If not, then exist x ∈ Ω and d ∈ R n , such that d ⊤ ∇ 2 f ( x ) d < 0 Since ∇ 2 f ( x ) is continuous, there exists s > 0 sufficiently small, such that for y = x + s d ∈ Ω , there is f ( y ) = f ( x ) + ∇ f ( x ) ⊤ ( y − x ) + 1 2( y − x ) ⊤ ∇ 2 f ( x + t d )( y − x ) < f ( x ) + ∇ f ( x ) ⊤ ( y − x ) for some t ∈ (0 , s ) since ( y − x ) ⊤ ∇ 2 f ( x + t d )( y − x ) = s 2 d ⊤ ∇ 2 f ( x + t d ) d < 0 . Hence f is not convex, a contradiction. Xiaojing Ye, Math & Stat, Georgia State University 6

  8. Proof (cont.) ( ⇐ ) For any x , y ∈ Ω , there is f ( y ) = f ( x ) + ∇ f ( x ) ⊤ ( y − x ) + 1 2( y − x ) ⊤ ∇ 2 f ( x + t d )( y − x ) ≥ f ( x ) + ∇ f ( x ) ⊤ ( y − x ) where d := y − x and t ∈ (0 , 1) . Note that we used the fact that ∇ 2 f ( x + t d ) � 0 . Hence f is convex. Xiaojing Ye, Math & Stat, Georgia State University 7

  9. Examples. Determine if any of the following functions is convex. f 1 ( x ) = − 8 x 2 f 2 ( x ) = 4 x 2 1 + 3 x 2 2 + 5 x 2 3 + 6 x 1 x 2 + x 1 x 3 − 3 x 1 − 2 x 2 + 15 f 3 ( x ) = 2 x 1 x 2 − x 2 1 − x 2 2 Solution. f ′′ 1 ( x ) = − 16 < 0 , so f 1 is concave. For f 2 , we have   8 6 1 ∇ 2 f 2 = 6 6 0     1 0 10 whose leading principal minors are 8 , 12 , 114 . Hence f 2 is convex. For f 3 , we have � � − 2 2 ∇ 2 f 3 = 2 − 2 whose eigenvalues are − 4 and 0 , hence f 3 is negative semidefinite. Xiaojing Ye, Math & Stat, Georgia State University 8

  10. Theorem. Suppose f : Ω → R is convex. Then x is a global minimizer of f on Ω iff it is a local minimizer of f . Proof. The necessity is trivial. Suppose x is a local minimizer, then ∃ r > 0 such that f ( x ) ≤ f ( z ) for all z ∈ B ( x , r ) . If ∃ y , such that f ( x ) > f ( y ) , r then let α = � y − x � and r x α = (1 − α ) x + α y = x + � y − x � ( y − x ) . Then x α ∈ B ( x , r ) and f ( x α ) ≥ f ( x ) > (1 − α ) f ( x ) + αf ( y ) , which is a contradiction. Hence x must be a global minimizer. Xiaojing Ye, Math & Stat, Georgia State University 9

  11. Lemma. Suppose f : Ω → R is convex. Then the sub-level set of f Γ c = { x ∈ Ω : f ( x ) ≤ c } is empty or convex for any c ∈ R . Proof. If x , y ∈ Γ c , then f ( x ) , f ( y ) ≤ c . Since f is convex, we have f ( α x + (1 − α ) y ) ≤ αf ( x ) + (1 − α ) f ( y ) ≤ c i.e., α x + (1 − α ) y ∈ Γ c for all α ∈ [0 , 1] . Hence Γ c is a convex set. Xiaojing Ye, Math & Stat, Georgia State University 10

  12. Corollary. Suppose f : Ω → R is convex. Then the set of all global minimiz- ers of f over Ω is convex. Proof. Let f ∗ = min x ∈ Ω f ( x ) . Then Γ f ∗ is the set of all global minimizers. By the lemma above, we knwo Γ c is a convex set. Xiaojing Ye, Math & Stat, Georgia State University 11

  13. Lemma. Suppose f : Ω → R is convex and C 1 . Then x ∗ is a global minimizer of f over Ω iff ∇ f ( x ∗ ) ⊤ ( x − x ∗ ) ≥ 0 , ∀ x ∈ Ω . Proof. ( ⇒ ) If not, then ∃ x ∈ Ω , s.t. ∇ f ( x ∗ ) ⊤ ( x − x ∗ ) < 0 Denote x α = (1 − α ) x ∗ + α x = x ∗ + α ( x − x ∗ ) for α ∈ (0 , 1) . Since f ∈ C 1 , we know there exists α small enough, s.t. ∀ α ′ ∈ (0 , α ) ∇ f ( x α ′ ) ⊤ ( x − x ∗ ) < 0 , Xiaojing Ye, Math & Stat, Georgia State University 12

  14. Proof (cont.) Moreover, there exists α ′ ∈ (0 , α ) s.t. f ( x α ) = f ( x ∗ ) + ∇ f ( x α ′ ) ⊤ ( x α − x ∗ ) = f ( x ∗ ) + α ∇ f ( x α ′ ) ⊤ ( x − x ∗ ) < f ( x ∗ ) which contradicts to x ∗ being a global minimizer. ( ⇐ ) For all x ∈ Ω , there is f ( x ) ≥ f ( x ∗ ) + ∇ f ( x ∗ ) ⊤ ( x − x ∗ ) ≥ f ( x ∗ ) Hence x ∗ is a global minimizer. Xiaojing Ye, Math & Stat, Georgia State University 13

  15. Then x ∗ is a global Suppose f : Ω → R is convex and C 1 . Theorem. minimizer of f over Ω iff for any feasible direction d at x ∗ there is d ⊤ ∇ f ( x ∗ ) ≥ 0 . Proof. ( ⇒ ) Let d be feasible, then ∃ x ∈ Ω s.t. x − x ∗ = α d for some α > 0 . Hence by the Lemma above, we have ∇ f ( x ∗ ) ⊤ ( x − x ∗ ) = α ∇ f ( x ∗ ) ⊤ d ≥ 0 . So ∇ f ( x ∗ ) ⊤ d ≥ 0 . ( ⇐ ) For any x ∈ Ω , we know x α = (1 − α ) x ∗ + α x ∈ Ω for all α ∈ (0 , 1) . Hence d = x − x ∗ = ( x α − x ∗ ) /α is a feasible direction. Therefore ∇ f ( x ∗ ) ⊤ ( x − x ∗ ) = ∇ f ( x ∗ ) ⊤ d ≥ 0 . As x ∈ Ω is arbitrary, we know x ∗ is a global minimizer. Xiaojing Ye, Math & Stat, Georgia State University 14

  16. Corollary. Suppose f : Ω → R is convex and C 1 . If x ∗ ∈ Ω is such that ∇ f ( x ∗ ) = 0 , then x ∗ is a global minimizer of f . Proof. For any feasible d there is ∇ f ( x ∗ ) ⊤ d = 0 . Hence x ∗ is a global minimizer. Xiaojing Ye, Math & Stat, Georgia State University 15

  17. Theorem. Let f : R n → R and f ∈ C 1 be convex, and Ω = { x ∈ R n : h ( x ) = 0 } where h : R n → R m such that Ω is convex. Then x ∗ ∈ Ω is a global minimizer of f over Ω iff there exists λ ∗ ∈ R m such that ∇ f ( x ∗ ) + D h ( x ∗ ) ⊤ λ ∗ = 0 . Proof. ( ⇒ ) By the KKT condition. ( ⇐ ) Note that f being convex implies f ( x ) ≥ f ( x ∗ ) + ∇ f ( x ∗ ) ⊤ ( x − x ∗ ) , ∀ x ∈ Ω Also note that ∇ f ( x ∗ ) = − D h ( x ∗ ) ⊤ λ ∗ , we know f ( x ) ≥ f ( x ∗ ) − λ ∗⊤ D h ( x ∗ )( x − x ∗ ) Xiaojing Ye, Math & Stat, Georgia State University 16

  18. Proof (cont.) For any x ∈ Ω , we know x ∗ + α ( x − x ∗ ) ∈ Ω for all α ∈ (0 , 1) . Hence h ( x ∗ + α ( x − x ∗ )) = 0 and h ( x ∗ + α ( x − x ∗ )) − h ( x ∗ ) D h ( x ∗ )( x − x ∗ ) = lim = 0 α α → 0 Hence f ( x ) ≥ f ( x ∗ ) for all x ∈ Ω . Therefore x ∗ is a global minimizer. Xiaojing Ye, Math & Stat, Georgia State University 17

  19. Theorem. Let f : R n → R and f ∈ C 1 be convex, and Ω = { x ∈ R n : h ( x ) = 0 , g ( x ) ≤ 0 } where h : R n → R m and g : R n → R p are C 1 and such that Ω is convex. Then x ∗ ∈ Ω is a global minimizer of f over Ω iff there exist λ ∗ ∈ R m , µ ∗ ∈ R p + such that ∇ f ( x ∗ ) ⊤ + λ ∗⊤ D h ( x ∗ ) + µ ∗⊤ D g ( x ∗ ) = 0 ⊤ , g ( x ∗ ) ⊤ µ ∗ = 0 . Proof. ( ⇒ ) By the KKT condition. Xiaojing Ye, Math & Stat, Georgia State University 18

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend