Lecture 3: Convex Function CK Cheng Dept. of Computer Science and - - PowerPoint PPT Presentation

โ–ถ
lecture 3 convex function
SMART_READER_LITE
LIVE PREVIEW

Lecture 3: Convex Function CK Cheng Dept. of Computer Science and - - PowerPoint PPT Presentation

CSE203B Convex Optimization: Lecture 3: Convex Function CK Cheng Dept. of Computer Science and Engineering University of California, San Diego 1 Outlines 1. Definitions: Convexity, Examples & Views 2. Conditions of Optimality 1. First


slide-1
SLIDE 1

1

CSE203B Convex Optimization: Lecture 3: Convex Function

CK Cheng

  • Dept. of Computer Science and Engineering

University of California, San Diego

slide-2
SLIDE 2

Outlines

  • 1. Definitions: Convexity, Examples & Views
  • 2. Conditions of Optimality
  • 1. First Order Condition
  • 2. Second Order Condition
  • 3. Operations that Preserve the Convexity
  • 1. Pointwise Maximum
  • 2. Partial Minimization
  • 4. Conjugate Function
  • 5. Log-Concave, Log-Convex Functions

2

slide-3
SLIDE 3

Outlines

  • 1. Definitions
  • 1. Convex Function vs Convex Set
  • 2. Examples

1. Norm 2. Entropy 3. Affine 4. Determinant 5. Maximum

  • 3. Views of Functions and Related Hyperplanes

3

slide-4
SLIDE 4
  • 1. Definitions: Convex Function vs Convex Set

4

Theorem: Given ๐‘‡ = ๐‘ฆ ๐‘” ๐‘ฆ โ‰ค ๐‘ If function ๐‘” ๐‘ฆ is convex, then ๐‘‡ is a convex set. Proof: We prove by the definition of convex set. For every ๐‘ฃ, ๐‘ค โˆˆ ๐‘‡, i. e. ๐‘” ๐‘ฃ โ‰ค ๐‘, ๐‘” ๐‘ค โ‰ค ๐‘, We want to show that ฮฑ๐‘ฃ + ๐›พ๐‘ค โˆˆ ๐‘‡, โˆ€ฮฑ + ๐›พ = 1, ๐›ฝ, ๐›พ โ‰ฅ 0. We have ๐‘” ๐›ฝ๐‘ฃ + ๐›พ๐‘ค โ‰ค ๐›ฝ๐‘” ๐‘ฃ + ๐›พ๐‘” ๐‘ค (๐‘” ๐‘—๐‘ก ๐‘‘๐‘๐‘œ๐‘ค๐‘“๐‘ฆ) โ‰ค ๐›ฝ๐‘ + ๐›พ๐‘ (๐›ฝ, ๐›พ โ‰ฅ 0) = ๐›ฝ + ๐›พ โˆ™ ๐‘ = ๐‘ (๐›ฝ + ๐›พ = 1) Thus ฮฑ๐‘ฃ + ๐›พ๐‘ค โˆˆ ๐‘‡ Remark: Convex function => Convex Set ๐‘”(๐‘ฆ) โ‰ค ๐‘ => Convex Set ๐‘”(๐‘ฆ) โ‰ฅ ๐‘ => ?

slide-5
SLIDE 5
  • 1. Convex Function Definitions: Examples

5

๐‘”: ๐‘†๐‘œ โ†’ ๐‘† is convex if ๐‘’๐‘๐‘› ๐‘” is a convex set and ๐‘” ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง โ‰ค ๐œ„๐‘” ๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘”(๐‘ง) โˆ€๐‘ฆ, ๐‘ง โˆˆ ๐‘’๐‘๐‘› ๐‘”, 0 โ‰ค ๐œ„ โ‰ค 1 Example on R: Convex Functions Affine: ๐‘๐‘ฆ + ๐‘ ๐‘๐‘œ ๐‘† for any ๐‘, ๐‘ โˆˆ ๐‘† Exponential: ๐‘“๐‘๐‘ฆ for any ๐‘ โˆˆ ๐‘† Power: ๐‘ฆ๐›ฝ ๐‘๐‘œ ๐‘†++ for ๐›ฝ โ‰ฅ 1 or ๐›ฝ โ‰ค 0 ๐‘ฆ ๐‘ž ๐‘๐‘œ ๐‘† for ๐‘ž โ‰ฅ 1 Concave Functions Affine: ๐‘๐‘ฆ + ๐‘ ๐‘๐‘œ ๐‘† for any ๐‘, ๐‘ โˆˆ ๐‘† Power: ๐‘ฆ๐›ฝ ๐‘๐‘œ ๐‘†++ for 0 โ‰ค ๐›ฝ โ‰ค 1 Logarithm: ๐‘š๐‘๐‘•๐‘ฆ ๐‘๐‘œ ๐‘†++

slide-6
SLIDE 6

6

Example on ๐‘†๐‘œ: Affine: ๐‘” ๐‘ฆ = ๐‘๐‘ˆ๐‘ฆ + ๐‘ Norms: ๐‘ฆ ๐‘ž = ฯƒ๐‘—=1

๐‘œ

๐‘ฆ ๐‘ž

เต—

1 ๐‘ž ๐‘”๐‘๐‘  ๐‘ž โ‰ฅ 1;

๐‘ฆ โˆž = max

๐‘™

|๐‘ฆ๐‘™| Example on ๐‘†๐‘›ร—๐‘œ: Affine: ๐‘” ๐‘Œ = ๐‘ข๐‘  ๐ต๐‘ˆ๐‘Œ = ฯƒ๐‘—=1

๐‘› ฯƒ๐‘˜=1 ๐‘œ

๐ต๐‘—๐‘˜๐‘ฆ๐‘—๐‘˜ Spectral (max singular value): ๐‘” ๐‘Œ = ๐‘Œ 2 = ๐œ๐‘›๐‘๐‘ฆ ๐‘Œ = (๐œ‡๐‘›๐‘๐‘ฆ ๐‘Œ๐‘ˆ๐‘Œ ) ฮค

1 2

  • 1. Convex Function Definitions: Examples
slide-7
SLIDE 7

7

Concave Functions: Log Determinant: ๐‘” ๐‘Œ = log det ๐‘Œ , ๐‘’๐‘๐‘› ๐‘” = S++

๐‘œ

Proof: Let ๐‘• ๐‘ข = ๐‘” ๐‘Œ + ๐‘ข๐‘Š ๐‘Š โˆˆ ๐‘‡๐‘œ ๐‘• ๐‘ข = ๐‘š๐‘๐‘• ๐‘’๐‘“๐‘ข (๐‘Œ + ๐‘ข๐‘Š) = ๐‘š๐‘๐‘• ๐‘’๐‘“๐‘ข ๐‘Œ + ๐‘š๐‘๐‘•๐‘’๐‘“๐‘ข(๐ฝ + ๐‘ข๐‘Œโˆ’1

2๐‘Š๐‘Œโˆ’1 2)

= ๐‘š๐‘๐‘• ๐‘’๐‘“๐‘ข ๐‘Œ + ฯƒ๐‘—=1

๐‘œ

๐‘š๐‘๐‘•(1 + ๐‘ข๐œ‡๐‘—) ๐œ‡๐‘—: ๐‘“๐‘—๐‘•๐‘“๐‘œ๐‘ค๐‘๐‘š๐‘ฃ๐‘“ ๐‘๐‘” ๐‘Œโˆ’1

2๐‘Š๐‘Œโˆ’1 2

๐‘• is concave in ๐‘ข โ‡’ ๐‘” is concave

  • 1. Convex Function Definitions: Examples
slide-8
SLIDE 8

8

Convex function examples: norm, max, expectation norm: If ๐‘”: ๐‘†๐‘œ โ†’ ๐‘† is a norm and 0 โ‰ค ๐œ„ โ‰ค 1 ๐‘” ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง โ‰ค ๐‘” ๐œ„๐‘ฆ + ๐‘” 1 โˆ’ ๐œ„ ๐‘ง = ๐œ„๐‘”(๐‘ฆ) + (1 โˆ’ ๐œ„)๐‘”(๐‘ง) Max function: ๐‘” ๐‘ฆ = max

๐‘—

๐‘ฆ๐‘— , ๐‘ฆ = ๐‘ฆ1, ๐‘ฆ2, โ€ฆ , ๐‘ฆ๐‘œ ๐‘ˆ ๐‘” ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง = max

๐‘—

๐œ„๐‘ฆ๐‘— + 1 โˆ’ ๐œ„ ๐‘ง๐‘— โ‰ค ๐œ„ max

๐‘—

๐‘ฆ๐‘— + 1 โˆ’ ๐œ„ max

๐‘—

๐‘ง๐‘— = ๐œ„๐‘” ๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘” ๐‘ง for 0 โ‰ค ๐œ„ โ‰ค 1 Probability: (Expectation) If ๐‘” ๐‘ฆ is convex with ๐‘ž ๐‘ฆ a probability at ๐‘ฆ,

  • i. e. ๐‘ž ๐‘ฆ โ‰ฅ 0, โˆ€๐‘ฆ and ืฌ ๐‘ž(๐‘ฆ) ๐‘’๐‘ฆ = 1

Then ๐‘” ๐น๐‘ฆ โ‰ค ๐น๐‘” ๐‘ฆ , where ๐น๐‘ฆ = ืฌ๐‘ฆ ๐‘ž ๐‘ฆ ๐‘’๐‘ฆ ๐น๐‘”(๐‘ฆ) = ืฌ ๐‘”(๐‘ฆ) ๐‘ž ๐‘ฆ ๐‘’๐‘ฆ triangle inequality scalability

slide-9
SLIDE 9

1.3 Views of Functions and Related Hyperplanes

9

Given ๐‘” ๐‘ฆ , ๐‘ฆ โˆˆ ๐‘†๐‘œ, we plot the function in ๐‘†๐‘œ and ๐‘†๐‘œ+1 spaces.

  • 1. Draw function in ๐‘†๐‘œ space

Equipotential surface: tangent plane ๐›ผ๐‘” เทค ๐‘ฆ ๐‘ˆ ๐‘ฆ โˆ’ เทค ๐‘ฆ = 0 at เทค ๐‘ฆ

  • 2. Draw function in ๐‘†๐‘œ+1 space

2.1 Graph of function: {(๐‘ฆ, โ„Ž)|๐‘ฆ โˆˆ ๐‘’๐‘๐‘› ๐‘”, โ„Ž = ๐‘” ๐‘ฆ } ๐ข๐ณ๐ช๐Ÿ๐ฌ๐ช๐ฆ๐›๐จ๐Ÿ (h = ๐›ผ๐‘” เทค ๐‘ฆ ๐‘ˆ ๐‘ฆ โˆ’ เทค ๐‘ฆ + ๐‘”(เทค ๐‘ฆ)) ๐›ผ๐‘” เทค ๐‘ฆ ๐‘ˆ โˆ’ 1 ๐‘ฆ โ„Ž โˆ’ เทค ๐‘ฆ ๐‘” เทค ๐‘ฆ = 0 Example: ๐‘” ๐‘ฆ = ๐‘ฆ2. We show the hyperplane with ๐›ผ๐‘” ๐‘ฆ 2.2. Epigraph: epi ๐‘”: {(x, ๐‘ข)|๐‘ฆ โˆˆ ๐‘’๐‘๐‘› ๐‘”, ๐‘” ๐‘ฆ โ‰ค ๐‘ข} A function is convex iff its epigraph is a convex set. Example: ๐‘” ๐‘ฆ = max ๐‘”

๐‘— ๐‘ฆ | ๐‘— = 1 โ€ฆ ๐‘  , ๐‘” ๐‘— ๐‘ฆ ๐‘๐‘ ๐‘“ ๐‘‘๐‘๐‘œ๐‘ค๐‘“๐‘ฆ.

Since epi ๐‘” is the intersect of epi ๐‘”

๐‘—, epi ๐‘” is convex.

Thus, function ๐‘” is convex.

slide-10
SLIDE 10
  • 2. Conditions of Optimality: First Order Condition

10

Defintion: ๐‘” is differentiable if ๐‘’๐‘๐‘›๐‘” is open and ๐›ผ๐‘”(๐‘ฆ) โ‰ก (

๐œ–๐‘” ๐‘ฆ ๐œ–๐‘ฆ1 , ๐œ–๐‘” ๐‘ฆ ๐œ–๐‘ฆ2 , โ€ฆ , ๐œ–๐‘” ๐‘ฆ ๐œ–๐‘ฆ๐‘œ ) exists at each ๐‘ฆ โˆˆ ๐‘’๐‘๐‘›๐‘”

Theorem: Differentiable ๐‘” with convex domain is convex iff ๐‘” ๐‘ง โ‰ฅ ๐‘” ๐‘ฆ + ๐›ผ๐‘” ๐‘ฆ T ๐‘ง โˆ’ ๐‘ฆ , โˆ€๐‘ฆ, ๐‘ง โˆˆ ๐‘’๐‘๐‘›๐‘” Proof => If ๐‘” is convex ๐‘ˆโ„Ž๐‘“๐‘œ 1 โˆ’ ๐‘ข ๐‘” ๐‘ฆ + ๐‘ข๐‘” ๐‘ง โ‰ฅ ๐‘” 1 โˆ’ ๐‘ข ๐‘ฆ + ๐‘ข๐‘ง , โˆ€0 โ‰ค ๐‘ข โ‰ค 1 ๐‘ข ๐‘” ๐‘ง โˆ’ ๐‘” ๐‘ฆ โ‰ฅ ๐‘” ๐‘ฆ + ๐‘ข ๐‘ง โˆ’ ๐‘ฆ โˆ’ ๐‘”(๐‘ฆ) ๐‘” ๐‘ง โˆ’ ๐‘” ๐‘ฆ โ‰ฅ

1 ๐‘ข (๐‘” ๐‘ฆ + ๐‘ข ๐‘ง โˆ’ ๐‘ฆ

โˆ’ ๐‘” ๐‘ฆ ) = ๐›ผ๐‘” ๐‘ฆ ๐‘ง โˆ’ ๐‘ฆ ๐‘ฅโ„Ž๐‘“๐‘œ ๐‘ข โ†’ 0 <= ๐ป๐‘—๐‘ค๐‘“๐‘œ ๐‘” ๐‘ง โ‰ฅ ๐‘” ๐‘ฆ + ๐›ผ๐‘” ๐‘ฆ T ๐‘ง โˆ’ ๐‘ฆ , โˆ€๐‘ฆ, ๐‘ง โˆˆ ๐‘’๐‘๐‘›๐‘” ๐‘€๐‘“๐‘ข ๐‘จ = 1 โˆ’ ๐‘ข ๐‘ฆ + ๐‘ข๐‘ง where เต๐‘” ๐‘ฆ โ‰ฅ ๐‘” ๐‘จ + ๐›ผ๐‘” ๐‘จ T ๐‘ฆ โˆ’ ๐‘จ ๐‘” ๐‘ง โ‰ฅ ๐‘” ๐‘จ + ๐›ผ๐‘” ๐‘จ T ๐‘ง โˆ’ ๐‘จ Thus 1 โˆ’ ๐‘ข ๐‘” ๐‘ฆ + ๐‘ข๐‘” ๐‘ง โ‰ฅ ๐‘”(๐‘จ)

slide-11
SLIDE 11
  • 2. Conditions: Second Order Condition

11

Definition: ๐‘” is twice differentiable if ๐‘’๐‘๐‘›๐‘” is open and the Hessian ๐›ผ2๐‘” ๐‘ฆ โˆˆ ๐‘‡๐‘œ ๐›ผ2๐‘” ๐‘ฆ ๐‘—๐‘˜ โ‰ก

๐œ–2๐‘” ๐‘ฆ ๐œ–๐‘ฆ๐‘—๐œ–๐‘ฆ๐‘˜ , ๐‘—, ๐‘˜ = 1, โ€ฆ , ๐‘œ exists at each ๐‘ฆ โˆˆ ๐‘’๐‘๐‘›๐‘”

Theorem: Twice Differentiable ๐‘” with convex domain is convex iff ๐›ผ2๐‘” ๐‘ฆ โ‰ฝ 0, โˆ€๐‘ฆ โˆˆ ๐‘’๐‘๐‘›๐‘” Proof: Using Lagrange remainder, we can find a z

๐‘” ๐‘ฆ + ๐‘ข(๐‘ง โˆ’ ๐‘ฆ) = ๐‘” ๐‘ฆ + ๐›ผ๐‘” ๐‘ฆ ๐‘ˆ๐‘ข ๐‘ง โˆ’ ๐‘ฆ + 1 2 ๐‘ข2 ๐‘ง โˆ’ ๐‘ฆ ๐‘ˆ๐›ผ2๐‘” ๐‘จ ๐‘ง โˆ’ ๐‘ฆ ,

โˆ€0 โ‰ค ๐‘ข โ‰ค 1, ๐‘จ is between ๐‘ฆ and ๐‘ฆ + ๐‘ข(๐‘ง โˆ’ ๐‘ฆ) Since the last term is always positive by assumption, the first order condition is satisfied.

slide-12
SLIDE 12
  • 2. Conditions: Second Order Condition

12

Example: Negative Entropy: ๐‘” ๐‘ฆ = ๐‘ฆ log ๐‘ฆ , ๐‘ฆ โˆˆ ๐‘†++ ๐‘”โ€ฒ ๐‘ฆ = ๐‘ฆ ๐‘ฆ + log ๐‘ฆ = 1 + log ๐‘ฆ , ๐‘”โ€ฒโ€ฒ ๐‘ฆ = 1 ๐‘ฆ Since ๐‘ฆ โˆˆ ๐‘†++, ๐‘”โ€ฒโ€ฒ ๐‘ฆ > 0 โ‡’ ๐‘” ๐‘ฆ is convex Show the plot of ๐‘ฆ log ๐‘ฆ Remark:

  • 1st order condition can be used to design and prove the

property of opt. algorithms.

  • 2nd order condition implies the 1st order condition
  • 2nd order condition can be used to prove the convexity of

the functions.

slide-13
SLIDE 13
  • 2. Conditions: Examples

13

  • Quadratic Function: ๐‘” ๐‘ฆ =

1 2 ๐‘ฆ๐‘ˆ๐‘„๐‘ฆ + ๐‘Ÿ๐‘ˆ๐‘ฆ + ๐‘ , ๐‘„ โˆˆ ๐‘‡๐‘œ

๐›ผ๐‘” ๐‘ฆ = ๐‘„๐‘ฆ + ๐‘Ÿ, ๐›ผ2๐‘” ๐‘ฆ = ๐‘„

  • Least Square: ๐‘” ๐‘ฆ =

๐ต๐‘ฆ โˆ’ ๐‘ 2

2

๐›ผ๐‘” ๐‘ฆ = 2๐ต๐‘ˆ ๐ต๐‘ฆ โˆ’ ๐‘ , ๐›ผ2๐‘” ๐‘ฆ = ๐ต๐‘ˆ๐ต

  • Quadratic over linear: ๐‘” ๐‘ฆ, ๐‘ง =

๐‘ฆ2 ๐‘ง , ๐‘ง > 0

๐›ผ๐‘” ๐‘ฆ, ๐‘ง = 2๐‘ฆ ๐‘ง , โˆ’ ๐‘ฆ2 ๐‘ง2

๐‘ˆ

, , ๐›ผ2๐‘” ๐‘ฆ = 2 ๐‘ง โˆ’ 2๐‘ฆ ๐‘ง2 โˆ’ 2๐‘ฆ ๐‘ง2 2๐‘ฆ2 ๐‘ง3 = 2 ๐‘ง3 ๐‘ง โˆ’๐‘ฆ ๐‘ง โˆ’๐‘ฆ

slide-14
SLIDE 14
  • 2. Conditions: Examples

14

  • Log-sum-exp: ๐‘” ๐‘ฆ = log ฯƒ๐ฟ=1

๐‘œ

๐‘“๐‘ฆ๐‘™ (Smooth max of softmax function) ๐›ผ2๐‘” ๐‘ฆ =

1 1๐‘ˆ๐‘จ ๐‘’๐‘—๐‘๐‘• ๐‘จ โˆ’ 1 1๐‘ˆ๐‘จ ๐‘จ๐‘จ๐‘ˆ, ๐‘จ๐‘™ = ๐‘“๐‘ฆ๐‘™

๐‘ค๐‘ˆ๐›ผ2๐‘” ๐‘ฆ ๐‘ค =

1 1๐‘ˆ๐‘จ 2 [ ฯƒ๐‘—=1 ๐‘œ

๐‘จ๐‘— ฯƒ๐‘—=1

๐‘œ

๐‘ค๐‘—

2๐‘จ๐‘— โˆ’ ฯƒ๐‘—=1 ๐‘œ

๐‘ค๐‘—๐‘จ๐‘— 2] โ‰ฅ 0, for all ๐‘ค โˆˆ ๐‘†๐‘œ (Cauchy-Schwarz inequality) Thus, ๐‘”(๐‘ฆ) is a convex function Cauchy-Schwarz inequality: ๐‘๐‘ˆ๐‘

๐‘๐‘ˆ๐‘ โ‰ฅ ๐‘๐‘ˆ๐‘ 2, ๐‘๐‘— = ๐‘จ๐‘—, ๐‘๐‘— = ๐‘ค๐‘— ๐‘จ๐‘—

Proof 1: Let ๐‘จ = ๐‘ โˆ’

๐‘๐‘ˆ ๐‘ ๐‘๐‘ˆ ๐‘ ๐‘, or ๐‘ = ๐‘จ + ๐‘๐‘ˆ๐‘ ๐‘๐‘ˆ๐‘ ๐‘

We have a๐‘ˆa = zTz + ๐‘๐‘ˆ๐‘ 2 ๐‘๐‘ˆ๐‘ 2 ๐‘๐‘ˆ๐‘ โ‰ฅ ๐‘๐‘ˆ๐‘ 2 ๐‘๐‘ˆ๐‘ 2 ๐‘๐‘ˆ๐‘ = ๐‘๐‘ˆ๐‘ 2 ๐‘๐‘ˆ๐‘ Proof 2: By induction

slide-15
SLIDE 15
  • 3. Operations that preserve convexity

15

  • Nonnegative multiple: ๐›ฝ๐‘”, where ๐›ฝ โ‰ฅ 0, ๐‘” is convex
  • Sum: ๐‘”

1 + ๐‘” 2, where ๐‘” 1, ๐‘๐‘œ๐‘’ ๐‘” 2 are convex

  • Composition with affine function: ๐‘” ๐ต๐‘ฆ + ๐‘ , where ๐‘” is

convex Proof: ๐›ผ

๐‘ฆ 2๐‘” ๐ต๐‘ฆ + ๐‘ = ๐ต๐‘ˆ๐›ผ ๐‘ง 2๐‘” ๐‘ง|๐‘ง = ๐ต๐‘ฆ + ๐‘ ๐ต

E.g. ๐‘” ๐‘ฆ = โˆ’ ฯƒ๐‘—=1

๐‘› log ๐‘๐‘— โˆ’ ๐‘๐‘— ๐‘ˆ๐‘ฆ๐‘— ,

๐‘’๐‘๐‘› ๐‘” = {๐‘ฆ|๐‘๐‘—

๐‘ˆ๐‘ฆ < ๐‘๐‘—, ๐‘— = 1, โ€ฆ , ๐‘›}

๐‘” ๐‘ฆ = ๐ต๐‘ฆ + ๐‘ (if ๐‘” is twice differentiable)

slide-16
SLIDE 16
  • 3. Operations that preserve convexity

16

  • Pointwise maximum: ๐‘” ๐‘ฆ = max{๐‘”

1 ๐‘ฆ , โ€ฆ , ๐‘” ๐‘ (๐‘ฆ)} , ๐‘” ๐‘— are

convex

  • Pointwise supremum:

๐‘• ๐‘ฆ = sup

๐‘งโˆˆ๐ท

๐‘”(๐‘ฆ, ๐‘ง) , where ๐‘” ๐‘ฆ, ๐‘ง is convex in ๐‘ฆ and ๐ท is an arbitrary set Examples

  • ๐‘‡๐‘‘ ๐‘ฆ = sup

๐‘งโˆˆ๐ท

๐‘ง๐‘ˆ๐‘ฆ, for an arbitrary set ๐ท

  • ๐‘” ๐‘ฆ = sup

๐‘งโˆˆ๐ท

๐‘ฆ โˆ’ ๐‘ง , for an arbitrary set ๐ท

  • ๐œ‡๐‘›๐‘๐‘ฆ ๐‘Œ =

sup

๐‘ง 2=1

๐‘ง๐‘ˆ๐‘Œ๐‘ง, ๐‘Œ โˆˆ ๐‘‡๐‘œ

slide-17
SLIDE 17
  • 3. Operations that preserve convexity: Dual norm

17

Example: ๐‘” ๐‘ฆ = max

๐‘ง 2โ‰ค1 ๐‘ง๐‘ˆ๐‘ฆ

๐‘” ๐‘ฆ = max

๐‘ง 1โ‰ค1 ๐‘ง๐‘ˆ๐‘ฆ

๐‘” ๐‘ฆ = max

๐‘ง ๐‘žโ‰ค1 ๐‘ง๐‘ˆ๐‘ฆ

slide-18
SLIDE 18

18

Theorem: Pointwise maximum of convex functions is convex Given ๐‘” ๐‘ฆ = max ๐‘”

1 ๐‘ฆ , ๐‘” 2 ๐‘ฆ

, where ๐‘”

1 and ๐‘” 2 are convex

and ๐‘’๐‘๐‘› ๐‘” = ๐‘’๐‘๐‘› ๐‘”

1

โˆฉ ๐‘’๐‘๐‘› ๐‘”

2 is convex, then ๐‘” ๐‘ฆ is

convex. Proof: For 0 โ‰ค ๐œ„ โ‰ค 1, ๐‘ฆ, ๐‘ง โˆˆ ๐‘’๐‘๐‘› ๐‘” ๐‘” ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง = max{๐‘”

1 ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง , ๐‘” 2 ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง }

โ‰ค max{๐œ„๐‘”

1 ๐‘ฆ) + 1 โˆ’ ๐œ„ ๐‘” 1(๐‘ง , ๐œ„๐‘” 2 ๐‘ฆ) + 1 โˆ’ ๐œ„ ๐‘” 2(๐‘ง }

โ‰ค ๐œ„ max{๐‘”

1 ๐‘ฆ), ๐‘” 2(๐‘ฆ)} + 1 โˆ’ ๐œ„ max {๐‘” 1(๐‘ง , ๐‘” 2(๐‘ง)}

= ๐œ„๐‘” ๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘” ๐‘ง i.e. ๐‘” ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง โ‰ค ๐œ„๐‘” ๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘” ๐‘ง Thus, function ๐‘” ๐‘ฆ is convex.

  • 3. Operations that preserve convexity: max function
slide-19
SLIDE 19

19

Theorem: Partial minimization If ๐‘• ๐‘ฆ, ๐‘ง is convex in ๐‘ฆ and ๐‘ง, and a set ๐ท is convex Then ๐‘” ๐‘ฆ = min

๐‘งโˆˆ๐ท ๐‘• ๐‘ฆ, ๐‘ง is convex.

Proof: Let ๐‘ง1 โˆˆ {๐‘ง| min

๐‘งโˆˆ๐ท ๐‘•(๐‘ฆ1, ๐‘ง)} and ๐‘ง2 โˆˆ {๐‘ง| min ๐‘งโˆˆ๐ท (๐‘•(๐‘ฆ2, ๐‘ง)},

we can write ๐œ„๐‘” ๐‘ฆ1 + 1 โˆ’ ๐œ„ ๐‘” ๐‘ฆ2 = ๐œ„๐‘• ๐‘ฆ1, ๐‘ง1 + 1 โˆ’ ๐œ„ ๐‘•(๐‘ฆ2, ๐‘ง2) โ‰ฅ ๐‘•(๐œ„๐‘ฆ1 + 1 โˆ’ ๐œ„ ๐‘ฆ2, ๐œ„๐‘ง1 + 1 โˆ’ ๐œ„ ๐‘ง2) ๐‘• ๐‘—๐‘ก ๐‘‘๐‘๐‘œ๐‘ค๐‘“๐‘ฆ โ‰ฅ min

๐‘งโˆˆ๐ท ๐‘• ๐œ„๐‘ฆ1 + 1 โˆ’ ๐œ„ ๐‘ฆ2, ๐‘ง

๐ท ๐‘—๐‘ก ๐‘‘๐‘๐‘œ๐‘ค๐‘“๐‘ฆ = ๐‘”(๐œ„๐‘ฆ1 + 1 โˆ’ ๐œ„ ๐‘ฆ2) i.e. we have ๐œ„๐‘” ๐‘ฆ1 + 1 โˆ’ ๐œ„ ๐‘” ๐‘ฆ2 โ‰ฅ ๐‘” ๐œ„๐‘ฆ1 + 1 โˆ’ ๐œ„ ๐‘ฆ2 Therefore, ๐‘” ๐‘ฆ = min

๐‘งโˆˆ๐ท ๐‘• ๐‘ฆ, ๐‘ง is convex.

  • 3. Operations that preserve convexity: minimization
slide-20
SLIDE 20

Examples for Partial Minimization Given ๐‘” ๐‘ฆ, ๐‘ง = ๐‘ฆ๐‘ˆ ๐‘ง๐‘ˆ ๐ต ๐ถ ๐ถ๐‘ˆ ๐ท ๐‘ฆ ๐‘ง ๐‘ฆ โˆˆ ๐‘†๐‘œ, ๐‘ง โˆˆ ๐‘†๐‘›, ๐ต โˆˆ ๐‘‡+

๐‘œ, ๐ท โˆˆ ๐‘‡+ ๐‘›,

๐ต ๐ถ ๐ถ๐‘ˆ ๐ท โˆˆ ๐‘‡+

๐‘œ+๐‘›

Let ๐‘• ๐‘ฆ = min

๐‘ง ๐‘”(๐‘ฆ, ๐‘ง) = ๐‘ฆ๐‘ˆ ๐ต โˆ’ ๐ถ๐ท+๐ถ๐‘ˆ ๐‘ฆ,

๐ท+: ๐ช๐ญ๐Ÿ๐ฏ๐ž๐ฉ ๐ฃ๐จ๐ฐ๐Ÿ๐ฌ๐ญ๐Ÿ of matrix ๐ท. (Drazin inverse, or generalized inverse) We can claim that function ๐‘• ๐‘ฆ is convex. Proof: (1) ๐‘” ๐‘ฆ, ๐‘ง is convex (2) ๐‘ง โˆˆ ๐‘†๐‘› where ๐‘†๐‘› is a convex non-empty set (3) Therefore, ๐‘•(๐‘ฆ) is convex, i.e. ๐ต โˆ’ ๐ถ๐ท+๐ถ๐‘ˆ โ‰ฝ 0

  • 3. Operations that preserve convexity

20

slide-21
SLIDE 21

Composition: Given ๐‘•: ๐‘†๐‘œ โ†’ ๐‘† ๐‘๐‘œ๐‘’ โ„Ž: ๐‘† โ†’ ๐‘†, we set ๐‘” ๐‘ฆ = โ„Ž(๐‘• ๐‘ฆ ) f is convex if g convex, h convex, เทจ โ„Ž nondecreasing g concave, h convex, เทจ โ„Ž nonincreasing f is concave if g convex, h concave, เทจ โ„Ž nonincreasing g concave, h concave, เทจ โ„Ž nondecreasing Proof : for ๐‘œ=1 ๐‘”โ€ฒโ€ฒ ๐‘ฆ = โ„Žโ€ฒโ€ฒ ๐‘• ๐‘ฆ ๐‘•โ€ฒ ๐‘ฆ 2 + โ„Žโ€ฒ ๐‘• ๐‘ฆ ๐‘•โ€ฒโ€ฒ ๐‘ฆ Ex1: exp ๐‘•(๐‘ฆ) is convex if g is convex Ex2: 1/๐‘• ๐‘ฆ is convex if g is concave and positive Note that we set เทจ โ„Ž ๐‘ฆ = โˆž if ๐‘ฆ โˆ‰ ๐‘’๐‘๐‘› โ„Ž, h is convex เทจ โ„Ž ๐‘ฆ = โˆ’โˆž if ๐‘ฆ โˆ‰ ๐‘’๐‘๐‘› โ„Ž, h is concave

  • 3. Operations that preserve convexity

21

slide-22
SLIDE 22

Show that โ„Ž ๐‘• ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง โ‰ค ๐œ„โ„Ž ๐‘• ๐‘ฆ + 1 โˆ’ ๐œ„ โ„Ž(๐‘• ๐‘ง ) for the case that g, h are convex, and เทจ โ„Ž is nondecreasing (1) g is convex ๐‘• ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง โ‰ค ๐œ„๐‘• ๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘•(๐‘ง) (2) h is nondecreasing: From (1), we have โ„Ž ๐‘• ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง โ‰ค โ„Ž ๐œ„๐‘• ๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘• ๐‘ง (3) h is convex โ„Ž ๐œ„๐‘• ๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘• ๐‘ง โ‰ค ๐œ„โ„Ž ๐‘• ๐‘ฆ + 1 โˆ’ ๐œ„ โ„Ž ๐‘• ๐‘ง (4) From (2) & (3) โ„Ž ๐‘• ๐œ„๐‘ฆ + 1 โˆ’ ๐œ„ ๐‘ง โ‰ค ๐œ„โ„Ž ๐‘• ๐‘ฆ + 1 โˆ’ ๐œ„ โ„Ž ๐‘• ๐‘ง

  • 3. Operations that preserve convexity

22

slide-23
SLIDE 23

The setting of conjugate functions starts from the following problem (which may not be convex) ๐‘›๐‘—๐‘œ ๐‘”(๐‘ฆ) subject to ๐‘ฆ โ‰ค 0 We convert to a function of ๐‘ง ๐‘—๐‘œ๐‘”

๐‘ฆ

๐‘” ๐‘ฆ โˆ’ ๐‘ง๐‘ˆ๐‘ฆ The conjugate function is ๐‘”โˆ— ๐‘ง = sup

๐‘ฆ

๐‘ง๐‘ˆ๐‘ฆ โˆ’ ๐‘”(๐‘ฆ) In the class, we interchange min and inf; max and sup to simplify the notation.

  • 4. Conjugate Functions

23

slide-24
SLIDE 24

Given ๐‘”: ๐‘†๐‘œ โ†’ ๐‘†, we have ๐‘”โˆ—: ๐‘†๐‘œ โ†’ ๐‘† ๐‘”โˆ— ๐‘ง = sup

๐‘ฆโˆˆ๐‘’๐‘๐‘› ๐‘”

๐‘ง๐‘ˆ๐‘ฆ โˆ’ ๐‘” ๐‘ฆ ; (โˆ’๐‘”โˆ— ๐‘ง = min

๐‘ฆโˆˆ๐‘’๐‘๐‘› ๐‘” โˆ’ ๐‘ง๐‘ˆ๐‘ฆ + ๐‘”(๐‘ฆ))

Constraint: ๐‘ง โˆˆ ๐‘†๐‘œ for which the supremum is finite (bounded) ๐‘”โˆ— ๐‘ง is called the conjugate of function f Theorem : ๐‘”โˆ—(๐‘ง) is convex (pointwise maximum) Proof : ๐‘”โˆ— ๐œ„๐‘ง1 + 1 โˆ’ ๐œ„ ๐‘ง2 = sup

๐‘ฆ

๐œ„๐‘ง1 + 1 โˆ’ ๐œ„ ๐‘ง2 ๐‘ˆ๐‘ฆ โˆ’ ๐‘”(๐‘ฆ) โ‰ค sup

๐‘ฆ

๐œ„๐‘ง1

๐‘ˆ๐‘ฆ + ๐œ„๐‘” ๐‘ฆ

+ sup

๐‘ฆ

( 1 โˆ’ ๐œ„ ๐‘ง2

๐‘ˆ ๐‘ฆ โˆ’ 1 โˆ’ ๐œ„ ๐‘”(๐‘ฆ))

= ๐œ„๐‘”โˆ— ๐‘ง1 + 1 โˆ’ ๐œ„ ๐‘”โˆ— ๐‘ง2 Remark: ๐‘”โˆ—(๐‘ง) is convex even if ๐‘”(๐‘ฆ) is not convex

  • 4. Conjugate Functions

24

slide-25
SLIDE 25

Suppose we have a pair าง ๐‘ฆ, เดค ๐‘ง, such that ๐‘”โˆ— เดค ๐‘ง = เดค ๐‘ง๐‘ˆ าง ๐‘ฆ โˆ’ ๐‘” าง ๐‘ฆ , we can show that เดค ๐‘ง = โˆ‡๐‘ฆ๐‘” าง ๐‘ฆ (exercise 3.40) And the supporting hyperplane : เดค ๐‘ง๐‘ˆ๐‘ฆ โˆ’ โ„Ž = ๐‘”โˆ— เดค ๐‘ง เดค ๐‘ง๐‘ˆ โˆ’1 ๐‘ฆ โ„Ž = ๐‘”โˆ—(เดค ๐‘ง)

  • Ex. ๐‘” ๐‘ฆ = ๐‘ฆ2 โˆ’ 2๐‘ฆ, ๐‘ฆ โˆˆ ๐‘†

๐‘”โˆ— ๐‘ง = sup

๐‘ฆ

๐‘ง๐‘ฆ โˆ’ ๐‘ฆ2 + 2๐‘ฆ, ๐‘ง โˆˆ ๐‘†

  • 4. Conjugate Functions

25

slide-26
SLIDE 26

One way to view conjugate function ๐‘”โˆ— ๐‘ง = sup

๐‘ฆโˆˆ๐‘’๐‘๐‘› ๐‘”

๐‘ง๐‘ˆ๐‘ฆ โˆ’ ๐‘”(๐‘ฆ) x : negative slack y : shadow price (loss) to accommodate the slack f*(y) : balance between price slack product (๐‘ง๐‘ˆ๐‘ฆ) and objective function ๐‘” ๐‘ฆ . Remark: When ๐‘”โˆ— ๐‘ง is unbounded, the shadow price ๐‘ง is not reasonable.

  • 4. Conjugate Functions

26

slide-27
SLIDE 27

Ex: ๐‘” ๐‘ฆ = ๐‘๐‘ฆ + ๐‘, ๐‘ฆ โˆˆ ๐‘† ๐‘”โˆ— ๐‘ง = sup

๐‘ฆ

(๐‘ง๐‘ฆ โˆ’ ๐‘๐‘ฆ โˆ’ ๐‘) (1) If ๐‘ง โ‰  ๐‘, ๐‘”โˆ— ๐‘ง = โˆž (2) If ๐‘ง = ๐‘, ๐‘”โˆ— ๐‘ง = โˆ’๐‘ โ†’ ๐‘’๐‘๐‘› ๐‘”โˆ— = ๐‘, ๐‘”โˆ— ๐‘ง = โˆ’๐‘

  • 4. Conjugate Functions: Examples (single variable)

27

slide-28
SLIDE 28

Ex: ๐‘” ๐‘ฆ = โˆ’๐‘š๐‘๐‘•๐‘ฆ, ๐‘ฆ โˆˆ R++ ๐‘”โˆ— ๐‘ง = sup

๐‘ฆโˆˆ๐‘†++

๐‘ง๐‘ฆ + ๐‘š๐‘๐‘•๐‘ฆ (1) If ๐‘ง โ‰ฅ 0, ๐‘”โˆ— ๐‘ง = โˆž (2) If ๐‘ง < 0, ๐‘”โˆ— ๐‘ง = max

๐‘ฆโˆˆ๐‘†++๐‘ฆ๐‘ง + ๐‘š๐‘๐‘•๐‘ฆ

Let ๐‘• ๐‘ฆ = ๐‘ฆ๐‘ง + ๐‘š๐‘๐‘•๐‘ฆ, ๐‘•โ€ฒ ๐‘ฆ = ๐‘ง +

1 ๐‘ฆ

If ๐‘•โ€ฒ ๐‘ฆ = 0, ๐‘ฆ = โˆ’

1 ๐‘ง

Thus, ๐‘”โˆ— ๐‘ง = โˆ’1 + log โˆ’

1 ๐‘ง = โˆ’1 โˆ’ log โˆ’๐‘ง

โ†’ ๐‘’๐‘๐‘› ๐‘”โˆ— = โˆ’๐‘†++, ๐‘”โˆ— ๐‘ง = โˆ’1 โˆ’ log(โˆ’๐‘ง)

  • 4. Conjugate Functions: Examples (single variable)

28

slide-29
SLIDE 29

Ex: ๐‘” ๐‘ฆ = ๐‘“๐‘ฆ, ๐‘ฆ โˆˆ ๐‘† ๐‘”โˆ— ๐‘ง = ๐‘ก๐‘ฃ๐‘ž

๐‘ฆ

๐‘ฆ๐‘ง โˆ’ ๐‘“๐‘ฆ (1) ๐‘ง < 0 โˆถ ๐‘”โˆ— ๐‘ง = โˆž (2) ๐‘ง > 0 โˆถ Let ๐‘• ๐‘ฆ = ๐‘ฆ๐‘ง โˆ’ ๐‘“๐‘ฆโ†’ ๐‘•โ€ฒ ๐‘ฆ = ๐‘ง โˆ’ ๐‘“๐‘ฆ If ๐‘•โ€ฒ ๐‘ฆ = 0, then ๐‘ฆ = ๐‘š๐‘๐‘•๐‘ง Thus ๐‘”โˆ— ๐‘ง = ๐‘ง๐‘š๐‘๐‘•๐‘ง โˆ’ ๐‘ง (3) ๐‘ง = 0 โˆถ ๐‘”โˆ— ๐‘ง = 0 โ†’ ๐‘’๐‘๐‘› ๐‘”โˆ— = ๐‘†+, ๐‘”โˆ— ๐‘ง = ๐‘ง๐‘š๐‘๐‘•๐‘ง โˆ’ ๐‘ง Therefore, we have ๐‘”โˆ— ๐‘ง = ๐‘ง๐‘š๐‘๐‘•๐‘ง โˆ’ ๐‘ง, where ๐‘ง โ‰ฅ 0.

  • 4. Conjugate Functions

29

slide-30
SLIDE 30

Ex: ๐‘” ๐‘ฆ = ๐‘ฆ๐‘š๐‘๐‘•๐‘ฆ, ๐‘ฆ โˆˆ ๐‘†+, ๐‘” 0 = 0 ๐‘”โˆ— ๐‘ง = ๐‘ก๐‘ฃ๐‘ž

๐‘ฆ

๐‘ฆ๐‘ง โˆ’ ๐‘ฆ๐‘š๐‘๐‘•๐‘ฆ Let ๐‘• ๐‘ฆ = ๐‘ฆ๐‘ง โˆ’ ๐‘ฆ๐‘š๐‘๐‘•๐‘ฆ โ†’ ๐‘•โ€ฒ ๐‘ฆ = ๐‘ง โˆ’ ๐‘š๐‘๐‘•๐‘ฆ โˆ’ 1 Suppose ๐‘•โ€ฒ ๐‘ฆ = 0, we have ๐‘ง = 1 + ๐‘š๐‘๐‘•๐‘ฆ or ๐‘ฆ = ๐‘“๐‘งโˆ’1 Thus ๐‘”โˆ— ๐‘ง = ๐‘ง๐‘“๐‘งโˆ’1 โˆ’ ๐‘“๐‘งโˆ’1(๐‘ง โˆ’ 1) = ๐‘“๐‘งโˆ’1 where ๐‘ง โˆˆ ๐‘†

  • 4. Conjugate Functions

30

slide-31
SLIDE 31

Ex: ๐‘” ๐‘ฆ =

1 2 ๐‘ฆ๐‘ˆ๐‘…๐‘ฆ, ๐‘ฆ โˆˆ ๐‘†๐‘œ, ๐‘… โˆˆ ๐‘‡++ ๐‘œ

๐‘”โˆ— ๐‘ง = ๐‘ก๐‘ฃ๐‘ž

๐‘ฆ

๐‘ฆ๐‘ˆ๐‘ง โˆ’

1 2 ๐‘ฆ๐‘ˆ๐‘…๐‘ฆ

Let ๐‘• ๐‘ฆ = ๐‘ฆ๐‘ˆ๐‘ง โˆ’

1 2 ๐‘ฆ๐‘ˆ๐‘…๐‘ฆ โ†’ ๐›ผ๐‘• ๐‘ฆ = ๐‘ง โˆ’ ๐‘…๐‘ฆ

If ๐›ผ๐‘• ๐‘ฆ = 0, we have ๐‘ฆ = ๐‘…โˆ’1๐‘ง Thus, ๐‘”โˆ— ๐‘ง =

1 2 ๐‘ง๐‘ˆ๐‘…โˆ’1๐‘ง

Remark: Suppose that ๐‘”โˆ— เดค ๐‘ง = เดค ๐‘ง๐‘ˆ าง ๐‘ฆ โˆ’ ๐‘” าง ๐‘ฆ and โˆ‡2๐‘” าง ๐‘ฆ โ‰ป 0 We have โˆ‡๐‘”โˆ— เดค ๐‘ง = าง ๐‘ฆ and โˆ‡2๐‘”โˆ— เดค ๐‘ง = โˆ‡2๐‘” าง ๐‘ฆ

โˆ’1 (exercise 3.40)

  • 4. Conjugate Functions

31

slide-32
SLIDE 32

Basic Properties (1) ๐‘” ๐‘ฆ + ๐‘”โˆ— ๐‘ง โ‰ฅ ๐‘ฆ๐‘ˆ๐‘ง Fenchelโ€™s inequality. Thus, in the above example ๐‘ฆ๐‘ˆ๐‘ง โ‰ค

1 2 ๐‘ฆ๐‘ˆ๐‘…๐‘ฆ + 1 2 ๐‘ง๐‘ˆ๐‘…โˆ’1๐‘ง, โˆ€๐‘ฆ, ๐‘ง โˆˆ ๐‘†๐‘œ, ๐‘… โˆˆ ๐‘‡++ ๐‘œ

(2) ๐‘”โˆ—โˆ— = ๐‘”, if f is convex & f is closed (i.e. epi f is a closed set) (3) If f is convex & differentiable, ๐‘’๐‘๐‘› ๐‘” = ๐‘†๐‘œ For max ๐‘ง๐‘ˆ๐‘ฆ โˆ’ ๐‘”(๐‘ฆ), we have ๐‘ง = ๐›ผ๐‘”(๐‘ฆโˆ—) Thus, ๐‘”โˆ— ๐‘ง = ๐‘ฆโˆ—๐‘ˆ๐›ผ๐‘” ๐‘ฆโˆ— โˆ’ ๐‘” ๐‘ฆโˆ— , ๐‘ง = ๐›ผ๐‘”(๐‘ฆโˆ—)

  • 4. Conjugate Functions

32

slide-33
SLIDE 33

Ex : ๐‘” ๐‘ฆ = ๐‘š๐‘๐‘• ฯƒ๐‘—=1

๐‘œ

๐‘“๐‘ฆ๐‘— โ†” ๐‘”โˆ— ๐‘ง = ฯƒ๐‘—=1

๐‘œ

๐‘ง๐‘—๐‘š๐‘๐‘•๐‘ง๐‘— ๐‘”โˆ— ๐‘ง = ๐‘ก๐‘ฃ๐‘ž

๐‘ฆ

๐‘ง๐‘ˆ๐‘ฆ โˆ’ ๐‘” ๐‘ฆ = ๐‘ก๐‘ฃ๐‘ž

๐‘ฆ

๐‘ง๐‘ˆ๐‘ฆ โˆ’ ๐‘š๐‘๐‘• ฯƒ๐‘—=1

๐‘œ

๐‘“๐‘ฆ๐‘— Let ๐‘• ๐‘ฆ = ๐‘ง๐‘ˆ๐‘ฆ โˆ’ ๐‘š๐‘๐‘• ฯƒ๐‘—=1

๐‘œ

๐‘“๐‘ฆ๐‘—

๐œ–๐‘• ๐‘ฆ ๐œ–๐‘ฆ๐‘— = ๐‘ง๐‘— โˆ’ ๐‘“๐‘ฆ๐‘— ฯƒ๐‘—=1

๐‘œ

๐‘“๐‘ฆ๐‘— = 0

Thus, ๐‘ง๐‘— =

๐‘“๐‘ฆ๐‘— ฯƒ๐‘—=1

๐‘œ

๐‘“๐‘ฆ๐‘—, i.e. 1๐‘ˆ๐‘ง = 1

(1) 1๐‘ˆ๐‘ง โ‰  1 โ†’ unbounded (2) ๐‘ง๐‘— < 0 โ†’ unbounded (3) ๐‘”โˆ— ๐‘ง = ฯƒ๐‘—=1

๐‘œ

๐‘ง๐‘—๐‘š๐‘๐‘•๐‘ง๐‘— , ๐‘ง โ‰ฅ 0, 1๐‘ˆ๐‘ง = 1

  • 4. Conjugate Functions

33

slide-34
SLIDE 34

Log function : ๐‘š๐‘๐‘•๐‘” ๐‘ฆ , ๐‘”: ๐‘†๐‘œ โ†’ ๐‘†, ๐‘” ๐‘ฆ > 0, โˆ€๐‘ฆ โˆˆ ๐‘’๐‘๐‘› ๐‘” Suppose f is twice differentiable, ๐‘’๐‘๐‘› ๐‘” is convex. ๐›ผ2๐‘š๐‘๐‘•๐‘” ๐‘ฆ =

1 ๐‘” ๐‘ฆ ๐›ผ2๐‘” ๐‘ฆ โˆ’ 1 ๐‘” ๐‘ฆ 2 ๐›ผ๐‘” ๐‘ฆ ๐›ผ๐‘” ๐‘ฆ ๐‘ˆ

Then f is log-convex iff โˆ€๐‘ฆ โˆˆ ๐‘’๐‘๐‘› ๐‘” ๐‘” ๐‘ฆ ๐›ผ2๐‘” ๐‘ฆ โ‰ฅ ๐›ผ๐‘” ๐‘ฆ ๐›ผ๐‘” ๐‘ฆ ๐‘ˆ f is log-concave iff โˆ€๐‘ฆ โˆˆ ๐‘’๐‘๐‘› ๐‘” ๐‘” ๐‘ฆ ๐›ผ2๐‘” ๐‘ฆ โ‰ค ๐›ผ๐‘” ๐‘ฆ ๐›ผ๐‘” ๐‘ฆ ๐‘ˆ

  • 5. Log-Concave, Log-Convex Functions

34

slide-35
SLIDE 35

๐‘” โˆถ ๐‘†๐‘œ โ†’ ๐‘†, ๐‘” ๐‘ฆ > 0, โˆ€๐‘ฆ โˆˆ ๐‘’๐‘๐‘› ๐‘” Definition: If log ๐‘” is concave, f is log-concave. Definition: If log ๐‘” is convex, f is log-convex. Ex : ๐‘” ๐‘ฆ = ๐‘๐‘ˆ๐‘ฆ + ๐‘, ๐‘’๐‘๐‘› ๐‘” = ๐‘ฆ ๐‘๐‘ˆ๐‘ฆ + ๐‘ : log-concave ๐‘” ๐‘ฆ = ๐‘ฆ๐‘, ๐‘ฆ โˆˆ ๐‘†++, ๐‘ โ‰ค 0 : log-convex ๐‘ > 0 โˆถ log-concave ๐‘” ๐‘ฆ = ๐‘“๐›ฝ๐‘ฆ : log convex & log-concave ๐‘” ๐‘ฆ =

1 2๐œŒ ืฌ โˆ’โˆž ๐‘ฆ ๐‘“โˆ’๐‘ฃ2

2 ๐‘’๐‘ฃ : cumulative distribution function of

Gaussian density log-concave ๐‘” ๐‘ฆ =

1 2๐œŒ ๐‘œ๐‘’๐‘“๐‘ขฯƒ ๐‘“โˆ’1

2 ๐‘ฆโˆ’ าง

๐‘ฆ ๐‘ˆ ฯƒโˆ’1 ๐‘ฆโˆ’ าง ๐‘ฆ

: log-concave

  • 5. Log-Concave, Log-Convex Functions

35

slide-36
SLIDE 36

Properties ๐›ผ2๐‘š๐‘๐‘•๐‘” ๐‘ฆ = 1 ๐‘” ๐‘ฆ ๐›ผ2๐‘” ๐‘ฆ โˆ’ 1 ๐‘” ๐‘ฆ 2 ๐›ผ๐‘” ๐‘ฆ ๐›ผf x T ๐‘” ๐‘ฆ ๐›ผ2๐‘” ๐‘ฆ โ‰ฅ ๐›ผ๐‘” ๐‘ฆ ๐›ผ๐‘” ๐‘ฆ ๐‘ˆ, โˆ€๐‘ฆ โˆˆ ๐‘’๐‘๐‘› ๐‘” : log-convex ๐‘” ๐‘ฆ ๐›ผ2๐‘” ๐‘ฆ โ‰ค ๐›ผ๐‘” ๐‘ฆ ๐›ผ๐‘” ๐‘ฆ ๐‘ˆ, โˆ€๐‘ฆ โˆˆ ๐‘’๐‘๐‘› ๐‘” : log-concave

  • 5. Log-Concave, Log-Convex Functions

36

slide-37
SLIDE 37

Outlines

  • 1. Definitions: Convexity, Examples & Views
  • 2. Conditions of Optimality
  • 1. First Order Condition
  • 2. Second Order Condition
  • 3. Operations that Preserve the Convexity
  • 1. Pointwise Maximum
  • 2. Partial Minimization
  • 4. Conjugate Function
  • 5. Log-Concave, Log-Convex Functions

37