CS599: Convex and Combinatorial Optimization Fall 2013 Lecture 8: - - PowerPoint PPT Presentation

cs599 convex and combinatorial optimization fall 2013
SMART_READER_LITE
LIVE PREVIEW

CS599: Convex and Combinatorial Optimization Fall 2013 Lecture 8: - - PowerPoint PPT Presentation

CS599: Convex and Combinatorial Optimization Fall 2013 Lecture 8: Convex Functions Wrapup Instructor: Shaddin Dughmi Outline Quasiconvex Functions 1 Log-Concave Functions 2 Quasiconvex Functions A function f : R n R is quasiconvex if


slide-1
SLIDE 1

CS599: Convex and Combinatorial Optimization Fall 2013 Lecture 8: Convex Functions Wrapup

Instructor: Shaddin Dughmi

slide-2
SLIDE 2

Outline

1

Quasiconvex Functions

2

Log-Concave Functions

slide-3
SLIDE 3

Quasiconvex Functions

A function f : Rn → R is quasiconvex if all its sublevel sets are convex. i.e. if Sα = {x|f(x) ≤ α} is convex for each α ∈ R.

Quasiconvex Functions 0/13

slide-4
SLIDE 4

Quasiconvex Functions

A function f : Rn → R is quasiconvex if all its sublevel sets are convex. i.e. if Sα = {x|f(x) ≤ α} is convex for each α ∈ R. f is quasiconcave if −f is quasiconvex

Equivalently, all its superlevel sets are convex.

f is quasilinear if it is both quasiconvex and quasiconcave

Equivalently, all its sublevel and superlevel sets are halfspaces, and all its level sets are affine

Quasiconvex Functions 0/13

slide-5
SLIDE 5

Examples

log x is quasilinear on R+ All functions f : R → R that are “unimodal” x1x2 is quasiconcave on R2

+ a⊺x+b c⊺x+d is quasilinear

||x||0 is quasiconcave on Rn

+.

Quasiconvex Functions 1/13

slide-6
SLIDE 6

Alternative Definitions

We will now look at two equivalent definitions of quasiconvex functions

Inequality Definition

f is quasiconvex if the following relaxation of Jensen’s inequality holds: f(θx + (1 − θ)y) ≤ max {f(x), f(y)} for 0 ≤ θ ≤ 1

Quasiconvex Functions 2/13

slide-7
SLIDE 7

Alternative Definitions

We will now look at two equivalent definitions of quasiconvex functions

Inequality Definition

f is quasiconvex if the following relaxation of Jensen’s inequality holds: f(θx + (1 − θ)y) ≤ max {f(x), f(y)} for 0 ≤ θ ≤ 1 Like Jensen’s inequality, a property of f on lines in its domain

Quasiconvex Functions 2/13

slide-8
SLIDE 8

Alternative Definitions

First Order Definition

A differentiable f : Rn → R is quasiconvex if and only if whenever f(y) ≤ f(x), we have ▽f(x)⊺(y − x) ≤ 0

Quasiconvex Functions 3/13

slide-9
SLIDE 9

Alternative Definitions

First Order Definition

A differentiable f : Rn → R is quasiconvex if and only if whenever f(y) ≤ f(x), we have ▽f(x)⊺(y − x) ≤ 0 ▽f(x) defines a supporting hyperplane for sublevel set with α = f(x)

Quasiconvex Functions 3/13

slide-10
SLIDE 10

Operations Preserving Quasiconvexity

Scaling

If f is quasiconvex and w > 0, then wf is also quasiconvex. f and wf have the same sublevel sets: wf(x) ≤ α iff f(x) ≤ α/w,

Quasiconvex Functions 4/13

slide-11
SLIDE 11

Operations Preserving Quasiconvexity

Scaling

If f is quasiconvex and w > 0, then wf is also quasiconvex. f and wf have the same sublevel sets: wf(x) ≤ α iff f(x) ≤ α/w,

Composition with Nondecreasing Function

If f : Rn → R is quasiconvex h : R → R is non-decreasing, then h ◦ f is quasiconvex. h ◦ f and f have the same sublevel sets: h(f(x)) ≤ α iff f(x) ≤ h−1(α)

Quasiconvex Functions 4/13

slide-12
SLIDE 12

Operations Preserving Quasiconvexity

Composition with Affine Function

If f : Rn → R is quasiconvex, and A ∈ Rn×m, b ∈ Rn, then g(x) = f(Ax + b) is a quasiconvex function from Rm to R.

Quasiconvex Functions 5/13

slide-13
SLIDE 13

Operations Preserving Quasiconvexity

Composition with Affine Function

If f : Rn → R is quasiconvex, and A ∈ Rn×m, b ∈ Rn, then g(x) = f(Ax + b) is a quasiconvex function from Rm to R.

Proof

The α sublevel of f(Ax + b) ≤ α is the inverse image of the α-sublevel

  • f f under the affine map x → Ax + b.

Quasiconvex Functions 5/13

slide-14
SLIDE 14

Operations Preserving Quasiconvexity

Composition with Affine Function

If f : Rn → R is quasiconvex, and A ∈ Rn×m, b ∈ Rn, then g(x) = f(Ax + b) is a quasiconvex function from Rm to R.

Proof

The α sublevel of f(Ax + b) ≤ α is the inverse image of the α-sublevel

  • f f under the affine map x → Ax + b.

Note: extends to linear fractional maps x → Ax+b

cT x+d.

Quasiconvex Functions 5/13

slide-15
SLIDE 15

Operations Preserving Quasiconvexity

Maximum

If f1, f2 are quasiconvex, then g(x) = max {f1(x), f2(x)} is also quasiconvex. Generalizes to the maximum of any number of functions, maxk

i=1 fi(x),

and also to the supremum of an infinite set of functions supy fy(x).

Quasiconvex Functions 6/13

slide-16
SLIDE 16

Operations Preserving Quasiconvexity

Maximum

If f1, f2 are quasiconvex, then g(x) = max {f1(x), f2(x)} is also quasiconvex. Generalizes to the maximum of any number of functions, maxk

i=1 fi(x),

and also to the supremum of an infinite set of functions supy fy(x).

  • Quasiconvex Functions

6/13

slide-17
SLIDE 17

Operations Preserving Quasiconvexity

Minimization

If f(x, y) is quasiconvex and C is convex and nonempty, then g(x) = infy∈C f(x, y) is quasiconvex.

Quasiconvex Functions 7/13

slide-18
SLIDE 18

Operations Preserving Quasiconvexity

Minimization

If f(x, y) is quasiconvex and C is convex and nonempty, then g(x) = infy∈C f(x, y) is quasiconvex.

Proof (for C = Rk)

Sα(g) is the projection of Sα(f) onto hyperplane y = 0.

Quasiconvex Functions 7/13

slide-19
SLIDE 19

Operations NOT preserving quasiconvexity

Sum

f1 + f2 is NOT necessarily quasiconvex when f1 and f2 are quasiconvex.

Quasiconvex Functions 8/13

slide-20
SLIDE 20

Operations NOT preserving quasiconvexity

Sum

f1 + f2 is NOT necessarily quasiconvex when f1 and f2 are quasiconvex.

Composition Rules

The composition rules for convex functions do NOT carry over in general.

Quasiconvex Functions 8/13

slide-21
SLIDE 21

Outline

1

Quasiconvex Functions

2

Log-Concave Functions

slide-22
SLIDE 22

We now briefly look at a class of quasiconcave functions which pops up in “multiplication” and “volume” maximization problems.

Log-concave Functions

A function f : Rn → R+ is log-concave if log f(x) is a concave function. Equivalently: f(θx + (1 − θ)y) ≥ f(x)θf(y)1−θ for x, y ∈ Rn and θ ∈ [0, 1].

Log-Concave Functions 9/13

slide-23
SLIDE 23

We now briefly look at a class of quasiconcave functions which pops up in “multiplication” and “volume” maximization problems.

Log-concave Functions

A function f : Rn → R+ is log-concave if log f(x) is a concave function. Equivalently: f(θx + (1 − θ)y) ≥ f(x)θf(y)1−θ for x, y ∈ Rn and θ ∈ [0, 1]. i.e. concave if plotted on log-scale paper

Log-Concave Functions 9/13

slide-24
SLIDE 24

We now briefly look at a class of quasiconcave functions which pops up in “multiplication” and “volume” maximization problems.

Log-concave Functions

A function f : Rn → R+ is log-concave if log f(x) is a concave function. Equivalently: f(θx + (1 − θ)y) ≥ f(x)θf(y)1−θ for x, y ∈ Rn and θ ∈ [0, 1]. i.e. concave if plotted on log-scale paper Concave functions are log-concave, and both are quasiconcave.

Log-Concave Functions 9/13

slide-25
SLIDE 25

We now briefly look at a class of quasiconcave functions which pops up in “multiplication” and “volume” maximization problems.

Log-concave Functions

A function f : Rn → R+ is log-concave if log f(x) is a concave function. Equivalently: f(θx + (1 − θ)y) ≥ f(x)θf(y)1−θ for x, y ∈ Rn and θ ∈ [0, 1]. i.e. concave if plotted on log-scale paper Concave functions are log-concave, and both are quasiconcave. Taking the logarithm of a non-concave (yet quasiconcave) function can “concavify” it

Log-Concave Functions 9/13

slide-26
SLIDE 26

We now briefly look at a class of quasiconcave functions which pops up in “multiplication” and “volume” maximization problems.

Log-concave Functions

A function f : Rn → R+ is log-concave if log f(x) is a concave function. Equivalently: f(θx + (1 − θ)y) ≥ f(x)θf(y)1−θ for x, y ∈ Rn and θ ∈ [0, 1]. i.e. concave if plotted on log-scale paper Concave functions are log-concave, and both are quasiconcave. Taking the logarithm of a non-concave (yet quasiconcave) function can “concavify” it Most common form of “concavification” and “convexification” of

  • bjective functions, which to a large extent is an art.

Log-Concave Functions 9/13

slide-27
SLIDE 27

Examples

All concave functions xa for a ≥ 0 ex

  • i xi

Determinant of a PSD matrix The pdf of many common distributions such as Gaussian and exponential

Intuitively, those distributions which decay faster than exponential (i.e. e−λx))

Log-Concave Functions 10/13

slide-28
SLIDE 28

Operations Preserving Log-Concavity

Scaling

If f is logconcave and w ∈ R then wf is also logconcave.

Log-Concave Functions 11/13

slide-29
SLIDE 29

Operations Preserving Log-Concavity

Scaling

If f is logconcave and w ∈ R then wf is also logconcave.

Composition with Affine

If f is logconcave then so is f(Ax + b).

Log-Concave Functions 11/13

slide-30
SLIDE 30

Operations Preserving Log-Concavity

Scaling

If f is logconcave and w ∈ R then wf is also logconcave.

Composition with Affine

If f is logconcave then so is f(Ax + b).

Multiplication

If f1, f2 are log-concave, then so is f1f2

Log-Concave Functions 11/13

slide-31
SLIDE 31

Operations Preserving Log-Concavity

Scaling

If f is logconcave and w ∈ R then wf is also logconcave.

Composition with Affine

If f is logconcave then so is f(Ax + b).

Multiplication

If f1, f2 are log-concave, then so is f1f2 Log-concavity NOT preserved by sums.

Log-Concave Functions 11/13

slide-32
SLIDE 32

Operations Preserving Log-Concavity

Theorem (Prekopa & Liendler)

If f(x, y) is log-concave, then g(x) =

  • y f(x, y) is also log-concave.

Log-Concave Functions 12/13

slide-33
SLIDE 33

Operations Preserving Log-Concavity

Theorem (Prekopa & Liendler)

If f(x, y) is log-concave, then g(x) =

  • y f(x, y) is also log-concave.

Example (Yield)

Design parameters x ∈ Rn of a product

Log-Concave Functions 12/13

slide-34
SLIDE 34

Operations Preserving Log-Concavity

Theorem (Prekopa & Liendler)

If f(x, y) is log-concave, then g(x) =

  • y f(x, y) is also log-concave.

Example (Yield)

Design parameters x ∈ Rn of a product Noise δ ∈ Rn drawn from a log-concave distribution with pdf f Resulting product has configuration x + δ.

Log-Concave Functions 12/13

slide-35
SLIDE 35

Operations Preserving Log-Concavity

Theorem (Prekopa & Liendler)

If f(x, y) is log-concave, then g(x) =

  • y f(x, y) is also log-concave.

Example (Yield)

Design parameters x ∈ Rn of a product Noise δ ∈ Rn drawn from a log-concave distribution with pdf f Resulting product has configuration x + δ. Set S ⊆ Rn of “desired” configurations

Log-Concave Functions 12/13

slide-36
SLIDE 36

Operations Preserving Log-Concavity

Theorem (Prekopa & Liendler)

If f(x, y) is log-concave, then g(x) =

  • y f(x, y) is also log-concave.

Example (Yield)

Design parameters x ∈ Rn of a product Noise δ ∈ Rn drawn from a log-concave distribution with pdf f Resulting product has configuration x + δ. Set S ⊆ Rn of “desired” configurations Probability of desirable configuration is

  • y∈S f(y − x)

Log-Concave Functions 12/13

slide-37
SLIDE 37

Operations Preserving Log-Concavity

Theorem (Prekopa & Liendler)

If f(x, y) is log-concave, then g(x) =

  • y f(x, y) is also log-concave.

Example (Yield)

Design parameters x ∈ Rn of a product Noise δ ∈ Rn drawn from a log-concave distribution with pdf f Resulting product has configuration x + δ. Set S ⊆ Rn of “desired” configurations Probability of desirable configuration is

  • y∈S f(y − x)

By above theorem, choosing x to optimize this probability is convex optimization problem

Log-Concave Functions 12/13

slide-38
SLIDE 38

Next Week

Convex Optimization Problems!

Log-Concave Functions 13/13