MATH529 Fundamentals of Optimization Fundamentals of Constrained - - PowerPoint PPT Presentation

math529 fundamentals of optimization fundamentals of
SMART_READER_LITE
LIVE PREVIEW

MATH529 Fundamentals of Optimization Fundamentals of Constrained - - PowerPoint PPT Presentation

MATH529 Fundamentals of Optimization Fundamentals of Constrained Optimization II Marco A. Montes de Oca Mathematical Sciences, University of Delaware, USA 1 / 33 Multiple equality constraints: Example A motor company makes cars, trucks and


slide-1
SLIDE 1

MATH529 – Fundamentals of Optimization Fundamentals of Constrained Optimization II

Marco A. Montes de Oca

Mathematical Sciences, University of Delaware, USA

1 / 33

slide-2
SLIDE 2

Multiple equality constraints: Example

A motor company makes cars, trucks and vans. Its revenue is R(c, t, v), where c, t, v are the number of cars, trucks, vans (respectively) it produces per year. Suppose production is constrained by the amount of steel available, and by the amount of aluminum available. Assume each car, truck, van needs sc, st, sv units of steel, respectively, and ac, at, av units of aluminum,

  • respectively. Suppose S units of steel and A of aluminum are

available. Write the constraints as equations. Use Lagrange multipliers to determine optimality conditions. What are the interpretations of each Lagrange multiplier?

2 / 33

slide-3
SLIDE 3

Multiple equality constraints

Illustrations by Steuard Jensen. (http://www.slimy.com/~steuard/teaching/tutorials/Lagrange.html)

3 / 33

slide-4
SLIDE 4

Multiple equality constraints

The Lagrangian can be extended to simultaneously consider multiple constraints. For example, let f (x) be subject to: g(x) = c and h(x) = d. The Lagrangian function may be defined as follows: L(x, λ, µ) = f (x) + λ(c − g(x)) + µ(d − h(x)). The new first-order condition is now: c − g(x) = 0, d − h(x) = 0, ∇f (x) − λ∇g(x) − µ∇h(x) = 0.

4 / 33

slide-5
SLIDE 5

Second-order conditions

Let z be the objective function, that is, let z = f (x, y). Let also g(x, y) = c be a constraint that a solution to the optimization problem must satisfy. Then, dg = gxdx + gydy = 0, which means that dx and dy are not

  • independent. So, we can find dy by fixing dx, that is,

dy = − gx

gy dx. Note that since gx and gy both depend on x

and y, so does dy. Now, d2z = d(dz) = ∂(dz)

∂x dx + ∂(dz) ∂y dy =

5 / 33

slide-6
SLIDE 6

Second-order conditions

Let z be the objective function, that is, let z = f (x, y). Let also g(x, y) = c be a constraint that a solution to the optimization problem must satisfy. Then, dg = gxdx + gydy = 0, which means that dx and dy are not

  • independent. So, we can find dy by fixing dx, that is,

dy = − gx

gy dx. Note that since gx and gy both depend on x

and y, so does dy. Now, d2z = d(dz) = ∂(dz)

∂x dx + ∂(dz) ∂y dy = ∂(fxdx+fydy) ∂x

dx + ∂(fxdx+fydy)

∂y

dy =

6 / 33

slide-7
SLIDE 7

Second-order conditions

Let z be the objective function, that is, let z = f (x, y). Let also g(x, y) = c be a constraint that a solution to the optimization problem must satisfy. Then, dg = gxdx + gydy = 0, which means that dx and dy are not

  • independent. So, we can find dy by fixing dx, that is,

dy = − gx

gy dx. Note that since gx and gy both depend on x

and y, so does dy. Now, d2z = d(dz) = ∂(dz)

∂x dx + ∂(dz) ∂y dy = ∂(fxdx+fydy) ∂x

dx + ∂(fxdx+fydy)

∂y

dy =

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy

7 / 33

slide-8
SLIDE 8

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy =

8 / 33

slide-9
SLIDE 9

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy = fxx(dx)2+

9 / 33

slide-10
SLIDE 10

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy = fxx(dx)2+(fyxdy + fy

∂(dy) ∂x )dx+

10 / 33

slide-11
SLIDE 11

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy = fxx(dx)2+(fyxdy + fy

∂(dy) ∂x )dx+fxydxdy+

11 / 33

slide-12
SLIDE 12

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy = fxx(dx)2+(fyxdy + fy

∂(dy) ∂x )dx+fxydxdy+(fyydy + fy ∂(dy) ∂y )dy =

12 / 33

slide-13
SLIDE 13

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy = fxx(dx)2+(fyxdy + fy

∂(dy) ∂x )dx+fxydxdy+(fyydy + fy ∂(dy) ∂y )dy =

fxx(dx)2 + 2fxydxdy + fyy(dy)2 + fy( ∂(dy)

∂x dx + ∂(dy) ∂y dy) =

13 / 33

slide-14
SLIDE 14

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy = fxx(dx)2+(fyxdy + fy

∂(dy) ∂x )dx+fxydxdy+(fyydy + fy ∂(dy) ∂y )dy =

fxx(dx)2 + 2fxydxdy + fyy(dy)2 + fy( ∂(dy)

∂x dx + ∂(dy) ∂y dy) =

fxx(dx)2 + 2fxydxdy + fyy(dy)2 + fy(d(dy)) = (dx dy) fxx fxy fyx fyy dx dy

  • + fy(d2y)

14 / 33

slide-15
SLIDE 15

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy = fxx(dx)2 + (fyxdy + fy

∂(dy) ∂x )dx + fxydxdy + (fyydy + fy ∂(dy) ∂y )dy =

fxx(dx)2 + 2fxydxdy + fyy(dy)2 + fy( ∂(dy)

∂x dx + ∂(dy) ∂y dy) =

fxx(dx)2 + 2fxydxdy + fyy(dy)2 + fy(d(dy)) = (dx dy) fxx fxy fyx fyy dx dy

  • Quadratic form associated with unconstrained problem

+fy(d2y)

15 / 33

slide-16
SLIDE 16

Second-order conditions

∂(fxdx) ∂x

dx + ∂(fydy)

∂x

dx + ∂(fxdx)

∂y

dy + ∂(fydy)

∂y

dy = fxx(dx)2 + (fyxdy + fy

∂(dy) ∂x )dx + fxydxdy + (fyydy + fy ∂(dy) ∂y )dy =

fxx(dx)2 + 2fxydxdy + fyy(dy)2 + fy( ∂(dy)

∂x dx + ∂(dy) ∂y dy) =

fxx(dx)2 + 2fxydxdy + fyy(dy)2 + fy(d(dy)) = (dx dy) fxx fxy fyx fyy dx dy

  • +

fy(d2y)

New 1st degree term in constrained case

16 / 33

slide-17
SLIDE 17

Second-order conditions

To eliminate this 1st degree term, recall that dy = − gx

gy dx, so

d2y =

∂ ∂x (− gx gy dx)dx + ∂ ∂y (− gx gy dx)dy =

17 / 33

slide-18
SLIDE 18

Second-order conditions

To eliminate this 1st degree term, recall that dy = − gx

gy dx, so

d2y =

∂ ∂x (− gx gy dx)dx + ∂ ∂y (− gx gy dx)dy =

− gygxx

(gy)2 dxdx + gyxgx (gy)2 dxdx − gygxy (gy)2 dxdy + gyygx (gy)2 dxdy =

18 / 33

slide-19
SLIDE 19

Second-order conditions

To eliminate this 1st degree term, recall that dy = − gx

gy dx, so

d2y =

∂ ∂x (− gx gy dx)dx + ∂ ∂y (− gx gy dx)dy =

− gygxx

(gy)2 dxdx + gyxgx (gy)2 dxdx − gygxy (gy)2 dxdy + gyygx (gy)2 dxdy =

− gxx

gy dxdx + gyxgx (gy)2 dxdx − gxy gy dxdy + gyygx (gy)2 dxdy =

19 / 33

slide-20
SLIDE 20

Second-order conditions

To eliminate this 1st degree term, recall that dy = − gx

gy dx, so

d2y =

∂ ∂x (− gx gy dx)dx + ∂ ∂y (− gx gy dx)dy =

− gygxx

(gy)2 dxdx + gyxgx (gy)2 dxdx − gygxy (gy)2 dxdy + gyygx (gy)2 dxdy =

− gxx

gy dxdx + gyxgx (gy)2 dxdx − gxy gy dxdy + gyygx (gy)2 dxdy =

− gxx

gy dxdx − 2gxy gy dxdy − gyy gy dydy.

20 / 33

slide-21
SLIDE 21

Second-order conditions

Substituting d2y back in the expression for d2z: d2z =

  • fxx − fy

gy gxx

  • dxdx + 2
  • fxy − fy

gy gxy

  • dxdy +
  • fyy − fy

gy gyy

  • dydy

21 / 33

slide-22
SLIDE 22

Second-order conditions

Substituting d2y back in the expression for d2z: d2z =

  • fxx − fy

gy gxx

  • dxdx + 2
  • fxy − fy

gy gxy

  • dxdy +
  • fyy − fy

gy gyy

  • dydy

Finally, since fy

gy = λ (recall the derivation of the Lagrange

multiplier equation), we have: Lxx = fxx − λgxx Lxy = Lyx = fxy − λgxy Lyy = fyy − λgyy d2z = Lxxdxdx + 2Lxydxdy + Lyydydy Since this is a quadratic form, we can now establish a determinantal test.

22 / 33

slide-23
SLIDE 23

Second-order conditions

What we are looking for: Necessary conditions For a maximum of z: d2z should be negative semidefinite, subject to dg = 0. For a minimum of z: d2z should be positive semidefinite, subject to dg = 0. Sufficient conditions For a maximum of z: d2z should be negative definite, subject to dg = 0. For a minimum of z: d2z should be positive definite, subject to dg = 0.

23 / 33

slide-24
SLIDE 24

Second-order conditions

Problem: What are the conditions for the sign of definiteness of q = au2 + 2huv + bv2 subject to αu + βv = 0 ?

24 / 33

slide-25
SLIDE 25

Second-order conditions

Since αu + βv = 0, v = − αu

β , thus

q = au2 + 2hu(− αu

β ) + b(− αu β )2 = (aβ2 − 2hαβ + bα2) u2 β2 .

Therefore, the sign of q depends on the sign of aβ2 − 2hαβ + bα2, which is equal to the negative of:

  • α

β α a h β h b

  • 25 / 33
slide-26
SLIDE 26

Second-order conditions

Thus, q is positive definite subject to αu + βv = 0, iff

  • α

β α a h β h b

  • < 0 and

q is negative definite subject to αu + βv = 0, iff

  • α

β α a h β h b

  • > 0

26 / 33

slide-27
SLIDE 27

Second-order conditions

When applied to d2z, the variable u = dx, and v = dy. Also, α = gx and β = gy. Thus, the determinantal test for the sign of definiteness of dz is: Determinantal test for relative constrained extremum (2 variables) d2z is positive (negative) definite subject to dg = 0 iff

  • gx

gy gx Lxx Lxy gy Lyx Lyy

  • < 0 (> 0)

This determinant is called the bordered Hessian and is denoted by |¯ H|.

27 / 33

slide-28
SLIDE 28

Example

Let us use again the problem of finding the extrema of f (x) = x2

1 + x2 2, subject to x2 1 + x2 = 1.

We now know that the critical points are (0, 1) (λ = 2), (−1/ √ 2, 1/2) and (1/ √ 2, 1/2) (λ = 1). Let us the determinantal test to classify these points.

28 / 33

slide-29
SLIDE 29

Second-order conditions

Extension to n variables: |¯ H| =

  • g1

g2 . . . gn g1 L11 L12 . . . L1n g2 L21 L22 . . . L2n . . . . . . . . . ... . . . gn Ln1 Ln2 . . . Lnn

  • 29 / 33
slide-30
SLIDE 30

Determinantal test for relative constrained extremum (n variables)

Condition Maximum Minimum 1st order necessary condition ∇L(x, λ) = 0 ∇L(x, λ) = 0 2nd order sufficient condition (−1)n|¯ Hn| > 0, n ≥ 2 |¯ Hn| < 0

30 / 33

slide-31
SLIDE 31

Multiconstraint Case

Extension to m constraints: |¯ H| =

  • . . .

g1

1

g1

2

. . . g1

n

. . . g2

1

g2

2

. . . g2

n

. . . . . . ... . . . . . . . . . ... . . . . . . gm

1

gm

2

. . . gm

n

g1

1

g2

1

. . . gm

1

L11 L12 . . . L12 g1

2

g2

2

. . . gm

2

L21 L22 . . . L2n . . . . . . ... . . . . . . . . . ... . . . g1

n

g2

n

. . . gm

n

Ln1 Ln2 . . . Lnn

  • 31 / 33
slide-32
SLIDE 32

Multiconstraint Case

Determinantal test in the multiconstrained case: Suppose you want to maximize f (x1, x2, x3, . . . , xn) subject to hi(x) = 0, i ∈ 1, . . . , k. The bordered Hessian (shown in the previous slide) has now (k + n) × (k + n) elements, and k + n leading principal minors. Of these, only the last n − k leading principal minors contain information about the constraints and the

  • bjective function. Exactly these last n − k leading principal

minors are the ones that will give us information about the nature

  • f a point that satisfies the 1st order conditions.

32 / 33

slide-33
SLIDE 33

Multiconstraint Case

Determinantal test in the multiconstrained case: For a maximum, the last n − k leading principal minors |¯ H2k+1|, |¯ H2k+2|, . . . , |¯ Hk+n| = |¯ H| alternate in sign, where the last minor |¯ H| has the sign (−1)n. For a minimum, the last n − k leading principal minors |¯ H2k+1|, |¯ H2k+2|, . . . , |¯ Hk+n| = |¯ H| all have the sign (−1)k.

33 / 33