Nonlinear Optimization: Optimality conditions
INSEAD, Spring 2006
Jean-Philippe Vert Ecole des Mines de Paris
Jean-Philippe.Vert@mines.org
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.1/62
Nonlinear Optimization: Optimality conditions INSEAD, Spring 2006 - - PowerPoint PPT Presentation
Nonlinear Optimization: Optimality conditions INSEAD, Spring 2006 Jean-Philippe Vert Ecole des Mines de Paris Jean-Philippe.Vert@mines.org 2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) p.1/62 Nonlinear optimization c
Jean-Philippe Vert Ecole des Mines de Paris
Jean-Philippe.Vert@mines.org
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.1/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.2/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.3/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.4/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.5/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.6/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.7/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.8/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.9/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.10/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.11/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.12/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.13/62
ǫ→0
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.14/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.15/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.16/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.17/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.18/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.19/62
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.20/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.21/62
1 + 1
1 + 2x1x2 + 1
2 − x2 + 1 .
1 + x1 + 2x2
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.22/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.23/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.24/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.25/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.26/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.27/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.28/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.29/62
2 ,
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.30/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.31/62
n
n
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.32/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.33/62
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.34/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.35/62
Left: at a local minimum, the gradient ∇f (x∗) makes an angle less than or equal to 90 degrees with all feasible variations x − x∗. Right: the optimality condition fails if X is not convex: x∗ is a local minimum, but ∇f (x∗)⊤ (x − x∗) < 0.
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.36/62
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.37/62
n
i ) ≥ 0
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.38/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.39/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.40/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.41/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.42/62
1, . . . , λ∗ m ∈ R called Lagrange multipliers such
m
i ∇hi (x∗) = 0 .
m
i ∇2hi(x∗)
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.43/62
1 + x2 2 = 2 .
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.44/62
2 = 1 ,
2 = 4 .
x∗ = 0 ∇f(x) = 1 1 ∇h1(x) = −2 ∇h2(x) = −4
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.45/62
Introduce, for k = 1, 2, . . ., the cost function: F k(x) = f(x) + k 2 h(x) 2 + α 2 x − x∗ 2 , where α > 0 and x∗ is a local minimum, and let xk = arg min
x∈S
Fk(x) , where S is a small ball around x∗ s.t. f(x∗) < f(x) for all feasible points of S. Observe that: F k xk = f(xk) + k 2 h(xk) 2 + α 2 xk − x∗ 2 ≤ F k (x∗) = f (x∗) .
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.46/62
Taking the limit when k → ∞, this shows that any limit point ¯ x of
k=1,... satisfies h (¯
x) = 0, f (¯ x) = f (x∗) and ¯ x = x∗. Therefore x∗ is the only limit point: lim
k→+∞ xk = x∗
. As a result, for k large enough, xk is an interior point of S and is an unconstrained local minimum of F k(x). From the first-order optimality condition we therefore have, for sufficiently large k: 0 = ∇F k xk = ∇f
+ k∇h
h
+ α
.
(1)
Since ∇h (x∗) has rank m, the same is true for ∇h
if k is sufficiently large, and therefore ∇h
is invertible.
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.47/62
We therefore obtain: kh
= −
∇h
∇f
+ α
. By taking the limit when k → +∞: lim
k→+∞ kh
= −
−1 ∇h (x∗)⊤ ∇f (x∗)
∆
= λ∗ . Take now the limit in (1) to obtain: ∇f (x∗) + ∇h (x∗) λ∗ = 0 . The second-order condition is also obtained by taking a limit from the second-order optimality condition of xk [Bersteskas p.288].
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.48/62
m
xxL (x∗, λ∗) y ≥ 0 ,
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.49/62
1 + x2 2 + x2 3
1 + λ∗ = 0 ,
2 + λ∗ = 0 ,
3 + λ∗ = 0 ,
1 = x∗ 2 = x∗ 3 = 1 .
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.50/62
n
n
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.51/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.52/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.53/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.54/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.55/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.56/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.57/62
1, . . . , λ∗ m) and µ∗ = (µ∗ 1, . . . , µ∗ r) such that the following
j ≥ 0 ,
j = 0 ,
m
r
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.58/62
r
j (x)
j (x) = max (0, gj(x)) ,
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.59/62
1 + x2 2 + x2 3
1 + µ∗ = 0 ,
2 + µ∗ = 0 ,
3 + µ∗ = 0 .
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.60/62
1 + x∗ 2 + x∗ 3 < −3 ,
1 = x∗ 2 = x∗ 3 = 0
1 + x∗ 2 + x∗ 3 = −3 .
1 = x∗ 2 = x∗ 3 = −1 and µ∗ = 1, which
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.61/62
Nonlinear optimization c
2003-2006 Jean-Philippe Vert, (Jean-Philippe.Vert@mines.org) – p.62/62