CS675: Convex and Combinatorial Optimization Fall 2019 Duality of - PowerPoint PPT Presentation

CS675: Convex and Combinatorial Optimization Fall 2019 Duality of Convex Optimization Problems Instructor: Shaddin Dughmi

Outline The Lagrange Dual Problem 1 Duality 2 Optimality Conditions 3

Recall: Optimization Problem in Standard Form minimize f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. For convex optimization problems in standard form, f i is convex and h i is affine. Let D denote the domain of all these functions (i.e. when their value is finite) The Lagrange Dual Problem 1/25

Recall: Optimization Problem in Standard Form minimize f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. For convex optimization problems in standard form, f i is convex and h i is affine. Let D denote the domain of all these functions (i.e. when their value is finite) This Lecture + Next We will develop duality theory for convex optimization problems, generalizing linear programming duality. The Lagrange Dual Problem 1/25

Running Example: Linear Programming We have already seen the standard form LP below maximize c ⊺ x − minimize − c ⊺ x subject to Ax � b subject to Ax − b � 0 x � 0 − x � 0 The Lagrange Dual Problem 2/25

Running Example: Linear Programming We have already seen the standard form LP below maximize c ⊺ x − minimize − c ⊺ x subject to Ax � b subject to Ax − b � 0 x � 0 − x � 0 Along the way, we will recover the following standard form dual minimize y ⊺ b A ⊺ y � c subject to y � 0 The Lagrange Dual Problem 2/25

The Lagrangian minimize f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. Basic idea of Lagrangian duality is to relax/soften the constraints by replacing each with a linear “penalty term” or “cost” in the objective. The Lagrange Dual Problem 3/25

The Lagrangian minimize f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. Basic idea of Lagrangian duality is to relax/soften the constraints by replacing each with a linear “penalty term” or “cost” in the objective. The Lagrangian Function m k � � L ( x, λ, ν ) = f 0 ( x ) + λ i f i ( x ) + ν i h i ( x ) i =1 i =1 λ i is Lagrange Multiplier for i ’th inequality constraint Required to be nonnegative ν i is Lagrange Multiplier for i ’th equality constraint Allowed to be of arbitrary sign The Lagrange Dual Problem 3/25

The Lagrange Dual Function minimize f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. The Lagrange dual function gives the optimal value of the primal problem subject to the softened constraints The Lagrange Dual Problem 4/25

The Lagrange Dual Function minimize f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. The Lagrange dual function gives the optimal value of the primal problem subject to the softened constraints The Lagrange Dual Function � m k � � � g ( λ, ν ) = inf x ∈D L ( x, λ, ν ) = inf f 0 ( x ) + λ i f i ( x ) + ν i h i ( x ) x ∈D i =1 i =1 Observe: g is a concave function of the Lagrange multipliers We will see: Its quite common for the Lagrange dual to be unbounded ( −∞ ) for some λ and ν By convention, domain of g is ( λ, ν ) s.t. g ( λ, ν ) > −∞ The Lagrange Dual Problem 4/25

Langrange Dual of LP minimize − c ⊺ x Ax − b � 0 subject to − x � 0 First, the Lagrangian function L ( x, λ ) = − c ⊺ x + λ ⊺ 1 ( Ax − b ) − λ ⊺ 2 x = ( A ⊺ λ 1 − c − λ 2 ) ⊺ x − λ ⊺ 1 b The Lagrange Dual Problem 5/25

Langrange Dual of LP minimize − c ⊺ x Ax − b � 0 subject to − x � 0 First, the Lagrangian function L ( x, λ ) = − c ⊺ x + λ ⊺ 1 ( Ax − b ) − λ ⊺ 2 x = ( A ⊺ λ 1 − c − λ 2 ) ⊺ x − λ ⊺ 1 b And the Lagrange Dual g ( λ ) = inf x L ( x, λ ) � −∞ if A ⊺ λ 1 − c − λ 2 � = 0 = − λ ⊺ if A ⊺ λ 1 − c − λ 2 = 0 1 b The Lagrange Dual Problem 5/25

Langrange Dual of LP minimize − c ⊺ x Ax − b � 0 subject to − x � 0 First, the Lagrangian function L ( x, λ ) = − c ⊺ x + λ ⊺ 1 ( Ax − b ) − λ ⊺ 2 x = ( A ⊺ λ 1 − c − λ 2 ) ⊺ x − λ ⊺ 1 b And the Lagrange Dual g ( λ ) = inf x L ( x, λ ) � −∞ if A ⊺ λ 1 − c − λ 2 � = 0 = − λ ⊺ if A ⊺ λ 1 − c − λ 2 = 0 1 b So we restrict the domain of g to λ satisfying A ⊺ λ 1 − c − λ 2 = 0 The Lagrange Dual Problem 5/25

Interpretation: “Soft” Lower Bound min f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. The Lagrange Dual Function � m k � � � g ( λ, ν ) = inf x ∈D L ( x, λ, ν ) = inf f 0 ( x ) + λ i f i ( x ) + ν i h i ( x ) x ∈D i =1 i =1 The Lagrange Dual Problem 6/25

Interpretation: “Soft” Lower Bound min f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. The Lagrange Dual Function � m k � � � g ( λ, ν ) = inf x ∈D L ( x, λ, ν ) = inf f 0 ( x ) + λ i f i ( x ) + ν i h i ( x ) x ∈D i =1 i =1 Fact g ( λ, ν ) is a lowerbound on OPT(primal) for every λ � 0 and ν ∈ R k . The Lagrange Dual Problem 6/25

Interpretation: “Soft” Lower Bound min f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. The Lagrange Dual Function � m k � � � g ( λ, ν ) = inf x ∈D L ( x, λ, ν ) = inf f 0 ( x ) + λ i f i ( x ) + ν i h i ( x ) x ∈D i =1 i =1 Fact g ( λ, ν ) is a lowerbound on OPT(primal) for every λ � 0 and ν ∈ R k . Proof Every primal feasible x incurs nonpositive penalty by L ( x, λ, ν ) Therefore, L ( x ∗ , λ, ν ) ≤ f 0 ( x ∗ ) So g ( λ, ν ) ≤ f 0 ( x ∗ ) = OPT ( Primal ) The Lagrange Dual Problem 6/25

Interpretation: “Soft” Lower Bound min f 0 ( x ) subject to f i ( x ) ≤ 0 , for i = 1 , . . . , m. h i ( x ) = 0 , for i = 1 , . . . , k. The Lagrange Dual Function � m k � � � g ( λ, ν ) = inf x ∈D L ( x, λ, ν ) = inf f 0 ( x ) + λ i f i ( x ) + ν i h i ( x ) x ∈D i =1 i =1 Interpretation A “hard” feasibility constraint can be thought of as imposing a penalty of + ∞ if violated, and a penalty/reward of 0 if satisfied Lagrangian imposes a “soft” linear penalty for violating a constraint, and a reward for slack Lagrange dual finds the optimal subject to these soft constraints The Lagrange Dual Problem 6/25

Interpretation: Geometric Most easily visualized in the presence of a single inequality constraint minimize f 0 ( x ) f 1 ( x ) ≤ 0 subject to Let G be attainable constraint/objective function value tuples i.e. ( u, t ) ∈ G if there is an x such that f 1 ( x ) = u and f 0 ( x ) = t p ∗ = inf { t : ( u, t ) ∈ G , u ≤ 0 } g ( λ ) = inf { λu + t : ( u, t ) ∈ G} The Lagrange Dual Problem 7/25

Interpretation: Geometric Most easily visualized in the presence of a single inequality constraint minimize f 0 ( x ) f 1 ( x ) ≤ 0 subject to Let G be attainable constraint/objective function value tuples i.e. ( u, t ) ∈ G if there is an x such that f 1 ( x ) = u and f 0 ( x ) = t p ∗ = inf { t : ( u, t ) ∈ G , u ≤ 0 } g ( λ ) = inf { λu + t : ( u, t ) ∈ G} λu + t = g ( λ ) is a supporting hyperplane to G pointing northeast Must intersect vertical axis below p ∗ Therefore g ( λ ) ≤ p ∗ The Lagrange Dual Problem 7/25

The Lagrange Dual Problem This is the problem of finding the best lower bound on OPT(primal) implied by the Lagrange dual function maximize g ( λ, ν ) subject to λ � 0 Note: this is a convex optimization problem, regardless of whether primal problem was convex By convention, sometimes we add “dual feasibility” constraints to impose “nontrivial” lowerbounds (i.e. g ( λ, ν ) ≥ −∞ ) ( λ ∗ , ν ∗ ) solving the above are referred to as the dual optimal solution The Lagrange Dual Problem 8/25

Langrange Dual Problem of LP maximize c ⊺ x − minimize − c ⊺ x subject to Ax � b subject to Ax − b � 0 x � 0 − x � 0 Recall Our Lagrange dual function for the above minimization LP (to the right), defined over the domain A ⊺ λ 1 − c − λ 2 = 0 . g ( λ ) = − λ ⊺ 1 b The Lagrange Dual Problem 9/25

Langrange Dual Problem of LP maximize c ⊺ x − minimize − c ⊺ x subject to Ax � b subject to Ax − b � 0 x � 0 − x � 0 Recall Our Lagrange dual function for the above minimization LP (to the right), defined over the domain A ⊺ λ 1 − c − λ 2 = 0 . g ( λ ) = − λ ⊺ 1 b The Lagrange dual problem can then be written as − maximize − λ ⊺ 1 b subject to A ⊺ λ 1 − c − λ 2 = 0 λ � 0 The Lagrange Dual Problem 9/25

Langrange Dual Problem of LP maximize c ⊺ x − minimize − c ⊺ x subject to Ax � b subject to Ax − b � 0 x � 0 − x � 0 Recall Our Lagrange dual function for the above minimization LP (to the right), defined over the domain A ⊺ λ 1 − c − λ 2 = 0 . g ( λ ) = − λ ⊺ 1 b The Lagrange dual problem can then be written as − maximize − λ ⊺ 1 b ✭✭✭✭✭✭✭✭✭ subject to A ⊺ λ 1 − c − λ 2 = 0 A ⊺ λ 1 � c λ � 0 The Lagrange Dual Problem 9/25

CS675: Convex and Combinatorial Optimization Fall 2019 Duality of - PowerPoint PPT Presentation

CS675: Convex and Combinatorial Optimization Fall 2019 Duality of Convex Optimization Problems Instructor: Shaddin Dughmi Outline The Lagrange Dual Problem 1 Duality 2 Optimality Conditions 3 Recall: Optimization Problem in Standard Form

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor:

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Optimization Problems

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Functions Instructor: Shaddin

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin

CS675: Convex and Combinatorial Optimization Fall 2019 Geometric Duality of Convex Sets and

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Fall 2014 Combinatorial Problems as Convex Programs

CS675: Convex and Combinatorial Optimization Fall 2019 Combinatorial Problems as Linear and

CS675: Convex and Combinatorial Optimization Spring 2018 Duality of Convex Sets and Functions

CS675: Convex and Combinatorial Optimization Fall 2016 Combinatorial Problems as Linear and

CS675: Convex and Combinatorial Optimization Fall 2014 Optimality Conditions for Convex

CS675: Convex and Combinatorial Optimization Fall 2019 Submodular Function Optimization

CS675: Convex and Combinatorial Optimization Fall 2019 Introduction to Matroid Theory

CS675: Convex and Combinatorial Optimization Fall 2014 Introduction to Matroid Theory

CS675: Convex and Combinatorial Optimization Fall 2019 Introduction to Optimization Instructor:

Nonlinear Control Lecture # 3 Stability of Equilibrium Points Nonlinear Control Lecture # 3

High-Dimensional Covariance Decomposition into Sparse Markov and Independence Domains Majid

NUMA and VM Scalability Mark Johnston markj@FreeBSD.org FreeBSD Developer Summit MeetBSD 2018

Features of edge states and domain walls in chiral superconductors Manfred Sigrist NQS2017, YITP

BradChamberlain,SungEunChoi,SteveDeitz,

A A novel hyb ybrid distributed-ro routing and SDN N solution for traffic engineering ANR

C LANGUAGE INTRODUCTION CSSE 120Rose Hulman Institute of Technology The C Programming

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale