7 separating hyperplane theorems ii
play

7. Separating Hyperplane Theorems II Daisuke Oyama Mathematics II - PowerPoint PPT Presentation

7. Separating Hyperplane Theorems II Daisuke Oyama Mathematics II May 7, 2020 Farkas Lemma Proposition 7.16 (Farkas Lemma) Let A R M N and b R N . The following conditions are equivalent: 1. There exists x R M such that A


  1. 7. Separating Hyperplane Theorems II Daisuke Oyama Mathematics II May 7, 2020

  2. Farkas’ Lemma Proposition 7.16 (Farkas’ Lemma) Let A ∈ R M × N and b ∈ R N . The following conditions are equivalent: 1. There exists x ∈ R M such that A T x = b and x ≥ 0 . 2. For any y ∈ R N , if Ay ≥ 0 , then b T y ≥ 0 . For proof, we will use the following: Lemma 7.17 { A T x ∈ R N | x ∈ R M + } is a closed set. 1 / 32

  3. Proof of Farkas’ Lemma ▶ (1) ⇒ (2): Immediate. ▶ (2) ⇒ (1): Suppose that (1) does not hold. Let K = { A T x ∈ R N | x ∈ R M + } . Then b / ∈ K . ▶ K is convex, and by Lemma 7.17 is closed. ▶ Then by the Separating Hyperplane Theorem, there exist y ∈ R N with y ̸ = 0 and c ∈ R such that y T b < c ≤ y T z for all z ∈ K, and therefore, y T b < inf z ∈ K y T z . ▶ Since K is a cone, it follows that inf z ∈ K y T z = 0 . ( → Homework) ▶ Thus we have y T b < 0 , and y T A T x ≥ 0 for all x ≥ 0 , which implies that y T A T ≥ 0 T . 2 / 32

  4. Proof of Lemma 7.17 Show that K = { A T x ∈ R N | x ∈ R M + } is closed. ▶ Denote the column vectors in A T by a 1 , . . . , a M , so that K = Cone { a 1 , . . . , a M } . ▶ Let { z m } be a sequence in K , and suppose that z m → ¯ z . We want to show that ¯ z ∈ K . eodory’s Theorem, for each m , z m is written as ▶ By Carath´ a conic combination of a linearly independent subset of { a 1 , . . . , a M } . ▶ Since there are finitely many such subsets, there is a linearly independent subset { a i 1 , . . . , a i L } such that infinitely many elements of { z m } are written as its conic combinations. ▶ Denote B = ( a i 1 a i L ) ∈ R N × L , and denote the · · · corresponding subsequence again by { z m } . 3 / 32

  5. ▶ Denote z m = Bλ m , where λ m ∈ R L + . ▶ We have B T z m = B T Bλ m , where B T B ∈ R L × L is non-singular: ▶ Let B T Bx = 0 . ▶ Then x T B T Bx = 0 , where x T B T Bx = ∥ Bx ∥ 2 . ▶ Therefore, x T B T Bx = 0 if and only if Bx = 0 . ▶ Since the columns of B are linearly independent, this holds if and only if x = 0 . ▶ Therefore, we have λ m = ( B T B ) − 1 B T z m . ▶ By the continuity of ( B T B ) − 1 B T z in z , λ m converges to ¯ z , where ¯ λ = ( B T B ) − 1 B T ¯ λ ∈ R L + . ▶ Thus, by the continuity of Bλ in λ , we have z = lim m →∞ Bλ m = B ¯ ¯ λ , so that ¯ z ∈ K . 4 / 32

  6. Variants of Farkas’ Lemma Proposition 7.18 (Farkas’ Lemma: Inequality version) Let A ∈ R M × N and b ∈ R N . The following conditions are equivalent: 1. There exists x ∈ R M such that A T x ≤ b and x ≥ 0 . 2. For any y ∈ R N , if y ≥ 0 and Ay ≥ 0 , then b T y ≥ 0 . 5 / 32

  7. Proof ▶ (1) ⇒ (2): Immediate. ▶ (2) ⇒ (1): Assume (2). ( A ) It implies that for all y ∈ R N , if y ≥ 0 , then b T y ≥ 0 . I ▶ By Farkas’ Lemma, there exist x ∈ R M and z ∈ R N such that x ≥ 0 , z ≥ 0 , and ) ( x ) ( A T I = b, z or A T x + z = b , and therefore, A T x ≤ b . 6 / 32

  8. Linear Programming Let A ∈ R K × N , f ∈ R N , c ∈ R K . Primal problem: x ∈ R N f T x (P) max s. t. Ax ≤ c x ≥ 0 . Dual problem: λ ∈ R K c T λ (D) min A T λ ≥ f s. t. λ ≥ 0 . The Lagrangians for the two problems coincide (the nonnegativity constraints aside): L ( x, λ ) = f T x − λ T ( Ax − c ) = c T λ − x T ( A T λ − f ) . 7 / 32

  9. Weak Duality Proposition 7.19 If x ∈ R N and λ ∈ R K are feasible for (P) and (D) , respectively, then f T x ≤ c T λ . Proof ▶ If x ∈ R N and λ ∈ R K are feasible for (P) and (D), then f T x ≤ ( A T λ ) T x = λ T ( Ax ) ≤ λ T c. x ∈ R N and ¯ λ ∈ R K are feasible and if f T ¯ x = c T ¯ Therefore, if ¯ λ , x and ¯ then ¯ λ are solutions to (P) and (D), respectively. 8 / 32

  10. Strong Duality Proposition 7.20 Suppose that both (P) and (D) are feasible. Then both (P) and (D) have solutions, and max { f T x | Ax ≤ c, x ≥ 0 } = min { c T λ | A T λ ≥ f, λ ≥ 0 } . 9 / 32

  11. Proof ▶ Suppose that (P) and (D) are feasible. We want to show that there exist x ∈ R N and λ ∈ R K such that Ax ≤ c , A T λ ≥ f , f T x ≥ c T λ , x ≥ 0 , and λ ≥ 0 , or  A O   c  ( x )  , x ≥ 0 , λ ≥ 0 . − A T O ≤ − f    λ − f T c T 0 ▶ By Proposition 7.18, this is equivalent to the condition that for all p ∈ R K , q ∈ R N , and r ∈ R ,   A O  ≥ 0 , p ≥ 0 , q ≥ 0 , r ≥ 0 ( p T q T ) − A T r O  − f T c T  c   ≥ 0 . ( p T q T ) ⇒ r − f  0 10 / 32

  12. ▶ That is, (1) A T p ≥ rf, Aq ≤ rc, p ≥ 0 , q ≥ 0 , r ≥ 0 implies (2) c T p − f T q ≥ 0 . We want to show that this holds whenever (P) and (D) are feasible. ▶ For r > 0 , (1) implies that q/r and p/r are feasible solutions to (P) and (D), so that we have c T p − f T q = r [ c T ( p/r ) − f T ( q/r )] ≥ 0 by Weak Duality. ▶ For r = 0 , let x and λ be feasible solutions to (P) and (D). From (1), we have c T p − f T q ≥ x T A T p − λ T Aq ≥ 0 . 11 / 32

  13. Strong Duality Proposition 7.21 1. Suppose that (D) has a solution. Then (P) has a solution, and max { f T x | Ax ≤ c, x ≥ 0 } = min { c T λ | A T λ ≥ f, λ ≥ 0 } . 2. Suppose that (P) has a solution. Then (D) has a solution, and max { f T x | Ax ≤ c, x ≥ 0 } = min { c T λ | A T λ ≥ f, λ ≥ 0 } . 12 / 32

  14. Proof ▶ Suppose that (D) has a solution. In light of Proposition 7.20, it suffices to show that (P) has a feasible solution. ▶ Let λ ∗ ∈ R K be a solution to (D). To apply Proposition 7.18, let z ∈ R K be such that A T z ≥ 0 and z ≥ 0 . ▶ Then λ ∗ + z ≥ 0 , and A T ( λ ∗ + z ) = A T λ ∗ + A T z ≥ f , which means that λ ∗ + z is feasible in (D). ▶ Therefore, by the optimality of λ ∗ , we have 0 ≤ c T ( λ ∗ + z ) − c T λ ∗ = c T z . ▶ By Proposition 7.18, there exists x ∈ R N such that Ax ≤ c and x ≥ 0 . 13 / 32

  15. Variants of Farkas’ Lemma Proposition 7.22 (Gale’s Theorem/Fan’s Theorem) Let A ∈ R M × N and b ∈ R N . The following conditions are equivalent: 1. There exists x ∈ R M such that A T x ≤ b . 2. For any y ∈ R N , if y ≥ 0 and Ay = 0 , then b T y ≥ 0 . 14 / 32

  16. Proof ▶ (1) ⇒ (2): Immediate. ▶ (2) ⇒ (1): Assume (2). ( A ) It implies that for all y ∈ R N , if y ≥ 0 and y ≥ 0 , then − A b T y ≥ 0 . ▶ By Proposition 7.18, there exist x ∈ R M and z ∈ R M such that x ≥ 0 , z ≥ 0 , and − A T ) ( x ) ( A T ≤ b, z or A T ( x − z ) ≤ b . 15 / 32

  17. Variants of Farkas’ Lemma Proposition 7.23 (Gordan’s Theorem) Let A ∈ R M × N . The following conditions are equivalent: 1. There exists x ∈ R M such that A T x ≫ 0 . 2. For any y ∈ R N , if y ≥ 0 and Ay = 0 , then y = 0 . 16 / 32

  18. Proof ▶ (1) ⇒ (2): Immediate. ▶ (2) ⇒ (1): Assume (2). It implies that for all y ∈ R N , if y ≥ 0 and ( − A ) y = 0 , then ( − 1 T ) y ≥ 0 . ▶ By Gale’s Theorem (Proposition 7.22), there exists x ∈ R M such that ( − A T ) x ≤ − 1 , or A T x ≥ 1 ≫ 0 . 17 / 32

  19. Variants of Farkas’ Lemma Proposition 7.24 (Ville/von Neumann-Morgenstern I) Let A ∈ R M × N . The following conditions are equivalent: 1. There exists x ∈ R M such that A T x ≫ 0 and x ≫ 0 . 2. For any y ∈ R N , if y ≥ 0 and Ay ≤ 0 , then y = 0 . 18 / 32

  20. Variants of Farkas’ Lemma ▶ In fact, “there exists x ∈ R M such that A T x ≫ 0 and x ≫ 0 ” is equivalent to “there exists x ∈ R M such that A T x ≫ 0 and x ≥ 0 ”. ▶ Given an x ≥ 0 in the latter, consider x + ε 1 for sufficiently small ε > 0 . Proposition 7.25 (Ville/von Neumann-Morgenstern II) Let A ∈ R M × N . The following conditions are equivalent: 1. There exists x ∈ R M such that A T x ≫ 0 and x ≥ 0 . 2. For any y ∈ R N , if y ≥ 0 and Ay ≤ 0 , then y = 0 . 19 / 32

  21. Proof of Proposition 7.24 ▶ (1) ⇒ (2): Immediate. ▶ (2) ⇒ (1): Assume (2). ▶ For y ∈ R N and z ∈ R M , assume that ) ( y ) ( A I = 0 , y ≥ 0 , z ≥ 0 . z ▶ Then we have Ay + z = 0 , and hence Ay = − z ≤ 0 by z ≥ 0 . ▶ Then by (2), we have y = 0 , and hence z = 0 . ▶ By Gordan’s Theorem (Proposition 7.23), there exists x ∈ R M such that ( A T ) x ≫ 0 , I or A T x ≫ 0 and x ≫ 0 . 20 / 32

  22. Variants of Farkas’ Lemma Proposition 7.26 Let A ∈ R M × N . The following conditions are equivalent: 1. There exists x ∈ R M such that A T x ≤ 0 and x ≫ 0 . 2. For any y ∈ R N , if y ≥ 0 and Ay ≥ 0 , then Ay = 0 . 21 / 32

  23. Proof ▶ (1) ⇒ (2): Immediate. ▶ (2) ⇒ (1): Assume (2). Then there do not exist y ∈ R N and z ∈ R M such that Ay = z, 1 T z = 1 , y ≥ 0 , z ≥ 0 , or ( − A ) ( y ) ( 0 M ) I = , y ≥ 0 , z ≥ 0 . 0 T 1 T z 1 N ▶ Then by “not (1) ⇒ not (2)” in Farkas’ Lemma, there exist x ∈ R M and α ∈ R such that ( − A T ) ( x ) ) ( x ) 0 N ( 0 T ≥ 0 , 1 < 0 , M I α α 1 or A T x ≤ 0 , x ≥ − α 1 , α < 0 , as desired. 22 / 32

  24. Variants of Farkas’ Lemma Proposition 7.27 (Stiemke’s Lemma) Let A ∈ R M × N . The following conditions are equivalent: 1. There exists x ∈ R M such that A T x = 0 and x ≫ 0 . 2. For any y ∈ R N , if Ay ≥ 0 , then Ay = 0 . 23 / 32

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend