of Polynomials over a Box Georgina Hall Decision Sciences, INSEAD - - PowerPoint PPT Presentation

โ–ถ
of polynomials over a box
SMART_READER_LITE
LIVE PREVIEW

of Polynomials over a Box Georgina Hall Decision Sciences, INSEAD - - PowerPoint PPT Presentation

On Convexity of Polynomials over a Box Georgina Hall Decision Sciences, INSEAD Joint work with Amir Ali Ahmadi ORFE, Princeton University 1 Convexity over a box A box is a set of the form: =


slide-1
SLIDE 1

On Convexity

  • f Polynomials over a Box

Georgina Hall Decision Sciences, INSEAD Joint work with Amir Ali Ahmadi ORFE, Princeton University

1

slide-2
SLIDE 2

Convexity over a box

  • A box ๐‘ช is a set of the form:

๐ถ = ๐‘ฆ โˆˆ โ„๐‘œ ๐‘š๐‘— โ‰ค ๐‘ฆ๐‘— โ‰ค ๐‘ฃ๐‘—, ๐‘— = 1, โ€ฆ , ๐‘œ} where ๐‘š1, โ€ฆ , ๐‘š๐‘œ, ๐‘ฃ1, โ€ฆ , ๐‘ฃ๐‘œ โˆˆ โ„ with ๐‘š๐‘— โ‰ค ๐‘ฃ๐‘—.

  • A function ๐’ˆ is convex over ๐‘ช if

๐‘” ๐œ‡๐‘ฆ + 1 โˆ’ ๐œ‡ ๐‘ง โ‰ค ๐œ‡๐‘” ๐‘ฆ + 1 โˆ’ ๐œ‡ ๐‘”(๐‘ง) for any ๐‘ฆ, ๐‘ง โˆˆ ๐ถ and ๐œ‡ โˆˆ [0,1].

  • If ๐‘ช is full dimensional (i.e., ๐‘š๐‘— < ๐‘ฃ๐‘—, ๐‘— = 1, โ€ฆ , ๐‘œ),

this is equivalent to ๐›ผ2๐‘” ๐‘ฆ โ‰ฝ 0, โˆ€๐‘ฆ โˆˆ ๐ถ.

2

slide-3
SLIDE 3

Complexity questions

  • Restrict ourselves to polynomial functions.
  • Related work:

Theorem [Ahmadi, Olshevsky, Parrilo, Tsitsiklis] It is strongly NP-hard to test (global) convexity of polynomials of degree 4.

  • One may hope that adding the restriction to a box could make things easier.

3

Goal: study the complexity of testing convexity of a function over a box

slide-4
SLIDE 4

Our theorem

Why are we interested in convexity over a box?

4

Theorem [Ahmadi, H.] It is strongly NP-hard to test convexity of polynomials of degree 3 over a box.

  • Nonconvex optimization: branch-and-bound
  • Prior work:
  • Sufficient conditions for convexity [Orban et

al.], [Grant et al.]

  • In practice, BARON, CVX, Gurobi check

convexity of quadratics and computationally tractable sufficient conditions for convexity

Detecting Imposing

  • Control theory: convex Lyapunov functions
  • Statistics: convex regression

[Ahmadi and Jungers] [Chesi and Hung]

slide-5
SLIDE 5

Question: What to do a reduction from? Idea: A cubic polynomial ๐‘” is convex

  • ver a (full-dimensional) box ๐ถ if and
  • nly if ๐›ผ2๐‘” ๐‘ฆ โ‰ฝ 0, โˆ€๐‘ฆ โˆˆ ๐ถ

๐›ผ2๐‘”(๐‘ฆ) is a matrix with entries affine in ๐’š

Proof of the theorem

5

Theorem [Ahmadi, H.] It is strongly NP-hard to test convexity of polynomials of degree 3 over a box. How to prove this? In general: Theorem [Nemirovski]: Let ๐‘€(๐‘ฆ) be a matrix with entries affine in ๐‘ฆ. It is NP-hard to test whether ๐‘€ ๐‘ฆ โ‰ฝ 0 for all ๐‘ฆ in a full-dimensional box ๐ถ.

Generic instance I

  • f a known

NP-hard problem Instance J of problem we are interested in Construct J from I Reduction

slide-6
SLIDE 6

Are we done?

No!

6

Issue 1: We want to show strong NP-hardness. Nemirovskiโ€™s result shows weak NP- hardness. Issue 2: Not every affine polynomial matrix is a valid Hessian! Example: ๐‘€ ๐‘ฆ1, ๐‘ฆ2 = 10 2๐‘ฆ1 + 1 2๐‘ฆ1 + 1 10 . We have

๐œ–๐‘€11(๐‘ฆ) ๐œ–๐‘ฆ2

โ‰ 

๐œ–๐‘€12(๐‘ฆ) ๐œ–๐‘ฆ1 .

slide-7
SLIDE 7

Dealing with Issue 1 (1/5)

Reminder: weak vs strong NP-hardness

  • Distinction only concerns problems where input is numerical
  • Max(I): largest number in magnitude that appears in the input of instance I

(numerator or denominator)

  • Length(I): number of bits it takes to write down input of instance I

7

Strong Weak

  • There are instances ๐ฝ that are hard with

Max(๐ฝ)โ‰ค ๐‘ž(Length(๐ฝ)) (๐‘ž is a polynomial)

  • No pseudo-polynomial algorithm possible
  • Examples:

Max-Cut Sat

  • The instances that are hard may contain

numbers of large magnitude (e.g., 2๐‘œ).

  • Pseudo-polynomial algorithms possible
  • Examples:

Partition Knapsack

slide-8
SLIDE 8

Dealing with Issue 1 (2/5)

Why weakly NP-hard?

8

Theorem [Nemirovski]: INTERVAL-PSDNESS Let ๐‘€(๐‘ฆ) be a matrix with entries affine in ๐‘ฆ. It is (weakly) NP-hard to test whether ๐‘€ ๐‘ฆ โ‰ฝ 0 for all ๐‘ฆ in a full-dimensional box ๐ถ.

PARTITION: Input: ๐‘ โˆˆ โ„๐‘œ such that ๐‘ 2 โ‰ค 0.1 Test: does there exist ๐‘ข โˆˆ โˆ’1,1 ๐‘œ such that ฯƒ๐‘— ๐‘๐‘—๐‘ข๐‘— = 0? INTERVAL PSDNESS Construct: ๐ท = ๐ฝ๐‘œ โˆ’ ๐‘๐‘๐‘ˆ โˆ’1, ๐œˆ = ๐‘œ โˆ’ ๐‘’โˆ’2 ๐‘ , where ๐‘’ ๐‘ = smallest cd of ๐‘. Take: ๐ถ = โˆ’1,1 ๐‘œ and ๐‘€ ๐‘ฆ = ๐ท ๐‘ฆ ๐‘ฆ๐‘ˆ ๐œˆ . Test: Is ๐‘€ ๐‘ฆ โ‰ฝ 0 โˆ€๐‘ฆ โˆˆ ๐ถ? Show: No to PARTITION โ‡” Yes to INTERVAL PSDNESS

REDUCTION

Weakly NP-hard Operation that can make the numbers in the instance blow up Example: ๐ต = 1 โˆ’1 1 โ‹ฎ โˆ’1 โ‹ฑ โˆ’1 โ‹ฑ โˆ’1 1 but one of the entries of ๐ตโˆ’1 is 2๐‘œโˆ’2!

=

๐‘1 ๐‘3 ๐‘10 ๐‘2 ๐‘4 ๐‘8

slide-9
SLIDE 9

Dealing with Issue 1 (3/5)

9

Theorem [Ahmadi, H.]: INTERVAL-PSDNESS Let ๐‘€(๐‘ฆ) be a matrix with entries affine in ๐‘ฆ. It is strongly NP-hard to test whether ๐‘€ ๐‘ฆ โ‰ฝ 0 for all ๐‘ฆ in a full-dimensional box ๐ถ.

MAX-CUT: Input: simple graph G=(V,E) with ๐‘Š = ๐‘œ and adj. matrix A, and a positive integer ๐‘™ โ‰ค ๐‘œ2 Test: does there exist a cut in the graph of size greater or equal to ๐‘™? INTERVAL PSDNESS Construct: ๐›ฝ =

1 ๐‘œ+1 3 , ๐ท = 4๐›ฝ(๐ฝ๐‘œ + ๐›ฝ๐ต)

๐œˆ = ๐‘œ 4๐›ฝ + ๐‘™ โˆ’ 1 โˆ’ 1 4 ๐‘“๐‘ˆ๐ต๐‘“ Take: ๐ถ = โˆ’1,1 ๐‘œ and ๐‘€ ๐‘ฆ = ๐ท ๐‘ฆ ๐‘ฆ๐‘ˆ ๐œˆ . Test: Is ๐‘€ ๐‘ฆ โ‰ฝ 0 โˆ€๐‘ฆ โˆˆ ๐ถ? Show: No to MAX-CUT โ‡” Yes to INTERVAL PSDNESS

REDUCTION

Strongly NP-hard Taylor series of 4๐›ฝ ๐ฝ โˆ’ ๐›ฝ๐ต โˆ’1 truncated at the first term Scaling needed so that ๐ฝ๐‘œ โˆ’ ๐›ฝ๐ต โˆ’1 โ‰ˆ ๐ฝ๐‘œ + ๐›ฝ๐ต

Preserves strong NP-hardness

slide-10
SLIDE 10

Dealing with Issue 1 (4/5)

In more detail: No to MAX-CUT โ‡’ Yes to INTERVAL PSDNESS

10

No cut in ๐ป of size โ‰ฅ ๐‘™ [max

๐‘ฆโˆˆ โˆ’1,1 ๐‘œ 1 4 ฯƒ๐‘—,๐‘˜ ๐ต๐‘—๐‘˜(1 โˆ’ ๐‘ฆ๐‘—๐‘ฆ๐‘˜)] โ‰ค ๐‘™ โˆ’ 1

Size of largest cut in ๐ป

โ‡” โ‡”

[ max

๐‘ฆโˆˆ โˆ’1,1 ๐‘œ โˆ’ 1 4 ๐‘ฆ๐‘ˆ๐ต๐‘ฆ] โ‰ค โˆ’ 1 4 ๐‘“๐‘ˆ๐ต๐‘“ + ๐‘™ โˆ’ 1

โ‡”

[ max

๐‘ฆโˆˆ โˆ’1,1 ๐‘œ 1 4 ๐‘ฆ๐‘ˆ

๐‘œ + 1 3๐ฝ๐‘œ โˆ’ ๐ต ๐‘ฆ] โ‰ค

๐‘œ ๐‘œ+1 3 4

โˆ’

1 4 ๐‘“๐‘ˆ๐ต๐‘“ + ๐‘™ โˆ’ 1 โ‰” ๐œˆ

๐›ฝ = ๐‘œ + 1 3

Convex

โ‡”

[ max

๐‘ฆโˆˆ[โˆ’1,1]๐‘œ 1 4 ๐‘ฆ๐‘ˆ ๐›ฝ๐ฝ๐‘œ โˆ’ ๐ต ๐‘ฆ] โ‰ค ๐œˆ

โ‡”

1 4 ๐‘ฆ๐‘ˆ ๐›ฝ๐ฝ๐‘œ โˆ’ ๐ต ๐‘ฆ โ‰ค ๐œˆ, โˆ€๐‘ฆ โˆˆ โˆ’1,1 ๐‘œ

โ‡’

๐‘ฆ๐‘ˆ๐ทโˆ’1๐‘ฆ โ‰ค ๐œˆ +

1 4, โˆ€๐‘ฆ โˆˆ โˆ’1,1 ๐‘œ

Approximation ๐ทโˆ’1 โ‰ˆ

1 4 (๐›ฝ๐ฝ โˆ’ ๐ต)

Approximation error

โ‡’

Schur complement

๐‘€ ๐‘ฆ = ๐ท ๐‘ฆ ๐‘ฆ๐‘ˆ ๐œˆ +

1 4

โ‰ฝ 0, โˆ€๐‘ฆ โˆˆ โˆ’1,1 ๐‘œ

slide-11
SLIDE 11

Dealing with Issue 1 (5/5)

For converse: Yes to MAX-CUT โ‡’ No to INTERVAL PSDNESS

  • Initial problem studied by Nemirovski
  • Of independent interest in robust control

11

There is a cut of size โ‰ฅ ๐‘™: Let เทœ ๐‘ฆ๐‘— = แ‰Š 1 if node ๐‘— on one side of cut โˆ’1 if node ๐‘— on other side of cut

โ‡’

Similar steps to previously

โ‡’

เทœ ๐‘ฆ๐‘ˆ๐ทโˆ’1 เทœ ๐‘ฆ โ‰ฅ ๐œˆ + 3 4 > ๐œˆ + 1 4 โˆƒ เทœ ๐‘ฆ โˆˆ โˆ’1,1 ๐‘œ s.t. ๐‘€ เทœ ๐‘ฆ

โ‡’ Corollary [Ahmadi, H.]: Let ๐‘œ be an integer and let เทœ ๐‘Ÿ๐‘—๐‘˜, เดค ๐‘Ÿ๐‘—๐‘˜ be rational numbers with เทœ ๐‘Ÿ๐‘—๐‘˜ โ‰ค เดค ๐‘Ÿ๐‘—๐‘˜ and เทœ ๐‘Ÿ๐‘—๐‘˜ = เทœ ๐‘Ÿ๐‘˜๐‘— and เดค ๐‘Ÿ๐‘—๐‘˜ = เดค ๐‘Ÿ๐‘˜๐‘— for all ๐‘— = 1, โ€ฆ , ๐‘œ and ๐‘˜ = 1, โ€ฆ , ๐‘œ. It is strongly NP-hard to test whether all symmetric matrices with entries in [เทœ ๐‘Ÿ๐‘—๐‘˜; เดค ๐‘Ÿ๐‘—๐‘˜] are positive semidefinite.

slide-12
SLIDE 12

Dealing with Issue 2 (1/3)

Proof: Reduction from INTERVAL PSDNESS

12

Theorem [Ahmadi, H.] CONV3BOX It is strongly NP-hard to test convexity of polynomials of degree 3 over a box.

INTERVAL PSDNESS Input: ๐‘€ ๐‘ฆ , เท  ๐ถ Test: Is ๐‘€ ๐‘ฆ โ‰ฝ 0, โˆ€๐‘ฆ โˆˆ เท  ๐ถ? Problem: How to construct a cubic polynomial ๐‘” from ๐‘€(๐‘ฆ)? Idea: Want ๐›ผ2๐‘” ๐‘ฆ = ๐‘€ ๐‘ฆ . Issue: Not all ๐‘€(๐‘ฆ) are valid Hessians! Key ideas for the construction of ๐’ˆ:

  • Start with ๐’ˆ ๐’š, ๐’› =

๐Ÿ ๐Ÿ‘ ๐’›๐‘ผ๐‘ด ๐’š ๐’›

  • For ๐›ผ2๐‘” ๐‘ฆ, ๐‘ง to be able to be psd when ๐‘€ ๐‘ฆ โ‰ฝ 0 , we need to have

a nonzero diagonal: add

๐œท ๐Ÿ‘ ๐’š๐‘ผ๐’š to ๐‘” ๐‘ฆ, ๐‘ง .

  • ๐‘€ ๐‘ฆ and ๐ผ(๐‘ง) do not depend on the same variable: what if

โˆƒ(๐‘ฆ, ๐‘ง) s.t. ๐‘€ ๐‘ฆ = 0 but ๐ผ ๐‘ง is not? The matrix cannot be psd: add

๐œƒ 2 ๐‘ง๐‘ˆ๐‘ง to ๐‘” ๐‘ฆ, ๐‘ง .

๐›ผ2๐‘” ๐‘ฆ, ๐‘ง = 1 2 ๐ผ(๐‘ง) 1 2 ๐ผ ๐‘ง ๐‘ˆ ๐‘€ ๐‘ฆ ๐›ผ2๐‘” ๐‘ฆ, ๐‘ง = ๐œท๐‘ฑ๐’ 1 2 ๐ผ(๐‘ง) 1 2 ๐ผ ๐‘ง ๐‘ˆ ๐‘€ ๐‘ฆ ๐›ผ2๐‘” ๐‘ฆ, ๐‘ง = ๐œท๐‘ฑ๐’ 1 2 ๐ผ(๐‘ง) 1 2 ๐ผ ๐‘ง ๐‘ˆ ๐‘€ ๐‘ฆ + ๐œƒ๐ฝ๐‘œ+1 โ‡’ ๐‘” ๐‘ฆ = 1 2 ๐‘ง๐‘ˆ๐‘€ ๐‘ฆ ๐‘ง + ๐›ฝ 2 ๐‘ฆ๐‘ˆ๐‘ฆ + ๐œƒ 2 ๐‘ง๐‘ˆ๐‘ง, ๐ถ = โˆ’1,1 2๐‘œ+1

slide-13
SLIDE 13

Dealing with Issue 2 (2/3)

Show NO to INTERVAL PSDNESS โ‡’ NO to CONV3BOX. This is equivalent to: Need to leverage extra structure of ๐‘€ ๐‘ฆ : ๐‘€ ๐‘ฆ = ๐ท ๐‘ฆ ๐‘ฆ๐‘ˆ ๐œˆ +

1 4

13

โˆƒ าง ๐‘ฆ โˆˆ โˆ’1,1 ๐‘œ s.t. ๐‘€ าง ๐‘ฆ โ‰ฝ 0 โ‡’ โˆƒ เทœ ๐‘ฆ, เทœ ๐‘ง โˆˆ โˆ’1,1 2๐‘œ+1 , ๐‘จ s.t. ๐‘จ๐‘ˆ๐›ผ2๐‘” เทœ ๐‘ฆ, เทœ ๐‘ง ๐‘จ < 0 ๐›ผ2๐‘” ๐‘ฆ, ๐‘ง = ๐›ฝ๐ฝ๐‘œ ๐ผ(๐‘ง) ๐ผ ๐‘ง ๐‘ˆ ๐ท + ๐œƒ๐ฝ๐‘œ ๐‘ฆ ๐‘ฆ๐‘ˆ ๐œˆ + 1 4 + ๐œƒ ๐›ผ2๐‘” เทœ ๐‘ฆ, เทœ ๐‘ง = ๐›ฝ๐ฝ๐‘œ ๐Ÿ ๐Ÿ ๐Ÿ ๐‘ซ + ๐œƒ๐ฝ๐‘œ เดฅ ๐’š ๐Ÿ เดฅ ๐’š๐‘ผ ๐‚ + ๐Ÿ ๐Ÿ“ + ๐œƒ ๐‘จ๐‘ˆ๐›ผ2๐‘” เทœ ๐‘ฆ, เทœ ๐‘ง ๐‘จ = โˆ’๐ทโˆ’1 าง ๐‘ฆ 1

๐‘ˆ

๐›ฝ๐ฝ๐‘œ ๐Ÿ ๐Ÿ ๐Ÿ ๐‘ซ + ๐œƒ๐ฝ๐‘œ เดฅ ๐’š ๐Ÿ เดฅ ๐’š๐‘ผ ๐‚ + ๐Ÿ ๐Ÿ“ + ๐œƒ โˆ’๐ทโˆ’1 าง ๐‘ฆ 1 = ๐œˆ + 1 4 โˆ’ าง ๐‘ฆ๐‘ˆ๐ทโˆ’1 าง ๐‘ฆ + ๐œƒ(1 + ๐ทโˆ’1 าง ๐‘ฆ 2

2)

< ๐Ÿ as ๐‘ด เดฅ ๐’š โ‰ฝ ๐Ÿ Appropriately scaled so that ๐‘จ๐‘ˆ๐›ผ2๐‘” เทœ ๐‘ฆ, เทœ ๐‘ง ๐‘จ remains <0.

Candidates: เทœ ๐‘ฆ = าง ๐‘ฆ, เทœ ๐‘ง = 0, ๐‘จ = โˆ’๐ทโˆ’1 าง ๐‘ฆ 1 Candidates: เท ๐’š = เดฅ ๐’š, เท ๐’› = ๐Ÿ, ๐‘จ = โˆ’๐ทโˆ’1 าง ๐‘ฆ 1 Candidates: เทœ ๐‘ฆ = าง ๐‘ฆ, เทœ ๐‘ง = 0, ๐’œ = ๐Ÿ โˆ’๐‘ซโˆ’๐Ÿเดฅ ๐’š ๐Ÿ

slide-14
SLIDE 14

Dealing with Issue 2 (3/3)

Show YES to INTERVAL PSDNESS โ‡’ YES to CONV3BOX. This is equivalent to: Butโ€ฆ

14

๐‘€ ๐‘ฆ โ‰ฝ 0 โˆ€๐‘ฆ โˆˆ โˆ’1,1 ๐‘œ โ‡’ ๐›ผ2๐‘” ๐‘ฆ, ๐‘ง = ๐›ฝ๐ฝ๐‘œ 1 2 ๐ผ(๐‘ง) 1 2 ๐ผ ๐‘ง ๐‘ˆ ๐‘€ ๐‘ฆ + ๐œƒ๐ฝ๐‘œ+1 โ‰ฝ 0, โˆ€ ๐‘ฆ, ๐‘ง โˆˆ โˆ’1,1 2๐‘œ+1

โ‡”

โ‰ฝ ๐Ÿ โˆ€๐’š โˆˆ โˆ’๐Ÿ, ๐Ÿ ๐’ (Assumption)

๐‘ด ๐’š + ๐œฝ๐‘ฑ๐’+๐Ÿ โˆ’

๐Ÿ ๐Ÿ“๐œท ๐‘ฐ ๐’› ๐‘ผ๐‘ฐ ๐’› โ‰ฝ 0, โˆ€ ๐‘ฆ, ๐‘ง โˆˆ โˆ’1,1 2๐‘œ+1

๐œท chosen large enough so that โ‰ฝ ๐Ÿ โˆ€๐’› โˆˆ โˆ’๐Ÿ, ๐Ÿ ๐’+๐Ÿ

๐›ผ2๐‘” ๐‘ฆ, ๐‘ง โ‰ฝ 0, โˆ€ ๐‘ฆ, ๐‘ง โˆˆ โˆ’1,1 2๐‘œ+1 Schur

slide-15
SLIDE 15

Corollary

Completely classifies the complexity of testing convexity of a polynomial ๐‘” of degree ๐‘’ over a box for any integer ๐‘’ โ‰ฅ 1.

15

๐’†

๐‘’ = 1

๐‘” is always convex

๐‘’ = 2

๐›ผ2๐‘”(๐‘ฆ) constant

๐‘’ = 3

Previous theorem (strongly NP-hard)

๐‘’ = 4 and above

Strongly NP-hard

Proof sketch:

  • ๐‘• ๐‘ฆ1, โ€ฆ , ๐‘ฆ๐‘œ =cubic polynomial for which testing

convexity over a box ๐ถ is hard

  • ๐‘” ๐‘ฆ1, โ€ฆ , ๐‘ฆ๐‘œ, ๐‘ฆ๐‘œ+1 = ๐‘• ๐‘ฆ1, โ€ฆ , ๐‘ฆ๐‘œ + ๐‘ฆ๐‘œ+1

๐‘’

  • เทจ

๐ถ = ๐ถ ร— [0,1] We have ๐›ผ2๐‘” ๐‘ฆ, ๐‘ฆ๐‘œ+1 = ๐›ผ2๐‘•(๐‘ฆ) ๐‘’ ๐‘’ โˆ’ 1 ๐‘ฆ๐‘œ+1

๐‘’โˆ’2

โ‡’ ๐›ผ2๐‘” ๐‘ฆ, ๐‘ฆ๐‘œ+1 โ‰ฝ 0 on เทจ ๐ถ โ‡” ๐›ผ2๐‘• ๐‘ฆ โ‰ฝ 0 on ๐ถ

slide-16
SLIDE 16

Summary

  • Interested in testing convexity of a polynomial over a box.
  • Showed that strongly NP-hard to test convexity of cubics over a box.
  • Gave a complete characterization of the complexity of testing convexity
  • ver a box depending on the degree of the polynomial.
  • In the process, strengthened a result on the complexity of testing

positive semidefiniteness of symmetric matrices with entries belonging to intervals.

16

slide-17
SLIDE 17

Thank you for listening

Questions? Want to learn more? https://scholar.princeton.edu/ghall

17