Lecture 5: SOS Proofs and the Motzkin Polynomial Lecture Outline - - PowerPoint PPT Presentation

β–Ά
lecture 5 sos proofs and the
SMART_READER_LITE
LIVE PREVIEW

Lecture 5: SOS Proofs and the Motzkin Polynomial Lecture Outline - - PowerPoint PPT Presentation

Lecture 5: SOS Proofs and the Motzkin Polynomial Lecture Outline Part I: SOS proofs and examples Part II: Motzkin Polynomial Part I: SOS proofs and examples SOS proofs Fundamental question: What can we say about the


slide-1
SLIDE 1

Lecture 5: SOS Proofs and the Motzkin Polynomial

slide-2
SLIDE 2

Lecture Outline

  • Part I: SOS proofs and examples
  • Part II: Motzkin Polynomial
slide-3
SLIDE 3

Part I: SOS proofs and examples

slide-4
SLIDE 4

SOS proofs

  • Fundamental question: What can we say about

the pseudo-expectation values SOS gives us?

  • In other words, which statements that are true

for any expectation of an actual distribution of solutions must also be true for pseudo- expectation values?

slide-5
SLIDE 5

Non-negativity of Squares

  • Trivial but extremely useful: If 𝑔 is a sum of

squares i.e. 𝑔 = Οƒπ‘˜ π‘•π‘˜

2 then ΰ·¨

𝐹 𝑔 β‰₯ 0

  • Example: If 𝑔 = 𝑦2 βˆ’ 4𝑦 + 5 then ΰ·¨

𝐹 𝑔 β‰₯ 0 as 𝑔 = 𝑦 βˆ’ 2 2 + 1. In fact, ΰ·¨ 𝐹 𝑔 β‰₯ 1

slide-6
SLIDE 6

Single Variable Polynomials

  • Theorem: For a single-variable polynomial p(x),

π‘ž(𝑦) is non-negative ⬄ π‘ž(𝑦) is a sum of squares.

  • Proof: By induction on the degree 𝑒
  • Base case 𝑒 = 0 is trivial
  • If 𝑒 > 0, let 𝑑 β‰₯ 0 be the minimal value of π‘ž(𝑦) and

let 𝑏 be a zero of π‘ž 𝑦 βˆ’ 𝑑. Since π‘ž 𝑦 βˆ’ 𝑑 is non- negative, it has a zero of order 2𝑙 at 𝑏 for some integer 𝑙 β‰₯ 1 (the order must be even).

  • Write π‘ž = 𝑦 βˆ’ 𝑏 2π‘™π‘žβ€² + 𝑑 where π‘žβ€² =

π‘žβˆ’π‘‘ π‘¦βˆ’π‘ 2𝑙 is

non-negative and thus a sum of squares.

slide-7
SLIDE 7

Degree 2 Polynomials

  • Given a degree 2 polynomial 𝑔, we can write

𝑔 𝑦1, 𝑦2, … , π‘¦π‘œ = σ𝑗,π‘˜ π‘‘π‘—π‘˜π‘¦π‘—π‘¦π‘˜ where cπ‘˜π‘— = π‘‘π‘—π‘˜ for all 𝑗 and π‘˜.

  • Taking 𝑁 to be the coefficient matrix where

π‘π‘—π‘˜ = π‘‘π‘—π‘˜, we can write 𝑁 = σ𝑗 πœ‡π‘—π‘€π‘—π‘€π‘—

π‘ˆ where

the {𝑀𝑗} are orthonormal. Now

1. 𝑔 𝑦 = π‘¦π‘ˆπ‘π‘¦. 2. 𝑔(𝑦) = σ𝑗 πœ‡π‘— π‘¦π‘ˆπ‘€π‘—π‘€π‘—

π‘ˆπ‘¦ = σ𝑗 πœ‡π‘— Οƒπ‘˜=1 π‘œ

π‘€π‘—π‘˜π‘¦π‘˜

2

slide-8
SLIDE 8

Degree 2 Polynomials

  • We have that

1. 𝑁 = σ𝑗 πœ‡π‘—π‘€π‘—π‘€π‘—

π‘ˆ where the {𝑀𝑗} are orthonormal.

2. 𝑔 𝑦 = π‘¦π‘ˆπ‘π‘¦ 3. 𝑔 = σ𝑗 πœ‡π‘— Οƒπ‘˜=1

π‘œ

π‘€π‘—π‘˜π‘¦π‘˜

2

  • If 𝑁 ≽ 0 then βˆ€π‘—, πœ‡π‘— β‰₯ 0 so 𝑔 is a sum of squares
  • If 𝑁 is not PSD then πœ‡π‘— < 0 for some 𝑗. Taking

𝑦 = 𝑀𝑗, 𝑔 𝑦 = 𝑀𝑗

π‘ˆπ‘π‘€π‘— < 0 so 𝑔 is not non-

negative.

  • Thus if deg 𝑔 = 2, 𝑔 is non-negative ⬄𝑔 is SOS
slide-9
SLIDE 9

Cauchy Schwarz Inequality

  • Cauchy-Schwarz inequality:

σ𝑗 𝑔

𝑗𝑕𝑗 2 ≀ σ𝑗 𝑔 𝑗 2

σ𝑗 𝑕𝑗

2

  • Extremely useful
  • Proof: Consider 𝑔 and 𝑕 as vectors. Cauchy-

Schwarz is equivalent to 𝑔 β‹… 𝑕 2 ≀ 𝑔 2 𝑕 2

  • This is true as 𝑔 β‹… 𝑕 2 = 𝑔 2 𝑕 2 cos2 Θ

where Θ is the angle between 𝑔 and 𝑕.

  • How about an SOS proof?
slide-10
SLIDE 10

Cauchy Schwarz: SOS Proof

  • Cauchy-Schwarz: σ𝑗 𝑔

𝑗𝑕𝑗 2 ≀ σ𝑗 𝑔 𝑗 2

σ𝑗 𝑕𝑗

2

  • Building block: For all 𝑗 and π‘˜,

𝑔

π‘—π‘•π‘˜ βˆ’ 𝑔 π‘˜π‘•π‘— 2 = 𝑔 𝑗 2π‘•π‘˜ 2 + 𝑔 π‘˜ 2𝑕𝑗 2 βˆ’ 2𝑔 𝑗𝑕𝑗𝑔 π‘˜π‘•π‘˜ β‰₯ 0

  • Note that:

1. σ𝑗<π‘˜(𝑔

𝑗 2π‘•π‘˜ 2 + 𝑔 π‘˜ 2𝑕𝑗 2) = σ𝑗 𝑔 𝑗 2

σ𝑗 𝑕𝑗

2 βˆ’ σ𝑗 𝑔 𝑗 2𝑕𝑗 2

2. 2 σ𝑗<π‘˜(𝑔

𝑗𝑕𝑗𝑔 π‘˜π‘•π‘˜) = σ𝑗 𝑔 𝑗𝑕𝑗 2 βˆ’ σ𝑗 𝑔 𝑗 2𝑕𝑗 2

  • Final proof: σ𝑗,π‘˜:𝑗<π‘˜ 𝑔

π‘—π‘•π‘˜ βˆ’ 𝑔 π‘˜π‘•π‘— 2 =

σ𝑗 𝑔

𝑗 2

σ𝑗 𝑕𝑗

2 βˆ’ σ𝑗 𝑔 𝑗𝑕𝑗 2 β‰₯ 0

slide-11
SLIDE 11

SOS Proofs With Constraints

  • What if we also have constraints

𝑑1 𝑦1, … , π‘¦π‘œ = 0, 𝑑2 𝑦1, … , π‘¦π‘œ = 0, etc.?

  • An SOS proof that β„Ž β‰₯ 𝑑 now takes the form

β„Ž = 𝑑 + σ𝑗 𝑔

𝑗𝑑𝑗 + Οƒπ‘˜ π‘•π‘˜ 2

  • Example: If 𝑦2 = 1 then x β‰₯ βˆ’1. Proof:

𝑦 + 1 = 𝑦2

2 + 𝑦 + 1 2 = 1 2 𝑦 + 1 2 β‰₯ 0

slide-12
SLIDE 12

Combining Proofs

  • If there is an SOS proof of degree 𝑒1 that 𝑔 β‰₯ 0

and an SOS proof of degree 𝑒2 that 𝑕 β‰₯ 0 then:

  • 1. There is an SOS proof of degree 𝑛𝑏𝑦{𝑒1, 𝑒2} that

𝑔 + 𝑕 β‰₯ 0

  • 2. There is an SOS proof of degree 𝑒1 + 𝑒2 that

𝑔𝑕 β‰₯ 0

slide-13
SLIDE 13

Products of Pseudo-expectation Values

  • What if our statements involve products of

pseudo-expectation values?

  • Example: We showed that

ΰ·¨ 𝐹 σ𝑗 𝑔

𝑗𝑕𝑗 2 ≀ ΰ·¨

𝐹 σ𝑗 𝑔

𝑗 2

σ𝑗 𝑕𝑗

2

What if we instead want to show that ΰ·¨ 𝐹 σ𝑗 𝑔

𝑗𝑕𝑗 2 ≀ ΰ·¨

𝐹 σ𝑗 𝑔

𝑗 2 ΰ·¨

𝐹 σ𝑗 𝑕𝑗

2 ?

  • Requires modified proof, see problem set
  • Can often prove such statements by using ΰ·¨

𝐹 values as constants in the proof.

slide-14
SLIDE 14

Example: Variance

  • For any random variable 𝑦, 𝐹 𝑦2 β‰₯ 𝐹 𝑦

2

  • Also true for pseudo-expectation values, i.e. for

any polynomial 𝑔, ΰ·¨ 𝐹 𝑔2 β‰₯ ΰ·¨ 𝐹 𝑔

2

  • Proof: Given ΰ·¨

𝐹, let 𝑑 = ΰ·¨ 𝐹[𝑔] and observe that ΰ·¨ 𝐹 𝑔 βˆ’ 𝑑 2 = ΰ·¨ 𝐹 𝑔2 βˆ’ 2𝑑 ΰ·¨ 𝐹 𝑔 + c2 = ΰ·¨ 𝐹 𝑔2 βˆ’ ΰ·¨ 𝐹 𝑔

2 β‰₯ 0

slide-15
SLIDE 15

In-class exercises

  • 1. Prove that ΰ·¨

𝐹 𝑦4 βˆ’ 4𝑦 + 3 β‰₯ 0

  • 2. Prove that

ΰ·¨ 𝐹 𝑦2 + 2𝑧2 + 6𝑨2 + 2𝑦𝑧 + 2𝑦𝑨 + 6𝑧𝑨 β‰₯ 0

  • 3. Prove that if 𝑦2 + 𝑧2 = 1 then 𝑦 + 𝑧 ≀

2

  • 4. Prove that if ΰ·¨

𝐹 𝑦2 = 0 then for any function 𝑔

  • f degree at most 𝑒

2, ΰ·¨

𝐹 𝑦𝑔 = 0.

slide-16
SLIDE 16

In-class exercise answers

  • 1. Prove that ΰ·¨

𝐹 𝑦4 βˆ’ 4𝑦 + 3 β‰₯ 0 Answer: 𝑦4 βˆ’ 4𝑦 + 3 = 𝑦 βˆ’ 1 2 𝑦2 + 2𝑦 + 3 = 𝑦 βˆ’ 1 2( 𝑦 + 1 2 + 2)

slide-17
SLIDE 17

In-class exercise answers

  • 2. Prove that

ΰ·¨ 𝐹 𝑦2 + 2𝑧2 + 6𝑨2 + 2𝑦𝑧 + 2𝑦𝑨 + 6𝑧𝑨 β‰₯ 0 Answer: The coefficient matrix for this polynomial is M = 1 1 1 1 2 3 1 3 6 One non-orthonormal factorization is 𝑁 = 𝑀1𝑀1

π‘ˆ + 𝑀2𝑀2 π‘ˆ + 𝑀3𝑀3 π‘ˆ where 𝑀1 π‘ˆ = [1

1 1], 𝑀2

π‘ˆ = [0

1 2], 𝑀3

π‘ˆ = [0

1],

slide-18
SLIDE 18

In-class exercise answers

This gives us that 𝑦2 + 2𝑧2 + 6𝑨2 + 2𝑦𝑧 + 2𝑦𝑨 + 6𝑧𝑨 = 𝑦 + 𝑧 + 𝑨 2 + 𝑧 + 2𝑨 2 + 𝑨2

slide-19
SLIDE 19

In-class exercise answers

  • 3. Prove that if we have the constraint 𝑦2 + 𝑧2 = 1

then ΰ·¨ 𝐹 𝑦 + 𝑧 ≀ 2 Answer: 2 βˆ’ 𝑦 βˆ’ 𝑧 =

𝑦2+𝑧2 2

βˆ’ 𝑦 βˆ’ 𝑧 +

1 2 = π‘¦βˆ’π‘§ 2 2 2 + 𝑦+𝑧 2 2 2 βˆ’ 𝑦 βˆ’ 𝑧 + 1 2 = π‘¦βˆ’π‘§ 2 2 2 + 1 2 2 𝑦 + 𝑧 βˆ’

2

2 β‰₯ 0

slide-20
SLIDE 20

In-class exercise answers

  • 4. Prove that if ΰ·¨

𝐹 𝑦2 = 0 then for any function 𝑔

  • f degree at most

𝑒 2 βˆ’ 1, ΰ·¨

𝐹 𝑦𝑔 = 0. Answer: Observe that for any constant 𝐷, ΰ·¨ 𝐹 𝑔 βˆ’ 𝐷𝑦 2 = ΰ·¨ 𝐹 𝑔2 βˆ’ 2𝐷 ΰ·¨ 𝐹 𝑦𝑔 + ΰ·¨ 𝐹 𝑦2 = ΰ·¨ 𝐹 𝑔2 βˆ’ 2𝐷 ΰ·¨ 𝐹 𝑦𝑔 β‰₯ 0 The only way this can be true for all 𝐷 us if ΰ·¨ 𝐹 𝑦𝑔 = 0.

slide-21
SLIDE 21

Part II: Motzkin Polynomial

slide-22
SLIDE 22

Non-negative vs. SOS polynomials

  • Unfortunately, not all non-negative polynomials

are SOS.

  • Are equivalent in the special cases where π‘œ = 1

(single-variable polynomials), 𝑒 = 2 (quadratic polynomials), or π‘œ = 2, 𝑒 = 4 (quartic polynomials with two variables)

  • Hilbert [Hil1888]: In all other cases, there are

non-negative polynomials which are not sums of squares of polynomials.

  • Motzkin [Mot67] found the first explicit example.
slide-23
SLIDE 23

Motzkin Polynomial

  • Motzkin Polynomial:

π‘ž 𝑦, 𝑧 = 𝑦4𝑧2 + 𝑦2𝑧4 βˆ’ 3𝑦2𝑧2 + 1

  • Question 1: Why is it non-negative?
  • Question 2: How can we show it is not a sum of

squares of polynomials?

slide-24
SLIDE 24

AM-GM inequality

  • Arithmetic mean/Geometric mean Inequality:

π‘œ ς𝑗=1

π‘œ

𝑦𝑗 ≀ 1

π‘œ σ𝑗=1 π‘œ

𝑦𝑗 if βˆ€π‘—, 𝑦𝑗 β‰₯ 0 with equality if and only if all of the 𝑦𝑗 are equal.

  • Proof: Minimize

1 π‘œ σ𝑗=1 π‘œ

𝑦𝑗 βˆ’

π‘œ ς𝑗=1

π‘œ

𝑦𝑗

  • Derivative with respect to π‘¦π‘˜ is

1 π‘œ

1 βˆ’

π‘œ Ο‚π‘—β‰ π‘˜ 𝑦𝑗 π‘œ π‘¦π‘˜ π‘œβˆ’1

  • Setting this to 0 for all π‘˜, βˆ€π‘˜, π‘¦π‘˜ =

π‘œ ς𝑗=1

π‘œ

𝑦𝑗

slide-25
SLIDE 25

Motzkin Polynomial Non-negativity

  • Motzkin Polynomial:

π‘ž 𝑦, 𝑧 = 𝑦4𝑧2 + 𝑦2𝑧4 βˆ’ 3𝑦2𝑧2 + 1

  • Applying AM-GM with 𝑦4𝑧2, 𝑧2𝑦4, 1,

𝑦2𝑧2 =

3

𝑦4𝑧2 β‹… 𝑧2𝑦4 β‹… 1 ≀ 𝑦4𝑧2+ 𝑧2𝑦4+ 1 3

  • Multiplying this by 3, π‘ž 𝑦, 𝑧 β‰₯ 0
slide-26
SLIDE 26

Newton Polytope

  • Given a polynomial, assign a point to each

monomial based on the degree of each variable. Examples:

1. 𝑦2𝑧 is assigned the point (2,1) 2. 𝑧5 is assigned the point (0,5) 3. 𝑦𝑧2𝑨3 is assigned the point (1,2,3)

  • The Newton polytope of a polynomial is the

convex hull of the points assigned to each monomial.

slide-27
SLIDE 27

Newton Polytope Example

  • Example: Newton Polytope for the polynomial

π‘ž 𝑦 = 3𝑦2𝑧4 βˆ’ 𝑦4𝑧3 βˆ’ 2𝑦3𝑧 + 4

  • Note that the coefficients in front of the

monomials don’t change the polytope.

0 1 2 3 4 5 6 1 2 3 4 5 6

slide-28
SLIDE 28

Newton Polytope of a Sum of Squares

  • Let 𝑔 be a sum of squares, i.e. 𝑔 = Οƒπ‘˜ π‘•π‘˜

2

  • Claim: The Newton polytope of 𝑔 is 2π‘Œ where π‘Œ

is the convex hull of all the points corresponding to some monomial in some π‘•π‘˜

  • Proposition: If π‘ž, π‘Ÿ are monomials with

corresponding points 𝑏, 𝑐 then π‘žπ‘Ÿ corresponds to the point 𝑏 + 𝑐

  • One direction: Let π‘Œ

π‘˜ be the Newton polytope of

π‘•π‘˜. The Newton polytope of π‘•π‘˜

2 βŠ† 2π‘Œ π‘˜ βŠ† 2π‘Œ.

Thus, the Newton polytope of 𝑔 βŠ† 2π‘Œ.

slide-29
SLIDE 29

Newton Polytope of a Sum of Squares

  • Other direction: If π‘ž, π‘Ÿ, 𝑠 are monomials where

π‘žπ‘  = π‘Ÿ2 and 𝑏, 𝑐, 𝑑 are the corresponding points, 𝑏 + 𝑑 = 2𝑐

  • Corollary: If 𝑐 is a vertex of π‘Œ corresponding to a

monomial π‘Ÿ then if

1. π‘ž, 𝑠 are monomials appearing in some π‘•π‘˜ (and thus their corresponding points 𝑏, 𝑑 are in π‘Œ) 2. π‘žπ‘  = π‘Ÿ2

then π‘ž = 𝑠 = π‘Ÿ.

slide-30
SLIDE 30

Newton Polytope of a Sum of Squares

  • Corollary: If 𝑐 is a vertex of π‘Œ corresponding

to a monomial π‘Ÿ then π‘Ÿ2 appears with positive coefficient in 𝑔 = Οƒπ‘˜ π‘•π‘˜

2.

  • This implies that 2π‘Œ βŠ† the Newton polytope
  • f 𝑔
  • Putting everthing together, the Newton

polytope of 𝑔is 2π‘Œ.

slide-31
SLIDE 31

Motzkin Polynomial Newton Polytope

  • Motzkin polynomial:

π‘ž 𝑦 = 𝑦4𝑧2 + 𝑦2𝑧4 βˆ’ 3𝑦2𝑧2 + 1

0 1 2 3 4 5 6 1 2 3 4 5 6

slide-32
SLIDE 32

Motzkin Polynomial Newton Polytope

  • If π‘ž(𝑦) were a sum of squares of polynomials,

their corresponding points would have to be inside the following polytope.

0 1 2 3 4 5 6 1 2 3 4 5 6

slide-33
SLIDE 33

Motzkin is not a Sum of Squares

  • If π‘ž 𝑦 = 𝑦4𝑧2 + 𝑦2𝑧4 βˆ’ 3𝑦2𝑧2 + 1 were a

sum of squares of polynomials, it would have to be a sum of terms of the form 𝑏𝑦2𝑧 + 𝑐𝑦𝑧2 + 𝑑𝑦𝑧 + 𝑒

2

  • However, no such term has a negative coefficient
  • f 𝑦2𝑧2. Contradiction.
slide-34
SLIDE 34

Showing Polynomials are not SOS

  • Is there a more general way to show a

polynomial is not a sum of squares?

  • Observation: By definition, if 𝑔 = Οƒπ‘˜ π‘•π‘˜

2 then

for any valid pseudo-expectation values, ΰ·¨ 𝐹 𝑔 = Οƒπ‘˜ ΰ·¨ 𝐹 π‘•π‘˜

2 β‰₯ 0

  • Thus, if we can find pseudo-expectation values

such that ΰ·¨ 𝐹 𝑔 < 0, then 𝑔 is not a sum of squares of polynomials.

slide-35
SLIDE 35

Motzkin is a Rational Function of Sums of Squares

  • π‘ž 𝑦 = 𝑦4𝑧2 + 𝑦2𝑧4 βˆ’ 3𝑦2𝑧2 + 1
  • 𝑦2 + 𝑧2 + 1 π‘ž 𝑦 = 𝑦6𝑧2 + 2𝑧4𝑦4 + 𝑦2𝑧6 βˆ’

2𝑦4𝑧2 βˆ’ 2𝑦2𝑧4 βˆ’ 3𝑦2𝑧2 + 𝑦2 + 𝑧2 + 1

  • This is a sum of squares. The components are:

1. 2

1 2 𝑦3𝑧 + 1 2 𝑦𝑧3 βˆ’ 𝑦𝑧 2

=

1 2 𝑦6𝑧2 + 2𝑧4𝑦4 + 𝑦2𝑧6 βˆ’ 2𝑦4𝑧2 βˆ’ 2𝑦2𝑧4 +

2𝑦2𝑧2 2. 𝑦2𝑧 βˆ’ 𝑧

2 = 𝑦4𝑧2 βˆ’ 2𝑦2𝑧2 + 𝑧2

3. 𝑦𝑧2 βˆ’ 𝑦

2 = 𝑦2𝑧4 βˆ’ 2𝑦2𝑧2 + 𝑦2

4.

1 2 𝑦3𝑧 βˆ’ 𝑦𝑧 2 = 1 2 𝑦6𝑧2 βˆ’ 𝑦4𝑧2 + 1 2 𝑦2𝑧2

5.

1 2 𝑦𝑧3 βˆ’ 𝑦𝑧 2 = 1 2 𝑦2𝑧6 βˆ’ 𝑦2𝑧4 + 1 2 𝑦2𝑧2

6. 𝑦2𝑧2 βˆ’ 1

2 = 𝑦4𝑧4 βˆ’ 2𝑦2𝑧2 + 1

slide-36
SLIDE 36

Can SOS use Rational Functions?

  • π‘ž 𝑦 = 𝑦4𝑧2 + 𝑦2𝑧4 βˆ’ 3𝑦2𝑧2 + 1
  • π‘ž 𝑦 =

Οƒπ‘˜ π‘•π‘˜

2

𝑦2+𝑧2+1 β‰₯ 0

  • Can the SOS hierarchy use such reasoning?
  • Yes and no… (see problem set)
slide-37
SLIDE 37

References

  • [Hil1888] D. Hilbert. Uber die darstellung definiter formen als summe von formen-
  • quadraten. Annals of Mathematics 32:342–350, 1888.
  • [Mot67] T. Motzkin. The arithmetic-geometric inequality. In Proc. Symposium on

Inequalities p. 205–224, 1967.