Lecture 5: SOS Proofs and the Motzkin Polynomial Lecture Outline - - PowerPoint PPT Presentation
Lecture 5: SOS Proofs and the Motzkin Polynomial Lecture Outline - - PowerPoint PPT Presentation
Lecture 5: SOS Proofs and the Motzkin Polynomial Lecture Outline Part I: SOS proofs and examples Part II: Motzkin Polynomial Part I: SOS proofs and examples SOS proofs Fundamental question: What can we say about the
Lecture Outline
- Part I: SOS proofs and examples
- Part II: Motzkin Polynomial
Part I: SOS proofs and examples
SOS proofs
- Fundamental question: What can we say about
the pseudo-expectation values SOS gives us?
- In other words, which statements that are true
for any expectation of an actual distribution of solutions must also be true for pseudo- expectation values?
Non-negativity of Squares
- Trivial but extremely useful: If π is a sum of
squares i.e. π = Οπ ππ
2 then ΰ·¨
πΉ π β₯ 0
- Example: If π = π¦2 β 4π¦ + 5 then ΰ·¨
πΉ π β₯ 0 as π = π¦ β 2 2 + 1. In fact, ΰ·¨ πΉ π β₯ 1
Single Variable Polynomials
- Theorem: For a single-variable polynomial p(x),
π(π¦) is non-negative β¬ π(π¦) is a sum of squares.
- Proof: By induction on the degree π
- Base case π = 0 is trivial
- If π > 0, let π β₯ 0 be the minimal value of π(π¦) and
let π be a zero of π π¦ β π. Since π π¦ β π is non- negative, it has a zero of order 2π at π for some integer π β₯ 1 (the order must be even).
- Write π = π¦ β π 2ππβ² + π where πβ² =
πβπ π¦βπ 2π is
non-negative and thus a sum of squares.
Degree 2 Polynomials
- Given a degree 2 polynomial π, we can write
π π¦1, π¦2, β¦ , π¦π = Οπ,π ππππ¦ππ¦π where cππ = πππ for all π and π.
- Taking π to be the coefficient matrix where
πππ = πππ, we can write π = Οπ πππ€ππ€π
π where
the {π€π} are orthonormal. Now
1. π π¦ = π¦πππ¦. 2. π(π¦) = Οπ ππ π¦ππ€ππ€π
ππ¦ = Οπ ππ Οπ=1 π
π€πππ¦π
2
Degree 2 Polynomials
- We have that
1. π = Οπ πππ€ππ€π
π where the {π€π} are orthonormal.
2. π π¦ = π¦πππ¦ 3. π = Οπ ππ Οπ=1
π
π€πππ¦π
2
- If π β½ 0 then βπ, ππ β₯ 0 so π is a sum of squares
- If π is not PSD then ππ < 0 for some π. Taking
π¦ = π€π, π π¦ = π€π
πππ€π < 0 so π is not non-
negative.
- Thus if deg π = 2, π is non-negative β¬π is SOS
Cauchy Schwarz Inequality
- Cauchy-Schwarz inequality:
Οπ π
πππ 2 β€ Οπ π π 2
Οπ ππ
2
- Extremely useful
- Proof: Consider π and π as vectors. Cauchy-
Schwarz is equivalent to π β π 2 β€ π 2 π 2
- This is true as π β π 2 = π 2 π 2 cos2 Ξ
where Ξ is the angle between π and π.
- How about an SOS proof?
Cauchy Schwarz: SOS Proof
- Cauchy-Schwarz: Οπ π
πππ 2 β€ Οπ π π 2
Οπ ππ
2
- Building block: For all π and π,
π
πππ β π πππ 2 = π π 2ππ 2 + π π 2ππ 2 β 2π ππππ πππ β₯ 0
- Note that:
1. Οπ<π(π
π 2ππ 2 + π π 2ππ 2) = Οπ π π 2
Οπ ππ
2 β Οπ π π 2ππ 2
2. 2 Οπ<π(π
ππππ πππ) = Οπ π πππ 2 β Οπ π π 2ππ 2
- Final proof: Οπ,π:π<π π
πππ β π πππ 2 =
Οπ π
π 2
Οπ ππ
2 β Οπ π πππ 2 β₯ 0
SOS Proofs With Constraints
- What if we also have constraints
π‘1 π¦1, β¦ , π¦π = 0, π‘2 π¦1, β¦ , π¦π = 0, etc.?
- An SOS proof that β β₯ π now takes the form
β = π + Οπ π
ππ‘π + Οπ ππ 2
- Example: If π¦2 = 1 then x β₯ β1. Proof:
π¦ + 1 = π¦2
2 + π¦ + 1 2 = 1 2 π¦ + 1 2 β₯ 0
Combining Proofs
- If there is an SOS proof of degree π1 that π β₯ 0
and an SOS proof of degree π2 that π β₯ 0 then:
- 1. There is an SOS proof of degree πππ¦{π1, π2} that
π + π β₯ 0
- 2. There is an SOS proof of degree π1 + π2 that
ππ β₯ 0
Products of Pseudo-expectation Values
- What if our statements involve products of
pseudo-expectation values?
- Example: We showed that
ΰ·¨ πΉ Οπ π
πππ 2 β€ ΰ·¨
πΉ Οπ π
π 2
Οπ ππ
2
What if we instead want to show that ΰ·¨ πΉ Οπ π
πππ 2 β€ ΰ·¨
πΉ Οπ π
π 2 ΰ·¨
πΉ Οπ ππ
2 ?
- Requires modified proof, see problem set
- Can often prove such statements by using ΰ·¨
πΉ values as constants in the proof.
Example: Variance
- For any random variable π¦, πΉ π¦2 β₯ πΉ π¦
2
- Also true for pseudo-expectation values, i.e. for
any polynomial π, ΰ·¨ πΉ π2 β₯ ΰ·¨ πΉ π
2
- Proof: Given ΰ·¨
πΉ, let π = ΰ·¨ πΉ[π] and observe that ΰ·¨ πΉ π β π 2 = ΰ·¨ πΉ π2 β 2π ΰ·¨ πΉ π + c2 = ΰ·¨ πΉ π2 β ΰ·¨ πΉ π
2 β₯ 0
In-class exercises
- 1. Prove that ΰ·¨
πΉ π¦4 β 4π¦ + 3 β₯ 0
- 2. Prove that
ΰ·¨ πΉ π¦2 + 2π§2 + 6π¨2 + 2π¦π§ + 2π¦π¨ + 6π§π¨ β₯ 0
- 3. Prove that if π¦2 + π§2 = 1 then π¦ + π§ β€
2
- 4. Prove that if ΰ·¨
πΉ π¦2 = 0 then for any function π
- f degree at most π
2, ΰ·¨
πΉ π¦π = 0.
In-class exercise answers
- 1. Prove that ΰ·¨
πΉ π¦4 β 4π¦ + 3 β₯ 0 Answer: π¦4 β 4π¦ + 3 = π¦ β 1 2 π¦2 + 2π¦ + 3 = π¦ β 1 2( π¦ + 1 2 + 2)
In-class exercise answers
- 2. Prove that
ΰ·¨ πΉ π¦2 + 2π§2 + 6π¨2 + 2π¦π§ + 2π¦π¨ + 6π§π¨ β₯ 0 Answer: The coefficient matrix for this polynomial is M = 1 1 1 1 2 3 1 3 6 One non-orthonormal factorization is π = π€1π€1
π + π€2π€2 π + π€3π€3 π where π€1 π = [1
1 1], π€2
π = [0
1 2], π€3
π = [0
1],
In-class exercise answers
This gives us that π¦2 + 2π§2 + 6π¨2 + 2π¦π§ + 2π¦π¨ + 6π§π¨ = π¦ + π§ + π¨ 2 + π§ + 2π¨ 2 + π¨2
In-class exercise answers
- 3. Prove that if we have the constraint π¦2 + π§2 = 1
then ΰ·¨ πΉ π¦ + π§ β€ 2 Answer: 2 β π¦ β π§ =
π¦2+π§2 2
β π¦ β π§ +
1 2 = π¦βπ§ 2 2 2 + π¦+π§ 2 2 2 β π¦ β π§ + 1 2 = π¦βπ§ 2 2 2 + 1 2 2 π¦ + π§ β
2
2 β₯ 0
In-class exercise answers
- 4. Prove that if ΰ·¨
πΉ π¦2 = 0 then for any function π
- f degree at most
π 2 β 1, ΰ·¨
πΉ π¦π = 0. Answer: Observe that for any constant π·, ΰ·¨ πΉ π β π·π¦ 2 = ΰ·¨ πΉ π2 β 2π· ΰ·¨ πΉ π¦π + ΰ·¨ πΉ π¦2 = ΰ·¨ πΉ π2 β 2π· ΰ·¨ πΉ π¦π β₯ 0 The only way this can be true for all π· us if ΰ·¨ πΉ π¦π = 0.
Part II: Motzkin Polynomial
Non-negative vs. SOS polynomials
- Unfortunately, not all non-negative polynomials
are SOS.
- Are equivalent in the special cases where π = 1
(single-variable polynomials), π = 2 (quadratic polynomials), or π = 2, π = 4 (quartic polynomials with two variables)
- Hilbert [Hil1888]: In all other cases, there are
non-negative polynomials which are not sums of squares of polynomials.
- Motzkin [Mot67] found the first explicit example.
Motzkin Polynomial
- Motzkin Polynomial:
π π¦, π§ = π¦4π§2 + π¦2π§4 β 3π¦2π§2 + 1
- Question 1: Why is it non-negative?
- Question 2: How can we show it is not a sum of
squares of polynomials?
AM-GM inequality
- Arithmetic mean/Geometric mean Inequality:
π Οπ=1
π
π¦π β€ 1
π Οπ=1 π
π¦π if βπ, π¦π β₯ 0 with equality if and only if all of the π¦π are equal.
- Proof: Minimize
1 π Οπ=1 π
π¦π β
π Οπ=1
π
π¦π
- Derivative with respect to π¦π is
1 π
1 β
π Οπβ π π¦π π π¦π πβ1
- Setting this to 0 for all π, βπ, π¦π =
π Οπ=1
π
π¦π
Motzkin Polynomial Non-negativity
- Motzkin Polynomial:
π π¦, π§ = π¦4π§2 + π¦2π§4 β 3π¦2π§2 + 1
- Applying AM-GM with π¦4π§2, π§2π¦4, 1,
π¦2π§2 =
3
π¦4π§2 β π§2π¦4 β 1 β€ π¦4π§2+ π§2π¦4+ 1 3
- Multiplying this by 3, π π¦, π§ β₯ 0
Newton Polytope
- Given a polynomial, assign a point to each
monomial based on the degree of each variable. Examples:
1. π¦2π§ is assigned the point (2,1) 2. π§5 is assigned the point (0,5) 3. π¦π§2π¨3 is assigned the point (1,2,3)
- The Newton polytope of a polynomial is the
convex hull of the points assigned to each monomial.
Newton Polytope Example
- Example: Newton Polytope for the polynomial
π π¦ = 3π¦2π§4 β π¦4π§3 β 2π¦3π§ + 4
- Note that the coefficients in front of the
monomials donβt change the polytope.
0 1 2 3 4 5 6 1 2 3 4 5 6
Newton Polytope of a Sum of Squares
- Let π be a sum of squares, i.e. π = Οπ ππ
2
- Claim: The Newton polytope of π is 2π where π
is the convex hull of all the points corresponding to some monomial in some ππ
- Proposition: If π, π are monomials with
corresponding points π, π then ππ corresponds to the point π + π
- One direction: Let π
π be the Newton polytope of
ππ. The Newton polytope of ππ
2 β 2π π β 2π.
Thus, the Newton polytope of π β 2π.
Newton Polytope of a Sum of Squares
- Other direction: If π, π, π are monomials where
ππ = π2 and π, π, π are the corresponding points, π + π = 2π
- Corollary: If π is a vertex of π corresponding to a
monomial π then if
1. π, π are monomials appearing in some ππ (and thus their corresponding points π, π are in π) 2. ππ = π2
then π = π = π.
Newton Polytope of a Sum of Squares
- Corollary: If π is a vertex of π corresponding
to a monomial π then π2 appears with positive coefficient in π = Οπ ππ
2.
- This implies that 2π β the Newton polytope
- f π
- Putting everthing together, the Newton
polytope of πis 2π.
Motzkin Polynomial Newton Polytope
- Motzkin polynomial:
π π¦ = π¦4π§2 + π¦2π§4 β 3π¦2π§2 + 1
0 1 2 3 4 5 6 1 2 3 4 5 6
Motzkin Polynomial Newton Polytope
- If π(π¦) were a sum of squares of polynomials,
their corresponding points would have to be inside the following polytope.
0 1 2 3 4 5 6 1 2 3 4 5 6
Motzkin is not a Sum of Squares
- If π π¦ = π¦4π§2 + π¦2π§4 β 3π¦2π§2 + 1 were a
sum of squares of polynomials, it would have to be a sum of terms of the form ππ¦2π§ + ππ¦π§2 + ππ¦π§ + π
2
- However, no such term has a negative coefficient
- f π¦2π§2. Contradiction.
Showing Polynomials are not SOS
- Is there a more general way to show a
polynomial is not a sum of squares?
- Observation: By definition, if π = Οπ ππ
2 then
for any valid pseudo-expectation values, ΰ·¨ πΉ π = Οπ ΰ·¨ πΉ ππ
2 β₯ 0
- Thus, if we can find pseudo-expectation values
such that ΰ·¨ πΉ π < 0, then π is not a sum of squares of polynomials.
Motzkin is a Rational Function of Sums of Squares
- π π¦ = π¦4π§2 + π¦2π§4 β 3π¦2π§2 + 1
- π¦2 + π§2 + 1 π π¦ = π¦6π§2 + 2π§4π¦4 + π¦2π§6 β
2π¦4π§2 β 2π¦2π§4 β 3π¦2π§2 + π¦2 + π§2 + 1
- This is a sum of squares. The components are:
1. 2
1 2 π¦3π§ + 1 2 π¦π§3 β π¦π§ 2
=
1 2 π¦6π§2 + 2π§4π¦4 + π¦2π§6 β 2π¦4π§2 β 2π¦2π§4 +
2π¦2π§2 2. π¦2π§ β π§
2 = π¦4π§2 β 2π¦2π§2 + π§2
3. π¦π§2 β π¦
2 = π¦2π§4 β 2π¦2π§2 + π¦2
4.
1 2 π¦3π§ β π¦π§ 2 = 1 2 π¦6π§2 β π¦4π§2 + 1 2 π¦2π§2
5.
1 2 π¦π§3 β π¦π§ 2 = 1 2 π¦2π§6 β π¦2π§4 + 1 2 π¦2π§2
6. π¦2π§2 β 1
2 = π¦4π§4 β 2π¦2π§2 + 1
Can SOS use Rational Functions?
- π π¦ = π¦4π§2 + π¦2π§4 β 3π¦2π§2 + 1
- π π¦ =
Οπ ππ
2
π¦2+π§2+1 β₯ 0
- Can the SOS hierarchy use such reasoning?
- Yes and no⦠(see problem set)
References
- [Hil1888] D. Hilbert. Uber die darstellung definiter formen als summe von formen-
- quadraten. Annals of Mathematics 32:342β350, 1888.
- [Mot67] T. Motzkin. The arithmetic-geometric inequality. In Proc. Symposium on
Inequalities p. 205β224, 1967.