[PPT] - Using noncommutative polynomial optimization for matrix PowerPoint Presentation

SLIDE 1

Using noncommutative polynomial optimization for matrix factorization ranks

Sander Gribling (CWI/QuSoft) David de Laat (CWI/QuSoft) Monique Laurent (CWI/Tilburg/QuSoft) SIAM Conference on Optimization, 25 May 2017, Vancouver

SLIDE 2

Symmetric matrix factorization ranks

SLIDE 3

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

SLIDE 4

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d;

SLIDE 5

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute;

SLIDE 6

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n

SLIDE 7

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

SLIDE 8

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d;

SLIDE 9

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute;

SLIDE 10

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

+ 1

SLIDE 11

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

+ 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj)

SLIDE 12

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

+ 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d;

SLIDE 13

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

+ 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

SLIDE 14

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

+ 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

There is no upper bound on d depending only on n [Slofstra, 2017]

SLIDE 15

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

+ 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

There is no upper bound on d depending only on n [Slofstra, 2017]

SLIDE 16

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

+ 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

There is no upper bound on d depending only on n [Slofstra, 2017] CP matrices ⊆ CPSD matrices ⊆ PSD matrices

SLIDE 17

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

+ 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

There is no upper bound on d depending only on n [Slofstra, 2017] CP matrices ⊆ CPSD matrices ⊆ PSD matrices Goal: Find lower bounds for matrix factorization ranks

SLIDE 18

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

SLIDE 19

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

SLIDE 20

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

◮ Corresponding matrix (Ap)(s,a),(t,b) = p(a, b|s, t)

SLIDE 21

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

◮ Corresponding matrix (Ap)(s,a),(t,b) = p(a, b|s, t) ◮ If p is a “synchronous quantum correlation”, then Ap is CPSD

SLIDE 22

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

◮ Corresponding matrix (Ap)(s,a),(t,b) = p(a, b|s, t) ◮ If p is a “synchronous quantum correlation”, then Ap is CPSD ◮ The smallest dimension to realize it is cpsd-rank(Ap)

SLIDE 23

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

◮ Corresponding matrix (Ap)(s,a),(t,b) = p(a, b|s, t) ◮ If p is a “synchronous quantum correlation”, then Ap is CPSD ◮ The smallest dimension to realize it is cpsd-rank(Ap) ◮ Combine proofs from above refs and

[Paulsen–Severini–Stahlke–Todorov–Winter 2016]

SLIDE 24

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

SLIDE 25

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn]

SLIDE 26

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S

SLIDE 27

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

SLIDE 28

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

SLIDE 29

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

SLIDE 30

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

◮ Let S ∪ {f } ⊆ Rx1, . . . , xn

SLIDE 31

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

◮ Let S ∪ {f } ⊆ Rx1, . . . , xn ◮ We can evaluate a noncommutative polynomial at a tuple

X = (X1, . . . , Xn) of matrices

SLIDE 32

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

◮ Let S ∪ {f } ⊆ Rx1, . . . , xn ◮ We can evaluate a noncommutative polynomial at a tuple

X = (X1, . . . , Xn) of matrices

◮ inf{tr(f (X)) : d ∈ N, X1, . . . , Xn ∈ Hd, g(X) 0 for g ∈ S}

SLIDE 33

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

◮ Let S ∪ {f } ⊆ Rx1, . . . , xn ◮ We can evaluate a noncommutative polynomial at a tuple

X = (X1, . . . , Xn) of matrices

◮ inf{tr(f (X)) : d ∈ N, X1, . . . , Xn ∈ Hd, g(X) 0 for g ∈ S}

Commutative polynomial optimization is used by Nie for testing membership in the CP cone and computing tensor nuclear norms

SLIDE 34

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A)

SLIDE 35

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj)

SLIDE 36

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars

SLIDE 37

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn)))

SLIDE 38

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A)

SLIDE 39

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX

SLIDE 40

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX Symmetric and tracial: LX(p∗) = LX(p) and LX(pq) = LX(qp)

SLIDE 41

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX Symmetric and tracial: LX(p∗) = LX(p) and LX(pq) = LX(qp) Positive: LX(p∗p) ≥ 0

SLIDE 42

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX Symmetric and tracial: LX(p∗) = LX(p) and LX(pq) = LX(qp) Positive: LX(p∗p) ≥ 0 Linear conditions: LX(xixj) = Aij

SLIDE 43

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX Symmetric and tracial: LX(p∗) = LX(p) and LX(pq) = LX(qp) Positive: LX(p∗p) ≥ 0 Linear conditions: LX(xixj) = Aij Localizing conditions: LX(p∗(√Aiixi − x2

i )p) ≥ 0

SLIDE 44

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t

SLIDE 45

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn

SLIDE 46

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx}

SLIDE 47

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx} Truncated quadratic module: M2t(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx, deg(p∗gp) ≤ 2t}

SLIDE 48

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx} Truncated quadratic module: M2t(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx, deg(p∗gp) ≤ 2t}

ξcpsd

t

(A) = min

L(1) : L ∈ Rx1, . . . , xn∗

2t tracial and symmetric,

(L(xixj)) = A, L ≥ 0

n

M2t

Aiixi − x2

i : i ∈ [n]

SLIDE 49

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx} Truncated quadratic module: M2t(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx, deg(p∗gp) ≤ 2t}

ξcpsd

t

(A) = min

L(1) : L ∈ Rx1, . . . , xn∗

2t tracial and symmetric,

(L(xixj)) = A, L ≥ 0

n

M2t

Aiixi − x2

i : i ∈ [n]

ξcpsd

1

(A) ≤ . . . ≤ ξcpsd

∞

(A) ≤ ξcpsd

∗

(A) ≤ cpsd-rank(A)

SLIDE 50

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx} Truncated quadratic module: M2t(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx, deg(p∗gp) ≤ 2t}

ξcpsd

t

(A) = min

L(1) : L ∈ Rx1, . . . , xn∗

2t tracial and symmetric,

(L(xixj)) = A, L ≥ 0

n

M2t

Aiixi − x2

i : i ∈ [n]

ξcpsd

1

(A) ≤ . . . ≤ ξcpsd

∞

(A) ≤ ξcpsd

∗

(A) ≤ cpsd-rank(A) ξcpsd

∗

(A) is ξcpsd

∞

(A) with the extra constraint rank(M(L)) < ∞

SLIDE 51

ξcpsd

∞ (A) and ξcpsd ∗

(A)

◮ We have ξcpsd t

(A) → ξcpsd

∞

(A), and if ξcpsd

t

(A) admits a flat

ptimal solution, then ξcpsd

t

(A) = ξcpsd

∗

(A)

SLIDE 52

ξcpsd

∞ (A) and ξcpsd ∗

(A)

◮ We have ξcpsd t

(A) → ξcpsd

∞

(A), and if ξcpsd

t

(A) admits a flat

ptimal solution, then ξcpsd

t

(A) = ξcpsd

∗

(A)

◮ ξcpsd ∗

(A) is the minimum of L(1) over all conic combinations L

f trace evaluations at elements of the matrix positivity

domain of {√Aiixi − x2

i : i ∈ [n]} such that A = (L(xixj))

SLIDE 53

ξcpsd

∞ (A) and ξcpsd ∗

(A)

◮ We have ξcpsd t

(A) → ξcpsd

∞

(A), and if ξcpsd

t

(A) admits a flat

ptimal solution, then ξcpsd

t

(A) = ξcpsd

∗

(A)

◮ ξcpsd ∗

(A) is the minimum of L(1) over all conic combinations L

f trace evaluations at elements of the matrix positivity

domain of {√Aiixi − x2

i : i ∈ [n]} such that A = (L(xixj))

ξcpsd

∗

(A) = inf

M
m=1

dm · max

i∈[n]

X m

i 2

Aii : M ∈ N, d1, . . . , dM ∈ N, X m

i

∈ Hdm

+ for i ∈ [n], m ∈ [M],

A = Gram

M
m=1

X m

1 , . . . , M

m=1

X m

n

.

SLIDE 54

Lower bound [Prakash–Sikora–Varvitsiotis–Wei 2016]: n

i=1

√Aii 2 n

i,j=1 Aij

≤ cpsd-rank(A)

SLIDE 55

Lower bound [Prakash–Sikora–Varvitsiotis–Wei 2016]: n

i=1

√Aii 2 n

i,j=1 Aij

≤ cpsd-rank(A) We have ξcpsd

1

(A) ≥ n

i=1

√Aii 2 n

i,j=1 Aij

SLIDE 56

Lower bound [Prakash–Sikora–Varvitsiotis–Wei 2016]: n

i=1

√Aii 2 n

i,j=1 Aij

≤ cpsd-rank(A) We have ξcpsd

1

(A) ≥ n

i=1

√Aii 2 n

i,j=1 Aij

Sharp for the matrix A ∈ R5×5 given by Aij = cos

4π/5(i − j)

2

SLIDE 57

Extra constraints to go beyond ξcpsd

∗

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj)

SLIDE 58

Extra constraints to go beyond ξcpsd

∗

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

n
i=1

viXi 2

SLIDE 59

Extra constraints to go beyond ξcpsd

∗

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

n
i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module

SLIDE 60

Extra constraints to go beyond ξcpsd

∗

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

n
i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module For a subset V ⊆ Sn−1 we have the stronger bound ξcpsd

t,V (A)

SLIDE 61

Extra constraints to go beyond ξcpsd

∗

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

n
i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module For a subset V ⊆ Sn−1 we have the stronger bound ξcpsd

t,V (A)

Example: A =       1 1/2 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1/2 1       ξcpsd

1

(A) = ξcpsd

∗

(A) = 5 2

SLIDE 62

Extra constraints to go beyond ξcpsd

∗

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

n
i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module For a subset V ⊆ Sn−1 we have the stronger bound ξcpsd

t,V (A)

Example: A =       1 1/2 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1/2 1       ξcpsd

1

(A) = ξcpsd

∗

(A) = 5 2, V = ei + ej √ 2 : i, j ∈ [5]

SLIDE 63

Extra constraints to go beyond ξcpsd

∗

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

n
i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module For a subset V ⊆ Sn−1 we have the stronger bound ξcpsd

t,V (A)

Example: A =       1 1/2 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1/2 1       ξcpsd

1

(A) = ξcpsd

∗

(A) = 5 2, V = ei + ej √ 2 : i, j ∈ [5]

, ξcpsd

2,V (A) = 10

3

SLIDE 64

The completely positive rank (cp-rank)

Fawzi and Parrilo (2014) give this SDP to lower bound cp-rank(A):

τ sos

cp (A) = inf

α : α ∈ R, X ∈ Rn2×n2,
α

vec(A)T vec(A) X

0,

X(i,j),(i,j) ≤ A2

ij

for 1 ≤ i, j ≤ n, X(i,j),(k,l) = X(i,l),(k,j) for 1 ≤ i < k ≤ n, 1 ≤ j < l ≤ n, X A ⊗ A

.

SLIDE 65

The completely positive rank (cp-rank)

Fawzi and Parrilo (2014) give this SDP to lower bound cp-rank(A):

τ sos

cp (A) = inf

α : α ∈ R, X ∈ Rn2×n2,
α

vec(A)T vec(A) X

0,

X(i,j),(i,j) ≤ A2

ij

for 1 ≤ i, j ≤ n, X(i,j),(k,l) = X(i,l),(k,j) for 1 ≤ i < k ≤ n, 1 ≤ j < l ≤ n, X A ⊗ A

.

They derive τ sos

cp (A) as an SDP relaxation of

τcp(A) = min

α : α > 0, 1

αA ∈ conv

R ∈ Sn : 0 ≤ R ≤ A, R A, rank(R) ≤ 1

SLIDE 66

The completely positive rank (cp-rank)

Fawzi and Parrilo (2014) give this SDP to lower bound cp-rank(A):

τ sos

cp (A) = inf

α : α ∈ R, X ∈ Rn2×n2,
α

vec(A)T vec(A) X

0,

X(i,j),(i,j) ≤ A2

ij

for 1 ≤ i, j ≤ n, X(i,j),(k,l) = X(i,l),(k,j) for 1 ≤ i < k ≤ n, 1 ≤ j < l ≤ n, X A ⊗ A

.

They derive τ sos

cp (A) as an SDP relaxation of

τcp(A) = min

α : α > 0, 1

αA ∈ conv

R ∈ Sn : 0 ≤ R ≤ A, R A, rank(R) ≤ 1
τcp(A) is at least the rank of A and the fractional edge-clique

cover number of the support graph of A

SLIDE 67

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

SLIDE 68

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi)

SLIDE 69

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank

SLIDE 70

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank M2t(S) = cone{gp2 : g ∈ S ∪ {1}, p ∈ R[x], deg(gp2) ≤ 2t}

SLIDE 71

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank M2t(S) = cone{gp2 : g ∈ S ∪ {1}, p ∈ R[x], deg(gp2) ≤ 2t} S = {√Aiixi − x2

i } ∪ {Aij − xixj : 1 ≤ i < j ≤ n}

SLIDE 72

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank M2t(S) = cone{gp2 : g ∈ S ∪ {1}, p ∈ R[x], deg(gp2) ≤ 2t} S = {√Aiixi − x2

i } ∪ {Aij − xixj : 1 ≤ i < j ≤ n}

ξcp

t (A) = min

L(1) : L ∈ R[x1, . . . , xn]∗

2t,

(L(xixj)) = A, L ≥ 0

n

M2t(S)

SLIDE 73

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank M2t(S) = cone{gp2 : g ∈ S ∪ {1}, p ∈ R[x], deg(gp2) ≤ 2t} S = {√Aiixi − x2

i } ∪ {Aij − xixj : 1 ≤ i < j ≤ n}

ξcp

t (A) = min

L(1) : L ∈ R[x1, . . . , xn]∗

2t,

(L(xixj)) = A, L ≥ 0

n

M2t(S)

ξcp

1 (A) ≤ . . . ≤ ξcp ∞(A) = ξcp ∗ (A) ≤ cp-rank(A)

SLIDE 74

Extra constraints for the cp-rank

As in the cpsd-rank case we can add extra constraints for a set V ⊆ Sn−1 giving the stronger bound ξcp

t,V (A)

SLIDE 75

Extra constraints for the cp-rank

As in the cpsd-rank case we can add extra constraints for a set V ⊆ Sn−1 giving the stronger bound ξcp

t,V (A)

We have ξcp

∗,Sn−1(A) = τcp(A)

SLIDE 76

Extra constraints for the cp-rank

As in the cpsd-rank case we can add extra constraints for a set V ⊆ Sn−1 giving the stronger bound ξcp

t,V (A)

We have ξcp

∗,Sn−1(A) = τcp(A)

Let V1 ⊆ V2 ⊆ . . . ⊆ Sn−1 be finite subsets such that

k Vk is

dense in Sn−1 We have ξcp

∗,Vk(A) → ξcp ∗,Sn−1(A) as k → ∞

SLIDE 77

Extra constraints for the cp-rank

As in the cpsd-rank case we can add extra constraints for a set V ⊆ Sn−1 giving the stronger bound ξcp

t,V (A)

We have ξcp

∗,Sn−1(A) = τcp(A)

Let V1 ⊆ V2 ⊆ . . . ⊆ Sn−1 be finite subsets such that

k Vk is

dense in Sn−1 We have ξcp

∗,Vk(A) → ξcp ∗,Sn−1(A) as k → ∞

This gives a (doubly indexed) sequence of finite semidefinite programs converging asymptotically to τcp(A)

SLIDE 78

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A):

SLIDE 79

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A): ◮ Add entrywise nonnegativity constraints

SLIDE 80

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A): ◮ Add entrywise nonnegativity constraints ◮ Add the tensor constraint X A ⊗ A from τ sos cp (A):

(L(ww′))w,w′∈x=l A⊗l for 2 ≤ l ≤ t

SLIDE 81

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A): ◮ Add entrywise nonnegativity constraints ◮ Add the tensor constraint X A ⊗ A from τ sos cp (A):

(L(ww′))w,w′∈x=l A⊗l for 2 ≤ l ≤ t

◮ Implement this constraint more efficiently by exploiting

symmetry: (L(mm′))m,m′∈[x]=l QlA⊗lQT

l

for 2 ≤ l ≤ t

SLIDE 82

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A): ◮ Add entrywise nonnegativity constraints ◮ Add the tensor constraint X A ⊗ A from τ sos cp (A):

(L(ww′))w,w′∈x=l A⊗l for 2 ≤ l ≤ t

◮ Implement this constraint more efficiently by exploiting

symmetry: (L(mm′))m,m′∈[x]=l QlA⊗lQT

l

for 2 ≤ l ≤ t Then ξcp

2,+(A) is a more efficient strengthening of τ sos cp (A)

SLIDE 83

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

SLIDE 84

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs

SLIDE 85

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs Fawzi and Parrilo (2014) define relaxations τ sos

+ (A) ≤ τ+(A) ≤ rank+(A)

SLIDE 86

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs Fawzi and Parrilo (2014) define relaxations τ sos

+ (A) ≤ τ+(A) ≤ rank+(A)

For A ∈ Rm×n

+

there are positive semidefinite diagonal matrices X1, . . . , Xm+n with Aij = Tr(XiXm+j) and λmax(Xi)2 ≤ maxi,j Aij

SLIDE 87

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs Fawzi and Parrilo (2014) define relaxations τ sos

+ (A) ≤ τ+(A) ≤ rank+(A)

For A ∈ Rm×n

+

there are positive semidefinite diagonal matrices X1, . . . , Xm+n with Aij = Tr(XiXm+j) and λmax(Xi)2 ≤ maxi,j Aij We can use this to adapt the above techniques to give a hiearchy ξ+

1 (A) ≤ . . . ≤ ξ+ ∞(A) = ξ+ ∗ (A) = τ+(A) ≤ rank+(A).

SLIDE 88

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs Fawzi and Parrilo (2014) define relaxations τ sos

+ (A) ≤ τ+(A) ≤ rank+(A)

For A ∈ Rm×n

+

there are positive semidefinite diagonal matrices X1, . . . , Xm+n with Aij = Tr(XiXm+j) and λmax(Xi)2 ≤ maxi,j Aij We can use this to adapt the above techniques to give a hiearchy ξ+

1 (A) ≤ . . . ≤ ξ+ ∞(A) = ξ+ ∗ (A) = τ+(A) ≤ rank+(A).

Going back to tracial optimization we can adapt this to the psd-rank – still work in progress

SLIDE 89

Nested rectangle problem [Fawzi–Parrilo, 2016]:

−a a −b b

−1 −1 1 1

SLIDE 90

Nested rectangle problem [Fawzi–Parrilo, 2016]:

−a a −b b

−1 −1 1 1

Such a triangle exists if and only if rank+



   1 − a 1 + a 1 + a 1 − a 1 + a 1 − a 1 − a 1 + a 1 − b 1 − b 1 + b 1 + b 1 + b 1 + b 1 − b 1 − b    

≤ 3

SLIDE 91

Nested rectangle problem [Fawzi–Parrilo, 2016]:

−a a −b b

−1 −1 1 1

Such a triangle exists if and only if rank+



   1 − a 1 + a 1 + a 1 − a 1 + a 1 − a 1 − a 1 + a 1 − b 1 − b 1 + b 1 + b 1 + b 1 + b 1 − b 1 − b    

≤ 3

In fact, such a triangle exists if and only if (1 + a)(1 + b) ≤ 2

SLIDE 92

Nested rectangle problem [Fawzi–Parrilo, 2016]:

a b

ξ+

2,+ = τ sos +

ξ+

3,+

0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1

SLIDE 93