Using noncommutative polynomial optimization for matrix - - PowerPoint PPT Presentation

using noncommutative polynomial optimization for matrix
SMART_READER_LITE
LIVE PREVIEW

Using noncommutative polynomial optimization for matrix - - PowerPoint PPT Presentation

Using noncommutative polynomial optimization for matrix factorization ranks Sander Gribling (CWI/QuSoft) David de Laat (CWI/QuSoft) Monique Laurent (CWI/Tilburg/QuSoft) SIAM Conference on Optimization, 25 May 2017, Vancouver Symmetric matrix


slide-1
SLIDE 1

Using noncommutative polynomial optimization for matrix factorization ranks

Sander Gribling (CWI/QuSoft) David de Laat (CWI/QuSoft) Monique Laurent (CWI/Tilburg/QuSoft) SIAM Conference on Optimization, 25 May 2017, Vancouver

slide-2
SLIDE 2

Symmetric matrix factorization ranks

slide-3
SLIDE 3

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

slide-4
SLIDE 4

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d;

slide-5
SLIDE 5

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute;

slide-6
SLIDE 6

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n

slide-7
SLIDE 7

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

slide-8
SLIDE 8

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d;

slide-9
SLIDE 9

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute;

slide-10
SLIDE 10

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

  • + 1
slide-11
SLIDE 11

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

  • + 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj)

slide-12
SLIDE 12

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

  • + 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d;

slide-13
SLIDE 13

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

  • + 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

slide-14
SLIDE 14

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

  • + 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

There is no upper bound on d depending only on n [Slofstra, 2017]

slide-15
SLIDE 15

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

  • + 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

There is no upper bound on d depending only on n [Slofstra, 2017]

slide-16
SLIDE 16

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

  • + 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

There is no upper bound on d depending only on n [Slofstra, 2017] CP matrices ⊆ CPSD matrices ⊆ PSD matrices

slide-17
SLIDE 17

Symmetric matrix factorization ranks

PSD matrices A ∈ Rn×n is PSD if there are a1, . . . , an ∈ Rd with Aij = aT

i aj

rank(A) = smallest possible d; Easy to compute; d ≤ n CP matrices A ∈ Rn×n is CP if there are a1, . . . , an ∈ Rd

+ with Aij = aT i aj

cp-rank(A) = smallest possible d; Hard to compute; If A is CP, then d ≤ n+1

2

  • + 1

CPSD matrices A ∈ Rn×n is CPSD if there are are Hermitian PSD matrices X1, . . . , Xn ∈ Cd×d with Aij = Tr(XiXj) cpsd-rank(A) = smallest possible d; Hard to compute;

There is no upper bound on d depending only on n [Slofstra, 2017] CP matrices ⊆ CPSD matrices ⊆ PSD matrices Goal: Find lower bounds for matrix factorization ranks

slide-18
SLIDE 18

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

slide-19
SLIDE 19

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

slide-20
SLIDE 20

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

◮ Corresponding matrix (Ap)(s,a),(t,b) = p(a, b|s, t)

slide-21
SLIDE 21

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

◮ Corresponding matrix (Ap)(s,a),(t,b) = p(a, b|s, t) ◮ If p is a “synchronous quantum correlation”, then Ap is CPSD

slide-22
SLIDE 22

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

◮ Corresponding matrix (Ap)(s,a),(t,b) = p(a, b|s, t) ◮ If p is a “synchronous quantum correlation”, then Ap is CPSD ◮ The smallest dimension to realize it is cpsd-rank(Ap)

slide-23
SLIDE 23

Connection to quantum information theory

◮ CPSD cone was studied by Piovesan and Laurent in relation

to quantum graph parameters

◮ Connections to entanglement dimensions of bipartite quantum

correlations p(a, b|s, t) [Sikora–Varvitsiotis 2015], [Manˇ cinska–Roberson 2014]

◮ Corresponding matrix (Ap)(s,a),(t,b) = p(a, b|s, t) ◮ If p is a “synchronous quantum correlation”, then Ap is CPSD ◮ The smallest dimension to realize it is cpsd-rank(Ap) ◮ Combine proofs from above refs and

[Paulsen–Severini–Stahlke–Todorov–Winter 2016]

slide-24
SLIDE 24

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

slide-25
SLIDE 25

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn]

slide-26
SLIDE 26

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

  • f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
slide-27
SLIDE 27

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

  • f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
  • ◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

slide-28
SLIDE 28

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

  • f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
  • ◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

slide-29
SLIDE 29

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

  • f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
  • ◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

  • ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):
slide-30
SLIDE 30

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

  • f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
  • ◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

  • ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

◮ Let S ∪ {f } ⊆ Rx1, . . . , xn

slide-31
SLIDE 31

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

  • f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
  • ◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

  • ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

◮ Let S ∪ {f } ⊆ Rx1, . . . , xn ◮ We can evaluate a noncommutative polynomial at a tuple

X = (X1, . . . , Xn) of matrices

slide-32
SLIDE 32

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

  • f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
  • ◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

  • ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

◮ Let S ∪ {f } ⊆ Rx1, . . . , xn ◮ We can evaluate a noncommutative polynomial at a tuple

X = (X1, . . . , Xn) of matrices

◮ inf{tr(f (X)) : d ∈ N, X1, . . . , Xn ∈ Hd, g(X) 0 for g ∈ S}

slide-33
SLIDE 33

Polynomial optimization

Commutative polynomial optimization (Lasserre, Parrilo, ...):

◮ Let S ∪ {f } ⊆ R[x1, . . . , xn] ◮ inf

  • f (x) : x ∈ Rn, g(x) ≥ 0 for g ∈ S
  • ◮ Hierarchy of semidefinite programming lower bounds based on

moments (primal) and sums of squares (dual)

◮ Asymptotic convergence under technical condition

Eigenvalue optimization (Ac´ ın, Navascues, Pironio, ...) and tracial

  • ptimization (Burgdorf, Cafuta, Klep, Povh, Schweighofer, ...):

◮ Let S ∪ {f } ⊆ Rx1, . . . , xn ◮ We can evaluate a noncommutative polynomial at a tuple

X = (X1, . . . , Xn) of matrices

◮ inf{tr(f (X)) : d ∈ N, X1, . . . , Xn ∈ Hd, g(X) 0 for g ∈ S}

Commutative polynomial optimization is used by Nie for testing membership in the CP cone and computing tensor nuclear norms

slide-34
SLIDE 34

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A)

slide-35
SLIDE 35

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj)

slide-36
SLIDE 36

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars

slide-37
SLIDE 37

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn)))

slide-38
SLIDE 38

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A)

slide-39
SLIDE 39

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX

slide-40
SLIDE 40

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX Symmetric and tracial: LX(p∗) = LX(p) and LX(pq) = LX(qp)

slide-41
SLIDE 41

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX Symmetric and tracial: LX(p∗) = LX(p) and LX(pq) = LX(qp) Positive: LX(p∗p) ≥ 0

slide-42
SLIDE 42

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX Symmetric and tracial: LX(p∗) = LX(p) and LX(pq) = LX(qp) Positive: LX(p∗p) ≥ 0 Linear conditions: LX(xixj) = Aij

slide-43
SLIDE 43

Lower bounding the cpsd-rank using tracial optimization

Let A ∈ Rn×n be a CPSD matrix and set d = cpsd-rank(A) X1, . . . , Xn ∈ Cd×d Hermitian PSD matrices with Aij = Tr(XiXj) Rx1, . . . , xn: ∗-algebra of noncommutative polynomials in n vars Define a linear form LX ∈ Rx1, . . . , xn∗ by LX(p) = Re(Tr(p(X1, . . . , Xn))) We have LX(1) = Re(Tr(Id)) = d = cpsd-rank(A) We obtain a relaxation by minimizing L(1) over all linear forms L that satisfy some computationally tractable properties of LX Symmetric and tracial: LX(p∗) = LX(p) and LX(pq) = LX(qp) Positive: LX(p∗p) ≥ 0 Linear conditions: LX(xixj) = Aij Localizing conditions: LX(p∗(√Aiixi − x2

i )p) ≥ 0

slide-44
SLIDE 44

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t

slide-45
SLIDE 45

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn

slide-46
SLIDE 46

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx}

slide-47
SLIDE 47

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx} Truncated quadratic module: M2t(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx, deg(p∗gp) ≤ 2t}

slide-48
SLIDE 48

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx} Truncated quadratic module: M2t(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx, deg(p∗gp) ≤ 2t}

ξcpsd

t

(A) = min

  • L(1) : L ∈ Rx1, . . . , xn∗

2t tracial and symmetric,

(L(xixj)) = A, L ≥ 0

  • n

M2t

  • Aiixi − x2

i : i ∈ [n]

slide-49
SLIDE 49

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx} Truncated quadratic module: M2t(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx, deg(p∗gp) ≤ 2t}

ξcpsd

t

(A) = min

  • L(1) : L ∈ Rx1, . . . , xn∗

2t tracial and symmetric,

(L(xixj)) = A, L ≥ 0

  • n

M2t

  • Aiixi − x2

i : i ∈ [n]

  • ξcpsd

1

(A) ≤ . . . ≤ ξcpsd

(A) ≤ ξcpsd

(A) ≤ cpsd-rank(A)

slide-50
SLIDE 50

Truncate to obtain a semidefinite programming hierarchy

Rx1, . . . , xn2t noncommututative polynomials with deg ≤ 2t Let S ⊆ Rx = Rx1, . . . , xn Quadratic module: M(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx} Truncated quadratic module: M2t(S) = cone{p∗gp : g ∈ S ∪ {1}, p ∈ Rx, deg(p∗gp) ≤ 2t}

ξcpsd

t

(A) = min

  • L(1) : L ∈ Rx1, . . . , xn∗

2t tracial and symmetric,

(L(xixj)) = A, L ≥ 0

  • n

M2t

  • Aiixi − x2

i : i ∈ [n]

  • ξcpsd

1

(A) ≤ . . . ≤ ξcpsd

(A) ≤ ξcpsd

(A) ≤ cpsd-rank(A) ξcpsd

(A) is ξcpsd

(A) with the extra constraint rank(M(L)) < ∞

slide-51
SLIDE 51

ξcpsd

∞ (A) and ξcpsd ∗

(A)

◮ We have ξcpsd t

(A) → ξcpsd

(A), and if ξcpsd

t

(A) admits a flat

  • ptimal solution, then ξcpsd

t

(A) = ξcpsd

(A)

slide-52
SLIDE 52

ξcpsd

∞ (A) and ξcpsd ∗

(A)

◮ We have ξcpsd t

(A) → ξcpsd

(A), and if ξcpsd

t

(A) admits a flat

  • ptimal solution, then ξcpsd

t

(A) = ξcpsd

(A)

◮ ξcpsd ∗

(A) is the minimum of L(1) over all conic combinations L

  • f trace evaluations at elements of the matrix positivity

domain of {√Aiixi − x2

i : i ∈ [n]} such that A = (L(xixj))

slide-53
SLIDE 53

ξcpsd

∞ (A) and ξcpsd ∗

(A)

◮ We have ξcpsd t

(A) → ξcpsd

(A), and if ξcpsd

t

(A) admits a flat

  • ptimal solution, then ξcpsd

t

(A) = ξcpsd

(A)

◮ ξcpsd ∗

(A) is the minimum of L(1) over all conic combinations L

  • f trace evaluations at elements of the matrix positivity

domain of {√Aiixi − x2

i : i ∈ [n]} such that A = (L(xixj))

ξcpsd

(A) = inf

  • M
  • m=1

dm · max

i∈[n]

X m

i 2

Aii : M ∈ N, d1, . . . , dM ∈ N, X m

i

∈ Hdm

+ for i ∈ [n], m ∈ [M],

A = Gram

  • M
  • m=1

X m

1 , . . . , M

  • m=1

X m

n

  • .
slide-54
SLIDE 54

Lower bound [Prakash–Sikora–Varvitsiotis–Wei 2016]: n

i=1

√Aii 2 n

i,j=1 Aij

≤ cpsd-rank(A)

slide-55
SLIDE 55

Lower bound [Prakash–Sikora–Varvitsiotis–Wei 2016]: n

i=1

√Aii 2 n

i,j=1 Aij

≤ cpsd-rank(A) We have ξcpsd

1

(A) ≥ n

i=1

√Aii 2 n

i,j=1 Aij

slide-56
SLIDE 56

Lower bound [Prakash–Sikora–Varvitsiotis–Wei 2016]: n

i=1

√Aii 2 n

i,j=1 Aij

≤ cpsd-rank(A) We have ξcpsd

1

(A) ≥ n

i=1

√Aii 2 n

i,j=1 Aij

Sharp for the matrix A ∈ R5×5 given by Aij = cos

  • 4π/5(i − j)

2

slide-57
SLIDE 57

Extra constraints to go beyond ξcpsd

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj)

slide-58
SLIDE 58

Extra constraints to go beyond ξcpsd

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

  • n
  • i=1

viXi 2

slide-59
SLIDE 59

Extra constraints to go beyond ξcpsd

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

  • n
  • i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module

slide-60
SLIDE 60

Extra constraints to go beyond ξcpsd

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

  • n
  • i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module For a subset V ⊆ Sn−1 we have the stronger bound ξcpsd

t,V (A)

slide-61
SLIDE 61

Extra constraints to go beyond ξcpsd

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

  • n
  • i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module For a subset V ⊆ Sn−1 we have the stronger bound ξcpsd

t,V (A)

Example: A =       1 1/2 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1/2 1       ξcpsd

1

(A) = ξcpsd

(A) = 5 2

slide-62
SLIDE 62

Extra constraints to go beyond ξcpsd

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

  • n
  • i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module For a subset V ⊆ Sn−1 we have the stronger bound ξcpsd

t,V (A)

Example: A =       1 1/2 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1/2 1       ξcpsd

1

(A) = ξcpsd

(A) = 5 2, V = ei + ej √ 2 : i, j ∈ [5]

slide-63
SLIDE 63

Extra constraints to go beyond ξcpsd

(A)

Let X1, . . . , Xn be Hermitian PSD matrices s.t. Aij = Tr(XiXj) For each v ∈ Rn, the following matrix is psd: vTAvI −

  • n
  • i=1

viXi 2 We can use this to add additional constraints to ξcpsd

t

(A) by extending the quadratic module For a subset V ⊆ Sn−1 we have the stronger bound ξcpsd

t,V (A)

Example: A =       1 1/2 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1 1/2 1/2 1/2 1       ξcpsd

1

(A) = ξcpsd

(A) = 5 2, V = ei + ej √ 2 : i, j ∈ [5]

  • , ξcpsd

2,V (A) = 10

3

slide-64
SLIDE 64

The completely positive rank (cp-rank)

Fawzi and Parrilo (2014) give this SDP to lower bound cp-rank(A):

τ sos

cp (A) = inf

  • α : α ∈ R, X ∈ Rn2×n2,
  • α

vec(A)T vec(A) X

  • 0,

X(i,j),(i,j) ≤ A2

ij

for 1 ≤ i, j ≤ n, X(i,j),(k,l) = X(i,l),(k,j) for 1 ≤ i < k ≤ n, 1 ≤ j < l ≤ n, X A ⊗ A

  • .
slide-65
SLIDE 65

The completely positive rank (cp-rank)

Fawzi and Parrilo (2014) give this SDP to lower bound cp-rank(A):

τ sos

cp (A) = inf

  • α : α ∈ R, X ∈ Rn2×n2,
  • α

vec(A)T vec(A) X

  • 0,

X(i,j),(i,j) ≤ A2

ij

for 1 ≤ i, j ≤ n, X(i,j),(k,l) = X(i,l),(k,j) for 1 ≤ i < k ≤ n, 1 ≤ j < l ≤ n, X A ⊗ A

  • .

They derive τ sos

cp (A) as an SDP relaxation of

τcp(A) = min

  • α : α > 0, 1

αA ∈ conv

  • R ∈ Sn : 0 ≤ R ≤ A, R A, rank(R) ≤ 1
slide-66
SLIDE 66

The completely positive rank (cp-rank)

Fawzi and Parrilo (2014) give this SDP to lower bound cp-rank(A):

τ sos

cp (A) = inf

  • α : α ∈ R, X ∈ Rn2×n2,
  • α

vec(A)T vec(A) X

  • 0,

X(i,j),(i,j) ≤ A2

ij

for 1 ≤ i, j ≤ n, X(i,j),(k,l) = X(i,l),(k,j) for 1 ≤ i < k ≤ n, 1 ≤ j < l ≤ n, X A ⊗ A

  • .

They derive τ sos

cp (A) as an SDP relaxation of

τcp(A) = min

  • α : α > 0, 1

αA ∈ conv

  • R ∈ Sn : 0 ≤ R ≤ A, R A, rank(R) ≤ 1
  • τcp(A) is at least the rank of A and the fractional edge-clique

cover number of the support graph of A

slide-67
SLIDE 67

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

slide-68
SLIDE 68

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi)

slide-69
SLIDE 69

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank

slide-70
SLIDE 70

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank M2t(S) = cone{gp2 : g ∈ S ∪ {1}, p ∈ R[x], deg(gp2) ≤ 2t}

slide-71
SLIDE 71

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank M2t(S) = cone{gp2 : g ∈ S ∪ {1}, p ∈ R[x], deg(gp2) ≤ 2t} S = {√Aiixi − x2

i } ∪ {Aij − xixj : 1 ≤ i < j ≤ n}

slide-72
SLIDE 72

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank M2t(S) = cone{gp2 : g ∈ S ∪ {1}, p ∈ R[x], deg(gp2) ≤ 2t} S = {√Aiixi − x2

i } ∪ {Aij − xixj : 1 ≤ i < j ≤ n}

ξcp

t (A) = min

  • L(1) : L ∈ R[x1, . . . , xn]∗

2t,

(L(xixj)) = A, L ≥ 0

  • n

M2t(S)

slide-73
SLIDE 73

Adapting our hierarchy for the cp-rank

Suppose Aij = vT

i vj for v1, . . . , vn ∈ Rd +

Then, Aij = Tr(XiXj) for diagonal PSD matrices Xi = Diag(vi) Use ideas for cpsd-rank to derive a hierarchy for cp-rank M2t(S) = cone{gp2 : g ∈ S ∪ {1}, p ∈ R[x], deg(gp2) ≤ 2t} S = {√Aiixi − x2

i } ∪ {Aij − xixj : 1 ≤ i < j ≤ n}

ξcp

t (A) = min

  • L(1) : L ∈ R[x1, . . . , xn]∗

2t,

(L(xixj)) = A, L ≥ 0

  • n

M2t(S)

  • ξcp

1 (A) ≤ . . . ≤ ξcp ∞(A) = ξcp ∗ (A) ≤ cp-rank(A)

slide-74
SLIDE 74

Extra constraints for the cp-rank

As in the cpsd-rank case we can add extra constraints for a set V ⊆ Sn−1 giving the stronger bound ξcp

t,V (A)

slide-75
SLIDE 75

Extra constraints for the cp-rank

As in the cpsd-rank case we can add extra constraints for a set V ⊆ Sn−1 giving the stronger bound ξcp

t,V (A)

We have ξcp

∗,Sn−1(A) = τcp(A)

slide-76
SLIDE 76

Extra constraints for the cp-rank

As in the cpsd-rank case we can add extra constraints for a set V ⊆ Sn−1 giving the stronger bound ξcp

t,V (A)

We have ξcp

∗,Sn−1(A) = τcp(A)

Let V1 ⊆ V2 ⊆ . . . ⊆ Sn−1 be finite subsets such that

k Vk is

dense in Sn−1 We have ξcp

∗,Vk(A) → ξcp ∗,Sn−1(A) as k → ∞

slide-77
SLIDE 77

Extra constraints for the cp-rank

As in the cpsd-rank case we can add extra constraints for a set V ⊆ Sn−1 giving the stronger bound ξcp

t,V (A)

We have ξcp

∗,Sn−1(A) = τcp(A)

Let V1 ⊆ V2 ⊆ . . . ⊆ Sn−1 be finite subsets such that

k Vk is

dense in Sn−1 We have ξcp

∗,Vk(A) → ξcp ∗,Sn−1(A) as k → ∞

This gives a (doubly indexed) sequence of finite semidefinite programs converging asymptotically to τcp(A)

slide-78
SLIDE 78

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A):

slide-79
SLIDE 79

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A): ◮ Add entrywise nonnegativity constraints

slide-80
SLIDE 80

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A): ◮ Add entrywise nonnegativity constraints ◮ Add the tensor constraint X A ⊗ A from τ sos cp (A):

(L(ww′))w,w′∈x=l A⊗l for 2 ≤ l ≤ t

slide-81
SLIDE 81

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A): ◮ Add entrywise nonnegativity constraints ◮ Add the tensor constraint X A ⊗ A from τ sos cp (A):

(L(ww′))w,w′∈x=l A⊗l for 2 ≤ l ≤ t

◮ Implement this constraint more efficiently by exploiting

symmetry: (L(mm′))m,m′∈[x]=l QlA⊗lQT

l

for 2 ≤ l ≤ t

slide-82
SLIDE 82

More efficient tensor constraints

Let ξcp

t,+(A) be the following strengthening of ξcp t (A): ◮ Add entrywise nonnegativity constraints ◮ Add the tensor constraint X A ⊗ A from τ sos cp (A):

(L(ww′))w,w′∈x=l A⊗l for 2 ≤ l ≤ t

◮ Implement this constraint more efficiently by exploiting

symmetry: (L(mm′))m,m′∈[x]=l QlA⊗lQT

l

for 2 ≤ l ≤ t Then ξcp

2,+(A) is a more efficient strengthening of τ sos cp (A)

slide-83
SLIDE 83

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

slide-84
SLIDE 84

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs

slide-85
SLIDE 85

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs Fawzi and Parrilo (2014) define relaxations τ sos

+ (A) ≤ τ+(A) ≤ rank+(A)

slide-86
SLIDE 86

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs Fawzi and Parrilo (2014) define relaxations τ sos

+ (A) ≤ τ+(A) ≤ rank+(A)

For A ∈ Rm×n

+

there are positive semidefinite diagonal matrices X1, . . . , Xm+n with Aij = Tr(XiXm+j) and λmax(Xi)2 ≤ maxi,j Aij

slide-87
SLIDE 87

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs Fawzi and Parrilo (2014) define relaxations τ sos

+ (A) ≤ τ+(A) ≤ rank+(A)

For A ∈ Rm×n

+

there are positive semidefinite diagonal matrices X1, . . . , Xm+n with Aij = Tr(XiXm+j) and λmax(Xi)2 ≤ maxi,j Aij We can use this to adapt the above techniques to give a hiearchy ξ+

1 (A) ≤ . . . ≤ ξ+ ∞(A) = ξ+ ∗ (A) = τ+(A) ≤ rank+(A).

slide-88
SLIDE 88

The nonnegative rank

The nonnegative rank rank+(A) is the smallest d for which there are vectors u1, . . . , un, v1, . . . , vn ∈ Rd

+ such that Aij = uT i vj

Relevant for the extension complexity of linear programs Fawzi and Parrilo (2014) define relaxations τ sos

+ (A) ≤ τ+(A) ≤ rank+(A)

For A ∈ Rm×n

+

there are positive semidefinite diagonal matrices X1, . . . , Xm+n with Aij = Tr(XiXm+j) and λmax(Xi)2 ≤ maxi,j Aij We can use this to adapt the above techniques to give a hiearchy ξ+

1 (A) ≤ . . . ≤ ξ+ ∞(A) = ξ+ ∗ (A) = τ+(A) ≤ rank+(A).

Going back to tracial optimization we can adapt this to the psd-rank – still work in progress

slide-89
SLIDE 89

Nested rectangle problem [Fawzi–Parrilo, 2016]:

−a a −b b

−1 −1 1 1

slide-90
SLIDE 90

Nested rectangle problem [Fawzi–Parrilo, 2016]:

−a a −b b

−1 −1 1 1

Such a triangle exists if and only if rank+

   1 − a 1 + a 1 + a 1 − a 1 + a 1 − a 1 − a 1 + a 1 − b 1 − b 1 + b 1 + b 1 + b 1 + b 1 − b 1 − b    

  • ≤ 3
slide-91
SLIDE 91

Nested rectangle problem [Fawzi–Parrilo, 2016]:

−a a −b b

−1 −1 1 1

Such a triangle exists if and only if rank+

   1 − a 1 + a 1 + a 1 − a 1 + a 1 − a 1 − a 1 + a 1 − b 1 − b 1 + b 1 + b 1 + b 1 + b 1 − b 1 − b    

  • ≤ 3

In fact, such a triangle exists if and only if (1 + a)(1 + b) ≤ 2

slide-92
SLIDE 92

Nested rectangle problem [Fawzi–Parrilo, 2016]:

a b

ξ+

2,+ = τ sos +

ξ+

3,+

0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1

slide-93
SLIDE 93

Thank you!