Singular Values of Tensors Harm Derksen University of Michigan - - PowerPoint PPT Presentation

singular values of tensors
SMART_READER_LITE
LIVE PREVIEW

Singular Values of Tensors Harm Derksen University of Michigan - - PowerPoint PPT Presentation

Singular Values of Tensors Harm Derksen University of Michigan CUNY/NYU February 15, 2019 Harm Derksen Singular Values of Tensors Tensor Rank F a field, F = R or F = C V ( i ) = F n i for i = 1 , 2 , . . . , d V = V (1) V (2)


slide-1
SLIDE 1

Singular Values of Tensors

Harm Derksen

University of Michigan

CUNY/NYU February 15, 2019

Harm Derksen Singular Values of Tensors

slide-2
SLIDE 2

Tensor Rank

F a field, F = R or F = C V (i) ∼ = Fni for i = 1, 2, . . . , d V = V (1) ⊗ V (2) ⊗ · · · ⊗ V (d) ∼ = Fn1×···×nd tensor product space

Definition

A simple tensor is a tensor of the form v(1) ⊗ v(2) ⊗ · · · ⊗ v(d) (v(i) ∈ V (i)).

Harm Derksen Singular Values of Tensors

slide-3
SLIDE 3

Tensor Rank

F a field, F = R or F = C V (i) ∼ = Fni for i = 1, 2, . . . , d V = V (1) ⊗ V (2) ⊗ · · · ⊗ V (d) ∼ = Fn1×···×nd tensor product space

Definition

A simple tensor is a tensor of the form v(1) ⊗ v(2) ⊗ · · · ⊗ v(d) (v(i) ∈ V (i)).

Definition (tensor rank)

The rank of T is the smallest positive integer r such that T is the sum of r simple tensors.

Harm Derksen Singular Values of Tensors

slide-4
SLIDE 4

Matrix Multiplication Tensor

Md =

d

  • i=1

d

  • j=1

d

  • k=1

ei,j ⊗ ej,k ⊗ ek,i ∈ Fn×n ⊗ Fn×n ⊗ Fn×n clearly rank(Md) ≤ d3

Harm Derksen Singular Values of Tensors

slide-5
SLIDE 5

Matrix Multiplication Tensor

Md =

d

  • i=1

d

  • j=1

d

  • k=1

ei,j ⊗ ej,k ⊗ ek,i ∈ Fn×n ⊗ Fn×n ⊗ Fn×n clearly rank(Md) ≤ d3

Theorem (Strassen)

if rank(Md) = s then complexity of n × n matrix multiplication is O(nlogd(s)) (the standard algorithm is O(n3))

Harm Derksen Singular Values of Tensors

slide-6
SLIDE 6

Matrix Multiplication Tensor

Md =

d

  • i=1

d

  • j=1

d

  • k=1

ei,j ⊗ ej,k ⊗ ek,i ∈ Fn×n ⊗ Fn×n ⊗ Fn×n clearly rank(Md) ≤ d3

Theorem (Strassen)

if rank(Md) = s then complexity of n × n matrix multiplication is O(nlogd(s)) (the standard algorithm is O(n3))

Theorem (Strassen)

rank(M2) ≤ 7, so complexity of n × n matrix multiplication is O(nlog2(7)) = O(n2.81) Current record: O(n2.373) (Le Gall)

Harm Derksen Singular Values of Tensors

slide-7
SLIDE 7

The Canonical Polyadic (CP) Model

aka PARAFAC, CANDECOMP

Problem

Given a tensor T ∈ V = V (1) ⊗ · · · ⊗ V (d), write T = v1 + v2 + · · · + vr where v1, v2, . . . , vr are simple tensors and r is minimal.

Harm Derksen Singular Values of Tensors

slide-8
SLIDE 8

The Canonical Polyadic (CP) Model

aka PARAFAC, CANDECOMP

Problem

Given a tensor T ∈ V = V (1) ⊗ · · · ⊗ V (d), write T = v1 + v2 + · · · + vr where v1, v2, . . . , vr are simple tensors and r is minimal. ◮ the CP-decomposition is sometimes unique, not always ◮ not numerically stable: there is no upper bound for max{v1, . . . , vr} as a function of T. ◮ difficult to compute: for many tensors of interest the rank is unknown Many numerical applications in psychometrics, signal processing, numerical linear algebra, computer vision, numerical analysis, data mining, graph analysis, neuroscience, chemometrics, etc.

Harm Derksen Singular Values of Tensors

slide-9
SLIDE 9

The Canonical Polyadic (CP) Model

To make it more numerically stable, we could allow an error term.

Problem

Given a tensor T ∈ V and fixed r, write T = v1 + v2 + · · · + vr + E where v1, v2, . . . , vr are simple tensors and E is minimal.

Harm Derksen Singular Values of Tensors

slide-10
SLIDE 10

The Canonical Polyadic (CP) Model

To make it more numerically stable, we could allow an error term.

Problem

Given a tensor T ∈ V and fixed r, write T = v1 + v2 + · · · + vr + E where v1, v2, . . . , vr are simple tensors and E is minimal. This optimization problem might be ill-posed. Define T = e2 ⊗ e1 ⊗ e1 + e1 ⊗ e2 ⊗ e1 + e1 ⊗ e1 ⊗ e2, v1(t) = t−1(e1+te2)⊗(e1+te2)⊗(e1+te2), v2(t) = −t−1e1⊗e1⊗e1 Then T = v1(t) + v2(t) + E(t) with E(t) = t√3 + t. So we can write T = v1 + v2 + E with E = ε for every ε > 0, but we cannot write T = v1 + v2 + E with E = 0.

Harm Derksen Singular Values of Tensors

slide-11
SLIDE 11

Motivation: Compressed Sensing and Convex Relaxation

For x ∈ Rn, its sparsity is measured by x0 = #{i | xi = 0}.

Problem

Given A ∈ Rm×n, b ∈ Rm, find a solution x ∈ Rn for Ax = b with x0 minimal (a sparsest solution). But, · 0 is not convex and this optimization problem is difficult,

Harm Derksen Singular Values of Tensors

slide-12
SLIDE 12

Motivation: Compressed Sensing and Convex Relaxation

For x ∈ Rn, its sparsity is measured by x0 = #{i | xi = 0}.

Problem

Given A ∈ Rm×n, b ∈ Rm, find a solution x ∈ Rn for Ax = b with x0 minimal (a sparsest solution). But, · 0 is not convex and this optimization problem is difficult, so instead we consider:

Problem (Basis Pursuit)

Given A ∈ Rm×n, b ∈ Rm, find a solution x ∈ Rn for Ax = b with x1 minimal. Basis Pursuit can be solved by linear programming and is generally

  • fast. Under reasonable assumptions, Basis Pursuit also gives the

sparsest solutions (Cand` es-Tao, Donoho).

Harm Derksen Singular Values of Tensors

slide-13
SLIDE 13

The Nuclear and Spectral Norms

Low rank tensors are sparse in some sense, and by convex relaxation:

Definition

The nuclear norm T⋆ is the smallest value of r

i=1 vi where

T = r

i=1 vi and v1, . . . , vr are simple tensors. (well-defined)

Harm Derksen Singular Values of Tensors

slide-14
SLIDE 14

The Nuclear and Spectral Norms

Low rank tensors are sparse in some sense, and by convex relaxation:

Definition

The nuclear norm T⋆ is the smallest value of r

i=1 vi where

T = r

i=1 vi and v1, . . . , vr are simple tensors. (well-defined)

V (1), . . . , V (d), V a spaces with a positive definite bilinear/hermitian form ·, ·

Harm Derksen Singular Values of Tensors

slide-15
SLIDE 15

The Nuclear and Spectral Norms

Low rank tensors are sparse in some sense, and by convex relaxation:

Definition

The nuclear norm T⋆ is the smallest value of r

i=1 vi where

T = r

i=1 vi and v1, . . . , vr are simple tensors. (well-defined)

V (1), . . . , V (d), V a spaces with a positive definite bilinear/hermitian form ·, ·

Definition

The spectral norm is defined by Tσ = max{|T, v| | v simple tensor with v = 1}. The spectral norm is dual to the nuclear norm, in particular |T, S| ≤ T⋆Sσ for all tensors S, T.

Harm Derksen Singular Values of Tensors

slide-16
SLIDE 16

The Nuclear and Spectral Norms

if A ∈ Rn×m = Rn ⊗ Rm is an n × m matrix then the tensor rank

  • f A coincides with the matrix rank of A

Harm Derksen Singular Values of Tensors

slide-17
SLIDE 17

The Nuclear and Spectral Norms

if A ∈ Rn×m = Rn ⊗ Rm is an n × m matrix then the tensor rank

  • f A coincides with the matrix rank of A

If λ1 ≥ λ2 ≥ · · · ≥ λr are the singular values of A, then A⋆ = λ1 + · · · + λr, Aσ = λ1 (spectral/operator norm) and A = A2 = AF =

  • λ2

1 + · · · + λ2 r (Euclidean/Frobenius

norm)

Harm Derksen Singular Values of Tensors

slide-18
SLIDE 18

Example: Determinant Tensor

Dn =

σ∈Sn sgn(σ)eσ(1) ⊗ eσ(2) ⊗ · · · ⊗ eσ(n) ∈ Cn ⊗ · · · ⊗ Cn.

clearly Dn⋆ ≤ n! and rank(Dn) ≤ n!

Harm Derksen Singular Values of Tensors

slide-19
SLIDE 19

Example: Determinant Tensor

Dn =

σ∈Sn sgn(σ)eσ(1) ⊗ eσ(2) ⊗ · · · ⊗ eσ(n) ∈ Cn ⊗ · · · ⊗ Cn.

clearly Dn⋆ ≤ n! and rank(Dn) ≤ n! Dnσ = max{| det(v1v2 · · · vn)| | v1 = · · · = vn = 1} = 1 Dn⋆ = Dn⋆Dnσ ≥ Dn, Dn = n!. so Dn⋆ ≥ n!

Theorem (D.)

Dn⋆ = n!

Harm Derksen Singular Values of Tensors

slide-20
SLIDE 20

Example: Permanent Tensor

Pn =

σ∈Sn eσ(1) ⊗ eσ(2) ⊗ · · · ⊗ eσ(n) ∈ Cn ⊗ · · · ⊗ Cn.

Pnσ = max{| perm(v1v2 · · · vn)| | v1 = · · · = vn = 1}

Harm Derksen Singular Values of Tensors

slide-21
SLIDE 21

Example: Permanent Tensor

Pn =

σ∈Sn eσ(1) ⊗ eσ(2) ⊗ · · · ⊗ eσ(n) ∈ Cn ⊗ · · · ⊗ Cn.

Pnσ = max{| perm(v1v2 · · · vn)| | v1 = · · · = vn = 1}

Theorem (Carlen, Lieb and Moss, 2006)

max{perm(v1v2 · · · vn) | v1 = · · · = vn = 1} = n!/nn/2 n! nn/2 Pn⋆ = Pn⋆Pnσ ≥ Pn, Pn = n!. so Pn⋆ ≥ nn/2

Harm Derksen Singular Values of Tensors

slide-22
SLIDE 22

Example: Permanent Tensor

Theorem (Glynn 2010)

Pn = 1 2n−1

  • δ

n

i=1 δi

  • (n

i=1 δiei) ⊗ · · · ⊗ (n i=1 δiei)

where δ runs over {1} × {−1, 1}n−1.

Harm Derksen Singular Values of Tensors

slide-23
SLIDE 23

Example: Permanent Tensor

Theorem (Glynn 2010)

Pn = 1 2n−1

  • δ

n

i=1 δi

  • (n

i=1 δiei) ⊗ · · · ⊗ (n i=1 δiei)

where δ runs over {1} × {−1, 1}n−1. In particular,

  • n

⌊n/2⌋

  • ≤ rank(Pn) ≤ 2n−1 and Pn⋆ ≤ nn/2, so

Theorem (D.)

Pn⋆ = nn/2

Harm Derksen Singular Values of Tensors

slide-24
SLIDE 24

The Convex Decomposition (CoDe) Model

Problem

Given a tensor T ∈ V , write T = v1 + v2 + · · · + vr where v1, v2, . . . , vr are simple tensors and v1 + v2 + · · · + vr is minimal. Well-defined but the decomposition may not be unique.

Harm Derksen Singular Values of Tensors

slide-25
SLIDE 25

The Convex Decomposition (CoDe) Model

Problem

Given a tensor T ∈ V , write T = v1 + v2 + · · · + vr where v1, v2, . . . , vr are simple tensors and v1 + v2 + · · · + vr is minimal. Well-defined but the decomposition may not be unique. We can also allow an error term . . .

Harm Derksen Singular Values of Tensors

slide-26
SLIDE 26

The Convex Decomposition (CoDe) Model

Problem

Given a tensor T ∈ V and fixed x ≥ 0, write T = v1 + v2 + · · · + vr + E where r is a nonnegative integer, v1, v2, . . . , vr are simple tensors, v1 + v2 + · · · + vr ≤ x and E is minimal. The decomposition is not unique. If A = v1 + v2 + · · · + vr then A is the unique tensor with A⋆ ≤ x for which T − A is minimal.

Harm Derksen Singular Values of Tensors

slide-27
SLIDE 27

The Convex Decomposition (CoDe) Model

Problem

Given a tensor T ∈ V and fixed x ≥ 0, write T = v1 + v2 + · · · + vr + E where r is a nonnegative integer, v1, v2, . . . , vr are simple tensors, v1 + v2 + · · · + vr ≤ x and E is minimal. The decomposition is not unique. If A = v1 + v2 + · · · + vr then A is the unique tensor with A⋆ ≤ x for which T − A is minimal.

Problem

Given a tensor T ∈ V and fixed y ≥ 0, write T = v1 + v2 + · · · + vr + E where r is a nonnegative integer, v1, v2, . . . , vr are simple tensors, E ≤ y and v1 + v2 + · · · + vr is minimal.

Harm Derksen Singular Values of Tensors

slide-28
SLIDE 28

t-Orthogonality

Definition

Simple unit tensors v1, v2, . . . , vr are t-orthogonal if

r

  • i=1

|vi, w|2/t ≤ 1 for every simple tensor w with w2 = 1.

Harm Derksen Singular Values of Tensors

slide-29
SLIDE 29

t-Orthogonality

Definition

Simple unit tensors v1, v2, . . . , vr are t-orthogonal if

r

  • i=1

|vi, w|2/t ≤ 1 for every simple tensor w with w2 = 1. 1-orthogonal ⇔ orthogonal If t > s then t-orthogonal ⇒ s-orthogonal.

Harm Derksen Singular Values of Tensors

slide-30
SLIDE 30

t-Orthogonality

Definition

Simple unit tensors v1, v2, . . . , vr are t-orthogonal if

r

  • i=1

|vi, w|2/t ≤ 1 for every simple tensor w with w2 = 1. 1-orthogonal ⇔ orthogonal If t > s then t-orthogonal ⇒ s-orthogonal.

Theorem (D.)

If v1, . . . , vr ∈ V are t-orthogonal, then r ≤ dim(V )1/t.

Harm Derksen Singular Values of Tensors

slide-31
SLIDE 31

Horizontal and Vertical Tensor Product

Theorem (“horizontal tensor product”, D.)

If v1, . . . , vr are t-orthogonal, and w1, . . . , wr are s-orthogonal, then v1 ⊗ w1, . . . , vr ⊗ wr are (s + t)-orthogonal.

Harm Derksen Singular Values of Tensors

slide-32
SLIDE 32

Horizontal and Vertical Tensor Product

Theorem (“horizontal tensor product”, D.)

If v1, . . . , vr are t-orthogonal, and w1, . . . , wr are s-orthogonal, then v1 ⊗ w1, . . . , vr ⊗ wr are (s + t)-orthogonal. If V = V (1) ⊗ · · · ⊗ V (d) and W = W (1) ⊗ · · · ⊗ W (d), then V ⊠ W := (V (1) ⊗ W (1)) ⊗ · · · ⊗ (V (d) ⊗ W (d)). (v1⊗· · ·⊗vd)⊠(w1⊗· · ·⊗wd) = (v1⊗w1)⊗(v2⊗w2)⊗· · ·⊗(vd⊗wd)

Theorem (“vertical tensor product”, D.)

If v1, v2, . . . , vr ∈ V and w1, . . . , ws ∈ W are t-orthogonal, then {vi ⊠ wj | 1 ≤ i ≤ r, 1 ≤ j ≤ s} are t-orthogonal.

Harm Derksen Singular Values of Tensors

slide-33
SLIDE 33

The Diagonal Singular Value Decomposition

Definition

Suppose that (⋆) : T = r

i=1 λivi such that λ1 ≥ · · · ≥ λr > 0

and v1, . . . , vr are 2-orthogonal simple tensors of unit length, then (⋆) is called a diagonal singular value decomposition of T (DSVD). If d = 2 (tensor product of 2 spaces) then the DSVD is the usual singular value decomposition. For d > 2, the DSVD is different from the Higher Order Singular Value Decomposition defined by De Lathauer, De Moor, and Vandewalle. Not every tensor has a DSVD.

Harm Derksen Singular Values of Tensors

slide-34
SLIDE 34

The Diagonal Singular Value Decomposition

Theorem (D.)

If T has a DSVD then T⋆ =

  • i

λi, T2 =

  • i

λ2

i ,

Tσ = λ1

Harm Derksen Singular Values of Tensors

slide-35
SLIDE 35

The Diagonal Singular Value Decomposition

Theorem (D.)

If T has a DSVD then T⋆ =

  • i

λi, T2 =

  • i

λ2

i ,

Tσ = λ1

Theorem (D.)

If λ1 > λ2 > · · · > λr then the DSVD is unique.

Theorem (D.)

If v1, . . . , vr are t-orthogonal with t > 2, then the DSVD is unique.

Harm Derksen Singular Values of Tensors

slide-36
SLIDE 36

Example: Matrix Multiplication Tensor

e1, . . . , en ∈ Cn are orthogonal e1 ⊗ e1, . . . , en ⊗ en ∈ Cn ⊗ Cn are 2-orthogonal

Harm Derksen Singular Values of Tensors

slide-37
SLIDE 37

Example: Matrix Multiplication Tensor

e1, . . . , en ∈ Cn are orthogonal e1 ⊗ e1, . . . , en ⊗ en ∈ Cn ⊗ Cn are 2-orthogonal e1 ⊗ e1 ⊗ 1, . . . , en ⊗ en ⊗ e1 ∈ Cn ⊗ Cn ⊗ C are 2-orthogonal e1 ⊗ 1 ⊗ e1, . . . , en ⊗ 1 ⊗ en ∈ Cn ⊗ C ⊗ Cn are 2-orthogonal 1 ⊗ e1 ⊗ e1, . . . , 1 ⊗ en ⊗ en ∈ C ⊗ Cn ⊗ Cn are 2-orthogonal

Harm Derksen Singular Values of Tensors

slide-38
SLIDE 38

Example: Matrix Multiplication Tensor

e1, . . . , en ∈ Cn are orthogonal e1 ⊗ e1, . . . , en ⊗ en ∈ Cn ⊗ Cn are 2-orthogonal e1 ⊗ e1 ⊗ 1, . . . , en ⊗ en ⊗ e1 ∈ Cn ⊗ Cn ⊗ C are 2-orthogonal e1 ⊗ 1 ⊗ e1, . . . , en ⊗ 1 ⊗ en ∈ Cn ⊗ C ⊗ Cn are 2-orthogonal 1 ⊗ e1 ⊗ e1, . . . , 1 ⊗ en ⊗ en ∈ C ⊗ Cn ⊗ Cn are 2-orthogonal Using vertical tensor product, we get {(ei ⊗ ej) ⊗ (ej ⊗ ek) ⊗ (ek ⊗ ei) | 1 ≤ i, j, k ≤ n} are 2-orthogonal.

Harm Derksen Singular Values of Tensors

slide-39
SLIDE 39

Example: Matrix Multiplication Tensor

Theorem (D.)

The matrix multiplication tensor Tn =

n

  • i,j,k=1

ei,j ⊗ ej,k ⊗ ek,i is a DSVD. The singular values of Tn are 1, 1, . . . , 1

  • n3

In particular, Tn⋆ =

n3

  • i=1

1 = n3.

Harm Derksen Singular Values of Tensors

slide-40
SLIDE 40

Example: Group Algebra Multiplication Tensor

G is a group with n elements and CG ∼ = Cn is the group algebra TG =

  • g,h∈G

g ⊗ h ⊗ h−1g−1. DFT case corresponds to G = Z/nZ.

Harm Derksen Singular Values of Tensors

slide-41
SLIDE 41

Example: Group Algebra Multiplication Tensor

G is a group with n elements and CG ∼ = Cn is the group algebra TG =

  • g,h∈G

g ⊗ h ⊗ h−1g−1. DFT case corresponds to G = Z/nZ.

Theorem (D.)

TG has a DSVD and its singular values are

  • n

d1 , . . . ,

  • n

d1

  • d3

1

, . . . ,

  • n

ds , . . . ,

  • n

ds

  • d3

s

where d1, d2, . . . , ds are the dimension of the irreducible representations of G.

Harm Derksen Singular Values of Tensors

slide-42
SLIDE 42

Pn no DSVD

Suppose that Pn has a DSVD with singular values λ1, . . . , λr. Pn⋆ = nn/2 = r

i=1 λi

Pnσ =

n! nn/2 = λ1

Pn2 = n! = r

i=1 λ2 i

λ1

r

  • i=1

λi = n! =

r

  • i=1

λ2

i

so λ1 = · · · = λr, and r = rλ1

λ1 = Dn⋆ Dnσ = nn n! .

If n > 2 then r is not an integer!

Harm Derksen Singular Values of Tensors

slide-43
SLIDE 43

Noisy Signal Model

Suppose that V is a finite dimensional R-vector space of signals. If c ∈ V is a noisy signal, then there is a decomposition c = a + b where a ∈ V is a sparse original signal, and b ∈ V is additive noise. Using convex relaxation, sparsity is measured by some ℓ1-type norm · X. The noise is measured by some ℓ2-type norm · Y .

Harm Derksen Singular Values of Tensors

slide-44
SLIDE 44

Noisy Signal Model

Suppose that V is a finite dimensional R-vector space of signals. If c ∈ V is a noisy signal, then there is a decomposition c = a + b where a ∈ V is a sparse original signal, and b ∈ V is additive noise. Using convex relaxation, sparsity is measured by some ℓ1-type norm · X. The noise is measured by some ℓ2-type norm · Y . For example, c is a tensor in V = V (1) ⊗ · · · ⊗ V (d), · X = · ⋆ is the nuclear norm, and · Y = · 2 = · is the usual ℓ2-norm. (see the CoDe Model)

Harm Derksen Singular Values of Tensors

slide-45
SLIDE 45

The Pareto Frontier

V finite dimensional R-vector space · X, · Y norms on V c ∈ V

Definition

A pair (x, y) ∈ R2 is Pareto-efficient if there exists a decomposition c = a + b with a = x, b = y and for every decomposition c = a′ + b′ we have a′X > x, b′Y > y or (a′X, b′Y ) = (x, y). We call c = a + b an XY -decomposition. The Pareto-frontier consists of all Pareto-efficient pairs.

Harm Derksen Singular Values of Tensors

slide-46
SLIDE 46

The Pareto Frontier

V finite dimensional R-vector space · X, · Y norms on V c ∈ V

Definition

A pair (x, y) ∈ R2 is Pareto-efficient if there exists a decomposition c = a + b with a = x, b = y and for every decomposition c = a′ + b′ we have a′X > x, b′Y > y or (a′X, b′Y ) = (x, y). We call c = a + b an XY -decomposition. The Pareto-frontier consists of all Pareto-efficient pairs.

Lemma

The Pareto-frontier is the graph of a strictly decreasing, convex, homeomorphism f c

YX : [0, cX] → [0, cY ].

Harm Derksen Singular Values of Tensors

slide-47
SLIDE 47

The Pareto Sub-Frontier

·, · positive definite bilinear form v2 := v =

  • v, v ℓ2-norm

· X, · Y dual to each others

Harm Derksen Singular Values of Tensors

slide-48
SLIDE 48

The Pareto Sub-Frontier

·, · positive definite bilinear form v2 := v =

  • v, v ℓ2-norm

· X, · Y dual to each others

Theorem

Equivalent:

  • 1. c = a + b is an X2-decomposition
  • 2. c = a + b is an 2Y -decomposition
  • 3. a, b = aXbY .

Harm Derksen Singular Values of Tensors

slide-49
SLIDE 49

The Pareto Sub-Frontier

·, · positive definite bilinear form v2 := v =

  • v, v ℓ2-norm

· X, · Y dual to each others

Theorem

Equivalent:

  • 1. c = a + b is an X2-decomposition
  • 2. c = a + b is an 2Y -decomposition
  • 3. a, b = aXbY .

Definition

The Pareto sub-frontier consists of all pairs (x, y) such that there exists an X2-decomposition c = a + b with aX = x and bY = y.

Harm Derksen Singular Values of Tensors

slide-50
SLIDE 50

The Pareto Sub-Frontier

Lemma

The Pareto sub-frontier is the graph of a strictly decreasing homeomorphism hc

YX : [0, cX] → [0, cY ].

Clearly, f c

YX ≤ hc YX.

Definition

A vector c ∈ V is called tight if f c

YX = hc

  • YX. If all vectors in V are

tight then the norms · X and · Y are called tight.

Harm Derksen Singular Values of Tensors

slide-51
SLIDE 51

(Sub)-Pareto Example

c = (3 3)t ∈ V = R2

  • z1

z2

  • X

=

  • 1

2z2 1 + 2z2 2,

  • z1

z2

  • Y

=

  • 2z2

1 + 1 2z2 2.

dual norms

1 2 3 4 5 1 2 3 4 5 fYX,hYX

So c is not tight.

Harm Derksen Singular Values of Tensors

slide-52
SLIDE 52

The Area Below the Pareto Sub-Frontier

Suppose c = a + b is an X2-decomposition with aX = x0 and bY = y0. The area under the Pareto sub-frontier is

1 2c2 = 1 2a2 + a, b + 1 2b2.

X Y hYX ||c||X ||c||Y x0 y0 (1/2)||a||2

2

(||a||X,||b||Y) <a,b> (1/2)||b||2

2 Harm Derksen Singular Values of Tensors

slide-53
SLIDE 53

The Area Below the Pareto Sub-Frontier

The sub-Pareto frontier applies to: ◮ denoising a sparse 1D signal: · 1 vs. · ∞ (tight!) ◮ denoising of a matrix by soft tresholding: · ⋆ vs. · σ (tight!) ◮ denoising tensors: · ⋆ vs. · σ ◮ compressive sensing (Basis Pursuit Denoising, LASSO, Dantzig Selector) ◮ Fatemi-Osher-Rudin image denoising: total variation norm vs. its dual ◮ 1D total variation denoising: Taut String Method

Harm Derksen Singular Values of Tensors

slide-54
SLIDE 54

Example: Singular Value Decomposition

Rn×m = Rn ⊗ Rm, r = min{n, m} A ∈ Rn×m with singular values λ1 ≥ λ2 ≥ · · · ≥ λr ≥ 0 AX = A⋆ = λ1 + λ2 + · · · + λr AY = Aσ = λ1 A =

  • λ2

1 + λ2 2 + · · · + λ2 r

Harm Derksen Singular Values of Tensors

slide-55
SLIDE 55

Example: Singular Value Decomposition

Rn×m = Rn ⊗ Rm, r = min{n, m} A ∈ Rn×m with singular values λ1 ≥ λ2 ≥ · · · ≥ λr ≥ 0 AX = A⋆ = λ1 + λ2 + · · · + λr AY = Aσ = λ1 A =

  • λ2

1 + λ2 2 + · · · + λ2 r

Lemma

The norms · ⋆ and · σ are tight. XY -decompositions are related to soft tresholding

Harm Derksen Singular Values of Tensors

slide-56
SLIDE 56

Example: Singular Value Decomposition

(note that the Pareto and sub-Pareto frontiers are the same here)

Lemma

If A has singular values λ1 ≥ λ2 ≥ · · · ≥ λr ≥ 0 then the Pareto (sub-)frontier is the piecewise linear function through the points: (λ1 + λ2 + · · · + λk − kλk+1, λk+1), k = 0, 1, 2, . . . , r (with the convention λr+1 = 0).

Harm Derksen Singular Values of Tensors

slide-57
SLIDE 57

Example: Singular Value Decomposition

(note that the Pareto and sub-Pareto frontiers are the same here)

Lemma

If A has singular values λ1 ≥ λ2 ≥ · · · ≥ λr ≥ 0 then the Pareto (sub-)frontier is the piecewise linear function through the points: (λ1 + λ2 + · · · + λk − kλk+1, λk+1), k = 0, 1, 2, . . . , r (with the convention λr+1 = 0).

Corollary

The singular values are uniquely determined by the (sub-)Pareto curve, and vice versa.

Harm Derksen Singular Values of Tensors

slide-58
SLIDE 58

Slope decomposition

Definition

c = c1 + c2 + · · · + cs is called a slope decomposition if ci, cj = ciXcjY for all i ≤ j and c1X c1Y < c2X c2Y < · · · < crX crY .

Harm Derksen Singular Values of Tensors

slide-59
SLIDE 59

Slope decomposition

Definition

c = c1 + c2 + · · · + cs is called a slope decomposition if ci, cj = ciXcjY for all i ≤ j and c1X c1Y < c2X c2Y < · · · < crX crY .

Theorem

c ∈ V has a slope decomposition if and only if c is tight.

Harm Derksen Singular Values of Tensors

slide-60
SLIDE 60

Slope decomposition

Definition

c = c1 + c2 + · · · + cs is called a slope decomposition if ci, cj = ciXcjY for all i ≤ j and c1X c1Y < c2X c2Y < · · · < crX crY .

Theorem

c ∈ V has a slope decomposition if and only if c is tight.

Theorem

The slope decomposition is unique when it exists.

Harm Derksen Singular Values of Tensors

slide-61
SLIDE 61

Singular values for tight vectors

Suppose c ∈ V is tight, with slope decomposition c = c1 + c2 + · · · + cs.

Definition

Let µi = ciY , and λi = µi + µi+1 + · · · + µs for all i. Then c has singular value λi with multiplicity ciX

ciY − ci−1X ci−1Y .

Multiplicities and singular values are positive, but not necessarily integers.

Harm Derksen Singular Values of Tensors

slide-62
SLIDE 62

Singular values for tight vectors

Suppose c ∈ V is tight, with slope decomposition c = c1 + c2 + · · · + cs.

Definition

Let µi = ciY , and λi = µi + µi+1 + · · · + µs for all i. Then c has singular value λi with multiplicity ciX

ciY − ci−1X ci−1Y .

Multiplicities and singular values are positive, but not necessarily integers.

Theorem

If c has singular Values λ1, . . . , λs with multiplicities m1, . . . , ms respectively, then cX =

i miλi, cY = λ1 and

T =

  • i miλ2

i .

Harm Derksen Singular Values of Tensors

slide-63
SLIDE 63

Singular Values for Tensors

Norms · X = · ⋆, · Y = · σ

Theorem

If T = r

i=1 λivi is a diagonal singular value decomposition, then

the singular values of T are λ1, . . . , λr.

Harm Derksen Singular Values of Tensors

slide-64
SLIDE 64

Singular Values for Tensors

Norms · X = · ⋆, · Y = · σ

Theorem

If T = r

i=1 λivi is a diagonal singular value decomposition, then

the singular values of T are λ1, . . . , λr. The permanent tensor Pn is tight, but has no DSVD. It has singular value

n! nn/2 with multiplicity nn n! .

Harm Derksen Singular Values of Tensors

slide-65
SLIDE 65

Singular Value Region of a Tensor

There is an easy procedure to go from the Pareto subfrontier to the singular values and their multiplicities for tight vectors. Even if a vector are not tight we can use the same procedure to define a singular spectrum. Some singular valules may have infinitesemal multiplicity.

Harm Derksen Singular Values of Tensors

slide-66
SLIDE 66

Example: Singular Value Region of a Tensor

· X = · ⋆ and · Y = · σ are dual norms on Rp×q×r C = e2 ⊗ e1 ⊗ e1 + e1 ⊗ e2 ⊗ e1 + e1 ⊗ e1 ⊗ e2 ∈ R2 ⊗ R2 ⊗ R2 has continuous singular spectrum

1 2 3 4 0.5 1 1.5

singular value

2 √ 3 with multiplicity 27 13, singular value 1 4 with

multiplicity 4

3 and singular values between 1 4 and 2 √ 3 with

infinitesemal multiplicity Cσ =

2 √ 3 (largest singular value), area is C⋆ = 3, and integral

  • f 2y over region is C2

2 = 3

Harm Derksen Singular Values of Tensors

slide-67
SLIDE 67

Thanks

Thank You!

Harm Derksen Singular Values of Tensors