A randomized block sampling approach to the canonical polyadic decomposition of large-scale tensors
Nico Vervliet
Joint work with Lieven De Lathauwer
A randomized block sampling approach to the canonical polyadic - - PowerPoint PPT Presentation
A randomized block sampling approach to the canonical polyadic decomposition of large-scale tensors Nico Vervliet Joint work with Lieven De Lathauwer SIAM AN17, July 13, 2017 Classification of hazardous gasses using e-noses Sensor Classify
Joint work with Lieven De Lathauwer
2
3
◮ Sum of R rank-1 terms
4
◮ Sum of R rank-1 terms
◮ Mathematically, for a general Nth order tensor T
R
r
⊗ a(2)
r
⊗ · · · ⊗ a(N)
r
4
◮ Optimization problem:
A(1),A(2),...,A(N)
F 5
◮ Optimization problem:
A(1),A(2),...,A(N)
F ◮ Algorithms
◮ Alternating least squares ◮ CPOPT
◮ (Damped) Gauss–Newton
◮ (Inexact) nonlinear least squares
5
◮ Suppose Nth order T ∈ CI×I×···×I, then ◮ number of entries: I N ◮ memory and time complexity: O
6
◮ Suppose Nth order T ∈ CI×I×···×I, then ◮ number of entries: I N ◮ memory and time complexity: O
◮ number of variables: NIR 6
◮ Suppose Nth order T ∈ CI×I×···×I, then ◮ number of entries: I N ◮ memory and time complexity: O
◮ number of variables: NIR
◮ number of entries: 1018 ◮ number of variables: 4500 6
◮ Use incomplete tensors
◮ Exploit sparsity
◮ Compress the tensor
◮ Decompose subtensors and combine results
◮ Parallel
7
8
9
9
9
9
9
s , n = 1, . . . , N
s
s , n = 1, ..., N and
s , n = 1, ..., N
10
s , n = 1, . . . , N
s
s , n = 1, ..., N and
s , n = 1, ..., N
10
11
11
11
◮ ALS variant
k+1 = (1 − α)A(n) k
−1
12
◮ ALS variant
k+1 = (1 − α)A(n) k
−1
◮ NLS variant
pk
12
13
13
◮ Function evaluation fval = 0.5
14
◮ Function evaluation fval = 0.5
◮ Step size 14
◮ Uncertainty of an estimate
15
◮ Uncertainty of an estimate
◮ CRB ≤ σ2 15
◮ Uncertainty of an estimate
◮ CRB ≤ σ2 ◮ C = τ 2(JHJ)−1 15
◮ Experimental bound
◮ Use estimates A(n)
k
◮ Use fval to estimate noise τ
16
◮ Experimental bound
◮ Use estimates A(n)
k
◮ Use fval to estimate noise τ
◮ Stopping criterion:
n In N
In
R
k (i, r) − A(n) k−KCRB(i, r)
16
◮ Experimental bound
◮ Use estimates A(n)
k
◮ Use fval to estimate noise τ
◮ Stopping criterion:
n In N
In
R
k (i, r) − A(n) k−KCRB(i, r)
16
◮ Unrestricted phase (1 + 2): converge to a neighborhood of an
◮ Restricted phase (3): pull iterates towards optimum 17
◮ Unrestricted phase (1 + 2): converge to a neighborhood of an
◮ Restricted phase (3): pull iterates towards optimum 17
◮ Unrestricted phase (1 + 2): converge to a neighborhood of an
◮ Restricted phase (3): pull iterates towards optimum
◮ CPD of rank R exists ◮ SNR is high enough ◮ Most block dimensions > R 17
18
◮ Experiments
◮ Comparison ALS vs NLS (see paper) ◮ Influence of block size ◮ Influence of step size (see paper)
19
◮ Experiments
◮ Comparison ALS vs NLS (see paper) ◮ Influence of block size ◮ Influence of step size (see paper)
◮ Performance
◮ 50 Monte Carlo experiments ◮ CPD error
n
res
◮ Experiments
◮ Comparison ALS vs NLS (see paper) ◮ Influence of block size ◮ Influence of step size (see paper)
◮ Performance
◮ 50 Monte Carlo experiments ◮ CPD error
n
res
19
20
21
22
23
23
24
25
◮ Resulting factor matrices
26
◮ Resulting factor matrices
◮ Performance after clustering
26
◮ The randomized block sampling CPD algorithm enables the
◮ Block size controls accuracy, data accesses and time ◮ Step size restriction improves accuracy ◮ Cram´
27
28
Joint work with Lieven De Lathauwer
2
3
4
5
6