Approximating a tensor as a sum of rank-one components Petros - PowerPoint PPT Presentation

Approximating a tensor as a sum of rank-one components Petros Drineas Petros Drineas Rensselaer Polytechnic Institute Computer Science Department To access my web page: drineas

Research interests in my group Algorithmic tools (Randomized, Approximate) Matrix/tensor algorithms and – in particular – matrix/tensor decompositions. Goal Learn a model for the underlying “physical” system generating the data. 2

Matrices/tensors in data mining Data are represented by matrices n features Numerous modern datasets are in matrix form. m objects We are given m objects and n features describing the objects. A ij shows the “importance” of feature j for object i . Data are also represented by tensors. Linear algebra and numerical analysis provide the fundamental mathematical and algorithmic tools to deal with matrix and tensor computations. 3

The TensorCUR algorithm Mahoney, Maggioni, & Drineas KDD ’06, SIMAX ’08, Drineas & Mahoney LAA ’07 m customers - Definition of Tensor-CUR decompositions - Theory behind Tensor-CUR decompositions - Applications of Tensor-CUR decompositions: n products recommendation systems, hyperspectral image analysis n products sample Theorem: 2 samples Unfold R along the α dimension Best rank k a and pre-multiply by CU approximation to A [a] n products n products 4

Overview • Preliminaries, notation, etc. • Negative results • Positive results Existential result (full proof) Algorithmic result (sketch of the algorithm) • Open problems 5

Approximating a tensor Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Notation A is an order-r tensor (e.g., a tensor with r modes) A rank-one component is an outer product of r vectors: A rank-one component has the same dimensions as A, and 6

Approximating a tensor Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Notation A is an order-r tensor (e.g., a tensor with r modes) A rank-one component is an outer product of r vectors: We will measure the error: 7

Approximating a tensor Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Notation A is an order-r tensor (e.g., a tensor with r modes) We will measure the error: Frobenius norm: Spectral norm: 8

Approximating a tensor Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Notation A is an order-r tensor (e.g., a tensor with r modes) We will measure the error: Frobenius norm: Spectral norm: 9

Approximating a tensor Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Notation A is an order-r tensor (e.g., a tensor with r modes) We will measure the error: Frobenius norm: Equivalent to the corresponding matrix norms for r=2 Spectral norm: 10

Approximating a tensor: negative results Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Negative results (A is an order- r tensor) 1. For r=3 , computing the minimal k such that A is exactly equal to the sum of rank-one components is NP-hard [Hastad ’89, ’90] 11

Approximating a tensor: negative results Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Negative results (A is an order- r tensor) 1. For r=3 , computing the minimal k such that A is exactly equal to the sum of rank-one components is NP-hard [Hastad ’89, ’90] 2. For r=3 , identifying k rank-one components such that the Frobenius norm error of the approximation is minimized might not even have a solution (L.- H. Lim ’04) 12

Approximating a tensor: negative results Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Negative results (A is an order- r tensor) 1. For r=3 , computing the minimal k such that A is exactly equal to the sum of rank-one components is NP-hard [Hastad ’89, ’90] 2. For r=3 , identifying k rank-one components such that the Frobenius norm error of the approximation is minimized might not even have a solution (L.- H. Lim ’04) 3. For r=3 , identifying k rank-one components such that the Frobenius norm error of the approximation is minimized (assuming such components exist) is NP-hard. 13

Approximating a tensor: positive results! Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Positive results ! Both from a paper of Kannan et al in STOC ‘05. (A is an order- r tensor) 1. (Existence) For any tensor A , and any ε > 0 , there exist at most k=1/ ε 2 rank-one tensors such that 14

Approximating a tensor: positive results! Fundamental Question Given a tensor A and an integer k find k rank-one tensors such that their sum is as “close” to A as possible. Positive results ! Both from a paper of Kannan et al in STOC ‘05. (A is an order- r tensor) 2. (Algorithmic) For any tensor A , and any ε > 0 , we can find at most k=4/ ε 2 rank-one tensors such that with probability at least .75 Time: 15

The matrix case… Matrix result For any matrix A , and any ε > 0 , we can find at most k=1/ ε 2 rank-one matrices such that 16

The matrix case… Matrix result For any matrix A , and any ε > 0 , we can find at most k=1/ ε 2 rank-one matrices such that To prove this, simply recall that the best rank k approximation to a matrix A is given by A k (as computed by the SVD). But, by setting k=1/ ε 2 17

The matrix case… Matrix result For any matrix A , and any ε > 0 , we can find at most k=1/ ε 2 rank-one matrices such that From an existential perspective , the result is the same for matrices and higher order tensors. From an algorithmic perspective , in the matrix case, the algorithm is (i) more efficient, (ii) returns fewer rank one components, and (iii) there is no failure probability. 18

Existential result: the proof 1. (Existence) For any tensor A , and any ε > 0 , there exist at most k=1/ ε 2 rank-one tensors such that Proof If then we are done. Otherwise, by the definition of the spectral norm of the tensor, w.l.o.g., unit norm 19

Existential result: the proof 1. (Existence) For any tensor A , and any ε > 0 , there exist at most k=1/ ε 2 rank-one tensors such that Proof Consider the tensor: scalar We can prove (easily) that: 20

Existential result: the proof 1. (Existence) For any tensor A , and any ε > 0 , there exist at most k=1/ ε 2 rank-one tensors such that Proof Now combine: 21

Existential result: the proof 1. (Existence) For any tensor A , and any ε > 0 , there exist at most k=1/ ε 2 rank-one tensors such that Proof We now iterate this process using B instead of A. Since at every step we reduce the Frobenius norm of A, this process will eventually terminate. The number of steps is at most k=1/ ε 2 , thus leading to k rank-one tensors. 22

Algorithmic result: outline 2. (Algorithmic) For any tensor A , and any ε > 0 , we can find at most k=4/ ε 2 rank-one tensors such that with probability at least .75 Time: Ideas: For simplicity, focus on order-3 tensors. The only part of the existential proof that is not constructive, is how to identify unit vectors x, y, and z such that is maximized. 23

Algorithmic result: outline (cont’d) 2. (Algorithmic) For any tensor A , and any ε > 0 , we can find at most k=4/ ε 2 rank-one tensors such that with probability at least .75 Good news! If x and y are known, then in order to maximize over all unit vectors x,y, and z, we can set z be the (normalized) vector whose j 3 entry is: for all j 3 24

Algorithmic result: outline (cont’d) 2. (Algorithmic) For any tensor A , and any ε > 0 , we can find at most k=4/ ε 2 rank-one tensors such that with probability at least .75 Approximating z … Instead of computing the entries of z , we approximate them by sub-sampling: We draw a set S of random tuples (j 1 ,j 2 ) – we need roughly 1/ ε 2 such tuples – and we approximate the entries of z by using the tuples in S only! 25

Algorithmic result: outline (cont’d) 2. (Algorithmic) For any tensor A , and any ε > 0 , we can find at most k=4/ ε 2 rank-one tensors such that with probability at least .75 Weighted sampling… Weighted sampling is used in order to pick the tuples (j 1 ,j 2 ) . More specifically, 26

Algorithmic result: outline (cont’d) 2. (Algorithmic) For any tensor A , and any ε > 0 , we can find at most k=4/ ε 2 rank-one tensors such that with probability at least .75 Exhaustive search in a discretized interval… We only need values for x j1 and y j2 in the set S . We will exhaustively try “all” possible values (by placing a fine grid on the interval [-1,1]). This leads to a number of trials that is exponential in |S| . 27

Approximating a tensor as a sum of rank-one components Petros - PowerPoint PPT Presentation

Approximating a tensor as a sum of rank-one components Petros Drineas Petros Drineas Rensselaer Polytechnic Institute Computer Science Department To access my web page: drineas Research interests in my group Algorithmic tools (Randomized,

2 3 4 5 8 9 MINNEAPOLIS MILWAUKEE MSA RANK #16 MSA RANK #39 CHICAGO MSA RANK #3

8. Tensor Field Visualization Tensor: extension of concept of scalar and vector Tensor data

Tensor Field Visualization 9-1 Ronald Peikert SciVis 2007 - Tensor Fields Tensors

Rank-Sum Test STAT 401 - Statistical Methods for Research Workers Jarad Niemi Iowa State

ex Addition: 1-bit half adder A + Sum B Carry out Carry A B Sum out 0 0 A 0 1 Sum

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

(Some) Challenges in (Some) Challenges in Tensor Mining Tensor Mining Evrim Acar Sandia

Tensor Field Techniques Lecture 11 March 5, 2020 Outline Basics of tensor algebra Tensor

TENSOR ALGEBRA Continuum Mechanics Course (MMC) - ETSECCPB - UPC Introduction to Tensors Tensor

Tensor-Matrix Products with a Compressed Sparse Tensor Shaden Smith George Karypis University

Tutorial: A brief survey on tensor rank and tensor decomposition, from a geometric perspective.

Low-Rank Tensor Techniques for High-Dimensional Problems Daniel Kressner CADMOS Chair for

A new family of maximum rank distance codes or: Maximum rank distance codes and finite semifields

1 SVD applications: rank, column, row, and null spaces Rank : the rank of a matrix is equal to:

Tensor estimation with structured priors Clment Luneau, Nicolas Macris June 29, 2020

PROGRAMMING TENSOR CORES: NATIVE VOLTA TENSOR CORES WITH CUTLASS Andrew Kerr, Timmy Liu, Mostafa

Some remarks about metric spaces, 1 Stephen William Semmes Of course various kinds of metric

System-on-Chip Design Data Flow hardware Implementa8on Hao Zheng Dept. Comp Sci & Eng U of

Discrete Mathematics -- Chapter 4: Properties of the Ch t 4 P ti f th Integers: Mathematical

Lecture 23: Recursive Algorithms Dr. Chengjiang Long Computer Vision Researcher at Kitware Inc.

Optimal Left and Right Additive Schwarz Preconditioning for Minimal Residual Methods with

On computing quadrature-based bounds for the A -norm of the error in conjugate gradients Petr

Quaternions Alan Pryde 24/11/11 1 . Introduction The set H of quaternions was first described by

Logical Foundations of Cyber-Physical Systems Andr Platzer Andr Platzer (CMU) LFCPS/13:

Approximating a tensor as a sum of rank-one components Petros - PowerPoint PPT Presentation

Approximating a tensor as a sum of rank-one components Petros Drineas Petros Drineas Rensselaer Polytechnic Institute Computer Science Department To access my web page: drineas Research interests in my group Algorithmic tools (Randomized,

2 3 4 5 8 9 MINNEAPOLIS MILWAUKEE MSA RANK #16 MSA RANK #39 CHICAGO MSA RANK #3

8. Tensor Field Visualization Tensor: extension of concept of scalar and vector Tensor data

Tensor Field Visualization 9-1 Ronald Peikert SciVis 2007 - Tensor Fields Tensors

Rank-Sum Test STAT 401 - Statistical Methods for Research Workers Jarad Niemi Iowa State

ex Addition: 1-bit half adder A + Sum B Carry out Carry A B Sum out 0 0 A 0 1 Sum

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

(Some) Challenges in (Some) Challenges in Tensor Mining Tensor Mining Evrim Acar Sandia

Tensor Field Techniques Lecture 11 March 5, 2020 Outline Basics of tensor algebra Tensor

TENSOR ALGEBRA Continuum Mechanics Course (MMC) - ETSECCPB - UPC Introduction to Tensors Tensor

Tensor-Matrix Products with a Compressed Sparse Tensor Shaden Smith George Karypis University

Tutorial: A brief survey on tensor rank and tensor decomposition, from a geometric perspective.

Low-Rank Tensor Techniques for High-Dimensional Problems Daniel Kressner CADMOS Chair for

A new family of maximum rank distance codes or: Maximum rank distance codes and finite semifields

1 SVD applications: rank, column, row, and null spaces Rank : the rank of a matrix is equal to:

Tensor estimation with structured priors Clment Luneau, Nicolas Macris June 29, 2020

PROGRAMMING TENSOR CORES: NATIVE VOLTA TENSOR CORES WITH CUTLASS Andrew Kerr, Timmy Liu, Mostafa

Some remarks about metric spaces, 1 Stephen William Semmes Of course various kinds of metric

System-on-Chip Design Data Flow hardware Implementa8on Hao Zheng Dept. Comp Sci &amp; Eng U of

Discrete Mathematics -- Chapter 4: Properties of the Ch t 4 P ti f th Integers: Mathematical

Lecture 23: Recursive Algorithms Dr. Chengjiang Long Computer Vision Researcher at Kitware Inc.

Optimal Left and Right Additive Schwarz Preconditioning for Minimal Residual Methods with

On computing quadrature-based bounds for the A -norm of the error in conjugate gradients Petr

Quaternions Alan Pryde 24/11/11 1 . Introduction The set H of quaternions was first described by

Logical Foundations of Cyber-Physical Systems Andr Platzer Andr Platzer (CMU) LFCPS/13:

System-on-Chip Design Data Flow hardware Implementa8on Hao Zheng Dept. Comp Sci & Eng U of