Towards Faster Nonnegative Tensor Factorization: A New Active-Set - PowerPoint PPT Presentation

Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons Haesun Park hpark@cc.gatech.edu College of Computing Georgia Institute of Technology Atlanta, GA 30332, USA Joint work with Krishnakumar Balasubramanian, Hyunsoo Kim, Jingu Kim and Lars Elden NSF Tensor Workshop, Feb. 20-21, 2009

Outline New algorithms for NMF (Nonnegative Matrix Factorization) and NTF(Nonnegative PARAFAC) Algorithms for NMF Block principal pivoting algorithm Comparison results (NMF) Extension to NTF(Nonnegative PARAFAC) results (NTF) Summary Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.1/23

Alternating Nonnegative Least Squares for NMF Given A ∈ R m × n and a desired rank k , find W ∈ R m × k and H ∈ R k × n such + + + ⇒ min W ≥ 0 ,H ≥ 0 � A − WH � 2 that A ≈ WH = F 1. Initialize W ≥ 0 (or H ≥ 0 ) 2. Iterate the following ANLS until a stopping criteria is satisfied: (a) Solve min H ≥ 0 � W H − A � 2 F 2 � H T W T − A T � � (b) Solve min W ≥ 0 � � � F 3. The columns of W are normalized to unit L 2 -norm Convergence : Any limit point of the sequence is a stationary point [Grippo and Sciandrone ’00] Alternating Nonnegative Least Squares (ANLS) [Lin ’07, Kim et al ’07, H. Kim and Park ’08] Alternating Least Squares(ALS) [Berry et al ’06]: convergence is difficult to analyse, but can solve each sub-problem fast. Multiplicative Updating Rules [Lee and Seung ’01]: Simple to implement, but residual non-increasing property may not imply convergence to a stationary point. Other algorithms and variants [Li et al ’01, Hoyer ’04, Pauca et al ’04, Gao and Church ’05, Chu and Lin ’08] Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.2/23

NMF/ANLS Algorithms Sub-problem : min X ≥ 0 � CX − B � 2 F Active Set [H. Kim and Park,SIMAX ’08] Classical algorithm for NNLS with single right hand side � � min x ≥ 0 � Cx − b � 2 [Lawson and Hansen ’95] Faster algorithms for multiple right hand side problems [Bro and de Jong, 1997], and [Van Benthem and Keenan ’04]. Projected Gradient [Lin ’07] x k +1 ← P + ( x k − α k ∇ f ( x k )) Improved selection of step constant α k Projected Quasi-Newton [Kim ’07]   � y k − α ¯ � � � D k ∇ f ( y k ) P + y x k +1 ← =   z k 0 Gradient scaling only for inactive variables Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.3/23

Structure of Sub-problems in NMF Recognizing the structure is important for developing a fast algorithm for NMF : k ≪ min( m, n ) min H ≥ 0 � WH − A � 2 F W ∈ R m × k is long and thin and A ∈ R m × n has n right hand sides. + + � H T W T − A T � � 2 � min W ≥ 0 F H T ∈ R n × k is long and thin and A T ∈ R n × m has m right hand sides. + + Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.4/23

Block Principal Pivoting Algorithm Consider single right-hand side problem [Portugal et al ’94] : for x ∈ R q x ≥ 0 � Cx − b � 2 min 2 KKT conditions: C T Cx − C T b = y (1a) y ≥ 0 (1b) x ≥ 0 (1c) = 0 , i = 1 , · · · , q x i y i (1d) Need to find x and y that satisfy KKT conditions. Guess two index sets F and G that partition { 1 , · · · , q } Repeat: Set x G = 0 . Solve x F = arg min x F � C F x F − b � 2 2 Set y F = 0 Set y G = C T G ( C F x F − b ) . If x F ≥ 0 and y G ≥ 0 , solution found. Otherwise, update F and G . Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.5/23

How block principal pivoting works Update by C T F C F x F = C T F b and y G = C T G C F x F − C T G b . Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.6/23

Active-Set type Algorithms Active-Set Algorithm: One column is replaced most of the time Residual is guaranteed to monotonically decrease Careful exchange rule requires many iterations Can be faster when the solution is sparse Block Principal Pivoting Algorithm: Multiple columns are replaced each time Residual is not guaranteed to decrease Backup exchange rule guarantees BPP to find the solution in a finite number of iterations Can be faster when the solution vector is dense or long Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.10/23

NNLS with Multiple right-hand side for NMF min X ≥ 0 � CX − B � 2 F Block principal pivoting [Kim and Park ’08] Exploit long and thin structure Precompute C T C and C T B : updates of x F and y G is given by C T C T = F C F x F F b C T G C F x F − C T y G = G b. All coefficients can be directly retrieved from C T C and C T B C T C and C T B is small. → Storage is not a problem. Exploiting common F and G sets. → X is flat and wide. → More common cases of F and G sets. Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.11/23

Extension to Sparse NMF and Regularized NMF Sparse NMF [H. Kim and Park, Bioinformatics ’07]   n    � A − W H � 2 F + η � W � 2 � � H (: , j ) � 2 min F + β (2) 1 W,H  j =1 subject to W, H ≥ 0 . ANLS reformulation: alternate the following 2 � �� W A � � min H − � √ βe 1 × k � � � H ≥ 0 0 1 × n � � F 2 � � � � �� A T H W T − � � min � � √ ηI k � � 0 k × m W ≥ 0 � � F Similar reformulation for regularized NMF: [Pauca ’06] � � � A − W H � 2 F + α � W � 2 F + β � H � 2 min (3) F W,H subject to W, H ≥ 0 . Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.12/23

Comparison results (NMF) Stopping criterion: normalized KKT optimality condition δ ∆ ≤ ǫ ∆ 0 , where ∆ = δ W + δ H Data sets: Synthetic: 300 × 200 , create sparse W and H and produce A = WH with noise Text: Topic Detection and Tracking 2, randomly select 20 topics, 12617 × 1491 Image: Olivetti Research Laboratory face image, 10304 × 400 Compared algorithms ( mult ) Lee and Seung’s multiplicative updating algorithm[’01] ( als ) Berry et al.’s alternating least squares algorithm [’07] ( lsqnonneg ) ANLS with Lawson and Hanson’s active set algorithm for single right hand side [’95] ( projnewton ) ANLS with Kim et al.’s projected quasi-Newton algorithm [’07] ( projgrad ) ANLS with Lin’s projected gradient algorithm [’07] ( activeset ) ANLS with Kim and Park’s active set algorithm for multiple right hand sides [’07 Bioinformatics, ’08 SIMAX] ( blockpivot ) Kim and Park’s ANLS with block principal pivoting algorithm [’08 ICDM] Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.13/23

Stopping Criterion KKT condition: W ≥ 0 H ≥ 0 ∂f ( W, H ) /∂W ≥ 0 ∂f ( W, H ) /∂H ≥ 0 W. ∗ ( ∂f ( W, H ) /∂W ) = 0 H. ∗ ( ∂f ( W, H ) /∂H ) = 0 These conditions can be simplified as min ( W, ∂f ( W, H ) /∂W ) = 0 (4a) min ( H, ∂f ( W, H ) /∂H ) = 0 (4b) where the minimum is taken componentwise. Normalized KKT residual: δ ∆ = (5) δ W + δ H where m k � � � � δ = � min( W iq , ( ∂f ( W, H ) /∂W ) iq � � � q =1 i =1 k n � � � � + � min( H qj , ( ∂f ( W, H ) /∂H ) qj � � (6) � q =1 j =1 δ W =# (min( W, ( ∂f ( W, H ) /∂W ) � = 0) (7) δ H =# (min( H, ( ∂f ( W, H ) /∂H ) � = 0) . (8) Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.14/23

Synthetic data set multi als lsqnonneg projnewton projgrad activeset blockpivot k 0.252 time (sec) 5 35.336 36.697 23.188 5.756 0.976 0.262 0.786 10 47.132 52.325 82.619 13.43 4.157 0.848 4.004 20 72.888 83.232 45.007 9.32 4.41 14.384 30 127.33 62.317 17.252 16.132 40 81.445 22.246 21.368 60 128.76 37.376 30.055 80 276.29 65.566 iterations 5 9784.2 10000 25.6 25.8 30 26.4 26.4 10 10000 10000 34.8 35.2 45 35.2 35.2 20 10000 10000 70.8 104 69.8 69.8 30 166 205.2 166.6 166.6 40 234.8 118 117.8 60 157.8 84.2 84.2 80 131.8 67.2 67.2 residual 5 0.04035 0.04043 0.04035 0.04035 0.04035 0.04035 0.04035 10 0.04345 0.04379 0.04343 0.04343 0.04344 0.04343 0.04343 20 0.04603 0.04556 0.04412 0.04414 0.04412 0.04412 30 0.04313 0.04316 0.04327 0.04327 40 0.04944 0.04943 0.04944 Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons – p.15/23

Towards Faster Nonnegative Tensor Factorization: A New Active-Set - PowerPoint PPT Presentation

Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons Haesun Park hpark@cc.gatech.edu College of Computing Georgia Institute of Technology Atlanta, GA 30332, USA Joint work with Krishnakumar

Tensor Factorization via Matrix Factorization Volodymyr Kuleshov Arun Tejasvi Chaganty Percy

8. Tensor Field Visualization Tensor: extension of concept of scalar and vector Tensor data

Nonnegative Tensor Factorization using a proximal algorithm: application to 3D fluorescence

Nonnegative Tensor Factorization for Source Separation of Loops in Audio Jordan B. L. Smith

Nonnegative matrix factorization and applications in audio signal processing C edric F

Fast Newton-type Methods for Nonnegative Matrix and Tensor Approximation Inderjit S. Dhillon

FASTER TRANSFORMER Bo Yang Hsueh, 2019/12/18 AGENDA What is Faster Transformer Introduce the

A Medium-Grained Algorithm for Distributed Sparse Tensor Factorization Shaden Smith George

Sparse Tensor Factorization: Algorithms, Data Structures, and Challenges Shaden Smith &

Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals

Some Applications of Nonnegative Tensor Factorizations (NTF) to Mining Hype rspectral &

(Some) Challenges in (Some) Challenges in Tensor Mining Tensor Mining Evrim Acar Sandia

Tensor Field Techniques Lecture 11 March 5, 2020 Outline Basics of tensor algebra Tensor

TENSOR ALGEBRA Continuum Mechanics Course (MMC) - ETSECCPB - UPC Introduction to Tensors Tensor

Tensor-Matrix Products with a Compressed Sparse Tensor Shaden Smith George Karypis University

Tensor Field Visualization 9-1 Ronald Peikert SciVis 2007 - Tensor Fields Tensors

Locked-on: verifying controls for aircraft tracking Chun Kai Ling, Daniel Wong 15-824 Logical

esm-tools A tool for Earth-System-Modellers Nadine Wieters, Dirk Barbi, Luisa Cristini, et al.

F acebook @DWAFCEA T witter afcea @DWAFCEA L inked I n D ayton W r ight C hapter of AFCEA I nter

Interior-point methods 10-725 Optimization Geoff Gordon Ryan Tibshirani Review SVM duality

Campaign externalities, programmatic spending and voting preferences in rural Mexico The case of

Focus Session on Stellar Populations and Gravitational Wave Observations Center for

Computational Complexity of Bayesian Networks Johan Kwisthout and Cassio P. de Campos Radboud

ARMAX: One Step Prediction IIT Bombay Consider 2' nd order ARMAX model with d 1 = y (

Towards Faster Nonnegative Tensor Factorization: A New Active-Set - PowerPoint PPT Presentation

Towards Faster Nonnegative Tensor Factorization: A New Active-Set type Algorithm and Comparisons Haesun Park hpark@cc.gatech.edu College of Computing Georgia Institute of Technology Atlanta, GA 30332, USA Joint work with Krishnakumar

Tensor Factorization via Matrix Factorization Volodymyr Kuleshov Arun Tejasvi Chaganty Percy

8. Tensor Field Visualization Tensor: extension of concept of scalar and vector Tensor data

Nonnegative Tensor Factorization using a proximal algorithm: application to 3D fluorescence

Nonnegative Tensor Factorization for Source Separation of Loops in Audio Jordan B. L. Smith

Nonnegative matrix factorization and applications in audio signal processing C edric F

Fast Newton-type Methods for Nonnegative Matrix and Tensor Approximation Inderjit S. Dhillon

FASTER TRANSFORMER Bo Yang Hsueh, 2019/12/18 AGENDA What is Faster Transformer Introduce the

A Medium-Grained Algorithm for Distributed Sparse Tensor Factorization Shaden Smith George

Sparse Tensor Factorization: Algorithms, Data Structures, and Challenges Shaden Smith &amp;

Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals

Some Applications of Nonnegative Tensor Factorizations (NTF) to Mining Hype rspectral &amp;

(Some) Challenges in (Some) Challenges in Tensor Mining Tensor Mining Evrim Acar Sandia

Tensor Field Techniques Lecture 11 March 5, 2020 Outline Basics of tensor algebra Tensor

TENSOR ALGEBRA Continuum Mechanics Course (MMC) - ETSECCPB - UPC Introduction to Tensors Tensor

Tensor-Matrix Products with a Compressed Sparse Tensor Shaden Smith George Karypis University

Tensor Field Visualization 9-1 Ronald Peikert SciVis 2007 - Tensor Fields Tensors

Locked-on: verifying controls for aircraft tracking Chun Kai Ling, Daniel Wong 15-824 Logical

esm-tools A tool for Earth-System-Modellers Nadine Wieters, Dirk Barbi, Luisa Cristini, et al.

F acebook @DWAFCEA T witter afcea @DWAFCEA L inked I n D ayton W r ight C hapter of AFCEA I nter

Interior-point methods 10-725 Optimization Geoff Gordon Ryan Tibshirani Review SVM duality

Campaign externalities, programmatic spending and voting preferences in rural Mexico The case of

Focus Session on Stellar Populations and Gravitational Wave Observations Center for

Computational Complexity of Bayesian Networks Johan Kwisthout and Cassio P. de Campos Radboud

ARMAX: One Step Prediction IIT Bombay Consider 2' nd order ARMAX model with d 1 = y (

Sparse Tensor Factorization: Algorithms, Data Structures, and Challenges Shaden Smith &

Some Applications of Nonnegative Tensor Factorizations (NTF) to Mining Hype rspectral &