Neural Nonnegative Matrix Factorization for Hierarchical Multilayer - PowerPoint PPT Presentation

Neural Nonnegative Matrix Factorization for Hierarchical Multilayer Topic Modeling Jamie Haddock CAMSAP 2019, December 16, 2019 Computational and Applied Mathematics UCLA joint with Mengdi Gao, Denali Molitor, Deanna Needell, Eli Sadovnik, Tyler Will, Runyu Zhang 1

Nonnegative Matrix Factorization (NMF) M k M k S ≈ X A N N � X − AS � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M ≥ 0 Problem Challenges: Problem Setup: 2

Nonnegative Matrix Factorization (NMF) M k M k S ≈ X A N N � X − AS � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M ≥ 0 Problem Challenges: Problem Setup: X ∈ R N × M : data matrix ≥ 0 2

Nonnegative Matrix Factorization (NMF) M k M k S ≈ X A N N � X − AS � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M ≥ 0 Problem Challenges: Problem Setup: X ∈ R N × M : data matrix ≥ 0 A ∈ R N × k ≥ 0 : features matrix 2

Nonnegative Matrix Factorization (NMF) M k M k S ≈ X A N N � X − AS � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M ≥ 0 Problem Challenges: Problem Setup: X ∈ R N × M : data matrix ≥ 0 A ∈ R N × k ≥ 0 : features matrix S ∈ R k × M : coefficients matrix ≥ 0 2

Nonnegative Matrix Factorization (NMF) M k M k S ≈ X A N N � X − AS � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M ≥ 0 Problem Challenges: Problem Setup: X ∈ R N × M : data matrix ≥ 0 A ∈ R N × k ≥ 0 : features matrix S ∈ R k × M : coefficients matrix ≥ 0 k : user chosen parameter 2

Nonnegative Matrix Factorization (NMF) M k M k S ≈ X A N N � X − AS � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M ≥ 0 Problem Challenges: Problem Setup: X ∈ R N × M : data matrix ⊲ nonconvex in A and S , ≥ 0 A ∈ R N × k ≥ 0 : features matrix NP-hard [Vavasis ’08] S ∈ R k × M : coefficients matrix ≥ 0 k : user chosen parameter 2

Nonnegative Matrix Factorization (NMF) M k M k S ≈ X A N N � X − AS � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M ≥ 0 Problem Challenges: Problem Setup: X ∈ R N × M : data matrix ⊲ nonconvex in A and S , ≥ 0 A ∈ R N × k ≥ 0 : features matrix NP-hard [Vavasis ’08] S ∈ R k × M : coefficients matrix ⊲ interpretability of factors ≥ 0 k : user chosen parameter 2 dependent upon k

Nonnegative Matrix Factorization (NMF) M : documents k : topics M : documents k : topics N : words N : words S ≈ X A � X − AS � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M ≥ 0 Problem Challenges: Problem Setup: X ∈ R N × M : data matrix ⊲ nonconvex in A and S , ≥ 0 A ∈ R N × k ≥ 0 : features matrix NP-hard [Vavasis ’08] S ∈ R k × M : coefficients matrix ⊲ interpretability of factors ≥ 0 k : user chosen parameter 2 dependent upon k

NMF Applications: Methods: 3

NMF Applications: Methods: ⊲ low-rank approximation 3

NMF Applications: Methods: ⊲ low-rank approximation ⊲ clustering 3

NMF Applications: Methods: ⊲ low-rank approximation ⊲ clustering ⊲ topic modeling 3

NMF Applications: Methods: ⊲ low-rank approximation ⊲ clustering ⊲ topic modeling ⊲ feature extraction 3

NMF Applications: Methods: ⊲ low-rank approximation ⊲ multiplicative updates ⊲ clustering ⊲ topic modeling ⊲ feature extraction 3

NMF Applications: Methods: ⊲ low-rank approximation ⊲ multiplicative updates ⊲ clustering ⊲ alternating nonnegative least squares ⊲ topic modeling ⊲ feature extraction 3

NMF Applications: Methods: ⊲ low-rank approximation ⊲ multiplicative updates ⊲ clustering ⊲ alternating nonnegative least squares ⊲ topic modeling ⊲ many others ⊲ feature extraction 3

(Semi)supervised NMF Goal: Incorporate known label information into problem. M P : classes Y � W ⊙ ( X − AS ) � 2 F + λ � L ⊙ ( Y − BS ) � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M , B ∈ R P × k ≥ 0 ≥ 0 Problem Setup: Problem Advantages: 4

(Semi)supervised NMF Goal: Incorporate known label information into problem. M P : classes Y � W ⊙ ( X − AS ) � 2 F + λ � L ⊙ ( Y − BS ) � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M , B ∈ R P × k ≥ 0 ≥ 0 Problem Setup: Problem Advantages: Y ∈ { 0 , 1 } P × M : label matrix ≥ 0 4

(Semi)supervised NMF Goal: Incorporate known label information into problem. M P : classes Y � W ⊙ ( X − AS ) � 2 F + λ � L ⊙ ( Y − BS ) � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M , B ∈ R P × k ≥ 0 ≥ 0 Problem Setup: Problem Advantages: Y ∈ { 0 , 1 } P × M : label matrix ≥ 0 P : number of classes 4

(Semi)supervised NMF Goal: Incorporate known label information into problem. M P : classes Y � W ⊙ ( X − AS ) � 2 F + λ � L ⊙ ( Y − BS ) � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M , B ∈ R P × k ≥ 0 ≥ 0 Problem Setup: Problem Advantages: Y ∈ { 0 , 1 } P × M : label matrix ≥ 0 P : number of classes W ∈ { 0 , 1 } N × M : data indicator ≥ 0 4

(Semi)supervised NMF Goal: Incorporate known label information into problem. M P : classes Y � W ⊙ ( X − AS ) � 2 F + λ � L ⊙ ( Y − BS ) � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M , B ∈ R P × k ≥ 0 ≥ 0 Problem Setup: Problem Advantages: Y ∈ { 0 , 1 } P × M : label matrix ≥ 0 P : number of classes W ∈ { 0 , 1 } N × M : data indicator ≥ 0 L ∈ { 0 , 1 } P × M : label indicator ≥ 0 4

(Semi)supervised NMF Goal: Incorporate known label information into problem. M P : classes Y � W ⊙ ( X − AS ) � 2 F + λ � L ⊙ ( Y − BS ) � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M , B ∈ R P × k ≥ 0 ≥ 0 Problem Setup: Problem Advantages: Y ∈ { 0 , 1 } P × M : label matrix ≥ 0 P : number of classes W ∈ { 0 , 1 } N × M : data indicator ≥ 0 L ∈ { 0 , 1 } P × M : label indicator ≥ 0 λ : user defined hyperparameter 4

(Semi)supervised NMF Goal: Incorporate known label information into problem. M P : classes Y � W ⊙ ( X − AS ) � 2 F + λ � L ⊙ ( Y − BS ) � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M , B ∈ R P × k ≥ 0 ≥ 0 Problem Setup: Problem Advantages: Y ∈ { 0 , 1 } P × M : label matrix ≥ 0 P : number of classes ⊲ use of label information W ∈ { 0 , 1 } N × M : data indicator ≥ 0 L ∈ { 0 , 1 } P × M : label indicator ≥ 0 λ : user defined hyperparameter 4

(Semi)supervised NMF Goal: Incorporate known label information into problem. M P : classes Y � W ⊙ ( X − AS ) � 2 F + λ � L ⊙ ( Y − BS ) � 2 min F A ∈ R N × k ≥ 0 , S ∈ R k × M , B ∈ R P × k ≥ 0 ≥ 0 Problem Setup: Problem Advantages: Y ∈ { 0 , 1 } P × M : label matrix ≥ 0 P : number of classes ⊲ use of label information W ∈ { 0 , 1 } N × M : data indicator ⊲ can extend multiplicative ≥ 0 L ∈ { 0 , 1 } P × M : label indicator updates method to SSNMF ≥ 0 λ : user defined hyperparameter 4

Hierarchical NMF Goal: Discover hierarchical topic structure within X . Problem Setup: Problem Challenges: 5

Hierarchical NMF Goal: Discover hierarchical topic structure within X . k (0) k (0) M M M k (1) k (1) S (1) k (0) S (0) A (1) k (0) ≈ ≈ X N A (0) A (0) N N Problem Setup: X ≈ A (0) S (0) X ≈ A (0) A (1) S (1) . . Problem Challenges: . X ≈ A (0) A (1) . . . A ( L ) S ( L ) 5

Hierarchical NMF Goal: Discover hierarchical topic structure within X . k (0) k (0) M M M k (1) k (1) S (1) k (0) S (0) A (1) k (0) ≈ ≈ X N A (0) A (0) N N Problem Setup: X ≈ A (0) S (0) ⊲ k (0) , k (1) , . . . , k ( L ) : user defined parameters X ≈ A (0) A (1) S (1) . . Problem Challenges: . X ≈ A (0) A (1) . . . A ( L ) S ( L ) 5

Hierarchical NMF Goal: Discover hierarchical topic structure within X . k (0) k (0) M M M k (1) k (1) S (1) k (0) S (0) A (1) k (0) ≈ ≈ X N A (0) A (0) N N Problem Setup: X ≈ A (0) S (0) ⊲ k (0) , k (1) , . . . , k ( L ) : user defined parameters X ≈ A (0) A (1) S (1) ⊲ k ( ℓ ) : supertopics collecting k ( ℓ − 1) subtopics . . Problem Challenges: . X ≈ A (0) A (1) . . . A ( L ) S ( L ) 5

Hierarchical NMF Goal: Discover hierarchical topic structure within X . k (0) k (0) M M M k (1) k (1) S (1) k (0) S (0) A (1) k (0) ≈ ≈ X N A (0) A (0) N N Problem Setup: X ≈ A (0) S (0) ⊲ k (0) , k (1) , . . . , k ( L ) : user defined parameters X ≈ A (0) A (1) S (1) ⊲ k ( ℓ ) : supertopics collecting k ( ℓ − 1) subtopics . . Problem Challenges: . X ≈ A (0) A (1) . . . A ( L ) S ( L ) ⊲ { k ( i ) } must be chosen 5

Hierarchical NMF Goal: Discover hierarchical topic structure within X . k (0) k (0) M M M k (1) k (1) S (1) k (0) S (0) A (1) k (0) ≈ ≈ X N A (0) A (0) N N Problem Setup: X ≈ A (0) S (0) ⊲ k (0) , k (1) , . . . , k ( L ) : user defined parameters X ≈ A (0) A (1) S (1) ⊲ k ( ℓ ) : supertopics collecting k ( ℓ − 1) subtopics . . Problem Challenges: . X ≈ A (0) A (1) . . . A ( L ) S ( L ) ⊲ { k ( i ) } must be chosen ⊲ error propagates through layers 5

Hierarchical NMF 6

Deep NMF Goal: Exploit similarities between neural networks and hierarchical NMF. 7

Neural Nonnegative Matrix Factorization for Hierarchical Multilayer - PowerPoint PPT Presentation

Neural Nonnegative Matrix Factorization for Hierarchical Multilayer Topic Modeling Jamie Haddock CAMSAP 2019, December 16, 2019 Computational and Applied Mathematics UCLA joint with Mengdi Gao, Denali Molitor, Deanna Needell, Eli Sadovnik,

Nonnegative matrix factorization and applications in audio signal processing C edric F

L101: Matrix Factorization In a nutshell Matrix factorization/completion you know? In NLP?

Data Sciences CentraleSupelec Advance Machine Learning Course VI - Nonnegative matrix

Online-Updating Regularized Kernel Matrix Factorization Models for Large-Scale Recommender

Tensor Factorization via Matrix Factorization Volodymyr Kuleshov Arun Tejasvi Chaganty Percy

Parallel Nonnegative Matrix Factorization Algorithms for Hyperspectral Images A Masters Thesis

Automatic relevance determination in nonnegative matrix factorization with the -divergence

Automated Gene Classification using Nonnegative Matrix Factorization on Biomedical Literature

New variants of Nonnegative Matrix Factorization for sparsity improvement and maximum biclique

Nonnegative Matrix Factorization and Applications Christine De Mol (joint work with Michel

Adversarial Nonnegative Matrix Factorization Lei Luo, Yanfu Zhang, Heng Huang Electrical and

Some Recent Advances in Nonnegative Matrix Factorization and their Applications to Hyperspectral

Multi-View Clustering via Joint Nonnegative Matrix Factorization Jialu Liu 1 Chi Wang 1 Jing Gao 2

Sparse Separable Nonnegative Matrix Factorization Extending Separable NMF with 0 sparsity

A Model For Mixed Linear-Tropical Matrix Factorization James Hook, Sanjar Karaev, Pauli Miettinen

Complexity of matrix multiplication (For Hierarchical matrix) For Usual matrix The

14.1 Physically Based Simulation II Mass-Spring Systems Hao Li http://cs420.hao-li.com 1

OpenSky: A Swiss Army Knife for Air Traffic Security Research Martin Strohmeier 1 Matthias

INC 151 Electrical Engineering So3ware Prac6ce Lecture #2 Scrip

Where are we? Layout - Line of Diffusion Lots of Layout issues Very common layout method

L AB N OTES : O DOMETRY , ROS R EFERENCE F RAMES I NSTRUCTOR : G IANNI A. D I C ARO A T Y P I C A

Neural Topological SLAM for Visual Navigation CVPR-2020 Webpage:

Types of Correspondence Problems and Data Sets 1 1 Correspondence Registration 2

Hybrid Systems Modeling, Analysis and Control Radu Grosu Vienna University of Technology Lecture