tensor decompositions for ensor decompositions for big
play

Tensor Decompositions for ensor Decompositions for Big Multi-aspect - PowerPoint PPT Presentation

Tensor Decompositions for ensor Decompositions for Big Multi-aspect Data Big Multi-aspect Data Analytics Analytics Evangelos Evangelos (V (Vagelis) agelis) Papalexakis Papalexakis UC Riverside Second W Second Workshop of Mission-Critical


  1. Tensor Decompositions for ensor Decompositions for Big Multi-aspect Data Big Multi-aspect Data Analytics Analytics Evangelos Evangelos (V (Vagelis) agelis) Papalexakis Papalexakis UC Riverside Second W Second Workshop of Mission-Critical Big Data Analytics (MCBDA 2017) orkshop of Mission-Critical Big Data Analytics (MCBDA 2017)

  2. Roadmap Roadmap 2 E. Papalexakis @ MCBDA 17

  3. Multi-Aspect Data?? 3 E. Papalexakis @ MCBDA 17

  4. Multi-V Multi-View Social iew Social Networks Networks 4 E. Papalexakis @ MCBDA 17

  5. Social Network Matrix Social Network Matrix 5 E. Papalexakis @ MCBDA 17

  6. Outer Pr Outer Product oduct 1x3 b T 3 3 4 4 5 5 By definition: 3 3 4 4 5 5 1 1 = = Rank-one matrix Rank-one matrix 6 8 8 10 6 10 2 2 10 a a 2x1 2x3 Matrix Rank Matrix Rank = Min # of rank-one matrices that add up to that matrix 6 E. Papalexakis @ MCBDA 17

  7. Matrix Decomposition Matrix Decomposition Decomposition into rank-1 = matrices/components 7 E. Papalexakis @ MCBDA 17

  8. Singular Value Decomposition Singular V alue Decomposition v 1 v k T T T σ 1 σ k 1 k = X X + … + u 1 u k 1 • If k = rank(X) then we have equality • If k < rank(X) we have the best rank k approximation that minimizes the squared error (Eckart Young Theorem) 8 E. Papalexakis @ MCBDA 17

  9. What if we have mor What if we have more than 1 view e than 1 view of the network? of the network? If we aggregate, we ignore important structure!! 9 E. Papalexakis @ MCBDA 17

  10. Tensors ensors • Multi-dimensional matrices • Long list of applications: Chemometrics, Psychometrics, Signal Processing, Data Mining X X 10 E. Papalexakis @ MCBDA 17

  11. What ar What are we looking for? e we looking for? Blocks Blocks within the data Subsets / co-clusters of: 1) Users (“senders”) 2) Users (“receivers”) X X 3) Means of communication 11 E. Papalexakis @ MCBDA 17

  12. Thr Three-way Outer Pr ee-way Outer Product oduct c 2x3 2x1 3 3 1x3 9 12 12 15 9 15 2 2 b 18 21 21 30 18 30 3 4 4 5 3 5 6 8 6 8 10 10 1 1 = = 2x3x2 rank-1 12 16 12 16 20 20 2 2 10 a a tensor 2x1 2x3 Blocks ar Blocks are rank-1 tensors e rank-1 tensors 12 E. Papalexakis @ MCBDA 17

  13. CP/P CP/PARAF ARAFAC Decomposition AC Decomposition c 1 c K b 1 b K ≈ X X + … + a 1 a K a k � b k � c k k 2 X A , B , C k X � min F k outer product 13 E. Papalexakis @ MCBDA 17

  14. G RAPH RAPH F USE USE PARAFAC with c 1 c K Sparse Latent Factors Step 1 Step 1 [Papalexakis et al. [Papalexakis et al. b 1 b K ≈ X + … IEEE ICASSP 2011, IEEE IEEE ICASSP 2011, IEEE + TSP 2013 ] TSP 2013 ] a 1 a K Assign users to communities Step 2 Step 2 (max component membership) For users with no assignment Step 3 Step 3 create K+1 th (umbrella) community Output: 1. Assignment of users to communities 2. Influence of a view to each community [Papalexakis et al. Fusion 2013 Fusion 2013] 14 E. Papalexakis @ MCBDA 17

  15. DBLP Multi-V DBLP Multi-View Graph iew Graph (a) citation (b) co-auth. (c) co-term • Assignment of authors to research communities • Measure NMI (Normalized Mutual Information) • Baselines ² Spectral clustering on sum of matrices / views ² Linked Matrix Factorization [Tang et al. ICDM 2009] • G RAPH F USE outperforms “2D” baselines [Papalexakis et al. Fusion 2013 Fusion 2013] 15 E. Papalexakis @ MCBDA 17

  16. Time-Evolving Graphs as T ime-Evolving Graphs as Tensors ensors …! …! …! t K ! t 1 ! t 2 ! t 3 ! Detect anomalies / real-life events 16 E. Papalexakis @ MCBDA 17

  17. Time-Evolving Graphs as T ime-Evolving Graphs as Tensors ensors • Tensor Decomposition will give us ² Communities in the Graph & ² Their evolution over time Change Detection 1 ICDE, SIGMOD, VLDB Point of CIKM,ECIR,ICDE,ICDM,IJCAI,JCDL,KDD,SIGIR, WWW 0.8 change 0.6 0.4 0.2 0 1986 1988 1990 1992 1994 1996 1998 2000 2002 2004 2006 Year Fig. 3. In this Figure we demonstrate how T ENSOR S PLAT is able to perform change detection. In particular, we observe two components in which a well- known professor appears as an author; the first component mainly contains Databases conferences, whereas the second contains Data Mining conferences. The dashed red line indicates the point of change in research direction. Koutra, Papalexakis, Faloutsos, “TENSORSPLAT: Spotting Latent Anomalies in Time” 17 E. Papalexakis @ MCBDA 17

  18. Neur Neurosemantics osemantics … � � � � �� � � � � � � � � � � � � �� �� �� “Does it fly?” (y/n) “Does it fly?” (y/n) “Does it bite?” (y/n) “Does it bite?” (y/n) fMRI MEG Real and predicted MEG brain activity Real and predicted MEG brain activity 0.4 0.3 0.3 ed: 0.25 0.25 0.3 0.2 0.2 0.2 0.15 0.15 0.1 0.1 0.1 0.05 0 0.05 0 20 40 0 20 40 0 20 40 18 E. Papalexakis @ MCBDA 17 � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � �

  19. Neur Neurosemantics osemantics • Semantically coherent brain regions? • Similarities/differences between subjects? • How is language processed in the brain? … � � � � �� � � � � � � � � � � � � �� �� �� “Does it fly?” (y/n) “Does it fly?” (y/n) “Does it bite?” (y/n) “Does it bite?” (y/n) Real and predicted MEG brain activity Real and predicted MEG brain activity 0.4 0.3 0.3 ed: 0.25 0.25 0.3 0.2 0.2 0.2 0.15 0.15 0.1 0.1 0.1 0.05 0 0.05 0 20 40 0 20 40 0 20 40 19 E. Papalexakis @ MCBDA 17 � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � �

  20. Airplane Airplane Puppy Puppy Dog Dog Combining measurements for Combining measur ed: ed: multiple subjects multiple subjects E. Papalexakis @ MCBDA 17 ed: ed: ements for ed: ed: ed: ed: 20

  21. Modeling Brain Data as Tensor Modeling Brain Data as T Airplane Airplane Puppy Puppy Dog Dog E. Papalexakis @ MCBDA 17 fMRI voxels fMRI voxels ed: ed: ed: ed: ed: ed: ed: ed: ensor 21

  22. Airplane Airplane Puppy Puppy Dog Dog CP/P CP/PARAF ed: ed: a r , b r , c r k X � ed: ed: ARAFAC Decomposition min ed: ed: X X ed: ed: AC Decomposition E. Papalexakis @ MCBDA 17 = = r =1 X R a 1 1 1 1 1 a r � b r � c r k 2 c 1 b 1 ed: F + + a 2 1 1 c 2 b 2 22

  23. � � � � �� Semantic Information Semantic Infor mation ✔ Is it alive? ✔ Does it bite? Does it fly? ✔ Can you buy it? Is it smaller than a golf ball? … • Human readable description of the noun • Useful information to guide the analysis • Can have different semantic features (corpus statistics, knowledge base features) 23 E. Papalexakis @ MCBDA 17 � � � � � � � � � � � �

  24. Y Tensor W ✔ ensor With Side Infor ✔ ✔ [Papalexakis Papalexakis et al. SDM 2014 ✔ ✔ ith Side Information E. Papalexakis @ MCBDA 17 ✔ ✔ ✔ et al. SDM 2014] Airplane Airplane Puppy Puppy Dog Dog ed: ed: mation ed: ed: ed: ed: X X ed: ed: 24

  25. Y a 1 1 1 1 1 ✔ Proposed Modeling: Coupled Pr ✔ ✔ = = ✔ d 1 Matrix-T Matrix-Tensor Factorization oposed Modeling: Coupled ✔ ✔ ✔ + + ✔ a 2 ✔ ✔ ✔ 1 1 Airplane Airplane Puppy Puppy ✔ Dog Dog d 2 ensor Factorization E. Papalexakis @ MCBDA 17 ✔ ed: ed: ed: ed: ed: ed: X X ed: ed: = = 1 1 1 1 a 1 c 1 ed: b 1 + + a 2 1 1 c 2 b 2 25

  26. Y a 1 1 1 1 1 ✔ Pr Proposed Modeling: Coupled ✔ ✔ = = ✔ d 1 Matrix-Tensor Factorization Matrix-T oposed Modeling: Coupled ✔ ✔ ✔ + + ✔ a 2 ✔ ✔ ✔ a r , b r , c r , d r k X � 1 1 Airplane Airplane min Puppy Puppy ✔ Dog Dog d 2 ensor Factorization E. Papalexakis @ MCBDA 17 ✔ ed: ed: ed: ed: Tensor part r =1 X ed: ed: R X X ed: ed: a r � b r � c r k 2 = = 1 1 1 1 a 1 c 1 F + k Y � ed: b 1 + + Matrix part r =1 X a 2 1 1 R c 2 a r d T b 2 r k 2 26 F

  27. Pr Pre-motor Cortex e-motor Cortex Nouns Q beetle c glass tomato bell ✔ Unsupervised Nou Questions Nou ✔ Agrees with Neuroscience can you pick it up? can you hold it in one hand? is it smaller than a golfball?’ 6 0.05 0.05 50 50 0.04 100 0.04 100 0.03 150 0.03 150 200 0.02 200 0.02 250 Pre$motor(cortex( 0.01 250 0.01 Premotor Cortex 300 0 300 50 100 150 200 250 0 50 100 150 200 250 27 E. Papalexakis @ MCBDA 17

  28. Multi-Aspect Data Everywher Multi-Aspect Data Everywhere! e! Social Networks Social Networks & Urban Comp. & Urban Comp. � � � � �� � � � � � � � � �� �� � � � � �� Neur Neurosemantics osemantics … W Web eb is president Knowledge Knowledge … � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 28 � � E. Papalexakis @ MCBDA 17 � � � � � �

  29. Roadmap Roadmap 29 E. Papalexakis @ MCBDA 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend