big data management analytics
play

Big Data Management & Analytics EXERCISE 9 SVD, CUR 11th of - PowerPoint PPT Presentation

Big Data Management & Analytics EXERCISE 9 SVD, CUR 11th of January, 2016 Sabrina Friedl LMU Munich 1 PCA REVISION 2 PCA Summary 3 Goals of PCA Detect hidden correla3ons Remove redundant and noisy features


  1. Big Data Management & Analytics EXERCISE 9 – SVD, CUR 11th of January, 2016 Sabrina Friedl LMU Munich 1

  2. PCA REVISION 2

  3. PCA – Summary 3

  4. Goals of PCA ◦ Detect hidden correla3ons ◦ Remove redundant and noisy features ◦ Interpreta3on and visualiza3on ◦ Easier storage and processing of dat -> Most helpful when there is a linear rela3onship between observed and hidden variables d=2 d=3 4

  5. Problems with PCA When applying PCA to a dataset of unknown structure 1. Unnormalized data can skew the result -> before PCA, norm the data! 2. Relevant structures might get lost 5

  6. Problems with PCA 3. Outliers can skew the PCA result 6

  7. Single Value Decomposition (SVD) REVISION AND EXERCISE 7

  8. SVD Any matrix X can be wriSen as (singular value decomposi3on) ◦ X Data matrix (n x d) ◦ V Right singular vectors: eigenvectors of ◦ U LeX-singular vectors of X: eigenvectors of ◦ Σ Singular Values: square roots of eigenvalues (elements on diagonal) hSps://de.wikipedia.org/wiki/Singul%C3%A4rwertzerlegung Usage example: Image compression 8

  9. SVD n x d n x n n x d d x d 9

  10. SVD- How to find matrices? Remember the Eigenwertproblem: v = eigenvector or λ = eigenvalue T = eigenvector matrix Λ diagonal eigenvalue matrix For ◦ Find V: ◦ Find U: or use: 10

  11. SVD - Example Given Matrix M Eigenvalues: Eigenpairs Eigenvectors : 11

  12. SVD - Example Eigenvalue decomposi3on Now we already know: 12

  13. SVD - Example Note: At this point we could write the SVD as follows: u 1 , u 2 and u 3 must build an How to find u 3 ? orthonormal basis! 13

  14. SVD - Example One-dimensional approxima3on of matrix M Recommended further reading: hSp://www.ams.org/samplings/feature-column/fcarc-svd 14

  15. CUR REVISION AND EXERCISE 15

  16. CUR Alterna3ve to SVD, which beSer respects the structure of the data 16

  17. Example Find CUR-decomposi3on of the given matrix with two rows and two columns! Sample size r = 2 Steps 1. Create sample matrices C and R 2. Construct U from C and R 17

  18. 1a. Create sample matrix C 18

  19. 1a. Create sample matrix C = 3 * 51 + 2*45 19

  20. 1a. Create sample matrix C 20

  21. 1b. Create sample matrix R 21

  22. 1b. Create sample matrix C Row 5 * Row 6 * 22

  23. 2. Construct U from C and R a) Create r x r matrix W as intersec3on of C and R b) Apply SVD on c) Compute as the pseudoinverse of d) Compute 23

  24. 2. Construct U from C and R a) Create matrix W: b) Apply SVD on W: c) Pseudo-Inverse of : d) Calculate 24

  25. Result of CUR decomposition 25

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend