Big Data Management & Analytics
EXERCISE 9 – SVD, CUR
11th of January, 2016
Sabrina Friedl LMU Munich
1
Big Data Management & Analytics EXERCISE 9 SVD, CUR 11th of - - PowerPoint PPT Presentation
Big Data Management & Analytics EXERCISE 9 SVD, CUR 11th of January, 2016 Sabrina Friedl LMU Munich 1 PCA REVISION 2 PCA Summary 3 Goals of PCA Detect hidden correla3ons Remove redundant and noisy features
EXERCISE 9 – SVD, CUR
11th of January, 2016
Sabrina Friedl LMU Munich
1
REVISION
2
3
4
between observed and hidden variables
d=2 d=3
When applying PCA to a dataset of unknown structure
5
6
REVISION AND EXERCISE
7
Any matrix X can be wriSen as (singular value decomposi3on)
(elements on diagonal)
Usage example: Image compression
8
hSps://de.wikipedia.org/wiki/Singul%C3%A4rwertzerlegung
9
n x n n x d d x d n x d
Remember the Eigenwertproblem: For
10
v = eigenvector λ = eigenvalue T = eigenvector matrix Λ diagonal eigenvalue matrix
Given Matrix M Eigenvalues: Eigenvectors:
11
Eigenpairs
Eigenvalue decomposi3on Now we already know:
12
13
Note: At this point we could write the SVD as follows: How to find u3?
u1, u2 and u3 must build an
One-dimensional approxima3on of matrix M
14
hSp://www.ams.org/samplings/feature-column/fcarc-svd Recommended further reading:
REVISION AND EXERCISE
15
Alterna3ve to SVD, which beSer respects the structure of the data
16
17
Find CUR-decomposi3on of the given matrix with two rows and two columns! Sample size r = 2 Steps
18
19
= 3 * 51 + 2*45
20
21
22
Row 5 * Row 6 *
a) Create r x r matrix W as intersec3on of C and R b) Apply SVD on c) Compute as the pseudoinverse of d) Compute
23
24
a) Create matrix W: b) Apply SVD on W: c) Pseudo-Inverse of : d) Calculate
25