diffusion maps and coarse graining
play

Diffusion Maps and Coarse-Graining: A unified framework for - PowerPoint PPT Presentation

Diffusion Maps and Coarse-Graining: A unified framework for dimensionality reduction, graph partitioning and data set parameterization Authors: Stephane Lafon and Ann B. Lee (PAMI, Sept. 2006) Presented by Shihao Ji Duke University Machine


  1. Diffusion Maps and Coarse-Graining: A unified framework for dimensionality reduction, graph partitioning and data set parameterization Authors: Stephane Lafon and Ann B. Lee (PAMI, Sept. 2006) Presented by Shihao Ji Duke University Machine Learning Group Sept. 28, 2006

  2. Outline • Diffusion distances and Maps • Graph partitioning and subsampling • Numerical examples

  3. Diffusion distances • Let be a finite graph with n nodes, and weight matrix W satisfies the following conditions: – symmetry: – positivity: i.e. Gaussian kernel • Markov random walk • t step random walk

  4. Diffusion distances (cont’d) • Definition where is the unique stationary distribution of P.

  5. Diffusion Maps • The transition matrix P is adjoint to a symmetric matrix thus, P and P s share the same eigenvalues. • Since P s is a symmetric matrix eigenvalues: eigenvectors: form the orthonormal basis.

  6. Diffusion Maps (cont’d) • The left and right eigenvectors of P : • Biorthogonal spectral decomposition • Diffusion distances

  7. Diffusion Maps (cont’d) • Diffusion Maps • Diffusion distances

  8. Graph partitioning • Consider an arbitrary partition

  9. Graph partitioning (cont’d) • Definition ( geometric centroid ): • Theorem : for , we have where and

  10. Graph partitioning (Cont’d) • This theorem tells us 1. If then and are approximate left and right eigenvectors of with approximate eigenvalue . 2. In order to maximize the quality of approximation, we need to minimize the following distortion in diffusion space: • This provides a rigorous justification for k-means clustering in diffusion space.

  11. Numerical examples • Diffusion distance vs. Euclidean distance The Swiss roll, and its quantization by k-means ( k =4)

  12. Numerical examples (cont’d) • Robustness of the diffusion distance Diffusion distance Dijkstra’s algorithm

  13. Numerical examples (cont’d) Averaged on 1000 instances

  14. Messages • Diffusion maps provide a unified framework for dimensionality reduction, graph partitioning and data set parameterization. • Coarse-graining gives a rigorous justification of k- means clustering in diffusion space. • Diffusion distance is robust to noise and small perturbations of the data.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend