Application of Spectral Clustering Algorithm Danielle Middlebrooks - PowerPoint PPT Presentation

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Application of Spectral Clustering Algorithm Danielle Middlebrooks dmiddle1@math.umd.edu Advisor: Kasso Okoudjou kasso@umd.edu Department of Mathematics University of Maryland- College Park Advance Scientific Computing II May 11, 2016 1/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Outline Project Overview 1 Results from MNIST Database 2 Adding New Datapoint 3 Results from Face Database 4 Project Schedule 5 References 6 2/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Background Information Spectral Clustering is technique that makes use of the spectrum of the similarity matrix derived from the data set in order to cluster the data set into different clusters. Implement an algorithm that groups same digits from the MNIST Handwritten digits database in the same cluster. In practice this algorithm and my code will work for any database that wants to group together similar objects. 3/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Motivation Motivated by the N cut problem. k W ( A i , ¯ min NCut( A 1 , ..., A k ) := min 1 A i ) � 2 vol ( A i ) i =1 where A is a subset of the vertices V the compliment ¯ A = V \ A W ( A i , A j ) = � i ∈ A i , j ∈ A j w ij vol ( A ) = � i ∈ A d i The idea is that the eigenvectors serve as indicator functions in order to easily cluster the database in a reduced dimension. 4/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Implementation Personal Laptop: Macbook Pro. Matlab R2016b 4GB Memory Desktop provided by Norbert Wiener Center Matlab R2015b 128GB Memory 5/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Normalized Laplacian Matrix −|| Xi − Xj || 2 Guassian Similarity Function: s ( X i , X j ) = e where σ 2 σ 2 is a parameter. � 1 , if s ( X i , X j ) > ǫ W - Adjacency matrix w ij = 0 , otherwise D - Degree matrix Unnormalized Laplacian Matrix: L = D − W Normalized Laplacian Matrix: L sym = D − 1 / 2 LD − 1 / 2 = I − D − 1 / 2 WD − 1 / 2 6/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Normalized Laplacian Matrix As validation we know the smallest eigenvalue of the Normalized Laplacian will be zero with eigenvector D 1 / 2 1 To choose the best parameters, we implement the entire algorithm a number of times, changing epsilon each time until we reach some tolerance for the total error σ = 2000 ǫ = 0 . 3575 7/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Modified B Matrix Normalized Laplacian Matrix: L sym = D − 1 / 2 LD − 1 / 2 = I − D − 1 / 2 WD − 1 / 2 = I − B Computing the first p eigenvalues of B using the power method give us the largest eigenvalues in magnitude. Let B mod = B + µ I where µ = max(sum(B,2)) 8/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Computing first p Eigenvectors Using the Power Method with Deflation on B mod we compute the first p eigenvalues. 9/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Computing first p Eigenvectors By changing convergence criterion and increasing max iterations we obtain λ 1 λ 2 λ 3 λ 4 r 6.90E-15 1.18E-14 2.44E-10 2.84E-09 r =norm( B λ v − B λ ∗ v ∗ ,2) ( λ, v ) came from power method ( λ ∗ , v ∗ ) came from eigs function 10/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Row Normalization Let T ∈ R nxk be the eigenvector matrix with norm 1. Set v i , j t i , j = p v 2 i , p ) 1 / 2 ( �     v 11 v 12 v 13 v 1 p t 11 t 12 t 13 t 1 p . . . . . . . . . . . . . . ... ... . . . . . . . .     . . . . . . . .         v i 1 v i 2 v i 3 . . . v ip ⇒ t i 1 t i 2 t i 3 . . . t ip     . . . . . . . .  ...   ...  . . . . . . . .  . . . .   . . . .      v n 1 v n 2 v n 3 v np t n 1 t n 2 t n 3 t np . . . . . . 11/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References K -means Clustering Let y i be the i th row of T Randomly select k cluster centroids, z j . Calculate the distance between each y i and z j . Assign the data point to the closest centroid. Recalculate centroids and distances from data points to new centroids. If no data point was reassigned then stop, else reassign data points and repeat. 12/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References K -means Clustering Assign the original point X i to cluster j if and only if row i of the matrix T was assigned to cluster j . 13/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Cluster Classification Next we classify each cluster as a particular digit. Digit 0 1 2 3 4 5 6 7 8 9 Cluster Class 6 5 2 3 7 9 8 4 1 10 Run time: 23mins 14/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Results Below is a table of error for each cluster on 2000 Error= Number of incorrect digits in cluster Total number of digits in cluster 1 2 3 4 5 6 7 8 9 10 78% 82% 48% 65% 39% 13% 69% 58% 65% 72% Overall Error= Total number of incorrect digits = 59% Total number of digits Overall Error on 1000 images=64% Overall Error on 10000 images=49% 15/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Results Cluster 6 Cluster 4 Cluster 3 16/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Addition of New Datapoint- Standard Method Proposition (Nystrom Method) Method for out-of-sample extension Goal: Use a similarity kernel function K ( x , y ) in order to embed the new data point x in the reduced dimension. Benjio, Y, et al. Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering 17/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Addition of New Datapoint- Another Method? We can determine which cluster a single new datapoint belongs to without re running the entire code. Create a similarity vector, denoted as X sim of 0’s and 1’s Normalize the similarity vector by multiplying it by D 1 / 2 Compute the projection of the similarity vector onto the eigenvectors of the Normalized Laplacian matrix and normalize. Denoted as C sim that lives in R p . Find the centroid that is closest to C sim 18/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Results Implementation on a random subset of 100 digits. Error Runtime Averaged over 100 digits 61% 12.6sec 19/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Yale Face Database Contains 165 grayscale images of 15 individuals. 11 images per subject, one per different facial expression or configuration. Each image is 32x32 pixels 20/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Results Using 10 subjects and 5 images per subject with σ = 2000 and ǫ = 0 . 465 Image 1 2 3 4 5 6 7 8 9 10 Cluster Class 5 6 8 4 2 7 9 10 3 1 Below is a table of error for each cluster classification Error= Number of incorrect faces in cluster Total number of faces in cluster 1 2 3 4 5 6 7 8 9 10 71% 33% 60% 83% 0% 66% 44% 40% 60% 66% Overall Error= Total number of incorrect faces = 54% Total number of faces 21/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Results Cluster 5 Cluster 4 Cluster 2 22/27

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Project Schedule End of October/ Early November: Construct Similarity Graph and Normalized Laplacian matrix. � End of November/ Early December: Compute first k eigenvectors validate this. � February: Normalize the rows of matrix of eigenvectors and perform dimension reduction. � March/April: Cluster the points using k-means and validate this step. � End of Spring semester: Implement entire algorithm, optimize and obtain final results. � 23/27

Application of Spectral Clustering Algorithm Danielle Middlebrooks - PowerPoint PPT Presentation

Project Overview Results from MNIST Database Adding New Datapoint Results from Face Database Project Schedule References Application of Spectral Clustering Algorithm Danielle Middlebrooks dmiddle1@math.umd.edu Advisor: Kasso Okoudjou

Spectral Clustering Spectral Clustering? Spectral methods Methods using eigenvectors of

Lecture 23: Spectral clustering Hierarchical clustering What is a good clustering?

Poster #190 1 Spectral Clustering of Signed Graphs Poster #190 Our Goal: Extend Spectral

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

LECTURE 7 Clustering The k-means algorithm Hierarchical Clustering The DBSCAN algorithm

Spectral Clustering Lecture 16 David Sontag New York

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Avoiding artifacts in spectral white matter fiber clustering and embedding Demian Wassermann

Guarantees for Spectral Clustering with Fairness Constraints Matthus Kleindessner, Samira Samadi

K-MEANS++ OPTIMAL INITIALIZATION ALGORITHM An Improved K-means Clustering Method OVERVIEW

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

A Large Scale Study of the Small Sample Performance of Random Coe ffi cient Models of Demand

Dynamics and Statistics of Dynamics and Statistics of Quantum Turbulence in Quantum Quantum

Outline Outline 4 Definition of Turbulence 4 Definition of Turbulence 4 Features of Turbulence

CASCADE S NBF Qubec Confrence Qubec Confrence June 5, 2012 DISCLAIMER Certain

COMPUTERS TAKING A QUANTUM LEAP Quantum computers will harness the power of atoms and molecules

CDW Corporation Webcast Conference Call February 7, 2017 CDW .com | 8 0 0 .8 0 0 .4 2 3 9

Neutralize Data Breaches Using data-centric security on NonStop Prashanth Kamath U Sr. Product

Recognizing Handwritten Characters with Local Descriptors and Bags of Visual Words Presentation