Shai Ben-David
University of Waterloo, Canada
NIPS Workshop December 2005
Attempts to Axiomatize Clustering Shai Ben-David University of - - PowerPoint PPT Presentation
Attempts to Axiomatize Clustering Shai Ben-David University of Waterloo, Canada NIPS Workshop December 2005 Workshop Goals Assuming we agree that theory is needed, We wish to create a basis for a research community: Define/detect
NIPS Workshop December 2005
Mixtures of Gaussians [Dasgupta ‘99], [Vempala,, ’03], [Kannan et al ‘04], [Achlitopas, McSherry ‘05].
– Information Bottleneck approach [Tishby, Pereira, Bialek ‘99]
– K means – Correlation Clustering [Blum, Bansal Chawla] – Normalized Cuts [Meila and Shi]
– Bregman Divergences [Banerjee, Dhilon, Gosh, Merugu] – Rate-distortion [Slonim, Atwal, Tkacik, Bialek] – Description length [Cilibrasi-Vitanyi, Myllymaki]
– Mixture of Gaussians – SuperParaMagnetic Clustering [Blatt, Weiseman, Domany] – Density Traversal Clustering [Storkey and Griffith]
– Agglomerative techniques (e.g., single linkage) [Hartigan, Stuetzle] – Projections based clustering (random/spectral) [Ng, Jordan, Weiss] – Spectral-based representations – [Belkin, Niyogi] – Unsupervised SVM’s [Xu and Schuurmans]
+ such that
Scaling up Consistency
framework, so that different clustering approaches could be classified by the different subsets of axioms they satisfy.
framework, so that different clustering approaches could be classified by the different subsets of axioms they satisfy.
Rate Distortion
MDL
Spectral
Center Based
Linkage Full Consistency Local Consistency Richness Scale Invariance
“Axioms” “Properties”
1, …
k be the clusters of F(d
0 ≥
1, ..
k ≤
id(a,b
i
0d(a,b)
1, P
2 of {1,
1 refines P
2 if
1 is contained in some cluster of P
2.
i}
2 1 1 , 2 i C x k i i k i C y x
i i
∈ = = ∈
i the center of
i)
d={