Hierarchical clustering David M. Blei COS424 Princeton University - PowerPoint PPT Presentation

Hierarchical clustering David M. Blei COS424 Princeton University February 28, 2008 D. Blei Clustering 02 1 / 21

Hierarchical clustering • Hierarchical clustering is a widely used data analysis tool. D. Blei Clustering 02 2 / 21

Hierarchical clustering • Hierarchical clustering is a widely used data analysis tool. • The idea is to build a binary tree of the data that successively merges similar groups of points D. Blei Clustering 02 2 / 21

Hierarchical clustering • Hierarchical clustering is a widely used data analysis tool. • The idea is to build a binary tree of the data that successively merges similar groups of points • Visualizing this tree provides a useful summary of the data D. Blei Clustering 02 2 / 21

Hierarchical clusering vs. k -means • Recall that k -means or k -medoids requires D. Blei Clustering 02 3 / 21

Hierarchical clusering vs. k -means • Recall that k -means or k -medoids requires • A number of clusters k D. Blei Clustering 02 3 / 21

Hierarchical clusering vs. k -means • Recall that k -means or k -medoids requires • A number of clusters k • An initial assignment of data to clusters D. Blei Clustering 02 3 / 21

Hierarchical clusering vs. k -means • Recall that k -means or k -medoids requires • A number of clusters k • An initial assignment of data to clusters • A distance measure between data d ( x n , x m ) D. Blei Clustering 02 3 / 21

Hierarchical clusering vs. k -means • Recall that k -means or k -medoids requires • A number of clusters k • An initial assignment of data to clusters • A distance measure between data d ( x n , x m ) • Hierarchical clustering only requires a measure of similarity between groups of data points. D. Blei Clustering 02 3 / 21

Agglomerative clustering • We will talk about agglomerative clustering . D. Blei Clustering 02 4 / 21

Agglomerative clustering • We will talk about agglomerative clustering . • Algorithm: D. Blei Clustering 02 4 / 21

Agglomerative clustering • We will talk about agglomerative clustering . • Algorithm: 1 Place each data point into its own singleton group D. Blei Clustering 02 4 / 21

Agglomerative clustering • We will talk about agglomerative clustering . • Algorithm: 1 Place each data point into its own singleton group 2 Repeat: iteratively merge the two closest groups D. Blei Clustering 02 4 / 21

Agglomerative clustering • We will talk about agglomerative clustering . • Algorithm: 1 Place each data point into its own singleton group 2 Repeat: iteratively merge the two closest groups 3 Until: all the data are merged into a single cluster D. Blei Clustering 02 4 / 21

Example Data ● 80 ● ● ● 60 ● 40 ● ● 20 ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● ● −20 ● ● 0 20 40 60 80 D. Blei Clustering 02 5 / 21

Example iteration 001 ● 80 ● ● ● 60 ● 40 V2 ● ● 20 ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● ● −20 ● ● 0 20 40 60 80 V1 D. Blei Clustering 02 5 / 21

Agglomerative clustering • Each level of the resulting tree is a segmentation of the data D. Blei Clustering 02 6 / 21

Agglomerative clustering • Each level of the resulting tree is a segmentation of the data • The algorithm results in a sequence of groupings D. Blei Clustering 02 6 / 21

Agglomerative clustering • Each level of the resulting tree is a segmentation of the data • The algorithm results in a sequence of groupings • It is up to the user to choose a ”natural” clustering from this sequence D. Blei Clustering 02 6 / 21

Dendrogram • Agglomerative clustering is monotonic D. Blei Clustering 02 7 / 21

Dendrogram • Agglomerative clustering is monotonic • The similarity between merged clusters is monotone decreasing with the level of the merge. D. Blei Clustering 02 7 / 21

Hierarchical clustering David M. Blei COS424 Princeton University - PowerPoint PPT Presentation

Hierarchical clustering David M. Blei COS424 Princeton University February 28, 2008 D. Blei Clustering 02 1 / 21 Hierarchical clustering Hierarchical clustering is a widely used data analysis tool. D. Blei Clustering 02 2 / 21

Unsupervised Learning and Clustering Owen Roberts, Zach Busser, Ganesh Sugunan Hierarchical

Hierarchical Clustering 4-4-16 Hierarchical clustering: the setting Unsupervised learning

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

Lecture 23: Spectral clustering Hierarchical clustering What is a good clustering?

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

RECSM Summer School: Machine Learning for Social Sciences Session 3.4: Hierarchical Clustering

Chapter 7: Clustering (Unsupervised Data Organization) 7.1 Hierarchical Clustering 7.2 Flat

Clustering: Hierarchical Clustering and K- Means Clustering Machine

LECTURE 7 Clustering The k-means algorithm Hierarchical Clustering The DBSCAN algorithm

CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

Comparing More than Two Observations Dmitriy Gorenshteyn Sr. Data Scientist, Memorial Sloan

Cluster Analysis Applied Multivariate Statistics Spring 2012 Overview Hierarchical

Introduction to Cluster Analysis Keesha Erickson keeshae@lanl.gov qBio Summer School June 2018

Clustering Aarti Singh Slides courtesy: Eric Xing Machine Learning 10-701/15-781 Oct 25, 2010

Cluster Analysis Objective: Group data points into classes of similar points based on a series of

Hierarchical cl u stering N E TW OR K AN ALYSIS IN TH E TIDYVE R SE Massimo Franceschet Prof .

scRNAseq clustering tools sa Bjrklund asa.bjorklund@scilifelab.se What is a celltype? What

scRNAseq clustering tools sa Bjrklund asa.bjorklund@scilifelab.se What is a celltype? What