Clustering
CSE 6242 / CX 4242 Duen Horng (Polo) Chau Georgia Tech
Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos, Le Song
Clustering Duen Horng (Polo) Chau Georgia Tech Partly based on - - PowerPoint PPT Presentation
CSE 6242 / CX 4242 Clustering Duen Horng (Polo) Chau Georgia Tech Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos, Le Song Clustering in Google Image Search How would you build this?
CSE 6242 / CX 4242 Duen Horng (Polo) Chau Georgia Tech
Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos, Le Song
2
http://googlesystem.blogspot.com/2011/05/google-image-search-clustering.html Video: http://youtu.be/WosBs0382SE
3
The most common type of unsupervised learning
(e.g., here are some pictures of dog, group them by their breed)
4
detection)
5
6
Summary
closest to (so, we need a similarity function)
7
Demo: http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/AppletKM.html
Need to decide k ourselves.
Only locally optimal (vs global)
slowly
8
http://nlp.stanford.edu/IR-book/html/htmledition/evaluation-of-clustering-1.html
http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/AppletH.html
Single linkage
the clusters’ most similar members Complete linkage
the clusters’ most dissimilar members Average linkage
10
11
12 https://github.com/mbostock/d3/wiki/Hierarchy-Layout
13 http://www.cc.gatech.edu/~dchau/papers/11-chi-apolo.pdf
14