Lecture 22:
− Clustering − Distance measures − K-Means
Aykut Erdem
May 2016 Hacettepe University
Lecture 22: Clustering Distance measures K-Means Aykut Erdem May - - PowerPoint PPT Presentation
Lecture 22: Clustering Distance measures K-Means Aykut Erdem May 2016 Hacettepe University Last time Boosting Idea: given a weak learner, run it multiple times on (reweighted) training data, then let the learned classifiers vote
− Clustering − Distance measures − K-Means
Aykut Erdem
May 2016 Hacettepe University
training data, then let the learned classifiers vote
weighted by their strength
2
slide by Aarti Singh & Barnabas Poczos
3
slide by Jiri Matas and Jan Šochman
4
5
6
m 4 3 2 1
1
2
3
m
4
j i
slide by Julia Hockenmeier
a more pragmatic approach.
to think in terms of a distance (rather than similarity) between vectors.
7
Hard to define! But we know it when we see it
8
0.23 3 342.7
slide by Andrew Moore
d(x,y) (xi yi)2
i
s(x,y) (xi x)(yi y)
i
like A
apart
like C at all”
10
slide by Alan Fern
hattan (L1) ity (Sup) Distance L
i=1 d
i=1 d
11
ity (Sup) Distance L∞
slide by Julia Hockenmeier
12
slide by Julia Hockenmeier
13 9
∞
1/2 2 2 2
∞
1/2 2 2 2
2 2
∞ ∞
slide by Julia Hockenmeier
✓ Shift and scale invariance
14
slide by Julia Hockenmeier
15
i=1 m
i=1 m
slide by Julia Hockenmeier
16
slide by Julia Hockenmeier
and then evaluate them by some criterion
criterion
17
18
slide by Andrew Moore
19
to closest mean
the average of its assigned points
20
slide by David Sontag
to closest mean
the average of its assigned points
21
slide by David Sontag
22
slide by David Sontag
23
slide by David Sontag
24
slide by David Sontag
25
slide by David Sontag
26
slide by David Sontag
27
slide by David Sontag
28
slide by David Sontag
– Take partial derivative of μi and set to zero, we have
29
K-Means takes an alternating optimization approach, each step is guaranteed to decrease the objective – thus guaranteed to converge
slide by Alan Fern
30
31
K = 2
K=2
Original image
Original
K = 3
K=3
K = 10
K=10
slide by David Sontag
32
K = 2
K=2
Original image
Original
K = 3
K=3
K = 10
K=10
slide by David Sontag
33
K = 2
K=2
Original image
Original
K = 3
K=3
K = 10
K=10
slide by David Sontag
34
FIGURE 14.9. Sir Ronald A. Fisher (1890 − 1962) was one of the founders
many other fundamental concepts. The image on the left is a 1024×1024 grayscale image at 8 bits per pixel. The center image is the result of 2 × 2 block VQ, using 200 code vectors, with a compression rate of 1.9 bits/pixel. The right image uses
[Figure from Hastie et al. book]
slide by David Sontag
35
aardvark 0 about 2 all 2 Africa 1 apple anxious ... gas 1 ...
1 … Zaire
slide by Carlos Guestrin
36
slide by Fei Fei Li
37
slide by Fei Fei Li
38
Normalize patch
Detect patches
[Mikojaczyk and Schmid ’02] [Matas et al. ’02] [Sivic et al. ’03]
Compute SIFT descriptor
[Lowe’99]
slide by Josef Sivic
39
slide by Josef Sivic
40
slide by Josef Sivic
41
Vector quantization
slide by Josef Sivic
42
slide by Fei Fei Li
43
Visual Polysemy. Single visual word occurring on different (but locally similar) parts on different object categories. Visual Synonyms. Two different visual words representing a similar part of an object (wheel of a motorbike).
slide by Andrew Zisserman
44
frequency
codewords
slide by Fei Fei Li
45
slide by Kristen Grauman