Lecture 22:
−Clustering −Distance measures −K-Means
Aykut Erdem
December 2016 Hacettepe University
Lecture 22: Clustering Distance measures K-Means Aykut Erdem - - PowerPoint PPT Presentation
Lecture 22: Clustering Distance measures K-Means Aykut Erdem December 2016 Hacettepe University Last time Boosting Idea: given a weak learner, run it multiple times on (reweighted) training data, then let the learned classifiers
−Clustering −Distance measures −K-Means
Aykut Erdem
December 2016 Hacettepe University
training data, then let the learned classifiers vote
weighted by their strength
2
slide by Aarti Singh & Barnabas Poczos
3
slide by Jiri Matas and Jan Šochman
4
5
6
m 4 3 2 1
1
2
3
m
4
j i
slide by Julia Hockenmeier
a more pragmatic approach.
to think in terms of a distance (rather than similarity) between vectors.
7
Hard to define! But we know it when we see it
8
0.23 3 342.7
slide by Andrew Moore
d(x,y) (xi yi)2
i
s(x,y) (xi x)(yi y)
i
like A
apart
like C at all”
10
slide by Alan Fern
hattan (L1) ity (Sup) Distance L
i=1 d
i=1 d
11
ity (Sup) Distance L∞
slide by Julia Hockenmeier
12
slide by Julia Hockenmeier
13 9
∞
1/2 2 2 2
∞
1/2 2 2 2
2 2
∞ ∞
slide by Julia Hockenmeier
✓ Shift and scale invariance
14
slide by Julia Hockenmeier
15
i=1 m
i=1 m
slide by Julia Hockenmeier
16
slide by Julia Hockenmeier
and then evaluate them by some criterion
criterion
17
18
slide by Andrew Moore
19
to closest mean
the average of its assigned points
20
slide by David Sontag
to closest mean
the average of its assigned points
21
slide by David Sontag
22
slide by David Sontag
23
slide by David Sontag
24
slide by David Sontag
25
slide by David Sontag
26
slide by David Sontag
27
slide by David Sontag
28
slide by David Sontag
– Take partial derivative of μi and set to zero, we have
29
K-Means takes an alternating optimization approach, each step is guaranteed to decrease the objective – thus guaranteed to converge
slide by Alan Fern
30
31
32
K = 2
K=2
Original image
Original
K = 3
K=3
K = 10
K=10
slide by David Sontag
33
K = 2
K=2
Original image
Original
K = 3
K=3
K = 10
K=10
slide by David Sontag
34
K = 2
K=2
Original image
Original
K = 3
K=3
K = 10
K=10
slide by David Sontag
35
FIGURE 14.9. Sir Ronald A. Fisher (1890 − 1962) was one of the founders
many other fundamental concepts. The image on the left is a 1024×1024 grayscale image at 8 bits per pixel. The center image is the result of 2 × 2 block VQ, using 200 code vectors, with a compression rate of 1.9 bits/pixel. The right image uses
[Figure from Hastie et al. book]
slide by David Sontag
36
. Fua, and S. Susstrunk SLIC Superpixels Compared to State-of-the-art Superpixel Methods, IEEE T-PAMI, 2012
λ: spatial regularization parameter
37
aardvark 0 about 2 all 2 Africa 1 apple anxious ... gas 1 ...
1 … Zaire
slide by Carlos Guestrin
38
slide by Fei Fei Li
39
slide by Fei Fei Li
40
Normalize patch
Detect patches
[Mikojaczyk and Schmid ’02] [Matas et al. ’02] [Sivic et al. ’03]
Compute SIFT descriptor
[Lowe’99]
slide by Josef Sivic
41
slide by Josef Sivic
42
slide by Josef Sivic
43
Vector quantization
slide by Josef Sivic
44
slide by Fei Fei Li
45
Visual Polysemy. Single visual word occurring on different (but locally similar) parts on different object categories. Visual Synonyms. Two different visual words representing a similar part of an object (wheel of a motorbike).
slide by Andrew Zisserman
46
frequency
codewords
slide by Fei Fei Li
47
slide by Kristen Grauman