Support vector machines
CS 446
Support vector machines CS 446 Part 1: linear support vector - - PowerPoint PPT Presentation
Support vector machines CS 446 Part 1: linear support vector machines 1.0 1.0 1.0 0.8 0.8 0.8 0 0 0 . 8 - 0.6 0.6 0.6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 . 0 0 0 0 0
CS 446
2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 0.0 0.2 0.4 0.6 0.8 1.0
2 .
4 .
6 .
. . 8 . 1 6 .
2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 0.0 0.2 0.4 0.6 0.8 1.0
. 2
8
4 . . 4 . 8 1 . 2
2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 0.0 0.2 0.4 0.6 0.8 1.0
.
.
.
. . 2 . 4 .
1 / 39
1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00 1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00
6 .
. . 8.000 8.000 16.000
1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00 1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00
.
. 5 . 0.000 2 . 5
1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00 1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00
.
.
. . 0.000 1.000 2.000 3.000
1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00 1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00
. 5
.
. . 0.500 0.500 1 . 1 . 5
2 / 39
(x,y)∈S yx
Tw⋆ > 0. 3 / 39
(x,y)∈S yx
Tw⋆ > 0.
3 / 39
(x,y)∈S yx
Tw⋆ > 0.
3 / 39
4 / 39
4 / 39
Best linear classifier on population
5 / 39
Best linear classifier on population Arbitrary linear separator on training data S
5 / 39
Best linear classifier on population Arbitrary linear separator on training data S
5 / 39
Best linear classifier on population Arbitrary linear separator on training data S Maximum margin solution on training data S
5 / 39
Best linear classifier on population Arbitrary linear separator on training data S Maximum margin solution on training data S
5 / 39
Best linear classifier on population Arbitrary linear separator on training data S Maximum margin solution on training data S
5 / 39
Best linear classifier on population Arbitrary linear separator on training data S Maximum margin solution on training data S
5 / 39
(x,y)∈S yx
Tw > 0. 6 / 39
(x,y)∈S yx
Tw > 0.
6 / 39
(x,y)∈S yx
Tw > 0.
6 / 39
(x,y)∈S yx
Tw > 0.
˜ y˜ xTw w2
6 / 39
(x,y)∈S yx
Tw > 0.
˜ y˜ xTw w2
6 / 39
(x,y)∈S yx
Tw > 0.
˜ y˜ xTw w2
6 / 39
(x,y)∈S yx
Tw > 0.
˜ y˜ xTw w2
1/w
(x,y)∈S yx
Tw = 1.
6 / 39
w∈Rd
2
Tw ≥ 1
7 / 39
w∈Rd
2
Tw ≥ 1
7 / 39
w∈Rd
2
Tw ≥ 1
7 / 39
w∈Rd
2
Tw ≥ 1
7 / 39
w∈Rd
2
Tw ≥ 1
7 / 39
w∈Rd
2
T
i w ≥ 1
8 / 39
w∈Rd
2
T
i w ≥ 1
α1,α2,...,αn≥0 n
n
T
i xj.
8 / 39
w∈Rd
2
T
i w ≥ 1
α1,α2,...,αn≥0 n
n
T
i xj.
8 / 39
w∈Rd
2
T
i w ≥ 1
α1,α2,...,αn≥0 n
n
T
i xj.
8 / 39
w∈Rd
2
T
i w ≤ 0
9 / 39
w∈Rd
2
T
i w ≤ 0
i w ≤ 0, associate a
9 / 39
w∈Rd
2
T
i w ≤ 0
i w ≤ 0, associate a
i=1 αi(1 − yixT i w) and
9 / 39
w∈Rd
2
T
i w ≤ 0
i w ≤ 0, associate a
i=1 αi(1 − yixT i w) and
w∈Rd
α≥0
2 + n
T
i w)
9 / 39
w∈Rd
2
T
i w ≤ 0
i w ≤ 0, associate a
i=1 αi(1 − yixT i w) and
w∈Rd
α≥0
2 + n
T
i w)
i w > 0), then the
α≥0” will set αi → ∞ to make objective → ∞. Such w cannot be minimizer!
9 / 39
2 + n
T
i w).
α≥0 L(w, α) = max α≥0
2 + n
T
i w)
10 / 39
2 + n
T
i w).
α≥0 L(w, α) = max α≥0
2 + n
T
i w)
i=1 αiyixi,
w∈Rd L(w, α) = L
n
n
2
n
n
T
i xj.
10 / 39
2 + n
T
i w).
α≥0 L(w, α) = max α≥0
2 + n
T
i w)
i=1 αiyixi,
w∈Rd L(w, α) = L
n
n
2
n
n
T
i xj.
10 / 39
2 + n
T
i w),
α≥0 L(w, α)
w L(w, α)
11 / 39
2 + n
T
i w),
α≥0 L(w, α)
w L(w, α)
α′≥0 L(w, α′) ≥ L(w, α) ≥ min w′ L(w′, α) = D(α).
11 / 39
2 + n
T
i w),
α≥0 L(w, α)
w L(w, α)
α′≥0 L(w, α′) ≥ L(w, α) ≥ min w′ L(w′, α) = D(α).
n
w
11 / 39
n
αi>0
i ˆ
12 / 39
n
αi>0
i ˆ
12 / 39
n
αi>0
i ˆ
12 / 39
n
αi>0
i ˆ
12 / 39
w∈Rd L(w, ˆ
13 / 39
w∈Rd L(w, ˆ
13 / 39
w∈Rd L(w, ˆ
2 + n
T
i ˆ
13 / 39
w∈Rd L(w, ˆ
2 + n
T
i ˆ
2
13 / 39
w∈Rd L(w, ˆ
2 + n
T
i ˆ
2
n
T
i ˆ
T
i ˆ
13 / 39
w∈Rd L(w, ˆ
2 + n
T
i ˆ
2
n
T
i ˆ
T
i ˆ
i ˆ
13 / 39
2 + n
T
i w).
α≥0 L(w, α) = max α≥0
2 + n
T
i w)
w∈Rd L(w, α) = L
n
n
2
n
n
T
i xj.
i=1 αiyixi;
i ˆ
14 / 39