Support vector machines
Course of Machine Learning Master Degree in Computer Science University of Rome “Tor Vergata” Giorgio Gambosi a.a. 2018-2019
1
Support vector machines Course of Machine Learning Master Degree in - - PowerPoint PPT Presentation
Support vector machines Course of Machine Learning Master Degree in Computer Science University of Rome Tor Vergata Giorgio Gambosi a.a. 2018-2019 1 Idea The binary classification problem is approached in a direct way, that is: We try
1
2
3
4
i
5
A B
6
7
i
8
x x x x x x x x x x x
9
w,w0 γ
w,w0 γ
10
i
γ , which corresponds to
11
12
w,w0 γ = ||w||−1
w,w0
13
14
x∈Ω f(x)
k
i=1
k′
j=1
x
λ,µ θ(λ, µ)
15
i ≥ 0
i gi(x∗) = 0
i can be
16
λ
w,w0 L(w, w0, λ) = max λ
w,b
n
i=1
λ
w,w0
n
i=1
n
i=1
17
n
i=1
n
i=1
18
λ
λ
i=1
n
i=1 n
j=1
n
i=1
19
λ
λ
i=1
n
i=1 n
j=1
n
i=1
20
21
i = n
j=1
jtjφi(xj)
0 can be obtained by observing that, for any support vector
j=1
jtjφ(xj)T φ(xk) + w∗
j=1
jtjκ(xj, xk) + w∗
j∈S
jtjκ(xj, xk) + w∗
22
j∈S
jtjκ(xj, xk) + w∗
0 = tk −
j∈S
jtjκ(xj, xk)
0 = 1
i∈S
j∈S
jtjκ(xj, xi)
23
m
i=1
i φi(x) + w∗ 0 = n
j=1
jtjκ(xj, x) + w∗
i = 0. Thus, the
j∈S
jtjκ(xj, x) + w∗
24
25
26
w,w0,ξ
n
i=1
n
i=1
n
i=1
n
i=1
n
i=1
i + n
i=1
n
i=1
m
j=1
n
i=1
i + n
i=1
n
i=1 m
j=1
n
i=1
n
i=1
27
28
n
j=1
n
i=1
29
λ
λ
i=1
n
i=1 n
j=1
n
i=1
30
31
32
x x x x x x x x x x x x x x
λi > 0, ξi = 0 λi = 0, ξi = 0 λi = C, 0 < ξi < 1 λi = C, ξi > 1 λi = C, ξi = 1
33
m
i=1
i φi(x) + b∗
i∈S
jtjκ(xi, xj) + b∗ 34
35
w,w0
w,w0 n
i=1
36
37
38
w,w0,ξ
n
i=1
2wT w, at the same time minimizing the
n
i=1
39
xi∈L
xi∈L
40
41
42
x1
x2
−∞
43
11x2 21 + x2 12x2 22 + 2x11x12x21x22
11, x2 12, x11x12, x11x12) · (x2 21, x2 22, x21x22, x21x22)
1, x2 2, x1x2, x1x2)T . 44
1, . . . , x2 d, x1x2, . . . , x1xd, x2x1, . . . , xdxd−1)T
45
n
i=1 n
j=1
n
i=1
1, . . . , x2 d, x1x2, . . . , x1xd, x2x1, . . . , xdxd−1,
46
t
i=0
47
48
1 Ax2, with A positive definite
49
1 x2 is a kernel function corresponding to the base functions
50
2σ2
1 x1 + xT 2 x2 − 2xT 1 x2, it results
xT 1 x1 2σ2 e− xT 2 x2 2σ2 e xT 1 x2 σ2
1 x2 is a kernel function (see above)
1 x2
xT 1 x2 σ2
xT 1 x1 σ2 e− xT 1 x1 2σ2 e xT 1 x2 σ2
xT 1 x1 2σ2
xT 2 x2 2σ2
51
1 x2. 52