CS 6316 Machine Learning
Support Vector Machines and Kernel Meth-
- ds
Yangfeng Ji
Department of Computer Science University of Virginia
CS 6316 Machine Learning Support Vector Machines and Kernel Meth- - - PowerPoint PPT Presentation
CS 6316 Machine Learning Support Vector Machines and Kernel Meth- ods Yangfeng Ji Department of Computer Science University of Virginia About Online Lectures Course Information Update Record the lectures and upload the videos on Collab
Department of Computer Science University of Virginia
◮ Unmuate yourself and chime in anytime ◮ Use the raise hand feature ◮ Send me a private message
2
◮ Unmuate yourself and chime in anytime ◮ Use the raise hand feature ◮ Send me a private message
◮ send out instant messages if my network connection
is unreliable
◮ online discussion
2
◮ Subject to change
3
◮ Subject to change
◮ Send out my feedback later this week ◮ Continue your collaboration with your teammates ◮ Presentation: record a presentation video and share it
with me
3
◮ Subject to change
◮ Send out my feedback later this week ◮ Continue your collaboration with your teammates ◮ Presentation: record a presentation video and share it
with me
◮ Wednesday 11 AM: I will be on Zoom ◮ You can also send me an email or Slack message
anytime
3
5
x′∈T ρh(x′)
6
◮ Existence of equation 3 ◮ All halfspace predictors that satisfy the condition in
equation 3 are ERM hypotheses
7
8
8
(w,b) min i∈[m]
9
(w,b) min i∈[m]
9
(w,b) min i∈[m]
9
(w,b) min i∈[m]
10
(w,b) min i∈[m]
11
(w,b) min i∈[m]
(w,b) min i∈[m]
11
(w,b) min i∈[m]
(w,b) min i∈[m]
(w,b): mini∈[m] yi(w,xi+b1
(w,b): yi(w,xi+b≥1
11
(w,b): yi(w,xi+b≥1
(w,b)
2
12
(w,b): yi(w,xi+b≥1
(w,b)
2
12
2 − m
13
x∈X
15
x∈X
15
+ as
m
+
16
i gi(x′) 0
17
2 − m
m
m
18
2 − m
m
m
m
m
18
2 − m
m
m
m
m
18
19
19
m
19
(w,b)
2
21
(w,b)
2
21
22
(w,b)
2 + C m
i
i1 ≥ 0 are known as slack
23
2 + C m
m
m
24
2 + C m
m
m
24
◮ xi may lie on the marginal hyper-planes (as in the
separable case)
25
◮ xi may lie on the marginal hyper-planes (as in the
separable case)
◮ xi is an outlier
25
26
2 − m
2 − m
m
m
28
2 − m
2 − m
m
m
m
28
m
2 − m
m
m
m
29
m
2 − m
m
m
m
i1 αi yixi2 2 m i1
j1 αiαj yi yjxi, xj, we
m
m
m
29
α m
m
m
30
α m
m
m
30
α m
m
m
30
(w,b)
2
α m
m
m
31
m
32
m
m
32
m
m
m
32
m
m
m
i1 αi yixi, x for any xi with
32
m
m
34
m
m
34
35
35
K(x, x′)
(49)
1 + x2x′ 2 + c)2
(50)
1x′2 1 + x1x2x′ 1x′ 2 + cx1x′ 1 + x1x2x′ 1x′ 2
+x2
2x′2 2 + cx2x′ 2 + cx1x′ 1 + cx2x′ 2 + c2
(51)
36
K(x, x′)
(49)
1 + x2x′ 2 + c)2
(50)
1x′2 1 + x1x2x′ 1x′ 2 + cx1x′ 1 + x1x2x′ 1x′ 2
+x2
2x′2 2 + cx2x′ 2 + cx1x′ 1 + cx2x′ 2 + c2
(51)
1x′2 1 + x2 2x′2 2 + 2x1x′1x2x′2
(52) +2cx1x′1 + 2cx2x′2 + c2 (53)
36
K(x, x′)
(49)
1 + x2x′ 2 + c)2
(50)
1x′2 1 + x1x2x′ 1x′ 2 + cx1x′ 1 + x1x2x′ 1x′ 2
+x2
2x′2 2 + cx2x′ 2 + cx1x′ 1 + cx2x′ 2 + c2
(51)
1x′2 1 + x2 2x′2 2 + 2x1x′1x2x′2
(52) +2cx1x′1 + 2cx2x′2 + c2 (53)
1, x2 2,
√ 2x1x2, √ 2cx1, √ 2cx2, c]
x′2
1
x′2
2
√ 2x′1x′2 √ 2cx′1 √ 2cx′2 c
36
1, x2 2,
37
1, x2 2,
37
1, x2 2,
37
2
38
2
38
α m
m
m
39
α m
m
m
j1 αj yjK(xj, xi) for any xi with αi > 0
39
40
is symmetric positive semi-definite K [K(xi, xj)]i,j ∈ Rm×m (58)
40
is symmetric positive semi-definite K [K(xi, xj)]i,j ∈ Rm×m (58)
40
Boyd, S. and Vandenberghe, L. (2004). Convex optimization. Cambridge university press. Mohri, M., Rostamizadeh, A., and Talwalkar, A. (2018). Foundations of machine learning. MIT press. Shalev-Shwartz, S. and Ben-David, S. (2014). Understanding machine learning: From theory to algorithms. Cambridge university press.
41