Linear, Binary SVM Classifiers
COMPSCI 371D — Machine Learning
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 1 / 17
Linear, Binary SVM Classifiers COMPSCI 371D Machine Learning - - PowerPoint PPT Presentation
Linear, Binary SVM Classifiers COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning Linear, Binary SVM Classifiers 1 / 17 Outline 1 What Linear, Binary SVM Classifiers Do 2 Margin 3 Loss and Regularized Risk 4 Training an SVM is
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 1 / 17
1 What Linear, Binary SVM Classifiers Do 2 Margin 3 Loss and Regularized Risk 4 Training an SVM is a Quadratic Program 5 The KKT Conditions and the Support Vectors
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 2 / 17
What Linear, Binary SVM Classifiers Do
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 3 / 17
What Linear, Binary SVM Classifiers Do
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 4 / 17
What Linear, Binary SVM Classifiers Do
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 5 / 17
Margin
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 6 / 17
Margin
def
def
n
separating hyperplane y ^ = 1 = −1 y ^
1
µ (x, 1)
v
1
µ (x, -1)
v
1
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 7 / 17
Loss and Regularized Risk
1 µ∗ max{0, µ∗ − µv(x, y)}
n
reference margin separating hyperplane y ^ = 1 = −1 y ^
1
µ (x, 1)
v
µ*
1
µ* µ (x, -1)
v
reference margin
l (x, -1)
v
l (x, 1)
v
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 8 / 17
Loss and Regularized Risk
N
n=1 ℓv(xn, yn)
1 µ∗
def
1 2w2 + C N
n=1 ℓ(w,b)(xn, yn)
1 µ∗ max{0, µ∗ − µ(w,b)(x, y)}
1 µ∗ max{0, µ∗ − y (nTx + c)} = max{0, 1 − y(wTx + b)}
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 9 / 17
Loss and Regularized Risk
def
1 2w2 + C N
n=1 ℓ(w,b)(xn, yn)
def
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 10 / 17
Training an SVM is a Quadratic Program
1 2w2 + C N
n=1 ℓn
def
l (ν)
(w, b)
1 1 ν ξ
ln
νn
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 11 / 17
Training an SVM is a Quadratic Program
1 2w2 + C N
n=1 ℓn(νn)
1 2w2 + C N
n=1 ξn
1 1 µ
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 12 / 17
Training an SVM is a Quadratic Program
1 2w2 + C N
n=1 ℓ(w,b)(xn, yn)
def
2w2 + γ N n=1 ξn
def
C N
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 13 / 17
The KKT Conditions and the Support Vectors
SVM Quadratic Program min
w,b,ξ f(w, ξ)
where f(w, ξ) = 1 2 w2 + γ
N
ξn subject to the constraints yn(wT xn + b) − 1 + ξn ≥ ξn ≥ KKT Conditions (u = (w, b, ξ)) ∇f(u∗) =
α∗
i ∇ci(u∗) with α∗ i ≥ 0 COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 14 / 17
The KKT Conditions and the Support Vectors
2w2 + γ N n=1 ξn
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 15 / 17
The KKT Conditions and the Support Vectors
nynxn
nyn
n + β∗ n for n = 1, . . . , N
j , β∗ k
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 16 / 17
The KKT Conditions and the Support Vectors
n∈A(u∗) α∗ nynxn
COMPSCI 371D — Machine Learning Linear, Binary SVM Classifiers 17 / 17