Machine Learning
Support Vector Machines
Rui Xia Text Mining Group Nanjing University of Science & Technology rxia@njust.edu.cn
Machine Learning Support Vector Machines Rui Xia T ext M ining - - PowerPoint PPT Presentation
Machine Learning Support Vector Machines Rui Xia T ext M ining Group N anjing U niversity of S cience & T echnology rxia@njust.edu.cn Outline Maximum Margin Linear Classifier Duality Optimization Soft-margin SVM Kernel
Rui Xia Text Mining Group Nanjing University of Science & Technology rxia@njust.edu.cn
Machine Learning Course, NJUST 2
Machine Learning Course, NJUST 3
Which linear hyper-plane is better? Which learning criterion to choose?
Machine Learning Course, NJUST 4
Machine Learning Course, NJUST 5
Machine Learning Course, NJUST 6
Machine Learning Course, NJUST 7
Machine Learning Course, NJUST 8
Machine Learning Course, NJUST 9
Machine Learning Course, NJUST 10
Machine Learning Course, NJUST 11
Machine Learning Course, NJUST 12
Machine Learning Course, NJUST 13
Machine Learning Course, NJUST 14
Machine Learning Course, NJUST 15
active inactive
Machine Learning Course, NJUST 16
Machine Learning Course, NJUST 17
Stationary Primal feasibility Dual feasibility Complementary condition Karush–Kuhn–Tucker (KKT) Conditions
Machine Learning Course, NJUST 18
Machine Learning Course, NJUST 19
Machine Learning Course, NJUST 20
When does the equality hold?
Machine Learning Course, NJUST 21
– f and the gi’s are convex, and the hi’s are affine; – gi are (strictly) feasible: this means that there exists some w so that gi(w)<0.
Primal Problem Dual Problem
Machine Learning Course, NJUST 22
Stationary Primal feasibility Dual feasibility Complementary condition
Primal feasibility
Machine Learning Course, NJUST 23
Sufficient and Necessary Condition
Machine Learning Course, NJUST 24
Machine Learning Course, NJUST 25
Guarantee that the KKT conditions are satisfied.
Machine Learning Course, NJUST 26
Machine Learning Course, NJUST 27
is a positive support vector is a negative support vector
Machine Learning Course, NJUST 28
How to compute alpha? How to solve the dual
Machine Learning Course, NJUST 29
Machine Learning Course, NJUST 30
Linearly separable Linearly non-separable
Machine Learning Course, NJUST 31
Machine Learning Course, NJUST 32
Machine Learning Course, NJUST 33
Primal Problem Dual Problem
Machine Learning Course, NJUST 34
Machine Learning Course, NJUST 35
Machine Learning Course, NJUST 36
Machine Learning Course, NJUST 37
Machine Learning Course, NJUST 38
Machine Learning Course, NJUST 39
Machine Learning Course, NJUST 40
Higher-dimensional-separable
Machine Learning Course, NJUST 41
Machine Learning Course, NJUST 42
Machine Learning Course, NJUST 43
– Sometimes it’s hard to know the exact projection function, but relatively easy to know the Kernel function – In SVM, all of the calculations of feature vectors are in the form of product – Therefore, we only need to know the Kernel function used in SVM, but without the need to know the exact projection function.
Machine Learning Course, NJUST 44
– For any finite set of points – Element of kernel matrix
– Symmetric – Positive semi-definite
Machine Learning Course, NJUST 45
Machine Learning Course, NJUST 46
Machine Learning Course, NJUST 47
Machine Learning Course, NJUST 48
Machine Learning Course, NJUST 49
Machine Learning Course, NJUST 50
Machine Learning Course, NJUST 51
Machine Learning Course, NJUST 52
Machine Learning Course, NJUST 53
where
Machine Learning Course, NJUST 54
Machine Learning Course, NJUST 55
where
Machine Learning Course, NJUST 56
Machine Learning Course, NJUST 57
Machine Learning Course, NJUST 58
Machine Learning Course, NJUST 59
Machine Learning Course, NJUST 60
multipliers satisfy the KKT conditions (within a user-defined tolerance).
Machine Learning Course, NJUST 61
Machine Learning Course, NJUST