Learning Structural SVMs with Latent Variables Presented By- - - PowerPoint PPT Presentation
Learning Structural SVMs with Latent Variables Presented By- - - PowerPoint PPT Presentation
Learning Structural SVMs with Latent Variables Presented By- Subhabrata Debnath(Roll- 13111063) Anjan Banerjee(Roll-13111008) Basics Machine Learning Blood Sugar Hyper Pressure Tension 10 15 No 5 5 No 25 25 Yes 30 36
Basics Machine Learning
Blood Pressure Sugar Hyper Tension 10 15 No 5 5 No 25 25 Yes 30 36 Yes
5 10 15 20 25 30 35 5 10 15 20 25 30 35 40
Y-Values
Basics Machine Learning
w Support Vectors
Y1 Y2
Objective Function of SVM
s.t.
Objective Function of SVM
s.t.
Score
w
Higher the value of WTx,higher the chance of belonging to this class
Latent Structured SVM
Final Objective Function:
Non-Convex Objective Function Can be solved by CCCP
Soft-Margin SVM
Soft-Margin SVM
s. t.
Soft-Margin SVM
Multiclass SVM
wY1 w
Y 2
wY3
Predicted Class:
Y1 Y3 Y2
Multi-Class SVM
s.t.
here,
Multi-class SVM
- What if we don’t want the same
amount of margin for all the classes?
- E.g.: Given age, sex of an user and the movie
genre, predict the rating(1-5) that the user will give.
- Highly Incorrect Class and Lesser Incorrect Class
Actual Rating Predicted Rating Loss 5 4 Less 5 1 High
Multi-Class SVM
s.t.
Multi-Class SVM
s.t.
Structured SVM
Structured SVM
Structured SVM
Structured SVM
Structured SVM
Structured SVM
Could have been solved using any convex solver The only problem is the number of classes, hence the number of constraints are exponentially large. e.g. Number of possible parse trees for a given sentence is exponential in the number of words.
Cutting Plane Method
However, this method gives a solution of the given convex
- ptimization problem with
precision ε. Our Convex Objective Function W*, ξi Cutting Plane Method
Latent Information
Hidden Information present in the training set that can improve our learning Let us denote these hidden/latent information as hi. xi yi hi (given/observed)
(hidden/unobserved)
Latent Information
Noun Phrase Coreference Problem:
Input x: Noun Phrases with edge features Labels y: Clusters Of Noun Phrases Latent Variable h: ‘Strong’ links as trees
Latent Information
Noun Phrase Coreference Problem:
Latent Structured SVM
Objective function:
Latent Structured SVM
Objective function:
Latent Structured SVM
Objective function:
Latent Structured SVM
Objective function:
Latent Structured SVM
Final Objective Function:
Non-Convex Objective Function Can’t be solved using Cutting plane
Property Of Convex Function
x1 x2 f x f(x1) f(x2) f ’ ( x 1 )
f ( x2 ) >= f ( x1 ) + ( x2 - x1 ) * f’ ( x1 )
x1 x2 f(x1) f(x2)
f ( x1 ) <= f ( x2 ) + ( x1 – x2 ) * f’ ( x2 )
f ’ ( x 2 )
Property Of Concave Function
Concave-Convex Procedure
Concave-Convex Procedure
Concave-Convex Procedure
Concave-Convex Procedure
Minimize the resulting sum
Cutting plane Algorithm
Wt+1 Iterate till desired precision
Overview of the CCCP
- Initialize w0