Liang Huang Kai Zhao Lemao Liu The City University of New York (CUNY)
Scalable Large-Margin Structured Learning: Theory and Algorithms
the man bit the dog DT NN VBD DT NN
x y x y=-1 y=+1 x
the man hit the dog 那 人 咬 了 狗
slides at: http://acl.cs.qc.edu/~lhuang/
What is Structured Prediction?
- binary classification: output is binary
- multiclass classification: output is a (small) number
- structured classification: output is a structure (seq., tree, graph)
- part-of-speech tagging, parsing, summarization, translation
- exponentially many classes: search (inference) efficiency is crucial! 2
x y=-1 y=+1 x
the man bit the dog DT NN VBD DT NN
x y
the man bit the dog
x y
S NP DT the NN man VP VB bit NP DT the NN dog
the man bit the dog
x
那 人 咬 了 狗
y
NLP is all about structured prediction!
Examples of Bad Structured Prediction
3
Learning: Unstructured vs. Structured
4
binary/multiclass structured learning
perceptron structured perceptron SVM structured SVM
Online+ Viterbi max margin max margin Online+ Viterbi
naive bayes
HMMs CRFs
logistic regression (maxent) Conditional Conditional
generative discriminative
(count & divide) (expectations) (argmax) (loss-augmented argmax)