Sequence Learning from Data with Multiple Labels Mark Dredze (Johns - PowerPoint PPT Presentation

Sequence Learning from Data with Multiple Labels Mark Dredze (Johns Hopkins Univ., USA) Partha Pratim Talukdar (Univ. of Penn., USA) Koby Crammer (Technion, Israel)

Motivation • Labeled data is expensive • Multiple cheap but noisy annotations may be available (e.g. Amazon Mechanical Turk)  The problem: Adjudication ! • Can we learn from multiple labels without adjudication?

Learning Setting • Input:  Feature sequence (sentence)  Set of initial priors over labels at each position John Blitzer studies at the University of Pennsylvania . PER/0.7 PER/0.7 O/1.0 O/1.0 O/1.0 ORG/1.0 ORG/1.0 ORG/0.3 O/1.0 O/0.1 O/0.1 LOC/0.7 ORG/0.1 LOC/0.1 LOC/0.1 • Output: Trained sequence labeler (e.g. CRF)  Take label priors into account during training

Why Multiple Labels? • Easy to encode guesses as to correct label  Users provide labels  Allows multiple conflicting labels  Don’t need to resolve conflicts (saves time)

Comparison with Canonical Multi-Label Learning Canonical Multi-Label This Paper 1. Multiple labels per 1. Same, but only one of instance during the labels is correct training 2. Only one valid label 2. Each instance can have per instance multiple valid labels

Previous Work • Jin and Ghahramani, NIPS 2003  Classification setting (simple output) • This paper  Structured Prediction (complex output)

Generality of the Learning Setting • Multi-Label setting encodes standard learning settings  Unsupervised  uniform prior over labels  Supervised  per-position prior of 1.0  Semi-supervised  combination of above

Learning with Multiple Labels • Two learning goals  Find a model that best describes the data  Respect per-position input prior over labels, as much as possible • Balance these two goals in a single objective function

Multi-CRF CRF Multi-CRF Objective CRF Estimated Prior Initial Prior

Multi-EM Algorithm • M-step  Learn a Multi-CRF that models all given labels at each position  Weigh possible labels by estimated label priors • E-step  Re-estimate label priors based on model and initial prior  Balances between CRF’s label estimates and the input priors

Experimental Setup • Dataset  CoNLL-2003: Named Entity Dataset with PER, LOC and ORG tags, 3454 test instances • Each instance has two different sequences  Gold labels  Labels generated by an HMM • Noise level:  probability of incorrect sequence getting higher prior (higher is noisier)

Variants • MAX  Standard CRF with max prior at each position. • MAX-EM  EM with MAX in M step • Multi  Multi-CRF • Multi-EM  EM with Multi-CRF in M step

Results on CoNLL Data Gold Noise Decreases Multi-EM most effective on noisier data, especially when less supervision is available.

When is Learning Successful? • Effective over single-label learning with  Small amount training data (low quantity)  Lots of noise (low quality) • Additional label may add information in this setting.

Conclusion • Presented novel models for learning structured predictors from multi- labeled data, in presence of noise. • Experimental results on real world data • Analyzed when learning in such setting is effective.

Thanks!

Sequence Learning from Data with Multiple Labels Mark Dredze (Johns - PowerPoint PPT Presentation

Sequence Learning from Data with Multiple Labels Mark Dredze (Johns Hopkins Univ., USA) Partha Pratim Talukdar (Univ. of Penn., USA) Koby Crammer (Technion, Israel) Motivation Labeled data is expensive Multiple cheap but noisy annotations

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Connectionist Temporal Classification 1 Sequence-to-sequence

2016 Vegetable Pesticide Update: Weeds 1) New/Changed labels 2) Labels soon 3) Auxin Technologies

2012 GFVGA: Herbicide Update 2012 Weed Control Update 1. Recent labels 2. New labels 3. Near

Multiple Sequence Multiple Sequence Alignments Alignments Multiple alignment Pairwise

Sequence-to-Sequence Learning with Neural Networks Ilya Sutskever, Oriol Vinyals, Quoc V. Le,

SEQUENCE ANALYSIS The term " sequence analysis " in biology implies subjecting a DNA or

Sequence to Sequence models: Connectionist Temporal Classification 5 March 2018 1

Introduction to sequence to sequence models N ATURAL LAN GUAGE GEN ERATION IN P YTH ON

Unsupervised Learning Unsupervised Learning Learning without Class Labels (or correct Learning

Sequence Alignment Gerhard Jger ESSLLI 2016 Gerhard Jger Sequence Alignment ESSLLI 2016 1

61A Lecture 30 Announcements Efficient Sequence Processing Sequence Operations 4 Sequence

Multiple Sequence Alignments COS551, Fall 2003 Global Multiple Sequence Alignment (MSA) Ex:

Sequence Analysis 15: lecture 5 Substitution matrices Multiple sequence alignment A teacher's

1 Actor-Goal List Including lower-level goals Goals should be on the same level

Presentation for GTSWCA Glenn Ackerley - Partner, WeirFoulds LLP Faren Bogach - Partner, WeirFoulds

CUMMINGS BASIN ADJUDICATION your basin your water your future STAKEHOLDERS UPDATE MEETING

SCL Adjudication Scheme (SCLA) Launch Event T HURSDAY 15 O CTOBER 2019, 6 PM Welcome: Matthew

What is the asylum bar? Aliens must apply for asylum within 1 year of arrival into the

Should You Own or Rent? The decision of ownership vs. renting has many aspects, some

Mortgage Experience of Borrowers in Non-Metro Counties: Insights from the National Survey of

Mortgages Mix of property and contract Promissory Note Mortgage Mortgagor

Sequence Learning from Data with Multiple Labels Mark Dredze (Johns - PowerPoint PPT Presentation

Sequence Learning from Data with Multiple Labels Mark Dredze (Johns Hopkins Univ., USA) Partha Pratim Talukdar (Univ. of Penn., USA) Koby Crammer (Technion, Israel) Motivation Labeled data is expensive Multiple cheap but noisy annotations

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Connectionist Temporal Classification 1 Sequence-to-sequence

2016 Vegetable Pesticide Update: Weeds 1) New/Changed labels 2) Labels soon 3) Auxin Technologies

2012 GFVGA: Herbicide Update 2012 Weed Control Update 1. Recent labels 2. New labels 3. Near

Multiple Sequence Multiple Sequence Alignments Alignments Multiple alignment Pairwise

Sequence-to-Sequence Learning with Neural Networks Ilya Sutskever, Oriol Vinyals, Quoc V. Le,

SEQUENCE ANALYSIS The term &quot; sequence analysis &quot; in biology implies subjecting a DNA or

Sequence to Sequence models: Connectionist Temporal Classification 5 March 2018 1

Introduction to sequence to sequence models N ATURAL LAN GUAGE GEN ERATION IN P YTH ON

Unsupervised Learning Unsupervised Learning Learning without Class Labels (or correct Learning

Sequence Alignment Gerhard Jger ESSLLI 2016 Gerhard Jger Sequence Alignment ESSLLI 2016 1

61A Lecture 30 Announcements Efficient Sequence Processing Sequence Operations 4 Sequence

Multiple Sequence Alignments COS551, Fall 2003 Global Multiple Sequence Alignment (MSA) Ex:

Sequence Analysis 15: lecture 5 Substitution matrices Multiple sequence alignment A teacher's

1 Actor-Goal List Including lower-level goals Goals should be on the same level

Presentation for GTSWCA Glenn Ackerley - Partner, WeirFoulds LLP Faren Bogach - Partner, WeirFoulds

CUMMINGS BASIN ADJUDICATION your basin your water your future STAKEHOLDERS UPDATE MEETING

SCL Adjudication Scheme (SCLA) Launch Event T HURSDAY 15 O CTOBER 2019, 6 PM Welcome: Matthew

What is the asylum bar? Aliens must apply for asylum within 1 year of arrival into the

Should You Own or Rent? The decision of ownership vs. renting has many aspects, some

Mortgage Experience of Borrowers in Non-Metro Counties: Insights from the National Survey of

Mortgages Mix of property and contract Promissory Note Mortgage Mortgagor

SEQUENCE ANALYSIS The term " sequence analysis " in biology implies subjecting a DNA or