Learning Joint Semantic Parsers from Disjoint Data
Hao Peng1, Sam Thomson2, Swabha Swayamdipta2Noah A. Smith1
1University of Washington 2Carnegie Mellon University
@NAACL June 4, 2018
Learning Joint Semantic Parsers from Disjoint Data Hao Peng 1 , Sam - - PowerPoint PPT Presentation
Learning Joint Semantic Parsers from Disjoint Data Hao Peng 1 , Sam Thomson 2 , Swabha Swayamdipta 2 Noah A. Smith 1 1 University of Washington 2 Carnegie Mellon University @NAACL June 4, 2018 Motivations almost Larger data Better
Learning Joint Semantic Parsers from Disjoint Data
Hao Peng1, Sam Thomson2, Swabha Swayamdipta2Noah A. Smith1
1University of Washington 2Carnegie Mellon University
@NAACL June 4, 2018
Motivations
❖ Larger data
Better performance
almost
❖ Overlaps among different theories
Learning Joint Semantic Parsers from Disjoint Data
FrameNet vs. semantic dependencies Different structures; no parallel annotations
Overview
Learning Joint Semantic Parsers from Disjoint Data
FrameNet vs. semantic dependencies Different structures; no parallel annotations Joint decoding Latent variables
Overview
❖ Parsing semantic spans and dependencies ❖ Joint parsing ❖ Learning with latent variables ❖ Empirical results
Outline
Input:
A few books fell in the room .
fall.v
Target: token span Lexical unit: lemma.pos
Parsing FrameNet Structures
Baker et al., (1998)
Input:
A few books fell in the room .
fall.v
Target: token span Lexical unit: lemma.pos
Output:
Parsing FrameNet Structures
A few books fell in the room .
fall.v Motion Directional Theme Place
Arguments: span + semantic roles Frame
who what when where …
Baker et al., (1998)
Input:
A few books fell in the room .
fall.v
A few books fell in the room .
fall.v Motion Directional Theme Place
Theme Place Motion Directional
fframe
Parsing FrameNet Structures
Input:
A few books fell in the room .
fall.v
A few books fell in the room .
fall.v Motion Directional Theme Place
Theme Place Motion Directional
fframe
Parsing FrameNet Structures
BiLSTM+MLPs
max
A few books fell in the room .
fall.v
frame? arg1?
arg3?
frame, args
Dynamic program
Kong et al., (2016); Swayamdipta et al., (2017)
Parsing FrameNet Structures
s.t. Decoding:
Input: Output:
MRS-derived dependencies (DM)
Parsing Semantic Dependencies
arg2
A few books fell in the room .
arg1 mwe arg1 arg1 BV top
head modifier
role label
who what when where …
Oepen et al., (2015)
A few books fell in the room .
Input:
A few books fell in the room .
Score:
Parsing Semantic Dependencies
arg2
A few books fell in the room .
arg1 mwe arg1 arg1 BV top
head mod
role
labeled arcs
Parsing Semantic Dependencies
Decoding: Linear program
AD3; Martins et al., (2011)
few books
compound
few books
arg1
fell room
arg2
? ? ? …
labeled arcs
s.t.
❖ Parsing semantic spans and dependencies ❖ Joint parsing ❖ Learning with latent variables ❖ Empirical results
Outline
Joint Parsing
Sharing parameters:
Swayamdipta et al., (2016); Hershcovich et al., (2018) A few books fell in the room .
fall.v Motion Directional Theme Place
A few books fell in the room .
Joint Parsing
A few books fell in the room .
fall.v Motion Directional Theme Place
A few books fell in the room .
This work, joint decoding:
Theme
A few books fell in the room .
fall.v Motion Directional Place
arg2 arg1 mwe arg1 arg1 BV topSharing parameters:
Swayamdipta et al., (2016); Hershcovich et al., (2018)
Joint Parsing
A few books fell in the room .
fall.v Motion Directional Theme Place
A few books fell in the room .
This work, joint decoding:
Theme
A few books fell in the room .
fall.v Motion Directional Place
arg2 arg1 mwe arg1 arg1 BV topSharing parameters:
Swayamdipta et al., (2016); Hershcovich et al., (2018)
Orthogonal
Joint Parsing
Input:
A few books fell in the room .
fall.v
Score:
Theme
A few books fell in the room .
fall.v Motion Directional Place
arg2 arg1 mwe arg1 arg1 BV topJoint Parsing
Input:
A few books fell in the room .
fall.v
=
A few books fell in the room .
fall.v Motion Directional Theme Place
arg2 arg1 mwe arg1 arg1 BV topA few books fell in the room .
DM Score
Theme
A few books fell in the room .
fall.v Motion Directional Place
arg2 arg1 mwe arg1 arg1 BV topScore:
Joint Parsing
Input:
A few books fell in the room .
fall.v
=
A few books fell in the room .
fall.v Motion Directional Theme Place
arg2 arg1 mwe arg1 arg1 BV topA few books fell in the room .
FrameNet Score DM Score Affinities between them
Theme
A few books fell in the room .
fall.v Motion Directional Place
arg2 arg1 mwe arg1 arg1 BV topScore:
Span vs. Dependencies
If both were spans
Finkel and Manning, (2009)
?
hjoint
role1 role2
hjoint
role1
hjoint
Lluís et al., (2013); Peng et al., (2017)
Span vs. Dependencies
If both were spans
Finkel and Manning, (2009)
Structural divergence
A few books fell
fall.v Motion Directional Theme
arg1 mwe arg1
?
hjoint
role1 role2
hjoint
role1
hjoint
Lluís et al., (2013); Peng et al., (2017)
Span vs. Dependencies
Structural divergence
A few books fell
fall.v Motion Directional Theme
arg1 mwe arg1
Designate a head for each span
PropBank dependencies; Surdeanu et al., (2008)
A few books fell
fall.v Theme
Span vs. Dependencies
Structural divergence
A few books fell
fall.v Motion Directional Theme
arg1 mwe arg1
Designate a head for each span
PropBank dependencies; Surdeanu et al., (2008)
A few books fell
fall.v Theme
Head selected by syntax
Collins, (2003)
Span vs. Dependencies
Structural divergence
A few books fell
fall.v Motion Directional Theme
arg1 mwe arg1
A few books fell
fall.v Theme
arg1
Designate a head for each span
PropBank dependencies; Surdeanu et al., (2008)
Span vs. Dependencies
Structural divergence
A few books fell
fall.v Motion Directional Theme
arg1 mwe arg1
This work
A few books fell
fall.v Theme
A few books fell
fall.v Theme
A few books fell
fall.v Theme
Span vs. Dependencies
Score:
A few books fell
fall.vTheme
arg1Motion Directional
+hjoint ⇣ ⌘
=
A few books fell in the room .
fall.v Motion Directional Theme Place
arg2 arg1 mwe arg1 arg1 BV topA few books fell in the room .
DM Score Affinities between them Multilinear mapping
Theme
A few books fell in the room .
fall.v Motion Directional Place
arg2 arg1 mwe arg1 arg1 BV topSpan vs. Dependencies
Decoding:
arg1 ? arg2 ?
A few books fell in the room .
fall.v
frame? arg1? arg2? arg3?
frame, args labeled arcs joint parts
Linear program Speed up by promoting sparsity
BV ?
❖ Parsing semantic spans and dependencies ❖ Joint parsing ❖ Learning with latent variables ❖ Empirical results
Outline
Learning with Latent Variables
FrameNet data DM data
Supervision
Theme
head mod
role
A few books fell
fall.v Theme
Learning with Latent Variables
Supervision
Theme
head mod
role
A few books fell
fall.v Theme
FrameNet data DM data
Learning with Latent Variables
Latent structured hinge
Yu and Joachims, (2009) frame, args labeled arcs joint parts
L = − max H ⇣ ⌘ + max H ⇣ ⌘ + δ
arg1 ? arg2 ?
A few books fell in the room .
fall.v frame? arg1? arg2? arg3? BV ? arg1 ? arg2 ?
A few books fell in the room .
fall.v BV ? Motion Directional Theme Place
labeled arcs joint parts
FrameNet data
Learning with Latent Variables
Latent structured hinge
Yu and Joachims, (2009) frame, args labeled arcs joint parts
L = − max H ⇣ ⌘ + max H ⇣ ⌘ + δ
arg1 ? arg2 ?
A few books fell in the room .
fall.v frame? arg1? arg2? arg3? BV ? arg1 ? arg2 ?
A few books fell in the room .
fall.v BV ? Motion Directional Theme Place
labeled arcs joint parts
cost
FrameNet data
Prediction
Learning with Latent Variables
Latent structured hinge
Yu and Joachims, (2009) frame, args labeled arcs joint parts
Gold FN output
L = − max H ⇣ ⌘ + max H ⇣ ⌘ + δ
arg1 ? arg2 ?
A few books fell in the room .
fall.v frame? arg1? arg2? arg3? BV ? arg1 ? arg2 ?
A few books fell in the room .
fall.v BV ? Motion Directional Theme Place
labeled arcs joint parts
FrameNet data
Learning with Latent Variables
Latent structured hinge
Yu and Joachims, (2009) frame, args labeled arcs joint parts
L = − max H ⇣ ⌘ + max H ⇣ ⌘ + δ
arg1 ? arg2 ?
A few books fell in the room .
fall.v frame? arg1? arg2? arg3? BV ? arg1 ? arg2 ?
A few books fell in the room .
fall.v BV ? Motion Directional Theme Place
labeled arcs joint parts
FrameNet data
❖ Parsing semantic spans and dependencies ❖ Joint parsing ❖ Learning with latent variables ❖ Empirical results
Outline
FrameNet Results
Compared models:
FitzGerald et al. (2015) Open-SESAME Yang & Mitchell (2017) This work Basic w/o joint decoding Frame & Arg. ID. Multitask Learning Joint Decoding
Open-SESAME: Swayamdipta et al., (2017)
FrameNet Results
Compared models:
FitzGerald et al. (2015) Open-SESAME Yang & Mitchell (2017) This work Basic w/o joint decoding Frame & Arg. ID. Multitask Learning Joint Decoding
Pipeline Predict both frames and arguments
Open-SESAME: Swayamdipta et al., (2017)
FrameNet Results
Compared models:
FitzGerald et al. (2015) Open-SESAME Yang & Mitchell (2017) This work Basic w/o joint decoding Frame & Arg. ID. Multitask Learning Joint Decoding
FrameNet & PropBank FrameNet & Syntax FrameNet & DM
Open-SESAME: Swayamdipta et al., (2017)
FrameNet Results
Compared models:
FitzGerald et al. (2015) Open-SESAME Yang & Mitchell (2017) This work Basic w/o joint decoding Frame & Arg. ID. Multitask Learning Joint Decoding
Share LSTMs & embeddings Joint decoding FrameNet only
Open-SESAME: Swayamdipta et al., (2017)
FrameNet Results
FitzGerald et al. (2015) Open-SESAME Yang & Mitchell (2017) This work Basic w/o joint decoding Frame & Arg. ID. Multitask Learning Joint Decoding
F1 60 64 68 72 76 80
single ensemble
Frame and argument F1, FrameNet 1.5 test set Open-SESAME: Swayamdipta et al., (2017)
FrameNet Results
FitzGerald et al. (2015) Open-SESAME Yang & Mitchell (2017) This work Basic w/o joint decoding Frame & Arg. ID. Multitask Learning Joint Decoding
F1 60 64 68 72 76 80
single ensemble
2× 2× 5× 10×
Frame and argument F1, FrameNet 1.5 test set Open-SESAME: Swayamdipta et al., (2017)
FrameNet Results
FitzGerald et al. (2015) Open-SESAME Yang & Mitchell (2017) This work Basic w/o joint decoding Frame & Arg. ID. Multitask Learning Joint Decoding
F1 60 64 68 72 76 80
single ensemble
Frame and argument F1, FrameNet 1.5 test set Open-SESAME: Swayamdipta et al., (2017)
2× 5× 10× 2×
Accuracy 82 84 86 88 90 92
Hartmann Y & M Hermman This work
Frame ID. accuracy, FrameNet 1.5 test set
Accuracy 82 84 86 88 90 92
Hartmann Y & M Hermman This work
Frame ID. accuracy, FrameNet 1.5 test set
This work NeurboParser
Labeled F1 82 84 86 88 90 92
in-domain
DM labeled F1 SemEval ’15 test set NeurboParser: Peng et al., (2017)
Conclusion
Problem
Conclusion
Problem Method
Conclusion
Problem Method Results
Addressee thank.v Judgement Direct Address
arg2 Communicator
arg1