Learning Joint Semantic Parsers from Disjoint Data Hao Peng 1 , Sam - PowerPoint PPT Presentation

Learning Joint Semantic Parsers from Disjoint Data Hao Peng 1 , Sam Thomson 2 , Swabha Swayamdipta 2 � Noah A. Smith 1 1 University of Washington 2 Carnegie Mellon University @NAACL June 4, 2018

Motivations almost ❖ Larger data Better performance ❖ Overlaps among di ff erent theories

Overview Learning Joint Semantic Parsers from Disjoint Data FrameNet vs. semantic dependencies Di ff erent structures; no parallel annotations

Overview Joint decoding Latent variables Learning Joint Semantic Parsers from Disjoint Data FrameNet vs. semantic dependencies Di ff erent structures; no parallel annotations

Outline ❖ Parsing semantic spans and dependencies ❖ Joint parsing ❖ Learning with latent variables ❖ Empirical results

Parsing FrameNet Structures Input: Target: token span A few books fell in the room . fall.v Lexical unit: lemma.pos Baker et al., (1998)

Parsing FrameNet Structures Input: Target: token span A few books fell in the room . fall.v Lexical unit: lemma.pos Output: Frame who what A few books fell in the room . when fall.v where Motion Theme Place … Directional Arguments: span + semantic roles Baker et al., (1998)

Parsing FrameNet Structures Input: A few books fell in the room . fall.v Score: � � A few books fell in the room . F fall.v Motion Theme Place Directional = � � � � � � f frame + f arg + f arg Motion Directional Theme Place

Parsing FrameNet Structures Input: A few books fell in the room . fall.v Score: � � A few books fell in the room . F fall.v Motion Theme Place Directional = � � � � � � f frame + f arg + f arg Motion Directional Theme Place BiLSTM+MLPs

Parsing FrameNet Structures Decoding: Dynamic program Kong et al., (2016); Swayamdipta et al., (2017) � � A few books fell in the room . F max fall.v arg1? arg2? arg3? frame? frame, args • non-overlapping s.t. • consistency • …

Parsing Semantic Dependencies Input: A few books fell in the room . Output: MRS-derived dependencies (DM) top arg2 mwe arg1 arg1 arg1 BV A few books fell in the room . who what role label when where head modifier … Oepen et al., (2015)

Parsing Semantic Dependencies Input: A few books fell in the room . Score: top � � arg2 G mwe arg1 arg1 arg1 BV A few books fell in the room . = X role � � BiLSTM+MLPs g head mod labeled arcs

Parsing Semantic Dependencies Decoding: Linear program AD 3 ; Martins et al., (2011) � compound � arg2 ? ? G few books fell room max arg1 ? … few books labeled arcs • consistency s.t. • determinism • …

Joint Parsing Sharing parameters: Swayamdipta et al., (2016); Hershcovich et al., (2018) top arg2 � � � � G F arg1 mwe arg1 arg1 BV A few books fell in the room . fall.v A few books fell in the room . Motion Place Theme Directional Shared LSTMs

Joint Parsing Sharing parameters: Swayamdipta et al., (2016); Hershcovich et al., (2018) top arg2 � � � � G F arg1 mwe arg1 arg1 BV A few books fell in the room . fall.v A few books fell in the room . Motion Place Theme Directional Shared LSTMs This work, joint decoding: top ⇣ ⌘ arg2 arg1 mwe arg1 arg1 BV H A few books fell in the room . fall.v Motion Place Theme Directional

Joint Parsing Sharing parameters: Swayamdipta et al., (2016); Hershcovich et al., (2018) top arg2 � � � � G F arg1 mwe arg1 arg1 BV A few books fell in the room . fall.v A few books fell in the room . Motion Place Theme Directional Shared LSTMs This work, joint decoding: Orthogonal top ⇣ ⌘ arg2 arg1 mwe arg1 arg1 BV H A few books fell in the room . fall.v Motion Place Theme Directional

Joint Parsing Input: A few books fell in the room . fall.v Score: top ⇣ arg2 ⌘ arg1 mwe arg1 arg1 BV H A few books fell in the room . fall.v Motion Place Theme Directional

Joint Parsing Input: A few books fell in the room . fall.v Score: top ⇣ arg2 ⌘ arg1 mwe arg1 arg1 BV H A few books fell in the room . fall.v Motion Place Theme Directional = top arg2 � � � � F + G A few books fell in the room . arg1 mwe arg1 arg1 BV fall.v A few books fell in the room . Motion Place Theme Directional FrameNet Score DM Score

Joint Parsing Input: A few books fell in the room . fall.v Score: top ⇣ arg2 ⌘ arg1 mwe arg1 arg1 BV H A few books fell in the room . fall.v Motion Place Theme Directional = top arg2 � � � � F + G A few books fell in the room . arg1 mwe arg1 arg1 BV fall.v A few books fell in the room . Motion Place Theme Directional � � + h joint ? FrameNet Score DM Score A ffi nities between them

Span vs. Dependencies � � ? h joint If both were dependencies Lluís et al., (2013); Peng et al., (2017) role1 � � h joint head mod role2 If both were spans Finkel and Manning, (2009) � � role1 h joint role2

Span vs. Dependencies � � ? h joint If both were dependencies Lluís et al., (2013); Peng et al., (2017) role1 � � h joint head mod role2 If both were spans Finkel and Manning, (2009) � � role1 h joint role2 Structural divergence mwe arg1 arg1 A few books fell fall.v Motion Theme Directional

Span vs. Dependencies Structural divergence mwe arg1 arg1 A few books fell fall.v Motion Theme Directional Designate a head for each span PropBank dependencies; Surdeanu et al., (2008) A few books fell fall.v Theme

Span vs. Dependencies Structural divergence mwe arg1 arg1 A few books fell fall.v Motion Theme Directional Designate a head for each span PropBank dependencies; Surdeanu et al., (2008) Head selected by syntax Collins, (2003) A few books fell fall.v Theme

Span vs. Dependencies Structural divergence mwe arg1 arg1 A few books fell fall.v Motion Theme Directional Designate a head for each span PropBank dependencies; Surdeanu et al., (2008) arg1 A few books fell fall.v Theme

Span vs. Dependencies Structural divergence mwe arg1 arg1 A few books fell fall.v Motion Theme Directional This work A few books fell A few books fell A few books fell fall.v fall.v fall.v Theme Theme Theme

Span vs. Dependencies Score: top ⇣ ⌘ arg2 arg1 mwe arg1 arg1 BV H A few books fell in the room . fall.v Motion Place Theme Directional = top arg2 � � � � F + G A few books fell in the room . arg1 mwe arg1 arg1 BV fall.v A few books fell in the room . Motion Place Theme Directional ⇣ ⌘ arg1 A few books fell + h joint fall.v Motion Theme Directional FrameNet Score DM Score A ffi nities between them Multilinear mapping

Span vs. Dependencies Decoding: ⇣ ⌘ arg1 ? arg2 ? BV ? max H A few books fell in the room . fall.v arg1? frame? arg2? arg3? frame, args labeled arcs joint parts Linear program Speed up by promoting sparsity

Learning with Latent Variables FrameNet data DM data

Learning with Latent Variables FrameNet data DM data Supervision Supervision Theme Theme role role head mod head mod A few books fell A few books fell fall.v fall.v Theme Theme

Learning with Latent Variables Latent structured hinge Yu and Joachims, (2009) arg1 ? arg2 ? BV ? ⇣ ⌘ L = − max H A few books fell in the room . fall.v Theme Motion Place labeled arcs Directional joint parts arg1 ? arg2 ? BV ? ⇣ ⌘ + δ + max H A few books fell in the room . fall.v frame, args arg1? frame? arg2? arg3? labeled arcs joint parts FrameNet data

Learning with Latent Variables Latent structured hinge Yu and Joachims, (2009) arg1 ? arg2 ? BV ? ⇣ ⌘ L = − max H A few books fell in the room . fall.v Theme Motion Place labeled arcs Directional joint parts arg1 ? arg2 ? BV ? ⇣ ⌘ + δ + max H A few books fell in the room . fall.v frame, args arg1? frame? arg2? arg3? labeled arcs joint parts cost Prediction FrameNet data

Learning with Latent Variables Latent structured hinge Yu and Joachims, (2009) Gold FN output arg1 ? arg2 ? BV ? ⇣ ⌘ L = − max H A few books fell in the room . fall.v Theme Motion Place labeled arcs Directional joint parts arg1 ? arg2 ? BV ? ⇣ ⌘ + δ + max H A few books fell in the room . fall.v frame, args arg1? frame? arg2? arg3? labeled arcs joint parts FrameNet data

Learning with Latent Variables Latent structured hinge Yu and Joachims, (2009) arg1 ? arg2 ? BV ? ⇣ ⌘ L = − max H A few books fell in the room . fall.v Theme Motion Place labeled arcs Directional joint parts arg1 ? arg2 ? BV ? ⇣ ⌘ + δ + max H A few books fell in the room . fall.v frame, args arg1? frame? arg2? arg3? labeled arcs joint parts FrameNet data

Learning Joint Semantic Parsers from Disjoint Data Hao Peng 1 , Sam - PowerPoint PPT Presentation

Learning Joint Semantic Parsers from Disjoint Data Hao Peng 1 , Sam Thomson 2 , Swabha Swayamdipta 2 Noah A. Smith 1 1 University of Washington 2 Carnegie Mellon University @NAACL June 4, 2018 Motivations almost Larger data Better

Slide 16 1. Disjoint 2. Not disjoint 3. Disjoint 4. Not disjoint 5. Disjoint Slide 18 Slide 25

Data Structures for Disjoint Set Union-Find Data Structure Disjoint Set Data Structure Disjoint

S 3 identified by a rep. identified by a rep. n n = # of = # of Make Make- -Set

Disjoint Sets and Disjoint sets The UNION-FIND ADT for disjoint sets the UNION-FIND

Scanners and parsers COMP 520 Fall 2010 Scanners and Parsers (2) A scanner or lexer transforms a

CSE 326: Data Structures Maintain a set of pairwise disjoint sets. Disjoint Sets

LR Parsing Compiler Design CSE 504 Shift-Reduce Parsing 1 LR Parsers 2 SLR and LR(1) Parsers

Objectives Combinator Parsing Show how to build complex parsers by composing simpler parsers.

XML Parsers Asst. Prof. Dr. Kanda Runapongsa Saikaew (krunapon@kku.ac.th) Dept. of Computer

CS406: Compilers Spring 2020 Week 5: Parsers, AST, and Semantic Routines 1 Recap 2 3

Data Structures for representative member. Disjoint Sets ! Operations: Make-Set(x): create a

Disjoint sets March 20, 2020 Cinda Heeren / Andy Roth / Geoffrey Tien 1 A data structure for

Scaling Semantic Parsers with On-the-Fly Ontology Matching Tom Kwiakowski, Eunsol Choi, Yoav

Dynamics of Disjoint Hypercyclic Operators: Hypercyclicity vs. Disjoint Hypercyclicity Rebecca

Disjoint Sets - Part 2 Todays announcements: PA3 out, due 29 March 11:59p Todays Plan

A disjoint union theorem for trees Konstantinos Tyros University of Warwick Mathematics

Kerberos Created in MIT Athena project 1988. Has been in wide use in the USA. It has been

Project AutoMate SESAME: Dynamic Context Aware Access Control G. Zhang, The AutoMate Group The

Sesame: A Secure and Convenient Mobile Solution for Passwords Dr. Mehrdad Aliasgari , Nick Sabol,

SESAME: una collaborazione senza frontiere Attilio Milanese 7 ottobre 2013 CERN & SESAME:

C OMET M C N AUGHT 2007 H ALLEY S C OMET 1986 H ALLEY S C OMET 1986 16 km x 8 km C OMET

Click to edit Master title style April 12 th 2017 Click to edit Master subtitle style Roisin

5G ESSENCE Embedded Network Services for 5G Experiences IEEE 5G Summit, Thessaloniki, July 11 th

Scalable SPARQL Querying of Large RDF Graphs Jiewen Huang, Daniel J. Abadi and Kun Ren Yale

Learning Joint Semantic Parsers from Disjoint Data Hao Peng 1 , Sam - PowerPoint PPT Presentation

Learning Joint Semantic Parsers from Disjoint Data Hao Peng 1 , Sam Thomson 2 , Swabha Swayamdipta 2 Noah A. Smith 1 1 University of Washington 2 Carnegie Mellon University @NAACL June 4, 2018 Motivations almost Larger data Better

Slide 16 1. Disjoint 2. Not disjoint 3. Disjoint 4. Not disjoint 5. Disjoint Slide 18 Slide 25

Data Structures for Disjoint Set Union-Find Data Structure Disjoint Set Data Structure Disjoint

S 3 identified by a rep. identified by a rep. n n = # of = # of Make Make- -Set

Disjoint Sets and Disjoint sets The UNION-FIND ADT for disjoint sets the UNION-FIND

Scanners and parsers COMP 520 Fall 2010 Scanners and Parsers (2) A scanner or lexer transforms a

CSE 326: Data Structures Maintain a set of pairwise disjoint sets. Disjoint Sets

LR Parsing Compiler Design CSE 504 Shift-Reduce Parsing 1 LR Parsers 2 SLR and LR(1) Parsers

Objectives Combinator Parsing Show how to build complex parsers by composing simpler parsers.

XML Parsers Asst. Prof. Dr. Kanda Runapongsa Saikaew (krunapon@kku.ac.th) Dept. of Computer

CS406: Compilers Spring 2020 Week 5: Parsers, AST, and Semantic Routines 1 Recap 2 3

Data Structures for representative member. Disjoint Sets ! Operations: Make-Set(x): create a

Disjoint sets March 20, 2020 Cinda Heeren / Andy Roth / Geoffrey Tien 1 A data structure for

Scaling Semantic Parsers with On-the-Fly Ontology Matching Tom Kwiakowski, Eunsol Choi, Yoav

Dynamics of Disjoint Hypercyclic Operators: Hypercyclicity vs. Disjoint Hypercyclicity Rebecca

Disjoint Sets - Part 2 Todays announcements: PA3 out, due 29 March 11:59p Todays Plan

A disjoint union theorem for trees Konstantinos Tyros University of Warwick Mathematics

Kerberos Created in MIT Athena project 1988. Has been in wide use in the USA. It has been

Project AutoMate SESAME: Dynamic Context Aware Access Control G. Zhang, The AutoMate Group The

Sesame: A Secure and Convenient Mobile Solution for Passwords Dr. Mehrdad Aliasgari , Nick Sabol,

SESAME: una collaborazione senza frontiere Attilio Milanese 7 ottobre 2013 CERN &amp; SESAME:

C OMET M C N AUGHT 2007 H ALLEY S C OMET 1986 H ALLEY S C OMET 1986 16 km x 8 km C OMET

Click to edit Master title style April 12 th 2017 Click to edit Master subtitle style Roisin

5G ESSENCE Embedded Network Services for 5G Experiences IEEE 5G Summit, Thessaloniki, July 11 th

Scalable SPARQL Querying of Large RDF Graphs Jiewen Huang, Daniel J. Abadi and Kun Ren Yale

SESAME: una collaborazione senza frontiere Attilio Milanese 7 ottobre 2013 CERN & SESAME: