Tree-Adjoining Grammar Parsing and Vector Representations of - PowerPoint PPT Presentation

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Tree-Adjoining Grammar Parsing and Vector Representations of Supertags Jungo Kasai Yale University December 14, 2017

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Outline Background and Motivations 1 Supertagging Models 2 Parsing Models 3 Vector Representations of Supertags 4 Ongoing TAG Parsing Work 5 Applications of TAG 6 Future Work 7

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Syntactic Parsing S NP VP John AdvP VP really V NP likes Mary Why do we need parsing? Does John love Mary? Does Mary love John? Understanding of a sentence depends on the structure

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Context Free Grammars S S → NP VP VP → AdvP VP NP VP AdvP → really VP → V NP John AdvP VP NP → Mary NP → they really V NP NP → John V → like likes Mary V → likes These production rules generate sentences

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Context Free Grammars S S → NP VP VP → AdvP VP NP VP AdvP → really VP → V NP John AdvP VP NP → Mary NP → they really V NP NP → John V → like likes Mary V → likes Fundamental problem : constraints are distributed over separate rules How do we choose V → like or V → likes ?

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Tree-Adjoining Grammar Tree-Adjoining Grammar (TAG) localizes grammatical constraints Finite set of lexicalized elementary trees Finite set of operations (Substitution and Adjunction) are used to combine elementary trees S VP S VP * NP 0 ↓ VP AdvP NP 0 ↓ VP NP Ad ♦ V ♦ V ♦ NP 1 ↓ N ♦ sleep really likes John

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Tree-Adjoining Grammar Substitution

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Tree-Adjoining Grammar Adjunction

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Tree-Adjoining Grammar Adjunction allows for unbounded recursion while still enforcing agreement. John smartly occasionally really only likes Mary...

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Derivation Tree Derivation tree records the operations. Forms a dependency tree (each token has exactly one parent) likes Subst 0 AdjSubst 1 John really Mary ROOT Subst 0 ADJ Subst 1 ROOT John really likes Mary

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Two Steps in TAG Parsing Now the reverse process. Supertagging Assign elementary trees (supertags) to each token. Similar to POS tagging. Parsing Predict operations on the elementary trees. NP * NP * S S NP 0 ↓ S NP 0 ↓ VP NP VP V ♦ -NONE- V ♦ left left

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Supertagging is a bottleneck Supertagger Parser Stag Acc UAS LAS Gold Chart (MICA) 100.00 97.60 97.30 Maxent (MICA) Chart (MICA) 88.52 87.60 85.80 Supertagging is almost parsing There are about 5,000 supertags in the grammar About half of them occur only once in the training data (PTB WSJ Sections 1-22).

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W BiLSTM Supertagging Figure: BiLSTM Supertagger Architecture.

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Supertagging is still a bottleneck Supertagger Parser Stag Acc UAS LAS Maxent (MICA) Chart (MICA) 88.52 BiLSTM Chart (MICA) 89.32

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Supertagging is still a bottleneck Supertagger Parser Stag Acc UAS LAS Gold Chart (MICA) 100.00 97.60 97.30 Maxent (MICA) Chart (MICA) 88.52 87.60 85.80 BiLSTM Chart (MICA) 89.32 90.05 88.32 We can compensate for supertagging errors by exploiting structural similarities across elementary trees. Similarities across supertags are not utilized by the chart parser. We use two alternative families of parsing algorithms

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Parsing Models Prior Work: Unlexicalized Chart-Parser (MICA) [Bangalore et al., 2009] Unlexicalized Transition-based Parser [Kasai et al., 2017, Friedman et al., 2017] Graph-based Parser (work in progress)

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Transition-based Parsing Arc-Eager System (MALT) [Nivre et al., 2006] ROOT Subst 0 ADJ Subst1 ROOT John really likes Mary

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Transition-based Parsing

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Transition-based TAG Parsing How do we learn? Represent the configuration by the top k elements from stack and buffer: { s i , b i } k i = 1 [Chen and Manning, 2014]. Represent s i ( b i ) by the TAG elementary tree and the derived substitution operations performed into s i . Encode the TAG elementary trees and the substitution operations with dense vectors.

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W NN Transition-based Parsing Model Figure: Transition-based Parser Neural Network Architecture.

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Example John really likes Mary Stack Buffer Relations Action ROOT likes Mary { ( ROOT , likes , ROOT ) , ( likes , John , 0 ) · · · } RIGHT:1

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Parsing Results Gold Stags Predicted Stags Parsing Model UAS LAS UAS LAS MICA Chart 97.60 97.30 90.05 88.32 Transition-based 97.67 97.45 90.23 88.77 Table: Results on Section 00. Beam size 16. Predicted supertags are from our BiLSTM supertagger.

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Embeddings for Elementary Trees

Tree-Adjoining Grammar Parsing and Vector Representations of - PowerPoint PPT Presentation

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Tree-Adjoining Grammar Parsing and Vector Representations of Supertags Jungo Kasai Yale University December 14, 2017

Syntactic Theory Tree-Adjoining Grammar (TAG) Yi Zhang Department of Computational Linguistics

Grammar formalisms Tree Adjoining Grammar: Formal Properties, Part I Parsing Formal Properties

Parsing as Deduction Joseph K uhner March 24, 2007 Joseph K uhner Parsing as Deduction

Syntactic Theory Tree-Adjoining Grammar (TAG) Yi Zhang Department of Computational Linguistics

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Working Together What does his future hold? Carres Grammar School Carres Grammar School

Parsing beyond context-free grammar: adjunction: replacing an internal node with a new tree.

Grammar Implementation with Lexicalized Tree Adjoining Grammars and Frame Semantics Grammar

Grammar Implementation with Lexicalized Tree Adjoining Grammars and Frame Semantics Grammar

Grammar Implementation with Lexicalized Tree Adjoining Grammars and Frame Semantics Grammar

Lagrangian Based Approaches for Lexicalized Tree Adjoining Grammar Parsing Caio Corro

From Tree Adjoining Grammars to Higher Order Representations of Abstract Meaning Representations

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Tree Transducers and Tree Adjoining Grammars Historical and Current Perspectives William C.

Grammar and word order Grammar and word order Grammar Grammar Includes morphology and syntax

General Context-Free Grammar Parsing: Application of grammar rewrite rules A phrase structure

Grice in the Grammar: How Dynamic Social Networks Give Rise to Honesty and Evidentials GURT 2014

Origins of Hip Hop Jacob Original Gangsta Chen The setting for Hip Hop 1973 Oil

Economic Approach to International Affairs Advisor: Roman Abramovich, Head of Chukotka

S O C I A L I N T E R A C T I O N S & E C O N O M I C O U T C O M E S I I MPA 612:

Almost-Toric Hypersurfaces Bo Lin University of California, Berkeley April 18th, 2015 STAGS

Commissioners Announcements Police and Crime Plan De Deliv livery ry Prog ogress Up ss

Conference 2019 Keynote Caroline Siarkiewicz, CEO, Money & Pensions Service Co-ordinating

Policy Transparency in the Public Sector: the Case of Social Benefits in Tanzania Outline The

Tree-Adjoining Grammar Parsing and Vector Representations of - PowerPoint PPT Presentation

Background and Motivations Supertagging Models Parsing Models Vector Representations of Supertags Ongoing TAG Parsing W Tree-Adjoining Grammar Parsing and Vector Representations of Supertags Jungo Kasai Yale University December 14, 2017

Syntactic Theory Tree-Adjoining Grammar (TAG) Yi Zhang Department of Computational Linguistics

Grammar formalisms Tree Adjoining Grammar: Formal Properties, Part I Parsing Formal Properties

Parsing as Deduction Joseph K uhner March 24, 2007 Joseph K uhner Parsing as Deduction

Syntactic Theory Tree-Adjoining Grammar (TAG) Yi Zhang Department of Computational Linguistics

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Working Together What does his future hold? Carres Grammar School Carres Grammar School

Parsing beyond context-free grammar: adjunction: replacing an internal node with a new tree.

Grammar Implementation with Lexicalized Tree Adjoining Grammars and Frame Semantics Grammar

Grammar Implementation with Lexicalized Tree Adjoining Grammars and Frame Semantics Grammar

Grammar Implementation with Lexicalized Tree Adjoining Grammars and Frame Semantics Grammar

Lagrangian Based Approaches for Lexicalized Tree Adjoining Grammar Parsing Caio Corro

From Tree Adjoining Grammars to Higher Order Representations of Abstract Meaning Representations

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Tree Transducers and Tree Adjoining Grammars Historical and Current Perspectives William C.

Grammar and word order Grammar and word order Grammar Grammar Includes morphology and syntax

General Context-Free Grammar Parsing: Application of grammar rewrite rules A phrase structure

Grice in the Grammar: How Dynamic Social Networks Give Rise to Honesty and Evidentials GURT 2014

Origins of Hip Hop Jacob Original Gangsta Chen The setting for Hip Hop 1973 Oil

Economic Approach to International Affairs Advisor: Roman Abramovich, Head of Chukotka

S O C I A L I N T E R A C T I O N S &amp; E C O N O M I C O U T C O M E S I I MPA 612:

Almost-Toric Hypersurfaces Bo Lin University of California, Berkeley April 18th, 2015 STAGS

Commissioners Announcements Police and Crime Plan De Deliv livery ry Prog ogress Up ss

Conference 2019 Keynote Caroline Siarkiewicz, CEO, Money &amp; Pensions Service Co-ordinating

Policy Transparency in the Public Sector: the Case of Social Benefits in Tanzania Outline The

S O C I A L I N T E R A C T I O N S & E C O N O M I C O U T C O M E S I I MPA 612:

Conference 2019 Keynote Caroline Siarkiewicz, CEO, Money & Pensions Service Co-ordinating