Straight to the Tree: Constituency Parsing with Neural Syntactic - PowerPoint PPT Presentation

Straight to the Tree: Constituency Parsing with Neural Syntactic Distance Yikang Shen*, Zhouhan Lin*, Athul Paul Jacob, Alessandro Sordoni, Aaron Courville, Yoshua Bengio University of Montreal, Microsoft Research, University of Waterloo

Overview - Motivation - Syntactic Distance based Parsing Framework - Model - Experimental Results

ICLR 2018: Neural Language Modeling by Jointly Learning Syntax and Lexicon Syntactic Structured LSTM Distance Self-Attention Supervised Constituency Parsing with Syntactic Distance? Language Unsupervised Model Constituency parser (61 ppl) (68 UF1) [Shen et al. 2018]

Chart Neural Parsers Transition based Neural Parsers 1. Greedy decoding: 1. High computational cost: Incompleted tree (the shift and Complexity of CYK is O(n^3). reduce steps may not match). 2. Complicated loss function: 2. Exposure bias The model is never exposed to its own mistakes during training [Stern et al., 2017; Cross and Huang, 2016]

Intuitions Only the order of split (or combination) matters for reconstructing the tree. Can we model the order directly?

Syntactic distance N1 N2 For each split point , their syntactic distance should share the same order as the height of S1 S2 related node

Convert to binary tree [Stern et al., 2017]

Tree to Distance The height for each non-terminal node is the maximum height of its children plus 1

Tree to Distance S VP S-VP ∅ NP NP ∅ ∅ ∅

Distance to Tree Split point for each bracket is the one with maximum distance.

Distance to Tree

Framework for inferring the distances and labels Labels for non-leaf nodes Labels for leaf nodes Distances

Inferring the distances Distances

Inferring the distances

Pairwise learning-to-rank loss for distances a variant of hinge loss

Pairwise learning-to-rank loss for distances While d i > d j : While d i < d j : L L -1 1

Framework for inferring the distances and labels Labels for non-leaf nodes Labels for leaf nodes Distances

Framework for inferring the distances and labels Labels for non-leaf nodes Labels for leaf nodes

Inferring the Labels

Putting it together

Experiments: Penn Treebank

Experiments: Chinese Treebank

Experiments: Detailed statistics in PTB and CTB

Experiments: Ablation Test

Experiments: Parsing Speed

Conclusions and Highlights - A novel constituency parsing scheme : predicting tree structure from a set of real-valued scalars (syntactic distances). - Completely free from compounding errors . - Strong performance compare to previous models, and - Significantly more efficient than previous models - Easy deployment : The architecture of model is no more than a stack of standard recurrent and convolutional layers.

One more thing... Why it works now? The research in rank loss is well-studied in the topic of - learning-to-rank, since 2005 (Burges et al. 2005). Models that are good at learning these syntactic distances are not - widely known until the rediscovery of LSTM in 2013 (Graves 2013). - Efficient regularization methods for LSTM didn’t become mature until 2017 (Merity 2017).

Yikang Shen, Zhouhan Lin Thank you! MILA, Université de Montréal {yikang.shn, lin.zhouhan}@gmail.com Questions? Code: Paper:

Straight to the Tree: Constituency Parsing with Neural Syntactic - PowerPoint PPT Presentation

Straight to the Tree: Constituency Parsing with Neural Syntactic Distance Yikang Shen, Zhouhan Lin, Athul Paul Jacob, Alessandro Sordoni, Aaron Courville, Yoshua Bengio University of Montreal, Microsoft Research, University of Waterloo

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Domain Adaptation for Constituency Parsing Using Partial Annotations Vidur Joshi Matthew Peters

Linear Time Constituency Parsing with RNNs and Dynamic Programming Juneki Hong 1 Liang Huang 1,2 1

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 18:

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 18:

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Are Hybrid Physical Designs Important? 1 B+ tree 2 C O L B+ tree 3 ? C O L C O L B+ tree

Span-Based Constituency Parsing with Provably Optimal Dynamic Oracles James Cross and Liang Huang

Parsing as Deduction Joseph K uhner March 24, 2007 Joseph K uhner Parsing as Deduction

Bottom-up parsing LR parsing Construct parse tree for input from leaves up LR( k ) parsing

WORLD BANK GROUP AFRICA GROUP 1 CONSTITUENCY 17 th Statutory Constituency Meeting ANNUAL REPORT

Advisory Group 5 Subregional Focal Points, 14 Constituency Focal Points Constituency Groups (1)

WORLD BANK GROUP AFRICA GROUP 1 CONSTITUENCY 16 th Statutory Constituency Meeting INTERIM REPORT

Constituency/Stakeholder Travel FY 11 FY 12 Update Update on Constituency Travel Support

Dow ntow n South Neighborhood I m provem ent District Exploratory Com m ittee Orlando City

Section 336(e) Elections in Taxable Dispositions Tax Executives Institute Orange County Chapter

2020 Budget Discussion Managing Member Meeting November 5 th , 2019 11/6/2019 1 Changes from

New Chair Orientation Retention, Tenure and Promotion Friday, August 16, 2019 Probationary

What is a suffix? Choose the correct answer. A race where two peoples legs are tied together.

Web Time Entry What is it? and How will it help me? What is it? WTE Web Time Entry

Ordered Line Integral Methods for Solving the Eikonal Equation Samuel F. Potter Maria K. Cameron

Comprehensive Plan & Municipal Code Update June 9, 2015 1 Exhibit 119 Page 2 of 8

Straight to the Tree: Constituency Parsing with Neural Syntactic - PowerPoint PPT Presentation

Straight to the Tree: Constituency Parsing with Neural Syntactic Distance Yikang Shen*, Zhouhan Lin*, Athul Paul Jacob, Alessandro Sordoni, Aaron Courville, Yoshua Bengio University of Montreal, Microsoft Research, University of Waterloo

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Domain Adaptation for Constituency Parsing Using Partial Annotations Vidur Joshi Matthew Peters

Linear Time Constituency Parsing with RNNs and Dynamic Programming Juneki Hong 1 Liang Huang 1,2 1

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 18:

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 18:

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Are Hybrid Physical Designs Important? 1 B+ tree 2 C O L B+ tree 3 ? C O L C O L B+ tree

Span-Based Constituency Parsing with Provably Optimal Dynamic Oracles James Cross and Liang Huang

Parsing as Deduction Joseph K uhner March 24, 2007 Joseph K uhner Parsing as Deduction

Bottom-up parsing LR parsing Construct parse tree for input from leaves up LR( k ) parsing

WORLD BANK GROUP AFRICA GROUP 1 CONSTITUENCY 17 th Statutory Constituency Meeting ANNUAL REPORT

Advisory Group 5 Subregional Focal Points, 14 Constituency Focal Points Constituency Groups (1)

WORLD BANK GROUP AFRICA GROUP 1 CONSTITUENCY 16 th Statutory Constituency Meeting INTERIM REPORT

Constituency/Stakeholder Travel FY 11 FY 12 Update Update on Constituency Travel Support

Dow ntow n South Neighborhood I m provem ent District Exploratory Com m ittee Orlando City

Section 336(e) Elections in Taxable Dispositions Tax Executives Institute Orange County Chapter

2020 Budget Discussion Managing Member Meeting November 5 th , 2019 11/6/2019 1 Changes from

New Chair Orientation Retention, Tenure and Promotion Friday, August 16, 2019 Probationary

What is a suffix? Choose the correct answer. A race where two peoples legs are tied together.

Web Time Entry What is it? and How will it help me? What is it? WTE Web Time Entry

Ordered Line Integral Methods for Solving the Eikonal Equation Samuel F. Potter Maria K. Cameron

Comprehensive Plan &amp; Municipal Code Update June 9, 2015 1 Exhibit 119 Page 2 of 8

Straight to the Tree: Constituency Parsing with Neural Syntactic Distance Yikang Shen, Zhouhan Lin, Athul Paul Jacob, Alessandro Sordoni, Aaron Courville, Yoshua Bengio University of Montreal, Microsoft Research, University of Waterloo

Comprehensive Plan & Municipal Code Update June 9, 2015 1 Exhibit 119 Page 2 of 8