Dynamic Programming for Linear-Time Incremental Parsing Liang Huang - PowerPoint PPT Presentation

Dynamic Programming for Linear-Time Incremental Parsing Liang Huang Information Sciences Institute University of Southern California (Joint work with Kenji Sagae, USC/ICT) JHU CLSP Seminar September 14, 2010

Remembering Fred Jelinek (1932-2010) Prof. Jelinek hosted my visit and this talk on his last day. He was very supportive of this work, which is related to his work on structured language models, and I dedicate my work to his memory.

Ambiguity and Incrementality • NLP is (almost) all about ambiguity resolution • human-beings resolve ambiguity incrementally DP for Incremental Parsing 3

Ambiguity and Incrementality • NLP is (almost) all about ambiguity resolution • human-beings resolve ambiguity incrementally One morning in Africa, I shot an elephant in my pajamas; DP for Incremental Parsing 3

Ambiguity and Incrementality • NLP is (almost) all about ambiguity resolution • human-beings resolve ambiguity incrementally One morning in Africa, I shot an elephant in my pajamas; how he got into my pajamas I’ll never know. DP for Incremental Parsing 3

Ambiguities in Translation CS 562 - Intro 4

Ambiguities in Translation Google translate: carefully slide CS 562 - Intro 4

If you are stolen... CS 562 - Intro 5

If you are stolen... Google translate: Once the theft to the police CS 562 - Intro 5

or even... CS 562 - Intro 6

or even... clear evidence that NLP is used in real life! CS 562 - Intro 6

Ambiguities in Parsing I feed cats nearby in the garden ... • let’s focus on dependency structures for simplicity • ambiguous attachments of nearby and in • ambiguity explodes exponentially with sentence length • must design efficient (polynomial) search algorithm • typically using dynamic programming (DP); e.g. CKY DP for Incremental Parsing 7

But full DP is too slow... I feed cats nearby in the garden ... • full DP (like CKY) is too slow (cubic-time) • while human parsing is fast & incremental (linear-time) DP for Incremental Parsing 8

But full DP is too slow... I feed cats nearby in the garden ... • full DP (like CKY) is too slow (cubic-time) • while human parsing is fast & incremental (linear-time) • how about incremental parsing then? • yes, but only with greedy search (accuracy suffers) • explores tiny fraction of trees (even w/ beam search) DP for Incremental Parsing 8

But full DP is too slow... I feed cats nearby in the garden ... • full DP (like CKY) is too slow (cubic-time) • while human parsing is fast & incremental (linear-time) • how about incremental parsing then? • yes, but only with greedy search (accuracy suffers) • explores tiny fraction of trees (even w/ beam search) • can we combine the merits of both approaches? • a fast, incremental parser with dynamic programming? • explores exponentially many trees in linear-time? DP for Incremental Parsing 8

Linear-Time Incremental DP incremental parsing greedy (e.g. shift-reduce) search ☹ (Nivre 04; Collins/Roark 04; ...) this work: full DP principled fast shift-reduce parsing (e.g. CKY) search with dynamic programming ☺ (Eisner 96; Collins 99; ...) ☹ ☺ fast slow (linear-time) (cubic-time) DP for Incremental Parsing 9

Big Picture natural programming languages languages ☺ ☹ human psycholinguistics ☹ NLP ☺ computer compiler theory (LR, LALR, ...) DP for Incremental Parsing 10

Preview of the Results • very fast linear-time dynamic programming parser • best reported dependency accuracy on PTB/CTB • explores exponentially many trees (and outputs forest) 1.4 parsing time (secs) 1.2 1 0.8 0.6 0.4 0.2 0 0 10 20 30 40 50 60 70 sentence length DP for Incremental Parsing 11

Preview of the Results • very fast linear-time dynamic programming parser • best reported dependency accuracy on PTB/CTB • explores exponentially many trees (and outputs forest) 1.4 y k e parsing time (secs) a l i e 1.2 n k r r a 1 e h B C MST 0.8 0.6 0.4 this work 0.2 0 0 10 20 30 40 50 60 70 sentence length DP for Incremental Parsing 11

Preview of the Results • very fast linear-time dynamic programming parser • best reported dependency accuracy on PTB/CTB • explores exponentially many trees (and outputs forest) 1.4 y 10 10 DP: exponential k e parsing time (secs) a l i e 1.2 n k 10 8 r r a 1 e h B 10 6 C MST 0.8 0.6 10 4 0.4 10 2 this work 0.2 non-DP beam search 10 0 0 0 10 20 30 40 50 60 70 0 10 20 30 40 50 60 70 sentence length sentence length DP for Incremental Parsing 11

Outline • Motivation • Incremental (Shift-Reduce) Parsing • Dynamic Programming for Incremental Parsing • Experiments DP for Incremental Parsing 12

Shift-Reduce Parsing I feed cats nearby in the garden. action stack queue I feed cats ... 0 - feed cats nearby ... cats nearby in ... cats nearby in ... nearby in the ... nearby in the ... in the garden ... DP for Incremental Parsing 13

Shift-Reduce Parsing I feed cats nearby in the garden. action stack queue I feed cats ... 0 - I feed cats nearby ... 1 shift cats nearby in ... cats nearby in ... nearby in the ... nearby in the ... in the garden ... DP for Incremental Parsing 14

Shift-Reduce Parsing I feed cats nearby in the garden. action stack queue I feed cats ... 0 - I feed cats nearby ... 1 shift I feed 2 shift cats nearby in ... cats nearby in ... nearby in the ... nearby in the ... in the garden ... DP for Incremental Parsing 15

Shift-Reduce Parsing I feed cats nearby in the garden. action stack queue I feed cats ... 0 - I feed cats nearby ... 1 shift I feed 2 shift cats nearby in ... feed 3 l-reduce cats nearby in ... I nearby in the ... nearby in the ... in the garden ... DP for Incremental Parsing 16

Shift-Reduce Parsing I feed cats nearby in the garden. action stack queue I feed cats ... 0 - I feed cats nearby ... 1 shift I feed 2 shift cats nearby in ... feed 3 l-reduce cats nearby in ... I 4 shift feed cats nearby in the ... I nearby in the ... in the garden ... DP for Incremental Parsing 17

Shift-Reduce Parsing I feed cats nearby in the garden. action stack queue I feed cats ... 0 - I feed cats nearby ... 1 shift I feed 2 shift cats nearby in ... feed 3 l-reduce cats nearby in ... I 4 shift feed cats nearby in the ... I 5a r-reduce feed nearby in the ... cats I in the garden ... DP for Incremental Parsing 18

Shift-Reduce Parsing I feed cats nearby in the garden. action stack queue I feed cats ... 0 - I feed cats nearby ... 1 shift I feed 2 shift cats nearby in ... feed 3 l-reduce cats nearby in ... I 4 shift feed cats nearby in the ... I 5a r-reduce feed nearby in the ... cats I 5b shift in the garden ... feed cats nearby I DP for Incremental Parsing 19

Dynamic Programming for Linear-Time Incremental Parsing Liang Huang - PowerPoint PPT Presentation

Dynamic Programming for Linear-Time Incremental Parsing Liang Huang Information Sciences Institute University of Southern California (Joint work with Kenji Sagae, USC/ICT) JHU CLSP Seminar September 14, 2010 Remembering Fred Jelinek

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Dynamic Programming for Linear Time Incremental Parsing Liang Huang Information Sciences

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Programming Languages: Parsing Onur Tolga S ehito glu Computer Engineering,METU 27 May

Linear Time Constituency Parsing with RNNs and Dynamic Programming Juneki Hong 1 Liang Huang 1,2 1

Type-driven Incremental Semantic Parsing with Polymorphism Kai Zhao and Liang Huang City

Efficient Incremental Dynamic Invariant Detection Jeff Perkins and Michael Ernst MIT CSAIL Page

Linear Programming Linear Programming In a linear programming problem, there is a set of

Incremental Parsing in Bounded Memory William Schuler Department of Linguistics The Ohio State

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Incremental Garbage Collection Part II Roland Schatz Incremental Garbage Collection p.1/22

Batch-Incremental vs. Instance-Incremental Learning in Dynamic and Evolving Data Jesse Read 1 ,

Dynamic Programming Outline and Reading Matrix Chain-Product (5.3.1) Dynamic Programming:

Dynamic Programming Prof. Kuan-Ting Lai 2020/4/10 Dynamic Programming Dynamic Programming is

Distributed Task Scheduling for Physics Fusion Applications J. Herrera A. Cappa E. Huedo M.A.

Stability of traveling waves with a point vortex Samuel Walsh (University of Missouri) joint work

Guarantees on the Probability of Good Selection David J. Eckman Shane G. Henderson Cornell

Social Networks M A R I O L . S M A L L A N D M A L E A H F E K E T E H A R V A R D U N I V

America's Quest for Equality of Opportunity Lane Kenworthy November 18, 2014 Americans like

= Tensors More Examples Entity Keywords Tensor Matrix Relation Reviews Entity Users

Technology Nodes Campbell Millar, Synopsys, Glasgow, UK ESSDERC/ ESSCIRC Workshop Process

RXGK Benjamin Kaduk 20 June 2019 Benjamin Kaduk RXGK rxkad is bad roots in Kerberos 4,