A Two-Stage Parsing Method for Text-Level Discourse Analysis Yizhong - PowerPoint PPT Presentation

A Two-Stage Parsing Method for Text-Level Discourse Analysis Yizhong Wang , Sujian Li, Houfeng Wang Peking University ACL, August 2, 2017

Back ckground: Te Text-Le Level el Di Discourse An Analysis • Task: Identifying the discourse structure of text. • Rhetorical Structure Theory [Mann and Thompson, 1988] 𝑓 +:0 [ The European Community’s consumer price index rose a provisional 0.6% in Evaluation September from August ] # $ [ and was 𝑓 +:. 𝑓 /:0 up 5.3% from September 1988, ] # & (Satellite) (Nucleus) [ according to Eurostat, the EC's Attribution Attribution statistical agency. ] # ' 𝑓 +:, 𝑓 . 𝑓 / 𝑓 0 [ The month-to-month rise in the index Comparison was the largest since April, ] # ( 𝑓 , 𝑓 + [ Eurostat said. ] # ) wsj-0699 Background | Motivation | Method | Experiments 2

Back ckground: Te Text-Le Level el Di Discourse An Analysis Goal: parse a text into a tree with nuclearity • Task: Identifying the discourse structure of text. and relation labels • Rhetorical Structure Theory [Mann and Thompson, 1988] 𝑓 +:0 [ The European Community’s consumer price index rose a provisional 0.6% in Evaluation September from August ] # $ [ and was 𝑓 +:. 𝑓 /:0 up 5.3% from September 1988, ] # & (Satellite) (Nucleus) [ according to Eurostat, the EC's Attribution Attribution statistical agency. ] # ' 𝑓 +:, 𝑓 . 𝑓 / 𝑓 0 [ The month-to-month rise in the index Comparison was the largest since April, ] # ( 𝑓 , 𝑓 + [ Eurostat said. ] # ) wsj-0699 Background | Motivation | Method | Experiments 3

Back ckground: Tr Transition-Ba Based ed Me Metho thod [Daniel Marcu. 1999; Kenji Sagae. 2009] • Initial state: Stack Queue 𝑓 + , 𝑓 , , 𝑓 . , … • Shift action: Stack Queue 𝑓 +:. , 𝑓 / , 𝑓 0 𝑓 0 , 𝑓 1 , 𝑓 2 , … • Reduce action: Stack Queue 𝑓 +:. , 𝑓 /:0 𝑓 1 , 𝑓 2 , … 𝑓 0 𝑓 / Background | Motivation | Method | Experiments 4

Back ckground: Tr Transition-Ba Based ed Me Metho thod [Daniel Marcu. 1999; Kenji Sagae. 2009] • The unified framework: 42 42 reduce actions are designed with 3 different nuclearity types (e.g. NS) and 18 18 relation labels (e.g. cause) . • Reduce action combined with nuclearity and relation: Stack Queue 𝑓 +:. , 𝑓 /:0 𝑓 1 , 𝑓 2 , … Attribution 𝑓 0 N S 𝑓 / Background | Motivation | Method | Experiments 5

Back ckground: Tr Transition-Ba Based ed Me Metho thod [Daniel Marcu. 1999; Kenji Sagae. 2009] • The unified framework: 42 reduce actions are designed with 3 different nuclearity types 42 (e.g. NS) and 18 18 relation labels (e.g. cause) . • Reduce action combined with nuclearity and relation: Classifier Shift Stack Queue Reduce-SN-Cause 𝑓 +:. , 𝑓 /:0 𝑓 1 , 𝑓 2 , … Reduce-NS-Summary Attribution 𝑓 0 N S Reduce-NN-Contrast 𝑓 / Reduce-NS-Temporal ...... Background | Motivation | Method | Experiments 6

Motivation: Nak Mo Naked Tr Tree fo for Re Reducing Sp Sparsi sity Distribution of the 42 42 actions in Number of the 4 actions that we need tree (without relation) Previous Transition-based Parsing Systems to build a nak naked tr 19443 11702 remove Shift 4329 relation 3065 Reduce-NS-Elaboration A Naked Discourse Parse Tree A Complete Discourse Parse Tree 𝑓 /:0 𝑓 /:0 N S S N Attribution 𝑓 / 𝑓 / 𝑓 0 𝑓 0 Background | Motivation | Method | Experiments 7

Mo Motivation: Le Level-Sp Specific Re Relation La Labelling • Discourse relations distribute differently at different linguistic levels: Top-5 Frequent Top-5 Frequent Top-5 Frequent Inner-Sentential Inter-Sentential Inter-Paragraph Relations Relations Relations 32.70 % Elaboration 43.10% 44.4 % Elaboration Elaboration 23.00 % Joint 12.7 % Attribution Joint 13.80% 10.90 % Explanation 9.2 % Explanation Same-Unit 7.60% 6.60 % Contrast 7.6 % Joint Contrast 6.40% 4.30 % Evaluation 5.3 % Enablement Evaluation 5.90% Background | Motivation | Method | Experiments 8

Mo Motivation: Le Level-Sp Specific Re Relation La Labelling • Some discourse relations tend to occur at specific linguistic levels: Inner-Sentential Inter-Sentential Inter-Paragraph 231 174 177 102 41 33 33 17 15 8 5 0 Condition Manner-Means Textual-Organization Topic-Change Background | Motivation | Method | Experiments 9

Me Metho thod: d: Tw Two-St Stage Pa Parsing Al Algorith thm • Stage 1: 𝑓 +:. Transition-based parsing system with only 4 actions is adopted to construct the naked tree (without labels). 𝑓 +:, 𝑓 . • Stage 2: Three dedicated classifiers are trained for labelling relations at three linguistic levels: 𝑓 +:. a) intra-sentential b) inter-sentential Attribution 𝑓 +:, 𝑓 . c) inter-paragraph Background | Motivation | Method | Experiments 10

Me Metho thod: d: Tw Two-St Stage Pa Parsing Al Algorith thm • Stage 1: 𝑓 +:. Transition-based parsing system with only 4 actions is adopted to construct the naked tree (without labels). 𝑓 +:, 𝑓 . Naked tree structure could help with relation classification. • Stage 2: Three dedicated classifiers are trained for labelling relations at three linguistic levels: 𝑓 +:. a) intra-sentential b) inter-sentential Attribution 𝑓 +:, 𝑓 . c) inter-paragraph Background | Motivation | Method | Experiments 11

Method Me od: Fea eatures es an and Classifier ers • We use manually-extracted features, including: a) Parsing status, position features (only for stage 1) b) N-gram features, dependency features, structural features, nucleus features c) Tree features (only for stage 2) c) Tree features (only for stage 2) : Height=1, Depth=2, SelfIsNucleus=True, ParentIsNucleus=False • Four SVM classifiers are trained for the four classification tasks (one action classifier and three relation classifier). Background | Motivation | Method | Experiments 12

Ex Expe peri riments: Performance ce Co Comp mparison • We evaluate our method on RST Discourse Treebank, and report the (micro-averaged) F-score: Model Span Nuclearity Relation Joty et al. (2013) 82.7 68.4 55.7 Feng and Hirst (2014) 85.7 71.0 58.2 Li et al. (2014) 84.0 70.8 58.6 Li et al. (2016) 85.8 71.1 58.9 Transition Ji and Eisenstein (2014) 82.1 71.1 61.6 -Based Heilman and Sagae (2015) 83.5 69.3 57.4 Systems Ours 86.0 72.4 59.7 Human 88.7 77.7 65.8 Background | Motivation | Method | Experiments 13

Experim Exp iments: Incr cremental An Analysis of of Our ur Met ethod n Simple Unified Framework 90 n Two-Stage Parsing (Basic) 86 86 86 84.4 80 l Span: ⇈ 1. 1.6 % l Nuclearity: ⇈ 1. 1.7 % l Relation: ⇈ 0. 0.9 % 72.4 72.4 72.4 70 70.7 n + Three-Level Relation l Relation: ⇈ 0. 0.8 % 60 59.7 59.4 58.6 57.7 n + Tree Features 50 l Relation: ⇈ 0. 0.3 % SPAN NUCLEARITY RELATION Background | Motivation | Method | Experiments 14

Co Conclusi sions • Summary: A pipelined two-stage discourse parsing method; • Three-level relation classification with tree features; • State-of-the-art performance. • • Future work: Update the features and classifiers with latest models; • Incorporate data from other sources. • 15

Thank you! Contact: yizhong@pku.edu.cn Code is available: https://github.com/EastonWang/StageDP

A Two-Stage Parsing Method for Text-Level Discourse Analysis Yizhong - PowerPoint PPT Presentation

A Two-Stage Parsing Method for Text-Level Discourse Analysis Yizhong Wang , Sujian Li, Houfeng Wang Peking University ACL, August 2, 2017 Back ckground: Te Text-Le Level el Di Discourse An Analysis Task: Identifying the discourse

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

in Big-Data Analytic Systems Rui Li , Peizhen Guo, Bo Hu, Wenjun Hu Yale University Background

Computational Models of Discourse Regina Barzilay MIT What is Discourse? What is Discourse?

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Modeling Discourse Cohesion for Discourse Parsing via Memory Network Yanyan Jia, Yuan Ye, Yansong

Computational Discourse 11-711 Algorithms for NLP 15 November 2018 What Is Discourse? Discourse

Computational Discourse 11-711 Algorithms for NLP 31 October 2019 What Is Discourse? Discourse

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Discourse Coherence Lecture Plan: Einf uhrung in Pragmatik Discourse cohesion and

Computational Models of Discourse: Discourse Parsing Caroline Sporleder Universit at des

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Discourse Structure Ling575 Discourse & Dialogue April 13, 2011 Roadmap Project

Memory-Enhanced Models for Discourse Understanding COMP90042 Web Search and Text Analysis Guest

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Discourse Structure & Wrap-up: Q-A Ling571 Deep Processing Techniques for NLP March 8, 2017

1 2

LIGO-Virgo Searches for Gravita5onal- Waves Associated with GRBs

in Intel Processors Alireza Farshin * , Amir Roozbeh + , Gerald Q. Maguire Jr. , Dejan Kosti *

Mitigating E Egregi egiou ous A ACK CK Delays in Ce n Cellular Da Data Ne Networks b by

Categories of natural models of type theory CT 2016 (Halifax, NS, Canada) Clive Newstead

What the heck is an In-Memory Data Grid? @addisonhuddy How are we going to answer this question?

Transactions and Concurrency Control (Manga Guide to DB, Chapter 5, pg 125-137, 153-160) 1

Architecting HBM as a High Bandwidth, High Capacity, Self-Managed Last-Level Cache Tyler

Sambuz

Useful Links

Newsletter

Mail Us

A Two-Stage Parsing Method for Text-Level Discourse Analysis Yizhong - PowerPoint PPT Presentation

A Two-Stage Parsing Method for Text-Level Discourse Analysis Yizhong Wang , Sujian Li, Houfeng Wang Peking University ACL, August 2, 2017 Back ckground: Te Text-Le Level el Di Discourse An Analysis Task: Identifying the discourse

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

in Big-Data Analytic Systems Rui Li , Peizhen Guo, Bo Hu, Wenjun Hu Yale University Background

Computational Models of Discourse Regina Barzilay MIT What is Discourse? What is Discourse?

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Modeling Discourse Cohesion for Discourse Parsing via Memory Network Yanyan Jia, Yuan Ye, Yansong

Computational Discourse 11-711 Algorithms for NLP 15 November 2018 What Is Discourse? Discourse

Computational Discourse 11-711 Algorithms for NLP 31 October 2019 What Is Discourse? Discourse

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Discourse Coherence Lecture Plan: Einf uhrung in Pragmatik Discourse cohesion and

Computational Models of Discourse: Discourse Parsing Caroline Sporleder Universit at des

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Discourse Structure Ling575 Discourse &amp; Dialogue April 13, 2011 Roadmap Project

Memory-Enhanced Models for Discourse Understanding COMP90042 Web Search and Text Analysis Guest

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Discourse Structure &amp; Wrap-up: Q-A Ling571 Deep Processing Techniques for NLP March 8, 2017

1 2

LIGO-Virgo Searches for Gravita5onal- Waves Associated with GRBs

in Intel Processors Alireza Farshin * , Amir Roozbeh *+ , Gerald Q. Maguire Jr. * , Dejan Kosti *

Mitigating E Egregi egiou ous A ACK CK Delays in Ce n Cellular Da Data Ne Networks b by

Categories of natural models of type theory CT 2016 (Halifax, NS, Canada) Clive Newstead

What the heck is an In-Memory Data Grid? @addisonhuddy How are we going to answer this question?

Transactions and Concurrency Control (Manga Guide to DB, Chapter 5, pg 125-137, 153-160) 1

Architecting HBM as a High Bandwidth, High Capacity, Self-Managed Last-Level Cache Tyler

Sambuz

Useful Links

Newsletter

Mail Us

Discourse Structure Ling575 Discourse & Dialogue April 13, 2011 Roadmap Project

Discourse Structure & Wrap-up: Q-A Ling571 Deep Processing Techniques for NLP March 8, 2017

in Intel Processors Alireza Farshin * , Amir Roozbeh + , Gerald Q. Maguire Jr. , Dejan Kosti *