Simple and Effective Text Simplification Using Semantic and Neural - PowerPoint PPT Presentation

Simple and Effective Text Simplification Using Semantic and Neural Methods Elior Sulem, Omri Abend and Ari Rappoport The Hebrew University of Jerusalem ACL 2018

Simple and Effective Text Simplification Using Semantic and Neural Methods Elior Sulem Omri Abend Ari Rappoport

Text Simplification John wrote a book. I read the book. Last year I read the book John authored Original sentence One or several simpler sentences 3

Text Simplification Last year I read the book John authored John wrote a book. I read the book. Original sentence One or several simpler sentences Multiple motivations Preprocessing for Natural Language Processing tasks e.g., machine translation, relation extraction, parsing Reading aids, Language Comprehension e.g., people with aphasia, dyslexia, 2 nd language learners 4

Text Simplification John wrote a book. I read the book. Last year I read the book John authored Original sentence One or several simpler sentences Multiple operations Word or phrase substitution Lexica l Sentence splitting Structural Deletion 5

In this talk ● Both structural and lexical simplification. ● The first simplification system combining structural transformations, using semantic structures, and neural machine translation. ● Compares favorably to the state-of-the-art in combined structural and lexical simplification. ● Alleviates the over-conseratism of MT-based systems. 6

Overview 1. Current approaches and challenges 1.1 Conservatism in MT-Based Simplification 1.2 Sentence splitting in Text Simplification 2. Direct Semantic Splitting (DSS) 2.1. The semantic structures 2.2. The semantic rules 3. Combining DSS with Neural Text Simplification 4. Experiments 5. Results 6. Human Evaluation Benchmark 7. Conclusion 7

Current Approaches and Challenges MT-Based Simplification Sentence simplification as monolingual machine translation Models Phrase-Based SMT (Specia, 2010; Coster and Kauchak, 2011; ● Wubben et al, 2012; Štajner et al., 2015) Syntax-Based SMT (Xu et al., 2016) ● Neural Machine Translation (Nisioi et al., 2017; Zhang et al., 2017; ● Zhang and Lapata, 2017) 8

Current Approaches and Challenges MT-Based Simplification Sentence simplification as monolingual machine translation Corpora English / Simple Wikipedia (Zhu et al., 2010; Coster and Kauchak., 2011; ● Hwang et al., 2015) Newsela (Xu et al., 2015) ● 9

Conservatism in MT-Based Simplification ● In both SMT and NMT Text Simplification, a large proportion of the input sentences are not modified. (Alva-Manchego et al., 2017; on the Newsela corpus). ● It is confirmed in the present work (experiments on Wikipedia): For the NTS system (Nisioi et al., 2017) / Moses (Koehn et al., 2007) - 66% / 80% of the input sentences remain unchanged. - None of the references are identical to the source. - According to automatic and human evaluation, the references are indeed simpler. Conservatism in MT-Based simplification is excessive 10

Sentence Splitting in Text Simplification Splitting in NMT-Based Simplification ● Sentence splitting is not addressed. ● Rareness of splittings in the simplification training corpora. (Narayan and Gardent, 2014; Xu et al., 2015). ● Recently, corpus focusing on sentence splitting for the Split-and-Rephrase task (Narayan et al., 2017) where the other operations are not addressed. 11

Sentence Splitting in Text Simplification Directly modeling sentence splitting 1. Hand-crafted syntactic rules: - Compilation and validation can be laborious (Shardlow, 2014) - Many rules are often involved (e.g., 111 rules in Siddharthan and Angrosh, 2014) for relative clauses, appositions, subordination and coordination). - Usually language specific. 12

Sentence Splitting in Text Simplification Directly modeling sentence splitting 1. Hand-crafted syntactic rules: Example: Noun phrase Relative clause Relative Pronoun One of the two rules for relative clauses in Siddharthan, 2004. 13

Sentence Splitting in Text Simplification Directly modeling sentence splitting 2. Using semantics for determining potential splitting points Narayan and Gardent (2014) - HYBRID - Discourse Semantic Representation (DRS) structures for splitting and deletion. - Depends on the proportion of splittings in the training corpus. We here use an intermediate way: Simple algorithm to directly decompose the sentence into its semantic constituents. 14

Direct Semantic Splitting (DSS) ● A simple algorithm that directly decomposes the sentence into its semantic components, using 2 splitting rules. ● The splitting is directed by semantic parsing. ● The semantic annotation directly captures shared arguments. ● It can be used as a preprocessing step for other simplification operations. DSS NMT-Based Simplification Input sentence Split sentence Output Sentence Splitting Deletions, Word substitutions 15 Reduces conservatism

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - Based on typological and cognitive theories (Dixon, 2010, 2012; Langacker, 2008) H H L A A and P A A P He came back home played piano Parallel Scene (H) Linker (L) Participant (A) Process (P) 16

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - Stable across translations (Sulem, Abend and Rappoport, 2015) H H L A A and P A A P He came back home played piano Parallel Scene (H) Linker (L) Participant (A) Process (P) 17

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - Used for the evaluation of MT, GEC and Text Simplification (Birch et al., 2016; Choshen and Abend, 2018; Sulem et al., 2018) H H L A A and P A A P He came back home played piano Parallel Scene (H) Linker (L) Participant (A) Process (P) 18

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - Explicitly annotates semantic distinctions, abstracting away from syntax (like AMR; Banarescu et al., 2013) - Unlike AMR, semantic units are directly anchored in the text. H H L A A and P A A P He came back home played piano Parallel Scene (H) Linker (L) Participant (A) Process (P) 19

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - UCCA parsing: TUPA parser (Hershcovich et al., 2017, 2018) - Shared Task in Sem-Eval 2019! H H L A A and P A A P He came back home played piano Parallel Scene (H) Linker (L) Participant (A) Process (P) 20

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - Scenes evoked by a Main Relation (Process or State). H H L A A and P A A P He came back home played piano Parallel Scene (H) Linker (L) Participant (A) Process (P) 21

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - A Scene may contain one or several Participants . H H L A and A P A A P came back He home played piano Parallel Scene (H) Linker (L) Participant (A) Process (P) 22

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - A Scene can provide additional information on an established entity: it is then an Elaborator Scene. H A A P He E observed E C A A R Parallel Scene (H) S the planet C E Participant (A) Process (P) State (S) which has Center (C) Elaborator (E) Relator (R) satellites 14 23

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - A Scene may also be a Participant in another Scene: It is then a Participant Scene . H A A P A Mary P surprised Parallel Scene (H) Linker (L) His Participant (A) Process (P) arrival 24

The Semantic Structures Semantic Annotation: UCCA (Abend and Rappoport, 2013) - In the other cases, Scenes are annotated as Parallel Scenes . A Linker may be included. H H L A A and P A A P He came back home played piano Parallel Scene (H) Linker (L) Participant (A) Process (P) 25

The Semantic Rules Main idea: Placing each Scene in a different sentence. ● Fits with event-wise simplification (Glavaš and Štajner, 2013) Here we only use semantic criteria. ● It was also investigated in the context of Text Simplification evaluation: SAMSA measure (Sulem, Abend and Rappoport, NAACL 2018) 26

The Semantic Rules Rule 1: Parallel Scenes H H L He came back home and played piano. and A A A P A P He came back home piano played A A A P P A He came back home. He played piano. He piano played He came back home 27

The Semantic Rules Rule 1: Parallel Scenes H H L and A A A P A P He came back home piano played S → Sc 1 ∣ Sc 2 ∣…∣ Sc n A A A Input sentence P P A Input Scenes He piano played He came back home 28

The Semantic Rules Rule 2: Elaborator Scenes H A A He observed the planet which has 14 satellites. P He E E C observed A A R the the planet S C E which has 14 satellites A A P He observed the planet. Planet has 14 satellites. A A He S observed E planet C C E has the planet satellites 14 29

Simple and Effective Text Simplification Using Semantic and Neural - PowerPoint PPT Presentation

Simple and Effective Text Simplification Using Semantic and Neural Methods Elior Sulem, Omri Abend and Ari Rappoport The Hebrew University of Jerusalem ACL 2018 Simple and Effective Text Simplification Using Semantic and Neural Methods Elior

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Semantic Structural Evaluation for Text Simplification Elior Sulem, Omri Abend and Ari Rappoport

Progressive Simplification of Polygonal Curves Kevin Buchin Maximilian Konzack Wim Reddingius

CS133 Computational Geometry Simplification Algorithms 1 Line Simplification ? 2 Line

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Semantic Full-Text Search Semantic Full Text Search Talk @ SIGIR JIWES Talk @ SIGIR

Organization and simplification of Organization and simplification of metabolic networks

Simplification of Successful Simplification of Successful Antiretroviral Therapy with

Outline Simplification Basic Level of Detail (LOD) issues & Simplification

Simplification by Rotation for Frobenius/Hopf algebras Aleks Kissinger September 9, 2017 The

Study 119 Simplification to EVG-COBI-TAF-FTC plus DRV Study 119: Design Study Design: Study 119

Mesh Simplification Mesh Simplification 1 Spring 2010 The The Law The The Law Law of Law

STUDY 073 Simplification to Efavirenz-TDF-FTC Study 073: Design Study Design: Study 073

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

IMAGING IN ACUTE ISCHEMIC STROKE Timo Krings MD, PhD, FRCP (C) Professor of Radiology &

Y oung Ischemic Stroke as Presentation of Thrombotic Thrombocytopenic Purpura: A Case Report

Brittany Clark, M.S., CCC-SLP Manager of Youth & Adult Rehab Speech-Language Pathologist

Issues In Single-Subject Research K. P. Kearns ASHA CPRI 7.10.09 Data speak, not men..

Neonatal (Birth to Age 1 Month) Diagnosis of HHHS Outcomes One child, who had an initial plasma

Journal of Neurolinguistics, February 2008 Ana Ins Ansaldo, Karine Marcotte, Lilian Scherer,

ALSTRM SYNDROME ASSOCIATED WITH CEREBRAL INVOLVEMENT: AN UNUSUAL PRESENTATION Cahide Ylmaz 1 ,

Scope ambiguity in Brocas aphasia: A comparative approach Lynda Kennedy, Jacopo Romoli, Lyn

Simple and Effective Text Simplification Using Semantic and Neural - PowerPoint PPT Presentation

Simple and Effective Text Simplification Using Semantic and Neural Methods Elior Sulem, Omri Abend and Ari Rappoport The Hebrew University of Jerusalem ACL 2018 Simple and Effective Text Simplification Using Semantic and Neural Methods Elior

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Semantic Structural Evaluation for Text Simplification Elior Sulem, Omri Abend and Ari Rappoport

Progressive Simplification of Polygonal Curves Kevin Buchin Maximilian Konzack Wim Reddingius

CS133 Computational Geometry Simplification Algorithms 1 Line Simplification ? 2 Line

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Semantic Full-Text Search Semantic Full Text Search Talk @ SIGIR JIWES Talk @ SIGIR

Organization and simplification of Organization and simplification of metabolic networks

Simplification of Successful Simplification of Successful Antiretroviral Therapy with

Outline Simplification Basic Level of Detail (LOD) issues &amp; Simplification

Simplification by Rotation for Frobenius/Hopf algebras Aleks Kissinger September 9, 2017 The

Study 119 Simplification to EVG-COBI-TAF-FTC plus DRV Study 119: Design Study Design: Study 119

Mesh Simplification Mesh Simplification 1 Spring 2010 The The Law The The Law Law of Law

STUDY 073 Simplification to Efavirenz-TDF-FTC Study 073: Design Study Design: Study 073

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

IMAGING IN ACUTE ISCHEMIC STROKE Timo Krings MD, PhD, FRCP (C) Professor of Radiology &amp;

Y oung Ischemic Stroke as Presentation of Thrombotic Thrombocytopenic Purpura: A Case Report

Brittany Clark, M.S., CCC-SLP Manager of Youth &amp; Adult Rehab Speech-Language Pathologist

Issues In Single-Subject Research K. P. Kearns ASHA CPRI 7.10.09 Data speak, not men..

Neonatal (Birth to Age 1 Month) Diagnosis of HHHS Outcomes One child, who had an initial plasma

Journal of Neurolinguistics, February 2008 Ana Ins Ansaldo, Karine Marcotte, Lilian Scherer,

ALSTRM SYNDROME ASSOCIATED WITH CEREBRAL INVOLVEMENT: AN UNUSUAL PRESENTATION Cahide Ylmaz 1 ,

Scope ambiguity in Brocas aphasia: A comparative approach Lynda Kennedy, Jacopo Romoli, Lyn

Outline Simplification Basic Level of Detail (LOD) issues & Simplification

IMAGING IN ACUTE ISCHEMIC STROKE Timo Krings MD, PhD, FRCP (C) Professor of Radiology &

Brittany Clark, M.S., CCC-SLP Manager of Youth & Adult Rehab Speech-Language Pathologist