Split and Rephrase: Better Evaluation and a Stronger Baseline Roee - PowerPoint PPT Presentation

Split and Rephrase: Better Evaluation and a Stronger Baseline Roee Aharoni and Yoav Goldberg NLP Lab, Bar Ilan University, Israel ACL 2018

Motivation

Motivation • Processing long, complex sentences is hard!

Motivation • Processing long, complex sentences is hard! • Children, people with reading disabilities, L2 learners…

Motivation • Processing long, complex sentences is hard! • Children, people with reading disabilities, L2 learners… • Sentence level NLP systems:

Motivation • Processing long, complex sentences is hard! • Children, people with reading disabilities, L2 learners… • Sentence level NLP systems: • Dependency Parsers McDonald & Nivre, 2011

Motivation • Processing long, complex sentences is hard! • Children, people with reading disabilities, L2 learners… • Sentence level NLP systems: • Dependency Parsers • Neural Machine Translation Koehn & Knowles, 2017

Motivation • Processing long, complex sentences is hard! • Children, people with reading disabilities, L2 learners… • Sentence level NLP systems: • Dependency Parsers • Neural Machine Translation Koehn & Knowles, 2017 • Can we automatically break a complex sentence into several simple ones while preserving its meaning?

The Split and Rephrase Task

The Split and Rephrase Task • Narayan, Gardent, Cohen & Shimorina, EMNLP 2017

The Split and Rephrase Task • Narayan, Gardent, Cohen & Shimorina, EMNLP 2017 • Dataset, evaluation method, baseline models

The Split and Rephrase Task • Narayan, Gardent, Cohen & Shimorina, EMNLP 2017 • Dataset, evaluation method, baseline models • Task definition: complex sentence -> several simple sentences with the same meaning

The Split and Rephrase Task • Narayan, Gardent, Cohen & Shimorina, EMNLP 2017 • Dataset, evaluation method, baseline models • Task definition: complex sentence -> several simple sentences with the same meaning Alan Bean joined NASA in 1963 where he became a member of the Apollo 12 mission along with Alfred Worden as back up pilot and David Scott as commander .

The Split and Rephrase Task • Narayan, Gardent, Cohen & Shimorina, EMNLP 2017 • Dataset, evaluation method, baseline models • Task definition: complex sentence -> several simple sentences with the same meaning Alan Bean joined NASA in 1963 where he became a member of the Apollo 12 mission along with Alfred Worden as back up pilot and David Scott as commander . Alan Bean served as a crew member of Apollo 12 . Alfred Worden was the backup pilot of Apollo 12 . Apollo 12 was commanded by David Scott . Alan Bean was selected by Nasa in 1963 .

The Split and Rephrase Task • Narayan, Gardent, Cohen & Shimorina, EMNLP 2017 • Dataset, evaluation method, baseline models • Task definition: complex sentence -> several simple sentences with the same meaning • Requires (a) identifying independent semantic units (b) rephrasing those units to single sentences Alan Bean joined NASA in 1963 where he became a member of the Apollo 12 mission along with Alfred Worden as back up pilot and David Scott as commander . Alan Bean served as a crew member of Apollo 12 . Alfred Worden was the backup pilot of Apollo 12 . Apollo 12 was commanded by David Scott . Alan Bean was selected by Nasa in 1963 .

This Work

This Work • We show that simple neural models seem to perform very on the original benchmark due to memorization of the training set

This Work • We show that simple neural models seem to perform very on the original benchmark due to memorization of the training set • We propose a more challenging data split for the task to discourage memorization

This Work • We show that simple neural models seem to perform very on the original benchmark due to memorization of the training set • We propose a more challenging data split for the task to discourage memorization • We perform automatic evaluation and error analysis on the new benchmark, showing that the task is still far from being solved

WebSplit Dataset Construction (Narayan et al. 2017)

WebSplit Dataset Construction (Narayan et al. 2017) Simple RDF Triples Simple Sentences (facts from DBpedia) Alan Bean is a US national. Alan Bean is a US national. <Alan_Bean | nationality | United_States> Alan Bean is a US national. Alan Bean was on the crew of Apollo 12. Alan Bean was on the crew of Apollo 12. <Alan_Bean | mission | Apollo_12> Alan Bean was on the crew of Apollo 12. Alan Bean was hired by NASA in 1963. <Alan_Bean | NASA selection | 1963> Alan Bean was hired by NASA in 1963. Alan Bean was hired by NASA in 1963.

WebSplit Dataset Construction (Narayan et al. 2017) Simple RDF Triples Simple Sentences (facts from DBpedia) Alan Bean is a US national. Alan Bean is a US national. <Alan_Bean | nationality | United_States> Alan Bean is a US national. Alan Bean was on the crew of Apollo 12. Alan Bean was on the crew of Apollo 12. <Alan_Bean | mission | Apollo_12> Alan Bean was on the crew of Apollo 12. Alan Bean was hired by NASA in 1963. <Alan_Bean | NASA selection | 1963> Alan Bean was hired by NASA in 1963. Alan Bean was hired by NASA in 1963. Sets of RDF triples <Alan_Bean | nationality | United_States, Alan_Bean | mission | Apollo_12, Alan_Bean | NASA selection | 1963>

WebSplit Dataset Construction (Narayan et al. 2017) Simple RDF Triples Simple Sentences (facts from DBpedia) Alan Bean is a US national. Alan Bean is a US national. <Alan_Bean | nationality | United_States> Alan Bean is a US national. Alan Bean was on the crew of Apollo 12. Alan Bean was on the crew of Apollo 12. <Alan_Bean | mission | Apollo_12> Alan Bean was on the crew of Apollo 12. Alan Bean was hired by NASA in 1963. <Alan_Bean | NASA selection | 1963> Alan Bean was hired by NASA in 1963. Alan Bean was hired by NASA in 1963. Complex Sets of RDF triples Sentences Alan Bean, born in the United States, was selected <Alan_Bean | nationality | United_States, Alan Bean, born in the United States, was selected Alan Bean, born in the United States, was selected by NASA in 1963 and served as a crew member of Alan_Bean | mission | Apollo_12, by NASA in 1963 and served as a crew member of by NASA in 1963 and served as a crew member of Apollo 12. Alan_Bean | NASA selection | 1963> Apollo 12. Apollo 12.

WebSplit Dataset Construction (Narayan et al. 2017) Simple RDF Triples Simple Sentences (facts from DBpedia) Alan Bean is a US national. Alan Bean is a US national. <Alan_Bean | nationality | United_States> Alan Bean is a US national. Alan Bean was on the crew of Apollo 12. Alan Bean was on the crew of Apollo 12. <Alan_Bean | mission | Apollo_12> Alan Bean was on the crew of Apollo 12. Alan Bean was hired by NASA in 1963. <Alan_Bean | NASA selection | 1963> Alan Bean was hired by NASA in 1963. Alan Bean was hired by NASA in 1963. Matching via RDFs Complex Sets of RDF triples Sentences Alan Bean, born in the United States, was selected <Alan_Bean | nationality | United_States, Alan Bean, born in the United States, was selected Alan Bean, born in the United States, was selected by NASA in 1963 and served as a crew member of Alan_Bean | mission | Apollo_12, by NASA in 1963 and served as a crew member of by NASA in 1963 and served as a crew member of Apollo 12. Alan_Bean | NASA selection | 1963> Apollo 12. Apollo 12.

WebSplit Dataset Construction (Narayan et al. 2017) Simple RDF Triples Simple Sentences (facts from DBpedia) Alan Bean is a US national. Alan Bean is a US national. <Alan_Bean | nationality | United_States> Alan Bean is a US national. Alan Bean was on the crew of Apollo 12. Alan Bean was on the crew of Apollo 12. <Alan_Bean | mission | Apollo_12> Alan Bean was on the crew of Apollo 12. Alan Bean was hired by NASA in 1963. <Alan_Bean | NASA selection | 1963> Alan Bean was hired by NASA in 1963. Alan Bean was hired by NASA in 1963. Matching via RDFs ~1M examples Complex Sets of RDF triples Sentences Alan Bean, born in the United States, was selected <Alan_Bean | nationality | United_States, Alan Bean, born in the United States, was selected Alan Bean, born in the United States, was selected by NASA in 1963 and served as a crew member of Alan_Bean | mission | Apollo_12, by NASA in 1963 and served as a crew member of by NASA in 1963 and served as a crew member of Apollo 12. Alan_Bean | NASA selection | 1963> Apollo 12. Apollo 12.

Preliminary Experiments

Preliminary Experiments • ~1M training examples

Preliminary Experiments • ~1M training examples • “Vanilla” LSTM seq2seq with attention 1 sim ple 2 sim ple 3 sim ple comp lex sen ten ce

Split and Rephrase: Better Evaluation and a Stronger Baseline Roee - PowerPoint PPT Presentation

Split and Rephrase: Better Evaluation and a Stronger Baseline Roee Aharoni and Yoav Goldberg NLP Lab, Bar Ilan University, Israel ACL 2018 Motivation Motivation Processing long, complex sentences is hard! Motivation Processing long,

Split and Rephrase John Clancy is a labor politican who leads Birmingham, where architect John

SPL SPLIT IT CA CAST ST Installation of Split Cast Kit Split Cast Kit Rf QM 2000 2 screws

PRODUCT DECOMPOSITION Ante Rozga, University of Split, Faculty of Economics/Split - Cvite

U i U i University of Split University of Split i i f S li f S li Livanjska 5 Livanjska 5

ROCKBOX FABRIQ EDITION ITS TIME FOR FOR BETTER SOUND. BETTER DESIGN. BETTER SPECS.

single-cycle (fjnish) / pipelining 0 1 Changelog 29 September 2020: rephrase questions on stage

Split Packing: An Algorithm for Packing Circles with up to Critical Density Sebastian Morr

Split Rock to Lakefield Junction 345 kV Transmission Line SD PUC Docket No. EL05-023 Split

Better Advice, Better Lives Adults Select Committee 21 st June Usk 1 Better Advice, Better Lives

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

Give us 4,000 -5,000 lbs and remain split from the Central Coast Meeting Goals: 1)

Architecture Research On Transport Information Services of EXPO 2010 Shanghai China Better City,

Stronger Together: Honoring the Past and Supporting the Future Welcome! Stronger Together:

STAY HEALTHY | RETURN SMARTER | RETURN STRONGER THANK YOU STAY HEALTHY | RETURN SMARTER | RETURN

STRONGER STRONGER Merck, KGaA Darmstadt, Germany FY 2015 results Karl-Ludwig Kley, CEO Marcus

Grassroots Organization (AGENDA) Regional Alliances (Los Angeles Metropolitan Alliance)

Setting the Course through collaboration District & Campus Improvement Plans Presentation

Graphs All The Way Down Building A GraphQL API Backed By A Graph Database William Lyon @lyonwj

Barclays Global Financial Services Conference September 2017 1 A disciplined, opportunistic and

Dr Khalid Dr Andre Mary Hikmat George Baldwin General General General Manager,

Overview The Achievement Gap Among cities that participate in NAEP, the magnitude of racial

Apollo Owuor Kenya Horticultural Exporters Ltd I NTRODUCTION Head of Agriculture and CSR

Investor Presentation Private placement of up to 190,454,000 new common shares 22 May 2019

Split and Rephrase: Better Evaluation and a Stronger Baseline Roee - PowerPoint PPT Presentation

Split and Rephrase: Better Evaluation and a Stronger Baseline Roee Aharoni and Yoav Goldberg NLP Lab, Bar Ilan University, Israel ACL 2018 Motivation Motivation Processing long, complex sentences is hard! Motivation Processing long,

Split and Rephrase John Clancy is a labor politican who leads Birmingham, where architect John

SPL SPLIT IT CA CAST ST Installation of Split Cast Kit Split Cast Kit Rf QM 2000 2 screws

PRODUCT DECOMPOSITION Ante Rozga, University of Split, Faculty of Economics/Split - Cvite

U i U i University of Split University of Split i i f S li f S li Livanjska 5 Livanjska 5

ROCKBOX FABRIQ EDITION ITS TIME FOR FOR BETTER SOUND. BETTER DESIGN. BETTER SPECS.

single-cycle (fjnish) / pipelining 0 1 Changelog 29 September 2020: rephrase questions on stage

Split Packing: An Algorithm for Packing Circles with up to Critical Density Sebastian Morr

Split Rock to Lakefield Junction 345 kV Transmission Line SD PUC Docket No. EL05-023 Split

Better Advice, Better Lives Adults Select Committee 21 st June Usk 1 Better Advice, Better Lives

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

Give us 4,000 -5,000 lbs and remain split from the Central Coast Meeting Goals: 1)

Architecture Research On Transport Information Services of EXPO 2010 Shanghai China Better City,

Stronger Together: Honoring the Past and Supporting the Future Welcome! Stronger Together:

STAY HEALTHY | RETURN SMARTER | RETURN STRONGER THANK YOU STAY HEALTHY | RETURN SMARTER | RETURN

STRONGER STRONGER Merck, KGaA Darmstadt, Germany FY 2015 results Karl-Ludwig Kley, CEO Marcus

Grassroots Organization (AGENDA) Regional Alliances (Los Angeles Metropolitan Alliance)

Setting the Course through collaboration District &amp; Campus Improvement Plans Presentation

Graphs All The Way Down Building A GraphQL API Backed By A Graph Database William Lyon @lyonwj

Barclays Global Financial Services Conference September 2017 1 A disciplined, opportunistic and

Dr Khalid Dr Andre Mary Hikmat George Baldwin General General General Manager,

Overview The Achievement Gap Among cities that participate in NAEP, the magnitude of racial

Apollo Owuor Kenya Horticultural Exporters Ltd I NTRODUCTION Head of Agriculture and CSR

Investor Presentation Private placement of up to 190,454,000 new common shares 22 May 2019

Setting the Course through collaboration District & Campus Improvement Plans Presentation