Generating Disambiguating Paraphrases for Structurally Ambiguous - PowerPoint PPT Presentation

Generating Disambiguating Paraphrases for Structurally Ambiguous Sentences Manjuan Duan, Ethan Hill, Michael White August 11-12, 2016, LAW-X The Ohio State University Department of Linguistics 1

Joint work with Manjuan& Ethan& Duan& Hill& 2

Introduction

How can we crowd-source data for adapting parsers to new domains? • To some extent, MTurk workers can perform meaning- and form-oriented tasks such as annotating PP-attachment points, with some training (Snow et al., 2008; Jha et al., 2010) • Gerdes (2013) and Zeldes (2016) also found that it was possible to obtain fairly high quality class-sourced annotations, where students only received a modest amount of training 3

How can we crowd-source data for adapting parsers to new domains? • To some extent, MTurk workers can perform meaning- and form-oriented tasks such as annotating PP-attachment points, with some training (Snow et al., 2008; Jha et al., 2010) • Gerdes (2013) and Zeldes (2016) also found that it was possible to obtain fairly high quality class-sourced annotations, where students only received a modest amount of training • In the current study, rather than annotating syntax, we use natural language clarification questions, simply asking Mturk workers to select the right paraphrase of a structurally ambiguous sentence 3

Big picture: Just ask people what ambiguous sentences mean Interp 1' Para 1' AMT :' Closer'in' Silver' Sent' Parser' Realizer' Interp t' meaning?' Data' Interp 2' Para 2' 4

Difference from previous studies • Aiming (ultimately) for all structural ambiguities identifiable by an automatic parser, not confined to some specific constructions (Jha et al., 2010) • AMT workers are making choices among paraphrases, not annotations, and no specific tutorial is needed 5

Methods

Generating disambiguating paraphrases: An illustration Reversal ¡ ✗ He stopped Godzilla with the laser � Top ¡Parse ¡ ✓ With the laser, he stopped Godzilla � stop.01 <TENSE>past,<MOOD>dcl PASS <TENSE>past,<MOOD>dcl Mod Arg1 Arg0 Arg1 Rewrite ¡ with Godzilla <NUM>sg he stop <PARTIC>pass Arg0 realize ✓ Godzilla was stopped � Arg1 Mod Arg1 Arg0 by him with the laser � Godzilla <NUM>sg laser <NUM>sg with by Arg1 Arg1 Det laser <NUM>sg he the Input ¡Sentence ¡ Det the He stopped Godzilla � with the laser � Next ¡Parse ¡ Reversal ¡ realize ✗ He stopped Godzilla with the laser � stop.01 <TENSE>past,<MOOD>dcl Arg1 Arg0 PASS <TENSE>past,<MOOD>dcl Godzilla <NUM>sg he Arg1 stop <PARTIC>pass Arg0 Mod Rewrite ¡ r e w r i t e Arg0 Arg1 with realize ✓ Godzilla with the laser � by Godzilla <NUM>sg was stopped by him � Arg1 Arg1 Mod laser <NUM>sg he with Arg1 Det laser <NUM>sg the Det the 6

Generating disambiguating paraphrases: An illustration Reversal ¡ ✗ He stopped Godzilla with the laser � Top ¡Parse ¡ ✓ With the laser, he stopped Godzilla � stop.01 <TENSE>past,<MOOD>dcl PASS <TENSE>past,<MOOD>dcl Mod Arg1 Arg0 Arg1 Godzilla <NUM>sg with he stop <PARTIC>pass Arg0 realize Arg1 Mod Arg1 Arg0 laser <NUM>sg with by Godzilla <NUM>sg Arg1 Arg1 Det laser <NUM>sg he the Det the � Next ¡Parse ¡ Reversal ¡ realize ✗ He stopped Godzilla with the laser � stop.01

Generating disambiguating paraphrases: An illustration Reversal ¡ ✗ He stopped Godzilla with the laser � ✓ With the laser, he stopped Godzilla � PASS <TENSE>past,<MOOD>dcl Arg1 Rewrite ¡ stop <PARTIC>pass Arg0 realize ✓ Godzilla was stopped � Mod Arg1 Arg0 by him with the laser � with by Godzilla <NUM>sg Arg1 Arg1 laser <NUM>sg he Det the Reversal ¡ e z i l ✗

Generating disambiguating paraphrases: An illustration Reversal ¡ ✗ He stopped Godzilla with the laser � Top ¡Parse ¡ ✓ With the laser, he stopped Godzilla � stop.01 <TENSE>past,<MOOD>dcl PASS <TENSE>past,<MOOD>dcl Mod Arg1 Arg0 Arg1 Rewrite ¡ with Godzilla <NUM>sg he stop <PARTIC>pass Arg0 realize ✓ Godzilla was stopped � Arg1 Mod Arg1 Arg0 by him with the laser � with by Godzilla <NUM>sg laser <NUM>sg Arg1 Arg1 Det laser <NUM>sg he the Input ¡Sentence ¡ Det He stopped Godzilla � the with the laser � Next ¡Parse ¡ Reversal ¡ realize ✗ He stopped Godzilla with the laser � stop.01 <TENSE>past,<MOOD>dcl Arg1 Arg0 PASS <TENSE>past,<MOOD>dcl Godzilla <NUM>sg he Arg1 stop <PARTIC>pass Arg0 Mod Rewrite ¡ rewrite Arg0 Arg1 with realize ✓ Godzilla with the laser � by Godzilla <NUM>sg was stopped by him � Arg1 Arg1 Mod laser <NUM>sg he with Arg1 Det laser <NUM>sg the Det the

✗ Generating disambiguating paraphrases: An illustration � Next ¡Parse ¡ Reversal ¡ realize ✗ He stopped Godzilla with the laser � stop.01 <TENSE>past,<MOOD>dcl Arg1 Arg0 PASS <TENSE>past,<MOOD>dcl Godzilla <NUM>sg he Arg1 stop <PARTIC>pass Arg0 Mod Rew rewrite Arg0 Arg1 realize with ✓ Godzilla w by Godzilla <NUM>sg was stoppe Arg1 Arg1 Mod laser <NUM>sg he with Arg1 Det laser <NUM>sg the Det the

✗ Generating disambiguating paraphrases: An illustration Reversal ¡ realize ✗ He stopped Godzilla with the laser � PASS <TENSE>past,<MOOD>dcl Arg1 stop <PARTIC>pass Arg0 Rewrite ¡ rewrite Arg0 Arg1 realize ✓ Godzilla with the laser � by Godzilla <NUM>sg was stopped by him � Arg1 Mod he with Arg1 laser <NUM>sg Det the

Obtaining meaningfully distinct parses 1. Parse the input sentence with the OpenCCG parser to obtain its top 25 parses 2. Find a parse from the n -best parse list which is meaningfully distinct from the top parse: 8

Obtaining meaningfully distinct parses 1. Parse the input sentence with the OpenCCG parser to obtain its top 25 parses 2. Find a parse from the n -best parse list which is meaningfully distinct from the top parse: • Only compare the unlabeled and unordered dependencies from the two parses • The symmetric difference cannot be empty, with neither set of dependencies a superset of the other 8

Obtaining meaningfully distinct parses 1. Parse the input sentence with the OpenCCG parser to obtain its top 25 parses 2. Find a parse from the n -best parse list which is meaningfully distinct from the top parse: • Only compare the unlabeled and unordered dependencies from the two parses • The symmetric difference cannot be empty, with neither set of dependencies a superset of the other • Ambiguities involving only POS, named entity or word sense differences are disregarded 8

Obtaining meaningfully distinct parses 1. Parse the input sentence with the OpenCCG parser to obtain its top 25 parses 2. Find a parse from the n -best parse list which is meaningfully distinct from the top parse: • Only compare the unlabeled and unordered dependencies from the two parses • The symmetric difference cannot be empty, with neither set of dependencies a superset of the other • Ambiguities involving only POS, named entity or word sense differences are disregarded 3. If successful, this phase yields a top and next parse — the ones reflecting the greatest uncertainty 8

Two ways to obtain paraphrases • Paraphrases obtained from reverse realization ( reversals ) • Able to generate paraphrases for ambiguities involving various constructions identifiable by an auto parser • Paraphrases obtained from logical form rewriting ( rewrites ) • Triggered by specific syntactic constructions such as PP-attachment ambiguity and modifier scope ambiguity in coordination 9

Validating reverse realizations Need to ensure paraphrases actually disambiguate intended meanings 10

Validating reverse realizations Need to ensure paraphrases actually disambiguate intended meanings 1. Realize the top and next parse into a n -best realization list ( n =25), using OpenCCG 2. Traverse the list to find a qualifying paraphrase, which has to • be different from the original sentence • have different relative distance among the words involving the ambiguity from the original sentence 10

Validating reverse realizations Need to ensure paraphrases actually disambiguate intended meanings 1. Realize the top and next parse into a n -best realization list ( n =25), using OpenCCG 2. Traverse the list to find a qualifying paraphrase, which has to • be different from the original sentence • have different relative distance among the words involving the ambiguity from the original sentence 3. Parse each candidate paraphrase to make sure the most likely interpretation includes the dependencies from which it was generated 10

Two-sided paraphrases and one-sided paraphrases • Two-sided paraphrases : Two paraphrases are obtained for the original sentence, one generated from the top parse, and one from the next • One-sided paraphrases : Only one paraphrase is obtained for the original sentence 11

Generating Disambiguating Paraphrases for Structurally Ambiguous - PowerPoint PPT Presentation

Generating Disambiguating Paraphrases for Structurally Ambiguous Sentences Manjuan Duan, Ethan Hill, Michael White August 11-12, 2016, LAW-X The Ohio State University Department of Linguistics 1 Joint work with Manjuan& Ethan&

Generating & Recognizing Paraphrases Paraphrases < IJCNLP 2005, Oct. 11th, 2005 >

Ranking based Techniques for Disambiguating B uchi Automata Hrishikesh Karmarkar Supratik

H2 F2009 H2 F2009 GENERATING GENERATING GENERATING GENERATING FREE CASH FLOW FREE CASH FLOW

Talk Overview Paraphrases Paraphrasing and Translation What theyre useful for How

Learning and Generating Paraphrases From Twitter and Beyond Wei Xu

Advanced Electric Generating Advanced Electric Generating Advanced Electric Generating

Ratchaburi Electricity Generating Holding PCL. Ratchaburi Electricity Generating Holding PCL.

Recursive Definitions Generating Functions Lecture 18 Generating Functions A generating

Disambiguating False-Alarm Hashtag Usages in Tweets for Irony Detection Hen-Hsen Huang 1 ,

Pyridines, Pyridine and Pyridine Rings: Disambiguating Chemical Named Entities Peter Corbett -

Tech session Disambiguating text with Babelfy. The Babelfy API Claudio Delli Bovi Outline

Evaluating variants of the Lesk Approach for Disambiguating Words Florentina Vasilescu Philippe

Generating Subfields Mark van Hoeij June 15, 2017 Mark van Hoeij Generating Subfields Overview

Atikokan Generating Station Thunder Bay Generating Station March 5, 2013 Alberta Biomaterials

Excess Soil Management Regulatory Proposal Summary Note : Deck paraphrases the proposal documents

DPIL@FIRE 2016: Overview of Shared Task on Detecting Paraphrases in Indian Languages (DPIL) M.

Based on Community Oral Health (Pine ) Essential Dental Public Health(Daly) By Dr. Asgari &

Correctly Rounded Arbitrary-Precision Floating-Point Summation Vincent LEFVRE AriC, Inria

CACHE POLICIES AND INTERCONNECTS Mahdi Nazm Bojnordi Assistant Professor School of Computing

McShane-Whitney extensions and the Hahn-Banach theorem Iosif Petrakis

microRNA33 targets the ABCA1 pump and regulates cholesterol metabolism Laure-Alix Clerbaux

CS 327E Lecture 5 Shirley Cohen September 14, 2016 Plan for Today Finish Normalization

CS 327E Lecture 12 Shirley Cohen March 7, 2016 Agenda Announcements Readings for

The Real Options Approach to Valuation: The Real Options Approach to Valuation: Challenges and

Generating Disambiguating Paraphrases for Structurally Ambiguous - PowerPoint PPT Presentation

Generating Disambiguating Paraphrases for Structurally Ambiguous Sentences Manjuan Duan, Ethan Hill, Michael White August 11-12, 2016, LAW-X The Ohio State University Department of Linguistics 1 Joint work with Manjuan& Ethan&

Generating &amp; Recognizing Paraphrases Paraphrases &lt; IJCNLP 2005, Oct. 11th, 2005 &gt;

Ranking based Techniques for Disambiguating B uchi Automata Hrishikesh Karmarkar Supratik

H2 F2009 H2 F2009 GENERATING GENERATING GENERATING GENERATING FREE CASH FLOW FREE CASH FLOW

Talk Overview Paraphrases Paraphrasing and Translation What theyre useful for How

Learning and Generating Paraphrases From Twitter and Beyond Wei Xu

Advanced Electric Generating Advanced Electric Generating Advanced Electric Generating

Ratchaburi Electricity Generating Holding PCL. Ratchaburi Electricity Generating Holding PCL.

Recursive Definitions Generating Functions Lecture 18 Generating Functions A generating

Disambiguating False-Alarm Hashtag Usages in Tweets for Irony Detection Hen-Hsen Huang 1 ,

Pyridines, Pyridine and Pyridine Rings: Disambiguating Chemical Named Entities Peter Corbett -

Tech session Disambiguating text with Babelfy. The Babelfy API Claudio Delli Bovi Outline

Evaluating variants of the Lesk Approach for Disambiguating Words Florentina Vasilescu Philippe

Generating Subfields Mark van Hoeij June 15, 2017 Mark van Hoeij Generating Subfields Overview

Atikokan Generating Station Thunder Bay Generating Station March 5, 2013 Alberta Biomaterials

Excess Soil Management Regulatory Proposal Summary Note : Deck paraphrases the proposal documents

DPIL@FIRE 2016: Overview of Shared Task on Detecting Paraphrases in Indian Languages (DPIL) M.

Based on Community Oral Health (Pine ) Essential Dental Public Health(Daly) By Dr. Asgari &amp;

Correctly Rounded Arbitrary-Precision Floating-Point Summation Vincent LEFVRE AriC, Inria

CACHE POLICIES AND INTERCONNECTS Mahdi Nazm Bojnordi Assistant Professor School of Computing

McShane-Whitney extensions and the Hahn-Banach theorem Iosif Petrakis

microRNA33 targets the ABCA1 pump and regulates cholesterol metabolism Laure-Alix Clerbaux

CS 327E Lecture 5 Shirley Cohen September 14, 2016 Plan for Today Finish Normalization

CS 327E Lecture 12 Shirley Cohen March 7, 2016 Agenda Announcements Readings for

The Real Options Approach to Valuation: The Real Options Approach to Valuation: Challenges and

Generating & Recognizing Paraphrases Paraphrases < IJCNLP 2005, Oct. 11th, 2005 >

Based on Community Oral Health (Pine ) Essential Dental Public Health(Daly) By Dr. Asgari &