Neutralizing Linguistically Problematic Annotations i U in - PowerPoint PPT Presentation

Neutralizing Linguistically Problematic Annotations i U in Unsupervised Dependency Parsing Evaluation i d D d P i E l ti Roy Schwartz 1 , Omri Abend 1 , Roi Reichart 2 and Ari Rappoport 1 1 The Hebrew University, 2 MIT In proceedings of ACL 2011

Outline Outline • Introduction • Problematic Gold Standard Annotation • Sensitivity to the Annotation of Problematic Structures • A Possible Solution – Undirected Evaluation • A Novel Evaluation Measure Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 2 Parsing Evaluation @ Schwartz et al.

Introduction Dependency Parsing we want to play ROOT Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 3 Parsing Evaluation @ Schwartz et al.

Introduction Related Work • Supervised Dependency Parsing – McDonald et al., 2005 ld l – Nivre et al., 2006 – Smith and Eisner, 2008 – Zhang and Clark, 2008 – Martins et al., 2009 – Goldberg and Elhadad, 2010 – inter alia • Unsupervised Dependency Parsing (unlabeled) – Klein and Manning, 2004 – Cohen and Smith, 2009 – Headden et al., 2009 dd l – Blunsom and Cohn, 2010 – Spitkovsky et al., 2010 – inter alia Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 4 Parsing Evaluation @ Schwartz et al.

Introduction Unsupervised Dependency Parsing Evaluation • Evaluation performed against a gold standard • Standard Measure – Attachment Score – Ratio of correct directed edges Ratio of correct directed edges • A single score (no precision/recall) • A single score (no precision/recall) Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 5 Parsing Evaluation @ Schwartz et al.

Introduction Unsupervised Dependency Parsing Evaluation • Example p – Gold Std: PRP VBP TO VB ROOT (we) (want) (we) (want) (to) (to) (play) (play) – Score: 2/4 / PRP VBP TO VB ROOT (we) (want) (to) (play) Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 6 Parsing Evaluation @ Schwartz et al.

Problematic Gold Standard Annotation Problematic Gold Standard Annotation • The g gold standard annotation of some structures is Linguistically Problematic – I.e., not under consensus • Examples E l (Collins, 1999) to play – Infinitive Verbs (Bosco and Lombardo, 2004) (Johansson and Nugues, 2007) (Johansson and Nugues, 2007) – Prepositional Phrases in Rome (Yamada and Matsumoto, 2003) (Y d d M t t 2003) Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 7 Parsing Evaluation @ Schwartz et al.

Problematic Gold Standard Annotation Problematic Gold Standard Annotation • Great majority of the problematic structures are local j y p – Confined to 2–3 words only – Often, alternative annotations differ in the direction of some edge – The controversy only relates to the internal structure The controversy only relates to the internal structure want to play chess • These structures are also very frequent – 42.9% of the tokens in PTB WSJ participate in at least one problematic structure structure Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 8 Parsing Evaluation @ Schwartz et al.

Problematic Gold Standard Annotation Problematic Gold Standard Annotation • Gold standard in English (and other languages) – converted g ( g g ) from constituency parsing using head percolation rules • At least three substantially different conversion schemes are currently in use for the same task 1. Collins head rules (Collins, 1999) 1. Collins head rules (Collins, 1999) – Used in e.g., (Berg ‐ Kirkpatrick et al., 2010; Spitkovsky et al., 2010) 2. Conversion rules of (Yamada and Matsumoto, 2003) 14 4% 14.4% – Used in e g (Cohen and Smith 2009; Gillenwater et al 2010) – Used in e.g., (Cohen and Smith, 2009; Gillenwater et al., 2010) 3. Conversion rules of (Johansson and Nugues, 2007) Diff. – Used in e.g., the CoNLL shared task 2007, (Blunsom and Cohn, 2010) Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 9 Parsing Evaluation @ Schwartz et al.

Problematic Gold Standard Annotation Problematic Gold Standard Annotation (Collins, 1999) (Yamada and Matsumoto 2003) (Yamada and Matsumoto, 2003) (Johansson and Nugues, 2007) Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 10 Parsing Evaluation @ Schwartz et al.

Problematic Structures Very Frequent 3 Substantially Different 3 Substantially Different Gold Standards Evaluation Problem Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 11 Parsing Evaluation @ Schwartz et al.

Sensitivity to the Annotation of Problematic Structures bl Trained Induced Parameters Test est Parser Parser to play < 1% Modified Test Test Gold Standard Modified Parameters Parser X 3 leading g Parsers Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 12 Parsing Evaluation @ Schwartz et al.

Sensitivity to the Annotation of Problematic Structures bl Model Original Modified Modified ‐ Original km04 34.3 43.6 9.3 cs09 39.7 54.4 14.7 saj10 41.3 54 12.7 • km04 – Klein and Manning, 2004 • cs09 – Cohen and Smith, 2009 cs09 Cohen and Smith, 2009 • saj10 – Spitkovsky et al., 2010 Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 13 Parsing Evaluation @ Schwartz et al.

Current evaluation Current evaluation does not always y reflect parser quality fl t lit Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 14 Parsing Evaluation @ Schwartz et al.

A Possible Solution Undirected Evaluation • Required – a measure indifferent to alternative q annotations of problematic structures • Recall – most alternative annotations differ only in the direction of some edge • A possible solution – a measure indifferent to edge directions directions • How about undirected evaluation ? How about undirected evaluation ? Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 15 Parsing Evaluation @ Schwartz et al.

A Possible Solution Undirected Evaluation • Gold standard: PRP VBP TO VB ROOT (we) (want) (to) (play) • Induced parse with a flipped edge Induced parse, with a flipped edge PRP VBP TO VB ROOT (we) (want) (to) (play) No head No head Two heads Two heads Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 16 Parsing Evaluation @ Schwartz et al.

A Possible Solution Undirected Evaluation • Gold standard: PRP VBP TO VB ROOT (we) (want) (to) (play) 3/4 (75%) This is the minimal undirected score • Induced parse with a flipped edge Induced parse, with a flipped edge modification! modification! ☺ � ☺ ☺ PRP VBP TO VB ROOT (we) (want) (to) (play) Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 17 Parsing Evaluation @ Schwartz et al.

The Neutral Edge Direction (NED) Measure • Undirected accuracy is not indifferent to edge flipping y ff g pp g • We will now present a measure that is – Neutral Edge Direction ( NED ) – A simple extension of the undirected evaluation measure – Ignores edge direction flips Ignores edge direction flips Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 18 Parsing Evaluation @ Schwartz et al.

want to play p y Gold Standard want want we want to play to play to play Induced parse I Induced parse II Induced parse III (agrees with gold std.) (linguistically plausible) (linguistically implausible) • correct NED attachment • correct NED attachment • NED error Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 19 Parsing Evaluation @ Schwartz et al.

The NED Measure The NED Measure • Therefore, NED is defined as follows: , – X is a correct parent of Y if: • X is Y’s gold parent or Attachment Undirected • X is Y’s gold child or • X is Y s gold child or • X is Y’s gold grandparent want want to play to play Gold Standard Gold Standard linguistically plausible parse linguistically plausible parse Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 20 Parsing Evaluation @ Schwartz et al.

NED Experiments Difference Between Gold Standards 16 14 14 12 10 Attach . Undir . 8 NED 6 4 4 2 0 • NED substantially reduces the difference between alternative gold standards Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 21 Parsing Evaluation @ Schwartz et al.

NED Experiments Sensitivity to Parameter modification 20 15 Attach. 10 Undir. 5 NED 0 0 saj 10 cs 09 km 04 5 - • NED substantially reduces the difference between parameter sets • The sign of the NED difference is predictable (see paper) Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency 22 Parsing Evaluation @ Schwartz et al.

Neutralizing Linguistically Problematic Annotations i U in - PowerPoint PPT Presentation

Neutralizing Linguistically Problematic Annotations i U in Unsupervised Dependency Parsing Evaluation i d D d P i E l ti Roy Schwartz 1 , Omri Abend 1 , Roi Reichart 2 and Ari Rappoport 1 1 The Hebrew University, 2 MIT In proceedings of

Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency Parsing Evaluation

8 JDT embraces Type Annotations JDT embraces Type Annotations Java 8 ready Stephan Herrmann GK

1 Reflection on code annotations Classification of Code Annotations (1) Code annotations may

From Open Annotations to W3C Web Annotations (and the impact on IIIF Presentation API 3.0)

CHILD FIND TEAMS Culturally, Linguistically and Ability Family Centered, Culturally, EB,

A Sociolinguistic Analysis of Linguistically Sensitive Dialectal Word Pronunciation Distances

Feedback for: Date of presentation Graded by: Criteria highly problematic somewhat

5 principles for reviving problematic groups Source: Ermeling, B.A. (2012). Breathe new life into

60 June 22 nd Antoine LUCAT Problematic How to qualify a BRDF measurement ? Graphics

Everyday Strategies for Neutralizing Worries and Anxiety How mindfulness, acceptance, and

To what extent financial regulation in Brazil was effective in neutralizing the overvaluation of

AMNESIA: Analysis and Monitoring for Neutralizing SQL- Injection Attacks William Halfond

Predicting and Understanding HIV-1 Resistance to Broadly Neutralizing Antibodies Anna Feldmann

ARTEMIS: Neutralizing BGP Hijacking within a Minute Alberto Dainotti alberto@caida.org

The absolutely neutralizing coalescence theory of mutation Paul de Lacy Rutgers University

ARTEMIS : Neutralizing BGP Hijacking within a Minute Pavlos Sermpezis INSPIRE group (Prof.

Thermodynamic Lattice Study for Preconformal Dynamics in Strongly Flavored QCD-Like Theory

2018 Not-for-Profit Considerations Jay Meglich 1 Economic & Industry Developments

Alfio Lomb Alfio Lombard ardo o Anto tonio Manzalini Vincenzo Vincenzo Ricco iccobene ene

Data visualization Cara Lombardo, Associated Press Hayley Sperling, Wisconsin Center for

Competence and Performance Grammar in Incremental Parsing Vincenzo Lombardo Alessando Mazzei

Comparing Convolution Kernels and RNNs on a wide-coverage computational analysis of natural

Non Equilibrium Many-Body Perturbation Theory from first principles D. Sangalli CNR-ISM,

beyond the standard model @ the tev scale nathaniel craig uc santa barbara 2017 ICTP Summer

Sambuz

Useful Links

Newsletter

Mail Us

Neutralizing Linguistically Problematic Annotations i U in - PowerPoint PPT Presentation

Neutralizing Linguistically Problematic Annotations i U in Unsupervised Dependency Parsing Evaluation i d D d P i E l ti Roy Schwartz 1 , Omri Abend 1 , Roi Reichart 2 and Ari Rappoport 1 1 The Hebrew University, 2 MIT In proceedings of

Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency Parsing Evaluation

8 JDT embraces Type Annotations JDT embraces Type Annotations Java 8 ready Stephan Herrmann GK

1 Reflection on code annotations Classification of Code Annotations (1) Code annotations may

From Open Annotations to W3C Web Annotations (and the impact on IIIF Presentation API 3.0)

CHILD FIND TEAMS Culturally, Linguistically and Ability Family Centered, Culturally, EB,

A Sociolinguistic Analysis of Linguistically Sensitive Dialectal Word Pronunciation Distances

Feedback for: Date of presentation Graded by: Criteria highly problematic somewhat

5 principles for reviving problematic groups Source: Ermeling, B.A. (2012). Breathe new life into

60 June 22 nd Antoine LUCAT Problematic How to qualify a BRDF measurement ? Graphics

Everyday Strategies for Neutralizing Worries and Anxiety How mindfulness, acceptance, and

To what extent financial regulation in Brazil was effective in neutralizing the overvaluation of

AMNESIA: Analysis and Monitoring for Neutralizing SQL- Injection Attacks William Halfond

Predicting and Understanding HIV-1 Resistance to Broadly Neutralizing Antibodies Anna Feldmann

ARTEMIS: Neutralizing BGP Hijacking within a Minute Alberto Dainotti alberto@caida.org

The absolutely neutralizing coalescence theory of mutation Paul de Lacy Rutgers University

ARTEMIS : Neutralizing BGP Hijacking within a Minute Pavlos Sermpezis INSPIRE group (Prof.

Thermodynamic Lattice Study for Preconformal Dynamics in Strongly Flavored QCD-Like Theory

2018 Not-for-Profit Considerations Jay Meglich 1 Economic &amp; Industry Developments

Alfio Lomb Alfio Lombard ardo o Anto tonio Manzalini Vincenzo Vincenzo Ricco iccobene ene

Data visualization Cara Lombardo, Associated Press Hayley Sperling, Wisconsin Center for

Competence and Performance Grammar in Incremental Parsing Vincenzo Lombardo Alessando Mazzei

Comparing Convolution Kernels and RNNs on a wide-coverage computational analysis of natural

Non Equilibrium Many-Body Perturbation Theory from first principles D. Sangalli CNR-ISM,

beyond the standard model @ the tev scale nathaniel craig uc santa barbara 2017 ICTP Summer

Sambuz

Useful Links

Newsletter

Mail Us

2018 Not-for-Profit Considerations Jay Meglich 1 Economic & Industry Developments