How to Compare Treebanks Sandra K ubler, Wolfgang Maier, Ines - PowerPoint PPT Presentation

Motivation Parsing Experiments How to Compare Treebanks Sandra K¨ ubler, Wolfgang Maier, Ines Rehbein & Yannick Versley LREC, May 2008 nclt K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Motivation Parsing Experiments Standardisation & Interoperability Creation of linguistic resources is extremely time-consuming Standardisation & interoperability One aspect of standardisation and interoperability Adaptation of existing syntactic annotation schemes for new language ressources (e.g. Chinese Penn Treebank, Arabic Penn Treebank) But: How to avoid importing flaws and weaknesses which might exist? Are annotation schemes really universal? We need to know more about syntactic annotation schemes and nclt their impact on NLP applications K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Motivation Parsing Experiments Recent work Studies on the impact of treebank design on PCFG parsing: K¨ ubler (2005), Maier (2006), K¨ ubler et al. (2006) Low PCFG parsing results (PARSEVAL) for the German NEGRA treebank imply that T¨ uBa-D/Z is more adequate to support PCFG parsing Rehbein & van Genabith (2007) Better PARSEVAL results for T¨ uBa-D/Z reflect higher ratio of non-terminal/terminal nodes in the treebank Results controversial, more extensive evaluation needed nclt K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Motivation Parsing Experiments Extensive evaluation of three different parsers BitPar (Schmid, 2004) LoPar (Schmid, 2000) Stanford Parser (Klein & Manning, 2003) trained on two German treebanks TiGer Release 2 (Brants et al., 2002) T¨ uBa-D/Z Release 3 (Telljohann et al., 2005) evaluated with evalb (an implementation of PARSEVAL ) Leaf-Ancestor Metric (Sampson & Barbarczy, 2003) Dependency-based Evaluation Human evaluation nclt K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Motivation Parsing Experiments Outline Data: TiGer & T¨ uBa-D/Z 1 Experimental setup 2 Evaluation results 3 Constituent-based evaluation with PARSEVAL and LA Dependency-based evaluation Human evaluation nclt K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Experimental Setup Motivation Constituent Evaluation Parsing Experiments Dependency Evaluation Human Evaluation The Treebanks: TiGer and T¨ uBa-D/Z Domain: German newspaper text POS tagset: STTS (Stuttgart-T¨ ubingen Tag Set) Differences in annotation TiGer T¨ uBa-D/Z Annotation: flat more hierarchical LDD: crossing branches grammatical functions Unary nodes: no yes Topological fields: no yes nclt K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Experimental Setup Motivation Constituent Evaluation Parsing Experiments Dependency Evaluation Human Evaluation TiGer But without the Tigers will it no peace give “But without the Tigers there will be no peace.” nclt K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Experimental Setup Motivation Constituent Evaluation Parsing Experiments Dependency Evaluation Human Evaluation T¨ uBa-D/Z nclt Namable reinforcements however will it for the next playing season not give “However, there won’t be considerable reinforcements for the next playing time.” K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Experimental Setup Motivation Constituent Evaluation Parsing Experiments Dependency Evaluation Human Evaluation Experimental Setup Test Sets: 2000 sentences from each treebank Training Sets: 25 005 sentences from each treebank TiGer: resolve crossing branches insert preterminal nodes for all terminals with governable grammatical functions Train BitPar, LoPar and Stanford Parser on training sets BitPar and LoPar: unlexicalised Stanford: factored Model (PCFG+dependencies), nclt hMarkov=1, vMarkov=2 K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Experimental Setup Motivation Constituent Evaluation Parsing Experiments Dependency Evaluation Human Evaluation Results for Constituent Evaluation PARSEVAL and LA scores (2000 sentences) TiGer T¨ uBa-D/Z Bit Lop Stan Bit Lop Stan evalb 74.0 75.2 77.3 83.4 84.6 88.5 LA 90.9 91.3 92.4 91.5 91.8 93.6 evalb and LA: better results for T¨ uBa-D/Z both measures show the same ranking: BitPar < LoPar < Stanford gap between LA results much smaller than between evalb nclt K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Experimental Setup Motivation Constituent Evaluation Parsing Experiments Dependency Evaluation Human Evaluation Discussion: PARSEVAL - LA PARSEVAL (Black et al., 1991) divides number of matching brackets by overall number of brackets in the trees more hierarchical annotation in T¨ uBa-D/Z results in higher number of brackets one mismatching bracket in T¨ uBa-D/Z is punished less Leaf-Ancestor Metric (Sampson & Barbarczy, 2003) string-based similarity measure based on Levenshtein distance extracts path for each terminal node to the root node computes the cost of transforming parser output paths into gold tree paths edit cost is computed relative to path length → results in lower costs for same error for T¨ uBa-D/Z PARSEVAL and LA are biased towards T¨ uBa-D/Z; Dependency nclt evaluation should abstract away from particular encoding schemes K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

Experimental Setup Motivation Constituent Evaluation Parsing Experiments Dependency Evaluation Human Evaluation Dependency-Based Evaluation Original treebanks and parser output converted into dependencies 34 different dependency relations (Foth, 2003) Conversion with Depsy (Daum et al., 2004) and software by Versley (2005) AUX OBJA PP ADV PN SUBJ ATTR ADV DET ATTR Namhafte Verstärkungen hingegen wird es für die nächste Spielzeit nicht geben . nclt “However, there won’t be considerable reinforcements for the next playing time” K¨ ubler, Maier, Rehbein & Versley How to Compare Treebanks

How to Compare Treebanks Sandra K ubler, Wolfgang Maier, Ines - PowerPoint PPT Presentation

Motivation Parsing Experiments How to Compare Treebanks Sandra K ubler, Wolfgang Maier, Ines Rehbein & Yannick Versley LREC, May 2008 nclt K ubler, Maier, Rehbein & Versley How to Compare Treebanks Motivation Parsing

Introduction to treebanks Session 1: 7/08/2011 1 Outline Types of treebanks (Syntactic)

Linked Open Treebanks Latin treebanks in the LiLa Knowledge Base Francesco Mambrini and Marco

Natural Language Processing Lecture 15: Treebanks and Probabilistic CFGs TREEBANKS: A

Associative anaphors in the Copenhagen Dependency Treebanks (CDT) Irn Korzen and Matthias

Dont Let PROC COMPARE Catch You Unaware Joshua Horstman Roger Muller Data to Events.com

ANLP Lecture 14 Treebanks and Statistical Parsing Shay Cohen (based on slides by Goldwater) 15

Deep Dependency Graph Conversion in English 15th International Workshop on Treebanks and

Using Treebanks tgrep2 Lecture 2: 07/12/2011 Using Corpora For discovery For evaluation

CREATING, ENRICHING AND VALORIZING TREEBANKS OF ANCIENT GREEK: THE ONGOING PEDALION PROJECT Alek

The Prague Dependency Treebanks Morphology, Syntax, Semantics Jan Haji Institute of Formal

PASSAGE: From French Parser Evaluation to Large Sized Treebanks http://atoll.inria.fr/passage

Improving Domain Independent Question Parsing with Synthetic Treebanks COLING 2018: LAW-MWE-CxG

Historical Treebanks The Penn Historical Corpora and the Icelandic Historical Parsed Corpus 1

Heuristic searches Genomics Compare DNA sequences to discover similarities/differences

Physician Compare Preview Period Part II: Accessing and Navigating the Physician Compare Preview

Presentation Class Lesson 6 Lesson Preview: I. Review Compare and Contrast a. Definition b.

rs r r rs ts

IIRSM Qatar 17 th April 2018 Presented by: Mark Stephens Welcome & Announcements Welcome

Finding PIMs for finite groups of Lie type Olivier Dudas CNRS & Paris-Diderot University

Equipment and Facilities 153 The content in this section is based on Chapter 4 (Equipment,

Quantum Schur algebras and their affine and super cousins Jie Du University of New South Wales,

On cyclotomic quiver Hecke algebras of affine type Susumu Ariki Osaka University The Eighth

Stream Reasoning with LARS Harald Beck Minh Dao-Tran Thomas Eiter The 2nd Stream Reasoning

BIG RIDE ASSEMBLY AND INSTALLATION INSTRUCTIONS S.R. SMITH BIG RIDE SLIDES ARE

Sambuz

Useful Links

Newsletter

Mail Us

How to Compare Treebanks Sandra K ubler, Wolfgang Maier, Ines - PowerPoint PPT Presentation

Motivation Parsing Experiments How to Compare Treebanks Sandra K ubler, Wolfgang Maier, Ines Rehbein & Yannick Versley LREC, May 2008 nclt K ubler, Maier, Rehbein & Versley How to Compare Treebanks Motivation Parsing

Introduction to treebanks Session 1: 7/08/2011 1 Outline Types of treebanks (Syntactic)

Linked Open Treebanks Latin treebanks in the LiLa Knowledge Base Francesco Mambrini and Marco

Natural Language Processing Lecture 15: Treebanks and Probabilistic CFGs TREEBANKS: A

Associative anaphors in the Copenhagen Dependency Treebanks (CDT) Irn Korzen and Matthias

Dont Let PROC COMPARE Catch You Unaware Joshua Horstman Roger Muller Data to Events.com

ANLP Lecture 14 Treebanks and Statistical Parsing Shay Cohen (based on slides by Goldwater) 15

Deep Dependency Graph Conversion in English 15th International Workshop on Treebanks and

Using Treebanks tgrep2 Lecture 2: 07/12/2011 Using Corpora For discovery For evaluation

CREATING, ENRICHING AND VALORIZING TREEBANKS OF ANCIENT GREEK: THE ONGOING PEDALION PROJECT Alek

The Prague Dependency Treebanks Morphology, Syntax, Semantics Jan Haji Institute of Formal

PASSAGE: From French Parser Evaluation to Large Sized Treebanks http://atoll.inria.fr/passage

Improving Domain Independent Question Parsing with Synthetic Treebanks COLING 2018: LAW-MWE-CxG

Historical Treebanks The Penn Historical Corpora and the Icelandic Historical Parsed Corpus 1

Heuristic searches Genomics Compare DNA sequences to discover similarities/differences

Physician Compare Preview Period Part II: Accessing and Navigating the Physician Compare Preview

Presentation Class Lesson 6 Lesson Preview: I. Review Compare and Contrast a. Definition b.

rs r r rs ts

IIRSM Qatar 17 th April 2018 Presented by: Mark Stephens Welcome &amp; Announcements Welcome

Finding PIMs for finite groups of Lie type Olivier Dudas CNRS &amp; Paris-Diderot University

Equipment and Facilities 153 The content in this section is based on Chapter 4 (Equipment,

Quantum Schur algebras and their affine and super cousins Jie Du University of New South Wales,

On cyclotomic quiver Hecke algebras of affine type Susumu Ariki Osaka University The Eighth

Stream Reasoning with LARS Harald Beck Minh Dao-Tran Thomas Eiter The 2nd Stream Reasoning

BIG RIDE ASSEMBLY AND INSTALLATION INSTRUCTIONS S.R. SMITH BIG RIDE SLIDES ARE

Sambuz

Useful Links

Newsletter

Mail Us

IIRSM Qatar 17 th April 2018 Presented by: Mark Stephens Welcome & Announcements Welcome

Finding PIMs for finite groups of Lie type Olivier Dudas CNRS & Paris-Diderot University