EASY, Evaluation of Parsers of French:what are the results? P. - PowerPoint PPT Presentation

EASY, Evaluation of Parsers of French:what are the results? P. Paroubek*, I. Robba*, A. Vilnat*, C. Ayache** LREC2008 Marrakech ∗ ∗∗

General presentation EASY: Sytactic Parser Evaluation 1 of the 8 evaluation campaigns of the evalda platform, which itself is part of the technolangue program 5 corpus providers, 12 participants, 15 runs The steps 1 at first: to define the annotation to collect and to annotate the corpora to modify the parsers to fulfill the demands of EASY 2 to define the evaluation measures 3 to evaluate the parser results 4 to combine the results of the parsers

Outline 1 Corpus 2 Annotation of the reference 3 Evaluation measures 4 Performance 5 First ROVER test 6 Conclusion and Perspective

Corpus Different linguistic types newspaper articles from Le Monde (as usual...) literary texts from ATILF databases medical texts, for specialized texts questions, with EQueR , a specific syntactic form manually transcribed parliamentary debates, “controlled” web pages and e-mails, to go further in direction of hybrid forms oral transcriptions Globally : 40,000 sentences 770,000 words

Annotation of the reference Choice made with all the participants small, not embedded constituents dependencies relations 6 kinds of constituents GN for Noun Phrase, as le petit chat , GP for Prepositional Phrase, as de la maison or comme eux , NV for Verb Kernel, including clitics as j’ai , or souffert , PV for Verb Kernel introduced by a Preposition, as de venir , GA for Adjectival Phrase, used for postponed adjectives in French, which are not included in GN, GR for Adverb Phrase as longtemps

Annotation of the reference : the relations 14 kinds of dependencies SUJ V (subject), AUX V (auxiliary), COD V (direct object), CPL V (verb complement) and MOD V (verb modifier) for the different verb complements, COMP (complementor), ATB SO (attribute of the subject or of the object), MOD N, MOD A, MOD R, MOD P (modifier respectively of the noun, the adjective, the adverb or the proposition), COORD (coordination), APP (apposition), JUXT (juxtaposition).

Annotation of the reference: an example from literary corpus coord mod−v cpl−v mod−n cpl−v suj−v suj−v Longtemps j’ai été comme eux et j’ai souffert du meme malaise aux−v aux−v Figure: Tentative translation: For a long time, I have lived as they do, and I suffered from the same unease

Evaluation measures Precision, recall and f-measure for constituents for relations for both of them For each parser for each kind of constituent for each relation for each genre of sub-corpus or globally

Evaluation measures: which comparisons? Different equality measures between two text spans from R (reference) and H (hypothesis) equality : H = R , the less permissive unitary fuzziness | H \ R | ≤ 1 inclusion : H ⊂ R barycenter : 2 ∗| R ∩ H | | R | + | H | > 0 . 25 intersection : R ∩ H � = ∅ , the most lenient

Evaluation measures: which comparisons? Two constituents are considered equal if they have the same type, they have equal text spans. Two relations are considered equal if they have the same type, their respective source and target have equal text spans.

Evaluation measures for constituents: global results CONSTITUENTS 1 0.8 0.6 0.4 0.2 0 P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12 P13 P14 P15 Figure: Results of the 15 parsers for constituents in precision/recall/f-measure (in this order), globally for all sub-corpora and all annotations together.

Evaluation measures for relations: global results RELATIONS 1 0.8 0.6 0.4 0.2 0 P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12 P13 P14 P15 Figure: Results of the 15 parsers for relations in precision/recall/f-measure (in this order), globally for all sub-corpora and all annotations together.

Parser obtaining the best precision 1 0.8 0.6 0.4 0.2 0 ALL SV XV LITTR COD MONDE CV ATB PARLM CMP MN QUEST MV MA WEB MR MAIL MP CRD ORAL AP JXT MED ALL Figure: Results for relations of the parser obtaining the best precision measure

Parser obtaining the best recall 1 0.8 0.6 0.4 0.2 0 ALL SV XV LITTR COD MONDE CV ATB PARLM CMP MN QUEST MV MA WEB MR MAIL MP CRD ORAL AP JXT MED ALL Figure: Results for relations of the parser obtaining the best recall measure

Parser obtaining the best f-measure 1 0.8 0.6 0.4 0.2 0 ALL SV XV LITTR COD MONDE CV ATB PARLM CMP MN QUEST MV MA WEB MR MAIL MP CRD ORAL AP JXT MED ALL Figure: Results for relations of the parser obtaining the best f-measure

First conclusions First results interesting: relations: best systems average f-measure near 0.60, high variability of results for relation annotation but some parsers manage to preserve the same level of performance across text genres. there is still an important part of work to do for analyzing syntactic phenomena which are rarely or never handled by the actual parsers (apposition or juxtaposition relation, or when coordination are combined together or mixed up with ellipses), best performances obtained by different parsers (different performance profiles), so there is a priori a relatively important margin for performance increase which could be obtained by combining the annotations of different parsers

First ROVER test ROVER 1 0.8 0.6 0.4 0.2 0 LITTR MONDE PARLM ALL SV QUEST XV COD CV WEB ATB CMP MN MAIL MV MA ORAL MR MP CRD MED AP JXT ALL Figure: Relative gain of performance in precision against the best precision result

Comparative precision results Relations precision (front view) ROVER P8 P3 P10 1 0.8 0.6 0.4 0.2 0 ALL SV XV LITTR COD MONDE CV ATB PARLM CMP MN QUEST MV MA WEB MR MAIL MP CRD ORAL AP JXT MED ALL Figure: Compared precisions of the ROVER and the three best systems

Conclusion and perspectives From EASY to PASSAGE... first campaign deploying the evaluation paradigm in real size for syntactic parsers of French with a black-box evaluation scheme using objective quantitative measures. create a working group on parsing evaluation the beginning of PASSAGE... in a few minutes!

EASY, Evaluation of Parsers of French:what are the results? P. - PowerPoint PPT Presentation

EASY, Evaluation of Parsers of French:what are the results? P. Paroubek, I. Robba, A. Vilnat*, C. Ayache** LREC2008 Marrakech General presentation EASY: Sytactic Parser Evaluation 1 of the 8 evaluation campaigns of the evalda

Scanners and parsers COMP 520 Fall 2010 Scanners and Parsers (2) A scanner or lexer transforms a

Easy-to-Use Easy-to-Install Easy on the Budget orecx.com Easy-to-Use

LR Parsing Compiler Design CSE 504 Shift-Reduce Parsing 1 LR Parsers 2 SLR and LR(1) Parsers

Objectives Combinator Parsing Show how to build complex parsers by composing simpler parsers.

XML Parsers Asst. Prof. Dr. Kanda Runapongsa Saikaew (krunapon@kku.ac.th) Dept. of Computer

Neatening sketched strokes using piecewise French Curves James McCrae, Karan Singh French Curves

Parser Evaluation and the BNC Standard Parser Evaluation The Parsers Jennifer Foster and Josef

Introduction to French Business Culture 1 IHRM French Business Culture Agenda The

French People - who we are French People - Studio of design, situated in Shanghai former French

Dependency and Phrasal Parsers of the Czech Language: A Comparison ak 1 , Tom s Holan 2 ,

Instruction Parsers Nathan Jay Paradyn Project Scalable Tools Workshop Granlibakken, California

Features of Statistical Parsers Mark Johnson Brown Laboratory for Linguistic Information

Shift-Reduce Parsers for Transition Networks Luca Breveglieri Stefano Crespi Reghizzi Angelo

Training Deterministic Parsers with Non-Deterministic Oracles by Yoav Goldberg and Joakim

CS406: Compilers Spring 2020 Week 5: Parsers, AST, and Semantic Routines 1 Recap 2 3

The French baccalaureate until 2020 1. What is the French Baccalaureate or the Bac? The French

Disclaimer This series represents the personal views of scientists who attend Grace Chapel. Our

r s rs tr ts

But can app ear on arcs, and means the empt y string (i.e., no visible con

A Questionnaire Guide to Web Instrument Accessibility Mark M Pierzchala MMP Survey Services, LLC

The DREAM Printer Department of Mechanical & Aerospace Engineering Jessica Javian Jeremy

Joint Agencies Vehicle-Grid Integration (VGI) Working Group Two KI KICK-OFF W OFF WOR ORKSHOP

Simultaneous Analysis of contingency tables drawn with telephone data registration from the

11/13/17 Crucified Peoples Strange Fruit Southern trees bear a strange fruit Blood on the

Sambuz

Useful Links

Newsletter

Mail Us

EASY, Evaluation of Parsers of French:what are the results? P. - PowerPoint PPT Presentation

EASY, Evaluation of Parsers of French:what are the results? P. Paroubek*, I. Robba*, A. Vilnat*, C. Ayache** LREC2008 Marrakech General presentation EASY: Sytactic Parser Evaluation 1 of the 8 evaluation campaigns of the evalda

Scanners and parsers COMP 520 Fall 2010 Scanners and Parsers (2) A scanner or lexer transforms a

Easy-to-Use Easy-to-Install Easy on the Budget orecx.com Easy-to-Use

LR Parsing Compiler Design CSE 504 Shift-Reduce Parsing 1 LR Parsers 2 SLR and LR(1) Parsers

Objectives Combinator Parsing Show how to build complex parsers by composing simpler parsers.

XML Parsers Asst. Prof. Dr. Kanda Runapongsa Saikaew (krunapon@kku.ac.th) Dept. of Computer

Neatening sketched strokes using piecewise French Curves James McCrae, Karan Singh French Curves

Parser Evaluation and the BNC Standard Parser Evaluation The Parsers Jennifer Foster and Josef

Introduction to French Business Culture 1 IHRM French Business Culture Agenda The

French People - who we are French People - Studio of design, situated in Shanghai former French

Dependency and Phrasal Parsers of the Czech Language: A Comparison ak 1 , Tom s Holan 2 ,

Instruction Parsers Nathan Jay Paradyn Project Scalable Tools Workshop Granlibakken, California

Features of Statistical Parsers Mark Johnson Brown Laboratory for Linguistic Information

Shift-Reduce Parsers for Transition Networks Luca Breveglieri Stefano Crespi Reghizzi Angelo

Training Deterministic Parsers with Non-Deterministic Oracles by Yoav Goldberg and Joakim

CS406: Compilers Spring 2020 Week 5: Parsers, AST, and Semantic Routines 1 Recap 2 3

The French baccalaureate until 2020 1. What is the French Baccalaureate or the Bac? The French

Disclaimer This series represents the personal views of scientists who attend Grace Chapel. Our

r s rs tr ts

But can app ear on arcs, and means the empt y string (i.e., no visible con

A Questionnaire Guide to Web Instrument Accessibility Mark M Pierzchala MMP Survey Services, LLC

The DREAM Printer Department of Mechanical &amp; Aerospace Engineering Jessica Javian Jeremy

Joint Agencies Vehicle-Grid Integration (VGI) Working Group Two KI KICK-OFF W OFF WOR ORKSHOP

Simultaneous Analysis of contingency tables drawn with telephone data registration from the

11/13/17 Crucified Peoples Strange Fruit Southern trees bear a strange fruit Blood on the

Sambuz

Useful Links

Newsletter

Mail Us

EASY, Evaluation of Parsers of French:what are the results? P. Paroubek, I. Robba, A. Vilnat*, C. Ayache** LREC2008 Marrakech General presentation EASY: Sytactic Parser Evaluation 1 of the 8 evaluation campaigns of the evalda

The DREAM Printer Department of Mechanical & Aerospace Engineering Jessica Javian Jeremy