Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Jes - PowerPoint PPT Presentation

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Jes´ us Gim´ enez and Llu´ ıs M` arquez — TALP Research Center Technical University of Catalonia May 29, 2008

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Outline 1 Introduction 2 Our Proposal 3 Applicability 4 Discussion

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction Outline 1 Introduction The Role of Evaluation Methods Recent Advances in Automatic MT Evaluation 2 Our Proposal 3 Applicability 4 Discussion

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction The Role of Evaluation Methods Outline 1 Introduction The Role of Evaluation Methods Recent Advances in Automatic MT Evaluation 2 Our Proposal 3 Applicability 4 Discussion

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction The Role of Evaluation Methods Development Cycle of MT systems

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction The Role of Evaluation Methods Error Analysis Today Error analyses are conducted manually low-level analysis related to the linguistic analysis of translation quality (i.e., what?) high-level analysis involving knowledge about the system architecture (i.e., why?) Error analyses require intensive human labor Automatic metrics are used only as quantitative evaluation measures to identify high/low quality translations

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction The Role of Evaluation Methods Metrics Based on Lexical Similarity Edit Distance WER, PER, TER Precision BLEU, NIST, WNM Recall ROUGE, CDER Precision/Recall GTM, METEOR, BLANC, SIA

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction Recent Advances in Automatic MT Evaluation Outline 1 Introduction The Role of Evaluation Methods Recent Advances in Automatic MT Evaluation 2 Our Proposal 3 Applicability 4 Discussion

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction Recent Advances in Automatic MT Evaluation Extending the Reference Lexicon Lexical variants Morphological variations (i.e., stemming) → ROUGE and METEOR Synonymy lookup → METEOR (based on WordNet) Paraphrasing support Zhou et al. [ZLH06] Kauchak and Barzilay [KB06] Owczarzak et al. [OGGW06]

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction Recent Advances in Automatic MT Evaluation Beyond the Lexical Level Syntactic Similarity Shallow Parsing Popovic and Ney [PN07] Gim´ enez and M` arquez [GM07] Constituency Parsing Liu and Gildea [LG05] Dependency Parsing Liu and Gildea[LG05] Amig´ o et al. [AGGM06] Mehay and Brew [MB07] Owczarzak et al. [OvGW07]

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Introduction Recent Advances in Automatic MT Evaluation Beyond the Lexical Level Semantic Similarity Semantic Roles Gim´ enez and M` arquez [GM07] Named Entities Reeder et al. [RMDW01] Gim´ enez and M` arquez [GM07] Discourse Representations Gim´ enez and M` arquez [GM08b]

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Our Proposal Outline 1 Introduction 2 Our Proposal A Smorgasbord of Features 3 Applicability 4 Discussion

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Our Proposal Rely on Automatic Metrics Idea: Let automatic metrics do most of the low-level analysis, so system developers may concentrate on high-level analysis.

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Our Proposal Heterogeneous Error Analysis as automatic as possible as heterogeneous as possible Quality Aspects: lexical, syntactic, semantic, etc. Granularity fine aspects → transfer of specific linguistic elements (e.g., what proportion of singular nouns are correctly translated?) coarse aspects → overall linguistic structure (e.g., what proportion of the semantic role structure is correctly translated?)

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Our Proposal A Smorgasbord of Features Outline 1 Introduction 2 Our Proposal A Smorgasbord of Features 3 Applicability 4 Discussion

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Our Proposal A Smorgasbord of Features Linguistic Similarities More than 500 metric variants operating at different linguistic levels: Lexical Shallow Syntactic (Lemmatization, PoS Tagging, and Base Phrase Chunking) Syntactic (Constituency and Dependency Parsing) Shallow Semantic (Semantic Roles and Named Entities) Semantic (Discourse Representations)

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Our Proposal A Smorgasbord of Features Shallow Syntactic Level SP-O p - ⋆ Average overlapping between words belonging to the same PoS. SP-O c - ⋆ Average overlapping between words belonging to the same phrase chunk type. SP-NIST l NIST score over sequences of lemmas. SP-NIST p NIST score over PoS sequences. SP-NIST iob NIST score over chunk IOB sequences. SP-NIST c NIST score over sequences of chunks.

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Our Proposal A Smorgasbord of Features Syntactic Level (i) Dependency Overlapping DP-O l - ⋆ Average overlapping between words hanging at the same level. DP-O c - ⋆ Average overlapping between words hanging from terminal nodes (i.e., grammatical categories). DP-O r - ⋆ Average overlapping between words ruled by non-terminal nodes (i.e., grammatical relations).

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Our Proposal A Smorgasbord of Features Syntactic Level (ii) Head-word Chain Matching (Liu and Gildea [LG05]) DP-HWC w Average head-word chain matching up to length-4 word chains. DP-HWC c Average head-word chain matching up to length-4 category chains. DP-HWC r Average head-word chain matching up to length-4 relation chains.

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Jes - PowerPoint PPT Presentation

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Jes us Gim enez and Llu s M` arquez TALP Research Center Technical University of Catalonia May 29, 2008

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Error Detection Codes Error Detection Two types Nave scheme Error Detection Codes

llvm::Error Rich Error Handling in LLVM Error Handling History LLVMs APIs historically

ECS 231 Lecture on Approximation and Error Analysis 1 / 9 Approximation and error analysis 1.

Unifying Heterogeneous Cray Unifying Heterogeneous Cray Resources and Systems into an

Automatic Translation Error Analysis or how to brute-force through exponential complexity

Automatic Enrollment and Automatic IRAs David C. John The Heritage Foundation The Retirement

Automatic Registration and Calibration Automatic Registration and Calibration Automatic

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Natural and Flexible Error Recovery for Generated Parsers Maartje de Jonge Emma Nilsson-Nyman

CS 403X Mobile and Ubiquitous Computing Lecture 7: Final Projects + Smorgasbord of Stuff!!

A Smrgsbord of Typos: Exploring International Keyboard Layout Typosquatting Victor Le Pochat

A Smorgasbord of Features for Statistical Machine Translation Franz Josef Och, Daniel Gildea,

Formal Theory, Informally Jonathan Worthington London Perl Workshop 2006 Formal Theory,

50 Things You May Not Know You Can Do With The 4GL Gus Bjrklund. Progress. PUG Challenge

20 Advanced Topics 2: Hybrid Neural-symbolic Models In the previous chapters, we learned about

Animation Techniques in Astronomy aka a Smorgasbord of Data Management, Coding Hacks and

What is Spirituality? 1 Cor. 1:11, For it has been declared to me ... that there are

Sambuz

Useful Links

Newsletter

Mail Us

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Jes - PowerPoint PPT Presentation

Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Towards Heterogeneous Automatic MT Error Analysis (6th LREC) Jes us Gim enez and Llu s M` arquez TALP Research Center Technical University of Catalonia May 29, 2008

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

ERROR DETECTON &amp; CORRECTION Error Detection EDC= Error Detection and Correction bits

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Error Detection Codes Error Detection Two types Nave scheme Error Detection Codes

llvm::Error Rich Error Handling in LLVM Error Handling History LLVMs APIs historically

ECS 231 Lecture on Approximation and Error Analysis 1 / 9 Approximation and error analysis 1.

Unifying Heterogeneous Cray Unifying Heterogeneous Cray Resources and Systems into an

Automatic Translation Error Analysis or how to brute-force through exponential complexity

Automatic Enrollment and Automatic IRAs David C. John The Heritage Foundation The Retirement

Automatic Registration and Calibration Automatic Registration and Calibration Automatic

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Natural and Flexible Error Recovery for Generated Parsers Maartje de Jonge Emma Nilsson-Nyman

CS 403X Mobile and Ubiquitous Computing Lecture 7: Final Projects + Smorgasbord of Stuff!!

A Smrgsbord of Typos: Exploring International Keyboard Layout Typosquatting Victor Le Pochat

A Smorgasbord of Features for Statistical Machine Translation Franz Josef Och, Daniel Gildea,

Formal Theory, Informally Jonathan Worthington London Perl Workshop 2006 Formal Theory,

50 Things You May Not Know You Can Do With The 4GL Gus Bjrklund. Progress. PUG Challenge

20 Advanced Topics 2: Hybrid Neural-symbolic Models In the previous chapters, we learned about

Animation Techniques in Astronomy aka a Smorgasbord of Data Management, Coding Hacks and

What is Spirituality? 1 Cor. 1:11, For it has been declared to me ... that there are

Sambuz

Useful Links

Newsletter

Mail Us

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits