A Class-Based Agreement Model for Generating Accurately Inflected - PowerPoint PPT Presentation

A Class-Based Agreement Model for Generating Accurately Inflected Translations ACL 2012 // Jeju Spence Green Stanford University John DeNero Google

Local Agreement Error Input : The car goes quickly. Reference : � � � � � �� (1) a. the-car +F go +F with-speed 2

Local Agreement Error Input : The car goes quickly. Reference : � � � � � �� (1) a. the-car +F go +F with-speed Google Translate : � � � � � �� (2) a. � the-car +F go +M with-speed 2

Long-distance Agreement Error Input : The one who is speaking is my wife. Reference : (3) a. celle qui parle , c’est ma femme one +F who speak , is my wife +F 3

Long-distance Agreement Error Input : The one who is speaking is my wife. Reference : (3) a. celle qui parle , c’est ma femme one +F who speak , is my wife +F Google Translate : (4) a. celui qui parle est ma femme one +M who speak is my spouse +F 3

Agreement Errors: Really Annoying Ref John runs to his house. MT John run to her house. 4

Agreement Errors in Phrase-Based MT Agreement relations cross phrase boundaries � � � � � � �� 5

Agreement Errors in Phrase-Based MT Agreement relations cross phrase boundaries � � � � � � �� Language model should help? ◮ Sparser n -gram counts ◮ LM may back off more often 5

Possible Solutions Morphological generation e.g. [ Minkov et al. 2007 ] ◮ Useful when correct translations aren’t in phrase table 6

Possible Solutions Morphological generation e.g. [ Minkov et al. 2007 ] ◮ Useful when correct translations aren’t in phrase table Our work: model agreement with a new feature ◮ Large phrase tables already contain many word forms 6

Key Idea: Morphological Word Classes   CAT noun � � � �� ‘car’   GEN fem   AGR   NUM sg   CAT verb   � � �   GEN fem ��   � � ‘to go’   AGR NUM sg         PER 3 7

Key Idea: Morphological Word Classes � �� ‘car’ noun+fem+sg � � � �� ‘to go’ verb+fem+sg+3 8

Key Idea: Morphological Word Classes � �� ‘car’ noun+fem+sg � � � �� ‘to go’ verb+fem+sg+3 Linearized feature structure is equally expressive, assuming a fixed order 8

A Class-based Agreement Model N +F V +F ADV �� 9

A Class-based Agreement Model N +F V +F ADV �� 10

1. Model Formulation (for Arabic) 2. MT Decoder Integration 3. English-Arabic Evaluation

Agreement Model Formulation Implemented as a decoder feature 13

Agreement Model Formulation Implemented as a decoder feature when each hypothesis h ∈ H is extended: ˆ s = segment ( h ) τ = tag (ˆ s ) q ( h ) = score ( τ ) return q ( h ) 13

Step 1: Segmentation Prt Pron+Fem+Sg Verb+Masc+3+Pl Conj � � � � � � � � � � and it they write will � � � � � � � � � � 14

Step 1: Segmentation p (ˆ Character-level CRF: s | words ) Features : Centered 5-character window Label set : ◮ I inside segment ◮ O outside segment (whitespace) ◮ B beginning of segment ◮ F do not segment (punctuation, digits, ASCII) 15

Step 2: Tagging N +F V +F ADV �� 16

Step 2: Tagging p ( τ | ˆ Token-level CRF: s ) Features : Current and previous words, affixes, etc. Label set : morphological classes (89 for Arabic) ◮ Gender, number, person, definiteness 17

Step 2: Tagging p ( τ | ˆ Token-level CRF: s ) Features : Current and previous words, affixes, etc. Label set : morphological classes (89 for Arabic) ◮ Gender, number, person, definiteness What about incomplete hypotheses? 17

Step 3: Scoring Problem : Discriminative model score p ( τ | ˆ s ) not comparable across hypotheses ◮ MST parser score: works? [ Galley and Manning 2009 ] ◮ CRF score: fail [this paper] 18

Step 3: Scoring Problem : Discriminative model score p ( τ | ˆ s ) not comparable across hypotheses ◮ MST parser score: works? [ Galley and Manning 2009 ] ◮ CRF score: fail [this paper] Solution : Generative scoring of class sequences 18

Step 3: Scoring Simple bigram LM trained on gold class sequences τ ∗ = arg max p ( τ | ˆ s ) τ � q ( h ) = p ( τ ∗ ) = p ( τ ∗ i | τ ∗ i − 1 ) i 19

Step 3: Scoring Simple bigram LM trained on gold class sequences τ ∗ = arg max p ( τ | ˆ s ) τ � q ( h ) = p ( τ ∗ ) = p ( τ ∗ i | τ ∗ i − 1 ) i Order of scoring model dependent on MT decoder design 19

MT Decoder Integration Tagger CRF 1. Remove next-word features 2. Only tag boundary for goal hypotheses 21

MT Decoder Integration Tagger CRF 1. Remove next-word features 2. Only tag boundary for goal hypotheses Hypothesis state: last segment + class � � � � � �� LM history: � �� Agreement history: � � / verb+fem+sg+3 21

Component Models (Arabic Only) Full (%) Incremental (%) Segmenter 98.6 – Tagger 96.3 96.2 Data : Penn Arabic Treebank [ Maamouri et al. 2004 ] Setup : Dev set, standard split [ Rambow et al. 2005 ] 23

Translation Quality Phrase-based decoder [ Och and Ney 2004 ] ◮ Phrase frequency, lexicalized re-ordering model, etc. Bitext : 502M English-Arabic tokens LM : 4-gram from 600M Arabic tokens 24

Translation Quality: NIST Newswire 27 24.8 25 23.9 23.5 BLEU-4 (uncased) 22.6 23 20.3 21 18.9 18.9 19 18.1 17 15 MT04 ( tune) MT02 MT03 MT05 Average gain : + 1.04 BLEU (significant at p ≤ 0.01) 25

Translation Quality: NIST Mixed Genre 16 BLEU-4 ( uncased) 15.0 15 14.7 14.5 14.3 14 13 MT06 MT08 Average gain : + 0.29 BLEU (significant at p ≤ 0.02) 26

Human Evaluation MT05 output: 74.3% of hypotheses differed from baseline Sampled 100 sentence pairs Manually counted agreement errors 27

Human Evaluation MT05 output: 74.3% of hypotheses differed from baseline Sampled 100 sentence pairs Manually counted agreement errors Result : 15.4% error reduction, p ≤ 0.01 (78 vs. 66) 27

Analysis: Phrase Table Coverage Hypo thesis: Inflected forms in phrase table, but unused 28

Analysis: Phrase Table Coverage Hypo thesis: Inflected forms in phrase table, but unused Analysis : Measure MT05 reference unigram coverage 28

Analysis: Phrase Table Coverage Hypo thesis: Inflected forms in phrase table, but unused Analysis : Measure MT05 reference unigram coverage 44.6% Baseline unigram coverage Matching phrase pairs 67.8% 0% 20% 40% 60% 80% 28

Conclusion: Implementation is Easy Y ou need: 1. CRF package 2. Know-how for implementing decoder features 3. Morphologically annotated corpus 29

Conclusion: Contributions T ranslation quality improvement in a large-scale system 30

Conclusion: Contributions T ranslation quality improvement in a large-scale system Classes and segmentation predicted during decoding ◮ Modeling flexibility 30

Conclusion: Contributions T ranslation quality improvement in a large-scale system Classes and segmentation predicted during decoding ◮ Modeling flexibility Foundation for structured language models ◮ Future work: long-distance relations 30

Segmenter: nlp.stanford.edu/software/ thanks. ��

References Galley, M. and C. D. Manning (2009). “Quadratic-time dependency parsing for machine translation”. In: ACL-IJCNLP . Maamouri, M. et al. (2004). “The Penn Arabic Treebank: Building a large-scale annotated Arabic corpus”. In: NEMLAR . Minkov, E., K. Toutanova, and H. Suzuki (2007). “Generating Complex Morphology for Machine Translation”. In: ACL . Och, F. J. and H. Ney (2004). “The alignment template approach to statistical machine translation”. In: Computational Linguistics 30.4, pp. 417–449. Rambow, O. et al. (2005). Parsing Arabic Dialects . Tech. rep. Johns Hopkins University. 32

A Class-Based Agreement Model for Generating Accurately Inflected - PowerPoint PPT Presentation

A Class-Based Agreement Model for Generating Accurately Inflected Translations ACL 2012 // Jeju Spence Green Stanford University John DeNero Google Local Agreement Error Input : The car goes quickly. Reference :

H2 F2009 H2 F2009 GENERATING GENERATING GENERATING GENERATING FREE CASH FLOW FREE CASH FLOW

Advanced Electric Generating Advanced Electric Generating Advanced Electric Generating

Expert 2D Shape Drawing Aim I can accurately draw a range of 2D shapes using the measurements

Ratchaburi Electricity Generating Holding PCL. Ratchaburi Electricity Generating Holding PCL.

Recursive Definitions Generating Functions Lecture 18 Generating Functions A generating

SCOPE OF THE TBT AGREEMENT TRADE IN GOODS GATT 1994 TBT Agreement lex specialis SCOPE OF THE

Generating Subfields Mark van Hoeij June 15, 2017 Mark van Hoeij Generating Subfields Overview

Atikokan Generating Station Thunder Bay Generating Station March 5, 2013 Alberta Biomaterials

Agreement in HPSG Introduction to HPSG, WS 2007/2008 Monica L. L au Universitt Tbingen

MDS 3.0 Provider Updates May 2013 Purpose of Training To educate on how to accurately code the

AutoGait: A Mobile Platform that Accurately Estimates the Distance Walked -Written By- Dae-Ki

Breach Impacts How to Estimate Costs More Accurately Thursday, April 2 at 11 AM Eastern

Bonn Agreement Oil Appearance Code Bonn Agreement Oil Appearance Code BAOAC BAOAC Bonn

Agreement July 1 1 , 2017 Agreement Key Terms Agreement between TJPA and salesforce.com 25-Year

20 STREAMING AGREEMENT 19 16 OCTOBER US$145 million Streaming Agreement US$145 million

The Bonn Agreement 1969 and BE-AWARE Project Alexander von Buxhoeveden Representing the Bonn

A Comparative Evaluation of Foreground/Background Sketch-based Mesh Segmentation Algorithms Min

Data Mining 2019 Classification Trees (1) Ad Feelders Universiteit Utrecht Ad Feelders (

Universal Dependencies Joakim Nivre, Dan Zeman, Filip Ginter, Sampo Pyysalo, Chris Manning,

Concepts and Algorithms of Scientific and Visual Computing Image Segmentation CS448J,

Converting Fieldbooks to Databases Talk given by Carsten Ehrler for the Project Seminar T

Earnings Conference Call Third Quarter 2015 October 28, 2015 Cautionary Statements And Risk

Dan Halperin School of Computer Science Tel Aviv University Heraklion, January 2013 Overview

The Eurozone's awkward threesome: fiscal stance, macroeconomic stability and growth Professor