Hybrid Adaptation of Named Entity Recognition for Statistical - PDF document

Hybrid Adaptation of Named Entity Recognition for Statistical Machine Translation Vassilina Nikoulina, Ágnes Sándor, Marc Dymetman

Outline • Introduction • Approach NER integration within SMT NER adaptation for SMT NER prediction for SMT • Experiments • Conclusion

Introduction • Incorrect NE translation can seriously harm quality • Main problems caused by NEs in standard PBMT: Ambiguity: • Grant wonderful in Bridget Jones's Diary / Grant obtained by French university Detecting the NE is crucial for producing the right translation Sparsity: • Some named entities can be very sparse (eg. DATEs, UNITs, NAMEs), although they are often used in similar contexts Standard SMT training does not cope well with this situation

Our Approach • Adaptation of NER for better integration within SMT Additional rules on top of generic NER rule-based model • NE generalization: Replace NE with a place-holder (specific to the NE type) • NE translation: Translation of NE with specific NE-translator • NE-replacement predictor, for choosing between: Replacing NE by place-holder and using NE-translator 1. Leaving NE as is and using SMT baseline translation 2.

Example of proposed framework • Src: The Author , F. Mellozzini , carries out an in-depth analysis of the objectives of agricultural policy which have arisen during a meeting held in Rome by the Confederation of Agricultural Workers on 18 - 19 October . • Reduced Src: The Author , PERSON , carries out an in - depth analysis of the objectives of agricultural policy which have arisen during a meeting held in Rome by the ORGANIZATION on DATE . • Reduced Translation: L'auteur, PERSON, exerce une analyse approfondie des objectifs de la politique agricole qui ont ainsi présentée au cours de la réunion tenue à Rome par la ORGANIZATION en DATE.

Example of proposed framework • Reduced Translation: L'auteur, PERSON, exerce une analyse approfondie des objectifs de la politique agricole qui ont ainsi présentée au cours de la réunion tenue à Rome par la ORGANIZATION en DATE. • NE Translation: can be rule-based, dictionary-based, specific for different NE types etc. F. Mellozzini = F. Mellozzini Confederation of Agricultural Workers = Confédération des travailleurs agricoles 18 - 19 October = 18 - 19 octobre • Final Translation: L'auteur, F. Mellozzini, exerce une analyse approfondie des objectifs de la politique agricole qui ont ainsi présentée au cours de la réunion tenue à Rome par la Confédération des travailleurs agricoles en 18 - 19 octobre .

NER adaptation and prediction • NER errors may lead to decrease in translation quality • The internal structure of NE's should be adapted for SMT (different from structure required for IE) We propose post-processing rules on top of our baseline NER system • Not all NEs should be replaced by the place-holder: e.g. If the NE is frequent in the bilingual training data, then the baseline SMT may perform well in translating it We propose to learn a predictor for making the choice

Adaptation of NER for SMT • Many existing NER systems are created for Information Extraction (IE) • Translation works better with a minimal pattern: Modification of NER system so that it does not extract • common nouns • function words • – Advantages: Simplifies NE translation model • Reduces sparsity in phrase extraction •

Prediction model for NE replacement • Prediction model: 0/1 classifier deciding whether NE replacement is beneficial for final translation quality • Some features : NE type NE frequency in training data NE context in source Confidence in NE translation • In order to learn this classifier, we need to create some training data…

Creating a training set for the prediction model For each sentence s in a dev-set : • Translate s with the baseline SMT model : SMT(s) • For each ne found by NER in s : Replace ne with place-holder: s | ne Translate s | ne with the placeholder-enabled SMT model : SMT_NE ( s | ne ) Compare SMT(s) and SMT_NE ( s | ne ) relative to the reference translation (BLEU or TER) Label ne positive if the comparison strongly in favor of SMT_NE ( s | ne ), negative in the opposite case, neutral if difference is small Train the classifier on the positive/neutral/negative labels Note: This model can be generalized to a multiple-class classication problem, when different NE translators are available.

Overall Training of the NE-aware SMT system • Create reduced parallel corpus: Use NER on the source side of our bilingual corpus Project source NEs on the target (through word-alignment) Replace aligned NEs with a place-holder • This replacement is done only with probability alpha, so as to keep a proportion of NEs in their original form • Train reduced SMT model: This model will be able to deal not only with the place-holders, but also with the original form of frequent Named Entities • Train Prediction model for reduced SMT model

Experimental settings • English-French translation task • Data: titles and abstracts of scientific publications in Agricultural domain (European Project Organic.Lingua) • Baseline SMT: Moses with standard settings trained on ~150K in-domain parallel sentences • Baseline NER: Xerox Incremental Parser, rule-based • NE prediction model: SVM 3-class classifier (libsvm) 1 : replace with a place-holder; 0/-1 : do not replace • NE-specific translation model: a combination of two techniques Bilingual dictionary extracted by projection from the bilingual corpus When not found in this dictionary, baseline SMT system, but tuned on a set of parallel NE's

Experimental results Titles Abstracts BLEU TER BLEU TER 0.3135 0.6566 0.1148 0.8935 Baseline SMT NE-aware SMT, baseline NER 0.3213 0.6636 0.1211 0.9064 NE-aware SMT, adapted NER 0.3258 0.6605 0.1257 0.8968 NE-aware SMT, 0.3371 0.6523 0.1228 0.9050 baseline NER + NE prediction model NE-aware SMT, 0.3421 0.6443 0.1341 0.8935 adapted NER + NE prediction model

Conclusions and future work • Proposed framework for NE integration within SMT addressing sparsity issues • Adaptation of standard NER + Prediction of NE-replacement are beneficial for final translation quality • Future work: replace pipeline architecture with confusion network

Questions ? Also can be addressed to: Vassilina.Nikoulina@m4x.org

Hybrid Adaptation of Named Entity Recognition for Statistical - PDF document

Hybrid Adaptation of Named Entity Recognition for Statistical Machine Translation Vassilina Nikoulina, gnes Sndor, Marc Dymetman Outline Introduction Approach NER integration within SMT NER adaptation for SMT NER prediction for

Named Entity Recognition Using BERT and ELMo Group 8 : Mikaela Guerrero Vikash Kumar Nitya

Recycling Named Entity Taggers Unsupervised Domain and Language Adaptation for Named Entity

Hybrid Adaptation of Named Entity Recognition for Statistical Machine Translation Vassilina

Multi-Task Transfer Learning for Fine-Grained Named Entity Recognition Masato Hagiwara 1 , Ryuji

Named Entity WordNet *Istituto di Linguistica Computazionale (Pisa, Italy) ^University of

Hybrid Construction Hybrid Construction Hybrid Construction Hybrid Construction 1 VP

Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Minema Minema

Information Extraction Extracting limited forms of information from text Named entity

Efficient Dependency-Guided Named Entity Recognition Zhanming Jie Aldrian Obaja Muis Wei Lu

Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model for t he Dist

Hybrid Automobiles Hybrid Automobiles It switches easily between fuel, batteries, or both It

AIDA-light: High-Throughput Named-Entity Disambiguation Ba Dat Nguyen Johannes Hoffart Martin

Coastal Adaptation Kellie Fisher FCERM Senior Advisor Why Adaptation? Adaptation to a

HIPAA RESEARCH POLICY THE NEW STRUCTURE OF THE EMORY UNIVERSITY HYBRID COVERED ENTITY AND HOW IT

VI.3 Named Entity Reconciliation Problem: Same entity appears in Different spellings

Design Challenges for Entity Linking Xiao Ling , Sameer Singh, Daniel S. Weld Entity Linking

A Nation of Law 1789 1 1781 TIMELINE 1781 1789 1789 1781 overview Confederation

Predicting Structures: Conditional Models and Local Classifiers CS 6355: Structured Prediction 1

Hoare logic Lecture 4: A verifier for Hoare logic Jean Pichon-Pharabod University of Cambridge

Briefing on eHR Content By Veronica HUNG Health Informatician, eHRISO Domains Domains eHR

Supporting Open Science through a global network of open, digital repositories Kathleen Shearer,

Multiple Antenna Secret Broadcast over Multiple Antenna Secret Broadcast over Wireless Networks

OPES and E2E Encryption Should OPES be compatible with end-to- end encryption? Define

Trademark and Unfair Competition Law Slides 19: Surveys & More Confusion LAWS 7341-001

Hybrid Adaptation of Named Entity Recognition for Statistical - PDF document

Hybrid Adaptation of Named Entity Recognition for Statistical Machine Translation Vassilina Nikoulina, gnes Sndor, Marc Dymetman Outline Introduction Approach NER integration within SMT NER adaptation for SMT NER prediction for

Named Entity Recognition Using BERT and ELMo Group 8 : Mikaela Guerrero Vikash Kumar Nitya

Recycling Named Entity Taggers Unsupervised Domain and Language Adaptation for Named Entity

Hybrid Adaptation of Named Entity Recognition for Statistical Machine Translation Vassilina

Multi-Task Transfer Learning for Fine-Grained Named Entity Recognition Masato Hagiwara 1 , Ryuji

Named Entity WordNet *Istituto di Linguistica Computazionale (Pisa, Italy) ^University of

Hybrid Construction Hybrid Construction Hybrid Construction Hybrid Construction 1 VP

Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Minema Minema

Information Extraction Extracting limited forms of information from text Named entity

Efficient Dependency-Guided Named Entity Recognition Zhanming Jie Aldrian Obaja Muis Wei Lu

Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model for t he Dist

Hybrid Automobiles Hybrid Automobiles It switches easily between fuel, batteries, or both It

AIDA-light: High-Throughput Named-Entity Disambiguation Ba Dat Nguyen Johannes Hoffart Martin

Coastal Adaptation Kellie Fisher FCERM Senior Advisor Why Adaptation? Adaptation to a

HIPAA RESEARCH POLICY THE NEW STRUCTURE OF THE EMORY UNIVERSITY HYBRID COVERED ENTITY AND HOW IT

VI.3 Named Entity Reconciliation Problem: Same entity appears in Different spellings

Design Challenges for Entity Linking Xiao Ling , Sameer Singh, Daniel S. Weld Entity Linking

A Nation of Law 1789 1 1781 TIMELINE 1781 1789 1789 1781 overview Confederation

Predicting Structures: Conditional Models and Local Classifiers CS 6355: Structured Prediction 1

Hoare logic Lecture 4: A verifier for Hoare logic Jean Pichon-Pharabod University of Cambridge

Briefing on eHR Content By Veronica HUNG Health Informatician, eHRISO Domains Domains eHR

Supporting Open Science through a global network of open, digital repositories Kathleen Shearer,

Multiple Antenna Secret Broadcast over Multiple Antenna Secret Broadcast over Wireless Networks

OPES and E2E Encryption Should OPES be compatible with end-to- end encryption? Define

Trademark and Unfair Competition Law Slides 19: Surveys &amp; More Confusion LAWS 7341-001

Trademark and Unfair Competition Law Slides 19: Surveys & More Confusion LAWS 7341-001