Using Word Embeddings to Enforce Document-Level Lexical Consistency - PowerPoint PPT Presentation

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva Martínez Garcia Carles Creus Cristina España-Bonet Lluís Màrquez EAMT 2017 – May 30th – Prague

Outline Motivation 1 Lexical Consistency 2 Experiments 3 Conclusions & Future Work 4

Outline Motivation 1 Document-Level Decoding Lexical Consistency 2 Experiments 3 Conclusions & Future Work 4

MOTIVATION Traditionally, MT systems are designed at sentence level Discourse information helps for more coherent translations SMT: recent work at Document Level: Usually focused on a specific phenomenon: pronominal anaphora, topic cohesion/coherence, lexical consistency, discourse connectives Post-process and re-ranking approaches Document-Level SMT decoders: Docent (Hardmeier et al. 2012, 2013) and Lehrer NMT: only some work introducing context information or tackling Document-Level phenomena 4

MOTIVATION: Sentence-Level Decoding 5

MOTIVATION: Document-Level Decoding 6

Outline Motivation 1 Lexical Consistency 2 Semantic Space Lexical Consistency Feature (SSLC) Lexical Consistency Change Operation (LCCO) Experiments 3 Conclusions & Future Work 4

Lexical Consistency: Our Approach Translations are more consistent when the same word appears translated into the same forms or into different forms with similar/related meaning throughout a document Goals Avoid inconsistent translations for the same word Handle lexical-choice problem 8

Lexical Consistency: Example 9

SSLC Feature Semantic Space Lexical Consistency Feature Inspired by Semantic Space Language Models (SSLM): - based on word embeddings - maximize the similarity between a word and its context Uses CBOW word2vec word embeddings trained on: - bilingual tokens ( target__source ) - monolingual tokens ( target ) 10

SSLC Feature SSLC scores each occurrence of an inconsistently translated source word depending on: - how distant the proposed translation is to the occurrence context - the best adequacy that could be obtained using another translation option (seen in the document) � � score ( w ) = sim ( � k ∈ occ ( w ) sim ( � w , ctxt w ) − max w k , ctxt w ) 11

SSLC Feature 12

LCCO Change Operation Lexical Consistency Change Operation Boost the decoding process applying several changes at a time & producing more consistent translation candidates LCCO works as follows: - Randomly chooses an inconsistently translated word - Randomly chooses one of its translation options used in the document - Retranslates its occurrences throughout the document 13

LCCO Change Operation 14

Outline Motivation 1 Lexical Consistency 2 Experiments 3 Automatic Evaluation Manual Evaluation Conclusions & Future Work 4

Experiments - Settings Word embeddings: - CBOW word2vec implementation - trained on: europarlv7, UN, MultiUN, subtitles2012 Corpus: - training: europarlv7 - development: newscommentary2009 - test: newscommentary2010 (119 documents) Baselines: Moses, Lehrer Extended systems: - using LCCO - using document-level features: SSLMs SSLC SSLMs+SSLC 16

Automatic Evaluation Development set Test set System TER ↓ BLEU ↑ METEOR ↑ TER ↓ BLEU ↑ METEOR ↑ M OSES 58.28 24.27 46.84 53.70 27.52 50.02 L EHRER 58.34 24.28 46.92 53.78 27.58 50.08 +SSLMs 58.01 24.36 46.91 53.49 27.48 50.10 27.61 +SSLC 58.38 24.26 46.90 53.77 50.07 +SSLMs+SSLC 57.99 24.39 46.95 53.50 27.50 50.07 L EHRER +LCCO 58.36 24.27 46.92 53.77 27.57 50.07 +SSLMs 58.04 24.35 46.92 53.43 27.60 50.15 +SSLC 58.36 24.25 46.89 53.81 27.59 50.07 +SSLMs+SSLC 58.06 24.34 46.93 53.46 27.57 50.12 - not statistically significat at 95 % of confidence - # diff. sentences: between 8 % − 42 % - LCCO applied on 8 % of the documents 17

Manual Evaluation: task 1 100 sentences randomly selected and randomly presented Translated by 17 different systems: - Moses - 8 Lehrer systems - 8 Lehrer + LCCO systems Task: ranking from best to worst sentence-level translation quality (allowing ties) 3 annotators, 70 % − 72 % of pairwise annotator agreement 18

Manual Evaluation: task 1 Results: Lehrer baselines are equivalent to Moses Lehrer+SSLC systems surpass Moses Bilingual information helps SSLC Best system: using SSLMs and SSLCbi together Same patterns when introducing LCCO 19

Manual Evaluation: task 2 Comparison between systems with and without LCCO: baseline, SSLC, SSLMs+SSLC 10 selected documents with lexical changes by LCCO Choose the document translation with the best lexical consistency and adequacy 20

Manual Evaluation: task 2 Comparison between systems with and without LCCO: baseline, SSLC, SSLMs+SSLC 10 selected documents with lexical changes by LCCO Choose the document translation with the best lexical consistency and adequacy Results : - 60 % of the time LCCO variants were preferred - 20 % of the time were ties Systems with LCCO provided better translations 20

Manual Evaluation: example source [...] Due to the choice of the camera and the equipment, these portraits remember the classic photos. [...] The passion for the portrait led Bauer to repeat the idea [...] reference [...] Son retratos que, debido a la selección de la cá- mara y del material recuerdan la fotografía clásica. [...] La pasión por los retratos de Bauer le llevó a repetir la idea [...] M OSES [...] Debido a la elección de la cámara y el equipo, estos retratos recordar el clásico fotos. [...] la pasión por el cuadro conducido Bauer a repetir la idea [...] L EHRER +LCCO [...] Debido a la elección de la cámara y el equipo, estos retratos recordar el clásico fotos. [...] la pasión por el retrato conducido Bauer a repetir la idea [...] 21

Manual Evaluation: example source A special desk was opened [...] “It has been in operation for over a week” respond the clerks at the desk [...] The desk is not overwhelmed with questions. reference [...] se abre una ventanilla especial [...] “Lleva funcio- nando una semana” responden los trabajadores tras ella [...] La ventanilla no logra disipar la avalancha de dudas. M OSES [...] un mostrador especial se inició [...] “Funciona desde hace más de una semana” responder los ujieres en la mesa [...] El escritorio no es, sin duda, cargado con preguntas. L EHRER +SSLC [...] una mesa especial se abre [...] “Funciona desde hace más de una semana” responder los ujieres en la mesa [...] El escritorio no es, sin duda, cargado con preguntas. L EHRER +LCCO [...] un mostrador especial se abre [...] “Funciona desde hace más de una semana” responder los ujieres en la ventanilla [...] El mostrador no es abrumado con preguntas. 22

Outline Motivation 1 Lexical Consistency 2 Experiments 3 Conclusions & Future Work 4

Conclusions We tackled lexical consistency at decoding time Introduced a new feature (SSLC) and a new change operation (LCCO) - SSLC uses word embeddings to measure lexical selection consistency - LCCO performs simultaneous lexical changes in a translation step thus generating more consistent translation candidates Results: - Automatic evaluation metrics do not capture system differences - Human evaluators prefer those systems with our strategies 24

Future Work Use information at lemma and seme level to identify inconsistent translations Work with NMT systems: - Develop post-process or re-ranking strategies - Introduce document-level information as input features - Explore new neural network architectures 25

Thank You! 26

Using Word Embeddings to Enforce Document-Level Lexical Consistency - PowerPoint PPT Presentation

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva Martnez Garcia Carles Creus Cristina Espaa-Bonet Llus Mrquez EAMT 2017 May 30th Prague Outline Motivation 1 Lexical Consistency

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Heterogeneous Lexical Resources MultiJEDI ERC 259234 Lexical Resource Lexical Resource Lexical

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

LEXICAL TYPOLOGY Peter Koch (Part I) Koch, Lexical typology, 2010-8-24 A. General introduction

Compilers Lexical Analysis Alex Aiken Lexical Analysis 1. Lexical Analysis 2. Parsing 3.

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction Roy Schwartz + ,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

8. Ordinary Differential Equations Indispensable for many technical applications! 8. Ordinary

Approaches to Voting Credit for several visuals: Ariel D. Procaccia CSC2556 - Nisarg Shah 1

under Class Imbalance Aditya K. Menon 1 , Harikrishna Narasimhan 2 , Shivani Agarwal 2 and Sanjay

The Consistency Analysis of Secondary Index on Distributed

Consistency, Completeness, and Classicality Adam P renosil Institute of Computer Science,

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci Hazelcast Hazelcast The

Verifying Strong Eventual Consistency in -CRDTs Taylor Blau University of Washington June,

(In)consistency of the combinatorial codifferential Gantumur Tsogtgerel (McGill University) Joint

Sambuz

Useful Links

Newsletter

Mail Us

Using Word Embeddings to Enforce Document-Level Lexical Consistency - PowerPoint PPT Presentation

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva Martnez Garcia Carles Creus Cristina Espaa-Bonet Llus Mrquez EAMT 2017 May 30th Prague Outline Motivation 1 Lexical Consistency

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Heterogeneous Lexical Resources MultiJEDI ERC 259234 Lexical Resource Lexical Resource Lexical

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

LEXICAL TYPOLOGY Peter Koch (Part I) Koch, Lexical typology, 2010-8-24 A. General introduction

Compilers Lexical Analysis Alex Aiken Lexical Analysis 1. Lexical Analysis 2. Parsing 3.

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction Roy Schwartz + ,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky &amp; Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky &amp; Martin How to

8. Ordinary Differential Equations Indispensable for many technical applications! 8. Ordinary

Approaches to Voting Credit for several visuals: Ariel D. Procaccia CSC2556 - Nisarg Shah 1

under Class Imbalance Aditya K. Menon 1 , Harikrishna Narasimhan 2 , Shivani Agarwal 2 and Sanjay

The Consistency Analysis of Secondary Index on Distributed

Consistency, Completeness, and Classicality Adam P renosil Institute of Computer Science,

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci Hazelcast Hazelcast The

Verifying Strong Eventual Consistency in -CRDTs Taylor Blau University of Washington June,

(In)consistency of the combinatorial codifferential Gantumur Tsogtgerel (McGill University) Joint

Sambuz

Useful Links

Newsletter

Mail Us

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to