Machine Translation Proposal Pilot Objective: Cost : PEMT 27% - PowerPoint PPT Presentation

Oct 26, 2022 •239 likes •509 views

Machine Translation Proposal Pilot Objective: Cost : PEMT 27% savings over human translation Efficiency : PEMT 25% faster than human translation Quality : an acceptable score under 30 according to the Harmonized the TAUS Dynamic

Machine Translation Proposal
Pilot Objective: ● Cost : PEMT 27% savings over human translation ● Efficiency : PEMT 25% faster than human translation ● Quality : an acceptable score under 30 according to the Harmonized the TAUS Dynamic Quality Framework (DQF) and Multidimensional Quality Metrics (MQM)
Error Type Minor Major Critical Accuracy omission 1 2 3 mistranslation 1 2 3 untranslated 1 2 3 Terminology inconsistent with termbase 1 2 3 inconsistent use of terminology 1 2 3 Fluency grammar 1 2 3
Pilot Project Processes & Problems
Step 1: File Preparation PDF DOCX Delete the unnecessary content Delete the extra space Make the file clear; the alignment easier
Step2: Testing and Training Round A: tmx only Round B: adding related PDFs BLEU score increased
Step3 First Round of Human Evaluation ❖ 2 post-editors ❖ A sample of 1000 words extracted from one of the system ❖ Analyzed and gave the quality score first ❖ average PE time: 53 minutes
Step4: Tuning Failed to train the system put them into training data it works:)!
Step 5: Adding dictionary ❖ 512-page IMF glossary ❖ Converted from PDF to DOCX ❖ Cleaned up formats and terms ❖ Added it into the dictionary data ❖ Trained two systems
Problem Time consuming glossary clean-up
Problem Failed to add the glossary into dictionary data
Problem Lower BLEU score after adding the glossary
Step 6: The final round of human evaluation ❖ 2 post-editors ❖ A sample of 1000 words extracted from the system with the highest BLEU score(44.03) ❖ Analyzed and gave the quality score first ❖ Average PE time: 0.68 hours
Problem Mistranslated and untranslated MT due to incomplete manual cleanup
Pilot Project Results
85% Efficiency
71% Cost HT: ❖ Translation: $0.12/word ❖ Editing: $0.05/word PEMT: ❖ Post-editing: $0.05/word
31.5 Quality
Quality QA Error Score: 49 31.5 PE time for 1000 words: 53mins 40.8mins Comparison of two rounds of human evaluation
Lessons Learned
PDF formating cleanup
When in doubt, check the system ➔ The system IS objective ➔ TMX IS better than PDF
Content relevance is key
➔ “ Terminology ” and ” Fluency ” performance are improved ➔ Further data needs to be collected for assessment accuracy.
Thank you

Recommend

I want my MVP UX in the City - 20th April 2017 PILOT WORKS 1 Hello, I am Alastair from PILOT

I want my MVP UX in the City - 20th April 2017 PILOT WORKS 1 Hello, I am Alastair from PILOT WORKS 2 PILOT WORKS alastair@pilot.works What shall we build? 3 PILOT WORKS alastair@pilot.works PILOT WORKS alastair@pilot.works My

1.09k views • 78 slides

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation www.uni-stuttart.de Problem: Automatic translation the foreign text: 2 Open Problems in Machine Translation www.uni-stuttart.de Ambiguity in translation

943 views • 44 slides

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan Ganegedara Data Scientist and A u thor Machine translation MACHINE TRANSLATION IN PYTHON Machine translation MACHINE TRANSLATION IN PYTHON Co u rse o u

839 views • 38 slides

Machine Translation Machine Translation February 13, 2008 Andreas Eisele UdS Computerlinguistik

Machine Translation Machine Translation February 13, 2008 Andreas Eisele UdS Computerlinguistik & DFKI eisele@dfki.de Foundations of Language Science and Technology WS 2007/8 Machine Translation: Overview Machine Translation: Overview

219 views • 21 slides

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1 Advances and Challenges 2 Gongbo Tang Neural Machine Translation 2/52 Neural Machine Translation Figure Recurrent neural network based NMT

907 views • 73 slides

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems Three part systems ASR ASR - -> Translation > Translation - -> TTS > TTS System configurations System

288 views • 27 slides

Machine Translation Philipp Koehn 28 April 2020 Philipp Koehn Artificial Intelligence: Machine

Machine Translation Philipp Koehn 28 April 2020 Philipp Koehn Artificial Intelligence: Machine Translation 28 April 2020 Machine Translation: French (2012) 1 Philipp Koehn Artificial Intelligence: Machine Translation 28 April 2020 Machine

1.32k views • 114 slides

IKATAN PILOT INDONESIA Source: FAA (2004-2013) IKATAN PILOT INDONESIA Source: FAA IKATAN PILOT

Pr Prepared by: Ca Capt Si Sigit Sa Sasongko Technical Di Director 1 IKATAN PILOT INDONESIA Source: FAA (2004-2013) IKATAN PILOT INDONESIA Source: FAA IKATAN PILOT INDONESIA IKATAN PILOT INDONESIA IKATAN PILOT INDONESIA IKATAN PILOT

368 views • 20 slides

Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of

Components: Translation model, language model, decoder Statistical Machine Translation Lecture 2: Theory and Praxis of Decoding p Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of Decoding

541 views • 9 slides

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation: Computer Aided Translation 30 April 2015 Why Machine Translation? 1 Assimilation reader initiates translation, wants to know content user is

777 views • 49 slides

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation: Computer Aided Translation 15 November 2018 Why Machine Translation? 1 Assimilation reader initiates translation, wants to know content user

1.04k views • 66 slides

Machine Translation: Going Deep Philipp Koehn 4 June 2015 Philipp Koehn Machine Translation:

Machine Translation: Going Deep Philipp Koehn 4 June 2015 Philipp Koehn Machine Translation: Going Deep 4 June 2015 How do we Improve Machine Translation? 1 More data Better linguistically motivated models Better machine learning

1.21k views • 67 slides

Machine Translation Philipp Koehn 1 December 2015 Philipp Koehn Artificial Intelligence:

Machine Translation Philipp Koehn 1 December 2015 Philipp Koehn Artificial Intelligence: Machine Translation 1 December 2015 Machine Translation: Chinese 1 Philipp Koehn Artificial Intelligence: Machine Translation 1 December 2015 Machine

1.07k views • 96 slides

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine Translation: Neural Machine Translation II Refinements 17 October 2017 Neural Machine Translation 1 <s> the house is big .

828 views • 44 slides

Representing Huge Translation Models Statistical Machine Translation parallel text + alignment

Representing Huge Translation Models Statistical Machine Translation parallel text + alignment Statistical Machine Translation extract rules parallel text + alignment Statistical Machine Translation score extract rules rules parallel

1.27k views • 109 slides

Global Translation Services Website translation using post-edited machine translation and

Global Translation Services Website translation using post-edited machine translation and crowdsourcing David Grunwald davidg@gts-translation.com twitter: @davegrun LinkedIn: davegrun March 31, 2011 David Grunwald, GTS About GTS Small

214 views • 17 slides

Automated Translation: How Does It Work? Stelios Piperidis Simon Krek ELRC, ILSP/Athena RC

Automated Translation: How Does It Work? Stelios Piperidis Simon Krek ELRC, ILSP/Athena RC Joef Stefan Institute ELRC Training Workshop in Slovenia, 08.12.2015 1 Machine Translation Agenda: Why MT: Volume, Quality and Cost? Why is

309 views • 30 slides

Overview Overview About MTM Driver Training Credentialing Electronic Trip Download

v Nebraska Transportation Provider Overview Overview About MTM Driver Training Credentialing Electronic Trip Download Reveal Claims MTM Culture The MTM Advantage Managing non-emergency medical transportation since

617 views • 15 slides

CMS Data Transfer tests towards LHC data taking CMS Data Transfer tests towards LHC data taking D

CMS Data Transfer tests towards LHC data taking CMS Data Transfer tests towards LHC data taking D Bonacorsi CMS Facilities Infrastructure Operations INFN CNAF Bologna Italy On behalf of the CMS experiment

772 views • 48 slides

Port of Vancouver USA Petroleum by Rail Terminal June 27, 2013 Overview Supplying the West

Port of Vancouver USA Petroleum by Rail Terminal June 27, 2013 Overview Supplying the West Coast with North American Petroleum Partnership of Tesoro and Savage Safety and Environmental Stewardship Tesoro Savage Petroleum Terminal

295 views • 26 slides

Integrated Knowledge Translation (IKT) Co-production of knowledge On-going relationship

Developing and improving indicators to capture the activity of Integrated Care Diabetes Clinical Nurse Specialists: a collaborative audit and feedback process Fiona Riordan School of Public Health, UCC Integrated Knowledge Translation (IKT)

344 views • 9 slides

Decision on Data Release Phase 3 Greg Cook Greg Cook Director, Market and Infrastructure Policy

Decision on Data Release Phase 3 Greg Cook Greg Cook Director, Market and Infrastructure Policy ISO Board of Governors Meeting General Session May 18-19, 2011 y , Phase 3 releases additional data to improve market efficiency and

78 views • 5 slides

spending Experience of UNCITRAL Samira Musayeva, UNCITRAL secretariat Relevance of the UNCITRAL

UNCITRAL United Nations Commission on International Trade Law Using E-Procurement data to measure the transparency and performance of public spending Experience of UNCITRAL Samira Musayeva, UNCITRAL secretariat Relevance of the UNCITRAL

357 views • 8 slides

PROTECTION GOALS FOR PRIVACY ENGINEERING Marit Hansen, Meiko Jensen, and Martin Rost

www.datenschutzzentrum.de PROTECTION GOALS FOR PRIVACY ENGINEERING Marit Hansen, Meiko Jensen, and Martin Rost International Workshop on Privacy Engineering May 21, 2015 Protection Goals for Privacy Engineering www.datenschutzzentrum.de

719 views • 35 slides