Introduction to Machine Translation CMSC 723 / LING 723 / INST 725 - - PowerPoint PPT Presentation

introduction to machine translation
SMART_READER_LITE
LIVE PREVIEW

Introduction to Machine Translation CMSC 723 / LING 723 / INST 725 - - PowerPoint PPT Presentation

Introduction to Machine Translation CMSC 723 / LING 723 / INST 725 Marine Carpuat Slides & figure credits: Philipp Koehn mt-class.org T odays topics Machine Translation Historical Background Machine Translation is an old idea


slide-1
SLIDE 1

Introduction to Machine Translation

CMSC 723 / LING 723 / INST 725 Marine Carpuat

Slides & figure credits: Philipp Koehn mt-class.org

slide-2
SLIDE 2

T

  • day’s topics

Machine Translation

  • Historical Background
  • Machine Translation is an old idea
  • Machine Translation Today
  • Use cases and method
  • Machine Translation Evaluation
slide-3
SLIDE 3

1947

When I look at an article in Russian, I say to myself: This is really written in English, but it has been coded in some strange

  • symbols. I will now

proceed to decode.

Warren Weaver

slide-4
SLIDE 4

1950s-1960s

  • 1954 Georgetown-IBM experiment
  • 250 words, 6 grammar rules
  • 1966 ALPAC report
  • Skeptical in research progress
  • Led to decreased US government funding for MT
slide-5
SLIDE 5

Rule based systems

  • Approach
  • Build dictionaries
  • Write transformation rules
  • Refine, refine, refine
  • Meteo system for weather

forecasts (1976)

  • Systran (1968), …
slide-6
SLIDE 6

1988

More about the IBM story: 20 years of bitext workshop

slide-7
SLIDE 7

Statistical Machine Translation

  • 1990s: increased research
  • Mid 2000s: phrase-based MT
  • (Moses, Google Translate)
  • Around 2010: commercial viability
  • Since mid 2010s: neural network models
slide-8
SLIDE 8

MT History: Hype vs. Reality

slide-9
SLIDE 9

How Good is Machine Translation? Chinese > English

slide-10
SLIDE 10

How Good is Machine Translation? French > English

slide-11
SLIDE 11

The Vauquois Triangle

slide-12
SLIDE 12

Learning from Data

  • What is the best translation?
  • Counts in parallel corpus (aka bitext)
  • Here European Parliament corpus
slide-13
SLIDE 13

Learning from Data

  • What is most fuent?
  • A language modeling problem!
slide-14
SLIDE 14

Word Alignment

slide-15
SLIDE 15

Phrase-based Models

  • Input segmented in phrases
  • Each phrase is translated in
  • utput language
  • Phrases are reordered
slide-16
SLIDE 16

Neural MT

slide-17
SLIDE 17

What is MT good (enough) for?

  • Assimilation: reader initiates translation, wants to know content
  • User is tolerant of inferior quality
  • Focus of majority of research
  • Communication: participants in conversation don’t speak same language
  • Users can ask questions when something is unclear
  • Chat room translations, hand-held devices
  • Often combined with speech recognition
  • Dissemination: publisher wants to make content available in other

languages

  • High quality required
  • Almost exclusively done by human translators
slide-18
SLIDE 18

Applications

slide-19
SLIDE 19

State of the Art (rough estimates)

slide-20
SLIDE 20

T

  • day’s topics

Machine Translation

  • Historical Background
  • Machine Translation is an old idea
  • Machine Translation Today
  • Use cases and method
  • Machine Translation Evaluation
slide-21
SLIDE 21

How good is a translation? Problem: no single right answer

slide-22
SLIDE 22

Evaluation

  • How good is a given machine translation system?
  • Many different translations acceptable
  • Evaluation metrics
  • Subjective judgments by human evaluators
  • Automatic evaluation metrics
  • Task-based evaluation
slide-23
SLIDE 23

Adequacy and Fluency

  • Human judgment
  • Given: machine translation output
  • Given: input and/or reference translation
  • Task: assess quality of MT output
  • Metrics
  • Adequacy: does the output convey the meaning of the input sentence? Is

part of the message lost, added, or distorted?

  • Fluency: is the output fluent? Involves both grammatical correctness and

idiomatic word choices.

slide-24
SLIDE 24

Fluency and Adequacy: Scales

slide-25
SLIDE 25
slide-26
SLIDE 26

Let’s try: rate fluency & adequacy on 1-5 scale

slide-27
SLIDE 27

Challenges in MT evaluation

  • No single correct answer
  • Human evaluators disagree
slide-28
SLIDE 28

Automatic Evaluation Metrics

  • Goal: computer program that computes quality of translations
  • Advantages: low cost, optimizable, consistent
  • Basic strategy
  • Given: MT output
  • Given: human reference translation
  • Task: compute similarity between them
slide-29
SLIDE 29

Precision and Recall of Words

slide-30
SLIDE 30

Precision and Recall of Words

slide-31
SLIDE 31

Word Error Rate

slide-32
SLIDE 32

WER example

slide-33
SLIDE 33

BLEU Bilingual Evaluation Understudy

slide-34
SLIDE 34

Multiple Reference Translations

slide-35
SLIDE 35

BLEU examples

slide-36
SLIDE 36

Semantics-aware metrics: e.g., METEOR

slide-37
SLIDE 37

Drawbacks of Automatic Metrics

  • All words are treated as equally relevant
  • Operate on local level
  • Scores are meaningless (absolute value not informative)
  • Human translators score low on BLEU
slide-38
SLIDE 38

Yet automatic metrics such as BLEU correlate with human judgement

slide-39
SLIDE 39

Caveats: bias toward statistical systems

slide-40
SLIDE 40

Automatic metrics

  • Essential tool for system development
  • Use with caution: not suited to rank systems of different types
  • Still an open area of research
  • Connects with semantic analysis
slide-41
SLIDE 41

T ask-Based Evaluation Post-Editing Machine Translation

slide-42
SLIDE 42

T ask-Based Evaluation Content Understanding T ests

slide-43
SLIDE 43

T

  • day’s topics

Machine Translation

  • Historical Background
  • Machine Translation is an old idea
  • Machine Translation Today
  • Use cases and method
  • Machine Translation Evaluation