1
CS 188: Artificial Intelligence
Spring 2006
Lecture 28: Machine Translation 5/2/2006
Dan Klein – UC Berkeley
Machine Translation: Examples Levels of Transfer
Interlingua Semantic Structure Semantic Structure Syntactic Structure Syntactic Structure Word Structure Word Structure Source Text Target Text Semantic Composition Semantic Decomposition Semantic Analysis Semantic Generation Syntactic Analysis Syntactic Generation Morphological Analysis Morphological Generation Semantic Transfer Syntactic Transfer Direct
(Vauquois triangle)
General Approaches
Rule
- b
ased approaches
Expert system style rewrite systems Interlingua methods (analyze and generate) Lexicons come from humans or dictionaries Can be very fast, and can accumulate a lot of knowledge over time (e.g. Systran)
Statistical approaches
Noisy channel systems Lower-level transfer Lexicons discovered using parallel corpora Require little human declaration of knowledge
The Coding View
“One naturally wonders if the problem of translation could conceivably be treated as a problem in cryptography. When I look at an article in Russian, I say: ‘This is really written in English, but it has been coded in some strange symbols. I will now proceed to decode.’ ”
Warren Weaver (1955:18, quoting a letter he wrote in 1947)
MT System Components
source P(e) e f decoder
- bserved
argmax P(e|f) = argmax P(f|e)P(e) e e e f best channel P(f|e)
Language Model Translation Model Finds an English translation which is both fluent and semantically faithful to the French source