Evalua&on of Machine Transla&on Quality
Marco Turchi
FBK Trento, Italy turchi@<k.eu Slides from the presenta&on by MaDeo Negri… and myself
Disclaimer
“More has been wriDen about MT evalua&on
- ver the past 50 years than about MT itself”
Hovy et al.: Principles of Context-Based Machine Transla7on Evalua7on. Machine Transla&on, 16, pp. 1–33, 2002 (aDributed to Yorick Wilks)
“It is impossible to write a comprehensive overview of the MT evalua&on literature”
Adam Lopez.: Sta7s7cal Machine Transla7on. ACM Compu&ng Surveys 40(3) pp. 1–49, August 2008.
MT Evalua&on, Trento, Doctoral School - April 2016
Outline
- Importance of MT Evalua&on
- Difficulty of MT Evalua&on
- Human evalua&on: fluency/adequacy
- Automa&c evalua&on:
– Reference-based: BLEU, TER, HTER (chosen among MANY others) – Reference-free: quality es&ma&on (es&ma&ng post-edi&ng effort)
MT Evalua&on, Trento, Doctoral School - April 2016
The importance of MT evalua&on
- Answering “How good is an MT system?” as a way to:
– Which system to use for a given task – Assess and compare systems’ performance – Define the state of the art – Drive system development and measure improvements – Decide whether to apply MT at all
- …Necessary (yes, not sufficient) condi&ons for progress in
any research field
- Difficult task!
MT Evalua&on, Trento, Doctoral School - April 2016