Word Segmentation and their Integration in Machine Translation
Advanced MT Seminar
ThuyLinh Nguyen
thuylinh@cs.cmu.edu
Advanced MT seminar – p. 1/1
Word Segmentation and their Integration in Machine Translation - - PowerPoint PPT Presentation
Word Segmentation and their Integration in Machine Translation Advanced MT Seminar ThuyLinh Nguyen thuylinh@cs.cmu.edu Advanced MT seminar p. 1/1 Word Segmentation Problems Advanced MT seminar p. 2/1 Word Segmentation for MT Use
ThuyLinh Nguyen
thuylinh@cs.cmu.edu
Advanced MT seminar – p. 1/1
Advanced MT seminar – p. 2/1
Advanced MT seminar – p. 3/1
Advanced MT seminar – p. 4/1
Advanced MT seminar – p. 4/1
Advanced MT seminar – p. 4/1
Advanced MT seminar – p. 5/1
Advanced MT seminar – p. 5/1
Advanced MT seminar – p. 5/1
Advanced MT seminar – p. 6/1
Advanced MT seminar – p. 7/1
Advanced MT seminar – p. 7/1
Advanced MT seminar – p. 8/1
Advanced MT seminar – p. 8/1
Advanced MT seminar – p. 9/1
Advanced MT seminar – p. 10/1
J 1
1 ,J
1 |cK 1
I 1
1,I
1|ˆ
J 1
Advanced MT seminar – p. 12/1
Advanced MT seminar – p. 13/1
Advanced MT seminar – p. 14/1
Advanced MT seminar – p. 15/1
Advanced MT seminar – p. 16/1
Advanced MT seminar – p. 17/1
Advanced MT seminar – p. 17/1
Advanced MT seminar – p. 17/1
Advanced MT seminar – p. 17/1
and integral-bit chinese text compression algorithms. Journal
228, 1999. Fuchun Peng, Fangfang Feng, and Andrew Mccallum. Chi- nese segmentation and new word detection using conditional random fields. In Proceedings of Coling 2004, pages 562– 568, Geneva, Switzerland, Aug FebruaryMarch–Aug Febru- aryJuly 2004. COLING. Richard Sproat, Chilin Shih, William Gale, and Nancy Chang. A stochastic finite-state word-segmentation algorithm for chi-
guistics, pages 66–73, 1994. URL #. Huihsin Tseng, Pichuan Chang, Galen Andrew, Daniel Jurafsky, and Christopher Manning. A condi- tional random field word segmenter. 2005. URL
http://www.aclweb.org/anthology-new/W/W06/
Xu. Integrated chinese word segmentation in statistical ma- chine translation. In Proceedings of the International Work- shop on Spoken Language Translation (IWSLT), pages 141– 147, Pittsburgh, PA, October 2005. 17-1
tation for statistical machine translation? In Proceedings of the Third SIGHAN Workshop on Chinese Language Learn- ing, pages 122–128, Barcelona, Spain, July 2004. 17-2