Language comparison through sparse multilingual word alignment
Thomas Mayer1 Michael Cysouw2
1Research Unit Quantitative Language Comparison
Ludwig-Maximilians-Universit¨ at M¨ unchen thommy.mayer@gmail.com
2 Research Center Deutscher Sprachatlas
Language comparison through sparse multilingual word alignment - - PowerPoint PPT Presentation
Language comparison through sparse multilingual word alignment Thomas Mayer 1 Michael Cysouw 2 1 Research Unit Quantitative Language Comparison Ludwig-Maximilians-Universit at M unchen thommy.mayer@gmail.com 2 Research Center Deutscher
1Research Unit Quantitative Language Comparison
2 Research Center Deutscher Sprachatlas
2 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
3 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
4 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
5 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
. . . Sentence no. 25 (S25) L1 why is there a need for a new world (English, en) L2 warum brauchen wir eine neue welt (German, de) L3 ③❛✇♦ s❡ ♥✉✙❞❛❡♠ ♦t ♥♦✈ s✈✤t (Bulgarian, bl) L4 por qu´ e se necesita un nuevo mundo (Spanish, es) L5 g¯ hala hemm b˙ zonn ta dinja ˙ gdida (Maltese, mt) L6 nukatae m´ ıehi˜ a xexeme yeye (Ewe, ew) . . . Sentence no. 93 (S93) L1 who will rule with jesus (English, en) L2 wer wird mit jesus regieren (German, de) L3 ❦♦✩ ✐ ✇❡ ✉♣r❛✈❧✤✈❛ s ✐s✉s (Bulgarian, bl) L4 qui´ enes gobernar´ an con jes´ us (Spanish, es) L5 min se ja¯ hkem ma ˙ ges` u (Maltese, mt) L6 amekawoe aãu fia kple yesu (Ewe, ew) . . .
6 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
L1 L2 Lm S1 why is it often good to ask questions warum ist es oft gut fragen zu stellen . . . S2 why do many stop trying to find answers. . . warum h¨
. . . S3 why can we trust that god will undo. . . warum k¨
. . . S4 what does the name jehovah mean was bedeutet der name jehova . . . S5 what may we learn about jehovah. . . was sagen folgende titel ber jehova. . . . . . S6 in what ways is the bible different. . . warum ist die bibel ein ganz besonderes. . . . . . S7 how can the bible help you cope. . . wie kann uns die bibel bei pers¨
. . . S8 why can you trust the prophecies. . . warum kann man den prophezeiungen. . . . . . S9 in what ways is the bible an exciting. . . warum kann man sagen dass die bibel. . . . . . S10 what impresses you about the. . . was ist an der verbreitung der bibel. . . . . . . . . . . . . . . . . .
7 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
8 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
9 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
10 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
11 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
12 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
13 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
Afrikaans English Welsh German Icelandic Lithuanian Polish Russian Ukrainian Czech Slovak Slovenian Croatian Serbian Albanian Greek Bulgarian Romanian Portuguese Spanish Catalan French Italian Danish Norwegian Swedish Dutch
1000000.0
14 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
15 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
16 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
Albanian Rarotongan Maltese Malagasy Lithuanian Iloko Croatian Chichewa Bulgarian German Ponapean Papiamento (Aruba) Papiamento (Curaçao) Dutch Niuean Miskito Indonesian Italian Kiribati French English Danish Haitian Creole Catalan Afrikaans Ateso Fijian Tuvaluan Swedish Guna Hungarian Quechua (Ancash) Kwanyama Tumbuka Chin (Hakha) Tswana Spanish Ndonga Nyaneka Greek Finnish Ewe Dangme Chitonga Shona Bicol Xitshwa Acholi Luganda Sepedi 10 15 20 25 30
17 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
Mayer and Cysouw: Language comparison through sparse multilingual word alignment
19 / 20 Mayer and Cysouw: Language comparison through sparse multilingual word alignment
Peter F. Brown, John Cocke, Stephen A. Della-Pietra, Vincent J. Della-Pietra, Frederick Jelinek, Robert L. Mercer, and Paul S. Roossin. 1988. A statistical approach to language translation. In Proceedings of the 12th International Conference on Computational Linguistics (COLING-88), pages 71–76. William Croft. 2000. Explaining Language Change: An Evolutionary Approach. Harlow: Longman. Michael Cysouw and Bernhard W¨
Brendan J. Frey and Delbert Dueck. 2007. Clustering by passing messages between data points. Science, 315:972–976. Daniel H. Huson and David Bryant. 2006. Application of phylogenetic networks in evolutionary studies. Molecular Biology and Evolution, 23(2):254–267. Mark Pagel. 2009. Human language as a culturally transmitted replicator. Nature Reviews Genetics, 10:405–415. Uwe Quasthoff and Christian Wolff. 2002. The poisson collocation measure and its applications. In Proceedings of the 2nd International Workshop on Computational Approaches to Collocations, Vienna, Austria. Michel Simard. 1999. Text-translation alignment: Three languages are better than two. In Proceedings of EMNLP/VLC-99, pages 2–11. Michel Simard. 2000. Text-translation alignment: Aligning three or more versions of a text. In Jean V´ eronis, editor, Parallel Text Processing: Alignment and Use of Translation Corpora, pages 49–67. Dordrecht: Kluwer Academic Publishers. Bernhard W¨
f¨ ur Sprachwissenschaft.