Creating Large-Scale Multilingual Cognate Tables
Winston Wu and David Yarowsky Center for Language and Speech Processing Johns Hopkins University
Creating Large-Scale Multilingual Cognate Tables Winston Wu and - - PowerPoint PPT Presentation
Creating Large-Scale Multilingual Cognate Tables Winston Wu and David Yarowsky Center for Language and Speech Processing Johns Hopkins University http://educationviews.org/wp-content/uploads/2013/06/world-bread-cognates-panis.jpg Cognates and
Winston Wu and David Yarowsky Center for Language and Speech Processing Johns Hopkins University
http://educationviews.org/wp-content/uploads/2013/06/world-bread-cognates-panis.jpg
Initial cluster with unweighted edit distance Alignment to get lexical translation probabilities Cluster with weighted distance function
azj: stol tat: ostal tat: tablis tuk: stol tuk: tablisa tur: tablo uig: ustel uzn: stol uzn: tablista
eng azj tat tuk tur uig uzn table stol stol stol table
ustel table tablo table tablis tablisa tablista
t -> t 0.600 t -> d 0.098 t -> c 0.061 t -> r 0.057 t -> p 0.019 t -> s 0.017 t -> l 0.017 t -> n 0.015 l -> l 0.747 l -> r 0.048 l -> n 0.024 l -> t 0.019 l -> o 0.018 l -> d 0.016 l -> c 0.015 l -> a 0.015 h -> h 0.529 h -> u 0.150 h -> NULL 0.140 h -> l 0.048 h -> a 0.032 h -> j 0.019 h -> o 0.017 h -> k 0.015
TAT UIG
multilingual cognate table construction