Phylogenetic trees II Estimating distances, estimating trees from distances
Gerhard Jäger Words, Bones, Genes, Tools February 28, 2018
Gerhard Jäger Distance-based estimation WBGT 1 / 67
Phylogenetic trees II Estimating distances, estimating trees from - - PowerPoint PPT Presentation
Phylogenetic trees II Estimating distances, estimating trees from distances Gerhard Jger Words, Bones, Genes, Tools February 28, 2018 Gerhard Jger Distance-based estimation WBGT 1 / 67 Background Background Gerhard Jger
Gerhard Jäger Distance-based estimation WBGT 1 / 67
Background
Gerhard Jäger Distance-based estimation WBGT 2 / 67
Background
Gerhard Jäger Distance-based estimation WBGT 3 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 4 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 5 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 6 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 7 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 8 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 9 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 10 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 11 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 12 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 13 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 14 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 15 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 16 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 17 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 18 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 19 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 20 / 67
Markov processes
1
2
3
4
Gerhard Jäger Distance-based estimation WBGT 21 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 22 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 23 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 24 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 25 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 26 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 27 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 28 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 29 / 67
Markov processes
Gerhard Jäger Distance-based estimation WBGT 30 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 31 / 67
Estimating distances
language iso_code gloss global_id local_id transcription cognate_class ELFDALIAN qov woman 962 woman ˈkɛ̀lɪŋg woman:Ag DUTCH nld woman 962 woman vrɑu woman:B GERMAN deu woman 962 woman fraŭ woman:B DANISH dan woman 962 woman ˈg̥ʰvenə woman:D DANISH_FJOLDE woman 962 woman kvinʲ woman:D GUTNISH_LAU woman 962 woman ˈkvɪnːˌfolk woman:D LATIN lat woman 962 woman ˈmulier woman:E LATIN lat woman 962 woman ˈfeːmina woman:G ENGLISH eng woman 962 woman wʊmən woman:H GERMAN deu woman 962 woman vaĭp woman:H DANISH dan woman 962 woman ˈd̥ɛːmə woman:K
Gerhard Jäger Distance-based estimation WBGT 32 / 67
Estimating distances
language woman:Ag woman:B woman:D woman:E woman:G woman:H woman:K · · · DANISH 1 1 · · · DANISH_FJOLDE 1 · · · DUTCH 1 · · · ELFDALIAN 1 · · · ENGLISH 1 · · · GERMAN 1 1 · · · GUTNISH_LAU 1 · · · LATIN 1 1 · · · Gerhard Jäger Distance-based estimation WBGT 33 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 34 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 35 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 36 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 37 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 38 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 39 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 40 / 67
Estimating distances
Bengali Breton Bulgarian Catalan Czech Danish Dutch English French Bengali – 2.16 1.64 1.39 1.81 1.41 1.24 1.33 1.28 Breton 2.16 – 1.81 1.67 1.77 1.82 1.86 1.80 1.64 Bulgarian 1.64 1.81 – 1.55 0.34 1.44 1.52 1.31 1.56 Catalan 1.39 1.67 1.55 – 1.53 1.40 1.37 1.17 0.29 Czech 1.81 1.77 0.34 1.53 – 1.40 1.44 1.34 1.53 Danish 1.41 1.82 1.44 1.40 1.40 – 0.45 0.48 1.38 Dutch 1.24 1.86 1.52 1.37 1.44 0.45 – 0.51 1.31 English 1.33 1.80 1.31 1.17 1.34 0.48 0.51 – 1.09 French 1.28 1.64 1.56 0.29 1.53 1.38 1.31 1.09 – German 1.25 1.72 1.45 1.39 1.40 0.43 0.27 0.49 1.28 Greek 1.57 2.09 1.74 1.72 1.85 1.64 1.69 1.64 1.71 Hindi 0.54 1.89 1.33 1.24 1.34 1.53 1.56 1.41 1.22 Icelandic 1.29 1.85 1.50 1.48 1.51 0.25 0.60 0.58 1.44 Irish 1.87 0.85 1.44 1.58 1.37 1.38 1.38 1.31 1.35 Italian 1.40 1.52 1.51 0.24 1.52 1.32 1.30 1.16 0.26 Lithuanian 2.22 1.66 0.84 1.22 0.83 1.34 1.41 1.25 1.19 Nepali 0.56 0.18 0.20 0.13 0.30 0.20 0.30 0.20 0.20 Polish 1.65 1.86 0.43 1.56 0.28 1.44 1.42 1.32 1.51 Portuguese 1.34 1.57 1.49 0.30 1.44 1.39 1.39 1.16 0.36 Romanian 1.32 1.05 1.19 0.32 1.19 1.12 1.09 1.00 0.28 Russian 1.64 1.73 0.34 1.49 0.29 1.38 1.45 1.26 1.44 Spanish 1.36 1.55 1.47 0.21 1.45 1.42 1.38 1.15 0.30 Swedish 1.43 1.87 1.49 1.41 1.44 0.15 0.49 0.57 1.43 Ukrainian 1.67 1.82 0.40 1.53 0.32 1.45 1.46 1.32 1.51 Welsh 2.08 0.38 1.39 1.19 1.41 1.00 1.08 1.15 1.02
Gerhard Jäger Distance-based estimation WBGT 41 / 67
Estimating distances
German Greek Hindi Icelandic Irish Italian Lithuanian Nepali Polish Bengali 1.25 1.57 0.54 1.29 1.87 1.40 2.22 0.56 1.65 Breton 1.72 2.09 1.89 1.85 0.85 1.52 1.66 0.18 1.86 Bulgarian 1.45 1.74 1.33 1.50 1.44 1.51 0.84 0.20 0.43 Catalan 1.39 1.72 1.24 1.48 1.58 0.24 1.22 0.13 1.56 Czech 1.40 1.85 1.34 1.51 1.37 1.52 0.83 0.30 0.28 Danish 0.43 1.64 1.53 0.25 1.38 1.32 1.34 0.20 1.44 Dutch 0.27 1.69 1.56 0.60 1.38 1.30 1.41 0.30 1.42 English 0.49 1.64 1.41 0.58 1.31 1.16 1.25 0.20 1.32 French 1.28 1.71 1.22 1.44 1.35 0.26 1.19 0.20 1.51 German – 1.65 1.46 0.61 1.30 1.28 1.30 0.20 1.38 Greek 1.65 – 1.53 1.68 1.70 1.60 1.74 0.41 1.85 Hindi 1.46 1.53 – 1.64 1.40 1.28 1.37 0.08 1.35 Icelandic 0.61 1.68 1.64 – 1.43 1.44 1.34 0.30 1.55 Irish 1.30 1.70 1.40 1.43 – 1.30 1.32 0.46 1.41 Italian 1.28 1.60 1.28 1.44 1.30 – 1.18 0.24 1.55 Lithuanian 1.30 1.74 1.37 1.34 1.32 1.18 – 0.81 0.78 Nepali 0.20 0.41 0.08 0.30 0.46 0.24 0.81 – 0.30 Polish 1.38 1.85 1.35 1.55 1.41 1.55 0.78 0.30 – Portuguese 1.30 1.63 1.27 1.44 1.47 0.32 1.25 0.20 1.44 Romanian 1.00 1.36 0.96 1.18 1.00 0.26 1.20 0.22 1.19 Russian 1.36 1.78 1.34 1.46 1.41 1.48 0.84 0.20 0.32 Spanish 1.32 1.67 1.21 1.50 1.37 0.28 1.18 0.20 1.46 Swedish 0.50 1.68 1.60 0.30 1.38 1.36 1.41 0.20 1.46 Ukrainian 1.42 1.88 1.31 1.51 1.41 1.52 0.79 0.30 0.27 Welsh 0.94 1.12 0.96 1.20 0.54 1.02 0.69 0.69 1.39
Gerhard Jäger Distance-based estimation WBGT 42 / 67
Estimating distances
Portuguese Romanian Russian Spanish Swedish Ukrainian Welsh Bengali 1.34 1.32 1.64 1.36 1.43 1.67 2.08 Breton 1.57 1.05 1.73 1.55 1.87 1.82 0.38 Bulgarian 1.49 1.19 0.34 1.47 1.49 0.40 1.39 Catalan 0.30 0.32 1.49 0.21 1.41 1.53 1.19 Czech 1.44 1.19 0.29 1.45 1.44 0.32 1.41 Danish 1.39 1.12 1.38 1.42 0.15 1.45 1.00 Dutch 1.39 1.09 1.45 1.38 0.49 1.46 1.08 English 1.16 1.00 1.26 1.15 0.57 1.32 1.15 French 0.36 0.28 1.44 0.30 1.43 1.51 1.02 German 1.30 1.00 1.36 1.32 0.50 1.42 0.94 Greek 1.63 1.36 1.78 1.67 1.68 1.88 1.12 Hindi 1.27 0.96 1.34 1.21 1.60 1.31 0.96 Icelandic 1.44 1.18 1.46 1.50 0.30 1.51 1.20 Irish 1.47 1.00 1.41 1.37 1.38 1.41 0.54 Italian 0.32 0.26 1.48 0.28 1.36 1.52 1.02 Lithuanian 1.25 1.20 0.84 1.18 1.41 0.79 0.69 Nepali 0.20 0.22 0.20 0.20 0.20 0.30 0.69 Polish 1.44 1.19 0.32 1.46 1.46 0.27 1.39 Portuguese – 0.28 1.39 0.17 1.43 1.44 0.96 Romanian 0.28 – 1.13 0.24 1.13 1.20 0.69 Russian 1.39 1.13 – 1.41 1.43 0.22 1.23 Spanish 0.17 0.24 1.41 – 1.45 1.48 1.03 Swedish 1.43 1.13 1.43 1.45 – 1.46 1.06 Ukrainian 1.44 1.20 0.22 1.48 1.46 – 1.25 Welsh 0.96 0.69 1.23 1.03 1.06 1.25 –
Gerhard Jäger Distance-based estimation WBGT 43 / 67
Estimating distances
Bengali Breton Bulgarian Catalan Czech Danish Dutch English French German Greek Hindi Icelandic Irish Italian Lithuanian Nepali Polish Portuguese Romanian Russian Spanish Swedish Ukrainian Welsh
0.035 0.049 0.284 0.269 0.032 0.022 0.135 0.141 0.03 0.087 0.128 0.195 0.369 0.332 0.146 0.465 0.355 0.373 0.063 0.164 0.036 0.053 0.094 0.168 0.103 0.161 0.106 0.193 0.008 0.335 0.095 0.009 0.026 0.053 0.108 0.059 0.117 0.12 0.14 0.037 0.082 0.054 0.882 0.412 0.357 0.178
Gerhard Jäger Distance-based estimation WBGT 44 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 45 / 67
Estimating distances
Bengali Breton Bulgarian Catalan Czech Danish Dutch English French German Greek Hindi Icelandic Irish Italian Lithuanian Nepali Polish Portuguese Romanian Russian Spanish Swedish Ukrainian Welsh
0.124 0.018 0.063 0.148 0.311 0.016 0.046 0.084 0.084 0.13 0.009 0.015 0.122 0.122 0.137 0.184 0.234 0.04 0.04 0.274 0.344 0.123 0.065 0.074 0.074 0.139 0.013 0.117 0.133 0.133 0.25 0.324 0.155 0.19 0.19 0.345 0.279 0.22 0.039 0.042 0.108 0.108 0.011 0.138 0.138 0.188 0.408 0.811
Gerhard Jäger Distance-based estimation WBGT 46 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 47 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 48 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 49 / 67
Estimating distances
Bengali Breton Bulgarian Catalan Czech Danish Dutch English French German Greek Hindi Icelandic Irish Italian Lithuanian Nepali Polish Portuguese Romanian Russian Spanish Swedish Ukrainian Welsh
0.095 0.069 0.056 0.035 0.057 0.047 0.155 0.073 0.22 0.1 0.179 0.175 0.071 0.015 0.049 0.175 0.065 0.073 0.099 0.16 0.112 0.034 0.038 0.119 0.012 0.054 0.121 0.262 0.123 0.013 0.113 0.054 0.111 0.128 0.017 0.028 0.018 0.071 0.148 0.198 0.084 0.088 0.209 0.229 0.369 0.233 0.067
Gerhard Jäger Distance-based estimation WBGT 50 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 51 / 67
Estimating distances
Bengali Breton Bulgarian Catalan Czech Danish Dutch English French German Greek Hindi Icelandic Irish Italian Lithuanian Nepali Polish Portuguese Romanian Russian Spanish Swedish Ukrainian Welsh
0.18 0.069 0.027 0.062 0.071 0.067 0.057 0.032 0.032 0.089 0.037 0.12 0.12 0.086 0.141 0.141 0.088 0.049 0.059 0.037 0.056 0.056 0.093 0.012 0.14 0.14 0.201 0.078 0.071 0.033 0.102 0.033 0.033 0.134 0.167 0.169 0.068 0.068 0.175 0.056 0.153 0.153 0.209 0.262 0.01 0.293 0.293 0.303
Gerhard Jäger Distance-based estimation WBGT 52 / 67
Estimating distances
Gerhard Jäger Distance-based estimation WBGT 53 / 67
Working with phonetic strings
Gerhard Jäger Distance-based estimation WBGT 54 / 67
Working with phonetic strings
Gerhard Jäger Distance-based estimation WBGT 55 / 67
Working with phonetic strings
1
2
3
language phonological form ASJP representation (IELex) Bengali
muˈrɛ murE Catalan mar; maɾ; ma mar; mar; ma Czech ˈmɔr̝ɛ morE Danish hɑw;søˀ how; se Dutch ze ze English si: si French mɛʀ mEr German ze:;’o:t ͜ sea:n;me:ɐ̯ ze; otsean; mea Greek ˈθalaˌsa 8alasa Hindi
haːv/sjouːr hav; syour Irish ˈfˠæɾˠɟɪ fErCi Italian ˈmare mare Lithuanian ˈju:rɐ yura Nepali
ˈmɔʐɛ moZE Portuguese maɾ mar Romanian ˈmare mare Russian ˈmɔrʲɛ morE Spanish maɾ mar Swedish hɑːv; ɧøː hov; Se Ukrainian ˈmɔrɛ morE Welsh
Distance-based estimation WBGT 56 / 67
Working with phonetic strings
Gerhard Jäger Distance-based estimation WBGT 57 / 67
Working with phonetic strings
Greek Bulgarian Russian Polish Ukrainian Czech Icelandic Swedish Danish English Dutch German Catalan Portuguese Spanish French Italian Breton Romanian Lithuanian Irish Hindi Bengali Welsh Nepali
0.027 0.04 0.083 0.349 0.03 0.039 0.01 0.277 0.348 0.411 0.618 0.021 0.21 0.496 0.051 0.362 0.243 0.297 0.265 0.45 0.752 0.202 0.244 0.114 0.021 0.046 0.21 0.421 0.239 0.259 0.44 0.737 0.205 0.299 0.313 0.698 0.791 0.329 0.117 0.181 0.219 0.601 0.256 0.048 0.136 0.426 0.379 0.685
Gerhard Jäger Distance-based estimation WBGT 58 / 67
Working with phonetic strings
Gerhard Jäger Distance-based estimation WBGT 59 / 67
Working with phonetic strings
Greek Bulgarian Russian Polish Ukrainian Czech Icelandic Swedish Danish English Dutch German Catalan Portuguese Spanish French Italian Breton Romanian Lithuanian Irish Hindi Bengali Welsh Nepali
0.008 0.084 0.045 0.039 0.293 0.079 0.058 0.039 0.067 0.267 0.267 0.334 0.373 0.43 0.51 0.803 0.356 0.206 0.281 0.281 0.487 0.199 0.28 0.066 0.065 0.057 0.22 0.22 0.277 0.342 0.408 0.688 0.338 0.071 0.101 0.131 0.332 0.332 0.462 0.161 0.402 0.402 0.634 0.183 0.292 0.506 0.506 0.797
Gerhard Jäger Distance-based estimation WBGT 60 / 67
Working with phonetic strings
Gerhard Jäger Distance-based estimation WBGT 61 / 67
Hands-on
Gerhard Jäger Distance-based estimation WBGT 62 / 67
Hands-on
Old Norse Ancient Greek Russian Latin Dutch Old Church Slavonic
Gerhard Jäger Distance-based estimation WBGT 63 / 67
Hands-on
1
Gerhard Jäger Distance-based estimation WBGT 64 / 67
Hands-on
2
3
Gerhard Jäger Distance-based estimation WBGT 65 / 67
Hands-on
library(ape) library(phangorn) contrasts <- matrix(data=c(1,0, 0,1, 1,1), ncol=2,byrow=T) dimnames(contrasts) <- list(c('0','1','-'), c('0','1')) cognacy.data <- phyDat(read.nexus.data('ielex.bin.nex'), 'USER', levels=c('0','1','-'), contrast=contrasts, ambiguity='-') cognacy.matrix <- as.character(cognacy.data) Gerhard Jäger Distance-based estimation WBGT 66 / 67
Hands-on
Gerhard Jäger Distance-based estimation WBGT 67 / 67
Hands-on
Gerhard Jäger Distance-based estimation WBGT 67 / 67