Representation of a dissimilarity matrix using reticulograms
Pierre Legendre
Université de Montréal and
Vladimir Makarenkov
Université du Québec à Montréal
DIMACS Workshop on Reticulated Evolution, Rutgers University, September 20-21, 2004
Representation of a dissimilarity matrix using reticulograms Pierre - - PowerPoint PPT Presentation
Representation of a dissimilarity matrix using reticulograms Pierre Legendre Universit de Montral and Vladimir Makarenkov Universit du Qubec Montral DIMACS Workshop on Reticulated Evolution, Rutgers University, September 20-21,
DIMACS Workshop on Reticulated Evolution, Rutgers University, September 20-21, 2004
Root x y i j l(x,y) l(i,x) Set of present-day taxa X
y x
...
l i j
Q1 dij rij – ( )2
j X ∈
i X ∈
n n 1 – ( ) 2
–
1 – ( ) 2
–
= Q Q2 dij rij – ( )2
j X ∈
i X ∈
n n 1 – ( ) 2
–
n n 1 – ( ) 2
–
= AIC 2n 2 – ( ) 2n 3 – ( ) 2
–
2n 2 – ( ) 2n 3 – ( ) 2
–
= dij rij – ( )2
j X ∈
i X ∈
MDL 2n 2 – ( ) 2n 3 – ( ) 2
N ( ) log –
2n 2 – ( ) 2n 3 – ( ) 2
N ( ) log –
= dij rij – ( )2
j X ∈
i X ∈
among n taxa
unrooted reticulogram For initial unrooted binary tree: N = 2n–3 (2n–2)(2n–3)/2 is the number of branches in a completely interconnected, unrooted graph containing n taxa and (2n–2) nodes AIC: Akaike Information Criterion; MDL: Minimum Description Length.
Q1 dij rij – ( )2
j X ∈
i X ∈
n n 1 – ( ) 2
–
Var% 100 1 dij rij – ( )2
j X ∈
i X ∈
dij d – ( )2
j X ∈
i X ∈
× =
1 Example developed in Makarenkov and Legendre (2000).
1 2 3 4 5 6 7 8 9 10 11
0.000
0.089 0.000
0.104 0.106 0.000
0.161 0.171 0.166 0.000
0.182 0.189 0.189 0.188 0.000
0.232 0.243 0.237 0.244 0.247 0.000
0.233 0.251 0.235 0.247 0.239 0.036 0.000
0.249 0.268 0.262 0.262 0.257 0.084 0.093 0.000
0.256 0.249 0.244 0.241 0.242 0.124 0.120 0.123 0.000
0.273 0.284 0.271 0.284 0.269 0.289 0.293 0.287 0.287 0.000
0.322 0.321 0.314 0.303 0.309 0.314 0.316 0.311 0.319 0.320 0.000
0.308 0.309 0.293 0.293 0.296 0.282 0.289 0.298 0.287 0.285 0.252
Macaca sylvanus 22 Lemur catta 20 21 Saimiri sciureus 17 18 Macaca fascicularis 19 Macaca fuscata Macaca mulatta 16 Hylobates 15 Pongo 14 Gorilla 13 Pan Homo sapiens
Cercopithecoidea (Old World monkeys) Prosimii (Lemurs, tarsiers and lorises) Ceboidea (New World monkeys) Hominoidea (Apes and man)
Tarsius syrichta
1 Example developed in Legendre and
Makarenkov (2002).
50 100 150 200 KILOMETRES 50 100 150 200 MILES CONICAL PROJECTION
13 18 17 12 2 1 8 19 21 20 14 3 15 16 5 4 7 6 11 9
ROOT
40 38 41 42 39 37 36 34 35 33 32 31 24 23 30 25 29 26 27 28
10
Camin-Sokal tree with reticulations
1 Legendre, P. and V. Legendre. 1984. Postglacial dispersal of freshwater fishes in the Québec
2 Legendre, P. 1986. Reconstructing biogeographic history using phylogenetic-tree analysis of
community structure. Systematic Zoology 35: 68-80.
50 100 150 200 KILOMETRES 50 100 150 200 MILES CONICAL PROJECTION
13 18 17 12 2 1 8 19 21 20 14 3 15 16 5 4 7 6 11 9
ROOT
40 38 41 42 39 37 36 34 35 33 32 31 24 23 30 25 29 26 27 28
10
Camin-Sokal tree with reticulations
1 Example developed in Makarenkov and Legendre (2004).
1 2 3 4 5 6 7 8
0.0000
0.0258 0.0000
0.0248 0.0357 0.0000
0.1124 0.1215 0.1014 0.0000
0.0713 0.0804 0.0604 0.0920 0.0000
0.1270 0.1361 0.1161 0.1506 0.1033 0.0000
0.1299 0.1390 0.1190 0.1535 0.1128 0.1611 0.0000
0.1370 0.1461 0.1261 0.1606 0.1133 0.1442 0.1427 0.0000
12 13 14 11 10 9 Euglena Chrysophyte Chlamydomonas Chlorella Liverworth Rice Tobacco
0.0193 0.0089 0.0143 0.0106 0.0298 0.0333 0.0025 0.0728 0.01297 0.0093 0.0805 0.0777 0.0745 0.1352 0.1529
Cyanobacterium
Reticulogram
0.1285
58% 87% 63% 100% 100%
Splits graph
Chlorella Chl a+b Tobacco Chl a+b Chlamydomonas Chl a+b Euglena Chl a+b Rice Chl a+b
0.0075 0.0008 0.0104 0.0175 0.0070 0.0680 0.0035 0.0002 0.0033 0.0238 0.0671 0.0142 0.0087 0.0678 0.0629 0.0255
Cyanobacterium Chl a Liverworth Chl a+b Chrysophyte Chl a+c H H H H H H
1 Example developed in Makarenkov, Legendre and Desdevises (2004).
1 2 3 4 5 6
0.000
0.090 0.000
0.103 0.093 0.000
0.096 0.090 0.117 0.000
0.004 0.093 0.106 0.099 0.000
0.075 0.100 0.103 0.099 0.078 0.000
9
Apis cerana
8
Apis mellifera Ancestor of Apis mellifera Ancestor of Apis cerana
9
Apis cerana
8
Apis mellifera Apis dorsata
10
Apis koschevnikovi
7
Apis florea Apis andreniformis
NJ: 100% ML: 100% NJ: 88% ML: 89% NJ: 57% ML: 54% 0.0510 0.0901 0.0395 0.0535 0.1034 0.0399 0.0347 0.0091 0.0059 0.0037 0.0007
Least-squares loss Q Criterion Q2 Phylogenetic tree 0.000143 0.000024 + 1 reticulation 0.000104 0.000021 + 2 reticulations 0.000078 0.000020 (min)
1 Example developed in Legendre and Makarenkov (2002).
Populations C E J L M N O T Z C 0.0000 E 2.1380 0.0000 J 2.2713 2.9579 0.0000 L 1.7135 2.3927 1.7772 0.0000 M 1.5460 1.9818 2.4575 1.0125 0.0000 N 2.6979 3.3566 1.9900 1.8520 2.6954 0.0000 O 2.9985 3.6848 3.4484 2.4272 2.6816 2.3108 0.0000 T 2.3859 2.3169 2.4666 1.4545 1.7581 2.2105 2.5041 0.0000 Z 2.3107 2.3648 1.8086 1.6609 2.0516 2.2954 3.4301 2.0413 0.0000
3 km
E C La Houille River J L T N Z M O L a H u l l e R i v e r
50° N 5° E
10 11 16 12 15 13 14
50° N 5° E
1 Example developed in Legendre and Makarenkov (2002). 2 McDade, L. A. 1992. Hybrids and phylogenetic systematics II. The impact of hybrids on
cladistic analysis. Evolution 46: 1329-1346.
15 14 DEPP PANA 20 21 STOR 23 22 GOLF GRAC 24 SINC TERR 19 LING 16 LEON 17 DARI 18 CAMP HART
Ancestor
Species tree with 5 reticulation branches
Ancestor
16 15 DEPP PANA 21 17 LING 20 LEON 19 DARI 18 CAMP HART 22 STOR 24 23 GOLF GRAC 25 TERR 26 SINC DE+SI
Available in PDF at http://www.fas.umontreal.ca/biol/legendre/reprints/ and http://www.info.uqam.ca/~makarenv/trex.html
Authors: Vladimir Makarenkov Versions: Windows 9x/NT/2000/XP and Macintosh With contributions from A. Boc, P. Casgrain, A. B. Diallo, O. Gascuel, A. Guénoche, P.-A. Landry, F.-
Distance matrix between objects
1 2 3 4 5 6 7 8 9 10
________
1 Makarenkov, V. 2001. T-REX: reconstructing and visualizing phylogenetic trees
and reticulation networks. Bioinformatics 17: 664-668.
Distance matrix between objects
1 2 3 4 5 6 7 8 9 10
Distance matrix between objects containing missing values ? ? ?
1 2 3 4 5 6 7 8 9 10