Phylogenetic trees I Foundations, Distance-based inference Gerhard - - PowerPoint PPT Presentation

phylogenetic trees i foundations distance based inference
SMART_READER_LITE
LIVE PREVIEW

Phylogenetic trees I Foundations, Distance-based inference Gerhard - - PowerPoint PPT Presentation

Phylogenetic trees I Foundations, Distance-based inference Gerhard Jger Words, Bones, Genes, Tools February 28, 2018 Gerhard Jger Phylogenetic trees I WBGT 1 / 27 Background Background readings for this lecture Ewens and Grant (2005),


slide-1
SLIDE 1

Phylogenetic trees I Foundations, Distance-based inference

Gerhard Jäger Words, Bones, Genes, Tools February 28, 2018

Gerhard Jäger Phylogenetic trees I WBGT 1 / 27

slide-2
SLIDE 2

Background

Background readings for this lecture

Ewens and Grant (2005), sections 15.1–15.4 Nunn (2011), chapter 2

Gerhard Jäger Phylogenetic trees I WBGT 2 / 27

slide-3
SLIDE 3

Background

Why trees?

tree diagrams have long history in linguistics and life sciences:

taxonomies (from Aristotle to Linné) tree of life (Darwin) language family trees (Schleicher)

commonalities between biological and language family trees:

tree diagram represents a historical hypothesis internal nodes represent a historical reality, not just a taxonomic category

technical term for this kind of tree: phylogenetic tree (aka phylogeny)

Gerhard Jäger Phylogenetic trees I WBGT 3 / 27

slide-4
SLIDE 4

Defjnitions

Some defjnitions

Defjnition (Tree) An unrooted tree is a connected undirected acyclic weighted graph with positives

  • weights. In other words, an unrooted tree T is a triple (V, E, l) with

V is a fjnite set, the nodes or vertices, E ⊂ V × V , the set of edges, is symmetric, E+ (E’s transitive closure) is irrefmexive, E∗ = V × V , and l : E → R+ is a function assigning each edge a non-negative length. Remark: Unrooted trees might seem to be unintuitive data structures. Later on we will see though that often, estimating the unrooted version of a phylogeny is a quite difgerent task from estimating the location of the root. So it makes sense to separate the two problems.

Gerhard Jäger Phylogenetic trees I WBGT 4 / 27

slide-5
SLIDE 5

Defjnitions

Some more defjnitions

Defjnition The degree of node v is the number of edges containing v as a component. Nodes with degree 1 are called tips or leaves. An unrooted binary tree is an unrooted tree with all nodes having degree 3 or 1.

R u s s i a n A n c i e n t G r e e k Dutch

unrooted tree

O l d N

  • r

s e Old Church Slavonic L a t i n Ancient Greek Old Church Slavonic O l d N

  • r

s e Dutch Latin R u s s i a n unrooted binary tree

Gerhard Jäger Phylogenetic trees I WBGT 5 / 27

slide-6
SLIDE 6

Defjnitions

Even more defjnitions

Defjnition (Rooted trees) A rooted tree is a pair (T , v), where T is an unrooted tree and v is a designated vertex in T (its root). A rooted binary tree is an unrooted tree where exactly one node (the root) has degree 2 and all other nodes have degrees 1 or 3.

Ancient Greek Russian Latin Dutch Old Norse Old Church Slavonic rooted non-binary tree Ancient Greek Dutch Old Church Slavonic Latin Old Norse Russian rooted binary tree

Gerhard Jäger Phylogenetic trees I WBGT 6 / 27

slide-7
SLIDE 7

Defjnitions

Distances

Defjnition (Distances) Let T = (V, E, l) be a tree. Let d : V × V → R be the unique function such that for all a, b ∈ V : If (a, b) ∈ E, then d(a, b) = l(a, b). l(a, a) = 0. d(a, b) = d(b, a). l(a, b) = minc(l(a, c) + l(c, b)) Vulgo: d(a, b) is the length of the unique path between a and b.

Gerhard Jäger Phylogenetic trees I WBGT 7 / 27

slide-8
SLIDE 8

Ultrametric trees

Ultrametric trees

Defjnition (Ultrametric distance) d is an ultrametric distance if it is a metric (d(a, a) = 0, d(a, b) = d(b, a) ≥ 0, d(a, b) + d(b, c) ≥ d(a, c) with d(a, b) ≤ max{d(a, c), d(b, c)} Defjnition (Ultrametric tree) A rooted tree is ultrametric ifg all tips have the same distance from the root.

Irish Hindi Greek Portuguese French Nepali Swedish Catalan Breton Czech Polish Danish Spanish Bengali German Lithuanian Ukrainian Icelandic English Welsh Italian Bulgarian Dutch Romanian Russian

ultrametric tree Gerhard Jäger Phylogenetic trees I WBGT 8 / 27

slide-9
SLIDE 9

Ultrametric trees

Ultrametric trees

Theorem The pairwise distances between a set of taxa are ultrametric if and only if there is an ultrametric tree with the taxa as tips representing those distances. Proof: By induction over number of taxa. Unweighted Pair Group Method Using Arithmetic Averages (UPGMA) algorithm constructs ultrametric tree from pairwise distances.

Gerhard Jäger Phylogenetic trees I WBGT 9 / 27

slide-10
SLIDE 10

Ultrametric trees

UPGMA

Cluster distances Led A and B be two non-empty sets of taxa. d(A, B) . = 1 |A| × |B|

  • x∈A,y∈B

d(x, y)

Gerhard Jäger Phylogenetic trees I WBGT 10 / 27

slide-11
SLIDE 11

Ultrametric trees

UPGMA

UPGMA algorithm Initialization:

X ← the set of taxa. V ← X E ← ∅ h(x) = 0 ∀x ∈ X

Iteration:

while |X| > 1

{i, j} ← argx∈X,y∈X,x=y min d(x, y) X ← X \ {i, j} ∪ {{i, j}} V ← V ∪ {{i, j}} E ← E ∪ {({i, j}, i), ({i, j}, j)} h({i, j}) = d(i,j)/2 l({i, j}, i) = h({i, j}) − h(i) l({i, j}, j) = h({i, j}) − h(j) d({i, j}, k) = d(i,k)+d(j,k)/2

Gerhard Jäger Phylogenetic trees I WBGT 11 / 27

slide-12
SLIDE 12

Ultrametric trees

Example

English Dutch German Italian Dutch 3.0 German 3.0 2.0 Italian 8.0 8.0 8.0 Spanish 8.0 8.0 8.0 3.4

English Dutch Spanish German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-13
SLIDE 13

Ultrametric trees

Example

English Dutch German Italian Dutch 3.0 German 3.0 2.0 Italian 8.0 8.0 8.0 Spanish 8.0 8.0 8.0 3.4

English Dutch Spanish German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-14
SLIDE 14

Ultrametric trees

Example

English a Italian a 3.0 Italian 8.0 8.0 Spanish 8.0 8.0 3.4

English Dutch Spanish

a 1

German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-15
SLIDE 15

Ultrametric trees

Example

English a Italian a 3.0 Italian 8.0 8.0 Spanish 8.0 8.0 3.4

English Dutch Spanish

a 1

German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-16
SLIDE 16

Ultrametric trees

Example

b Italian Italian 8.0 Spanish 8.0 3.4

English Dutch Spanish

b a 1 1.5

German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-17
SLIDE 17

Ultrametric trees

Example

b Italian Italian 8.0 Spanish 8.0 3.4

English Dutch Spanish

b a 1 1.5

German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-18
SLIDE 18

Ultrametric trees

Example

b c 8.0

English Dutch Spanish

c b a 1 1.5 1.7

German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-19
SLIDE 19

Ultrametric trees

Example

b c 8.0

English Dutch Spanish

c b a 1 1.5 1.7

German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-20
SLIDE 20

Ultrametric trees

Example

English Dutch Spanish

c b a d 1 1.5 1.7 4

German Italian

h Gerhard Jäger Phylogenetic trees I WBGT 12 / 27

slide-21
SLIDE 21

Ultrametric trees

Doing it in Paup

start the command-line version of Paup: paup> load the distance matrix example1.nex into Paup paup> execute example1.nex compute UPGMA-tree paup> upgma save tree in Nexus format (no other choice available) paup> upgma treefile=example1.paup.upgma.tre \ replace=yes brlens=yes load tree again paup> gettrees file= example1.paup.upgma.tre save tree in Newick format paup> savetrees format=newick brlen=user file = \ example1.paup.upgma.tre replace=yes quit Paup paup> q view tree with Dendroscope and Figtree

Gerhard Jäger Phylogenetic trees I WBGT 13 / 27

slide-22
SLIDE 22

Ultrametric trees

Doing it in R

load library

library(phangorn)

Gerhard Jäger Phylogenetic trees I WBGT 14 / 27

slide-23
SLIDE 23

Ultrametric trees

Doing it in R

define distance matrix

taxa <- c('English','Dutch','German','Italian','Spanish') d <- as.dist(matrix(c(0.0,3.0,3.0,8.0,8.0, 3.0,0.0,2.0,8.0,8.0, 3.0,2.0,0.0,8.0,8.0, 8.0,8.0,8.0,0.0,3.4, 8.0,8.0,8.0,3.4,0.0 ), byrow=T,nrow=5, dimnames=list(taxa,taxa)))

Gerhard Jäger Phylogenetic trees I WBGT 14 / 27

slide-24
SLIDE 24

Ultrametric trees

Doing it in R

print(d) ## English Dutch German Italian ## Dutch 3.0 ## German 3.0 2.0 ## Italian 8.0 8.0 8.0 ## Spanish 8.0 8.0 8.0 3.4

Gerhard Jäger Phylogenetic trees I WBGT 14 / 27

slide-25
SLIDE 25

Ultrametric trees

Doing it in R

perform UPGMA

upgma.tree <- upgma(d) cophenetic(upgma.tree)-as.matrix(d) ## English Dutch German Italian Spanish ## English ## Dutch ## German ## Italian ## Spanish

Gerhard Jäger Phylogenetic trees I WBGT 14 / 27

slide-26
SLIDE 26

Ultrametric trees

Doing it in R

visualize result

plot(upgma.tree,type='cladogram') edgelabels(upgma.tree$edge.length)

English Dutch German Italian Spanish 1.7 1.7 1 1 1.5 0.5 2.5 2.3

write.tree(upgma.tree,'upgmaExample.tre')

Gerhard Jäger Phylogenetic trees I WBGT 14 / 27

slide-27
SLIDE 27

Ultrametric trees

If distances are not ultra-metric

UPGMA algorithm also works with distances which are not ultra-metric in this case it will not recover the correct distances tree topology may or may not be recovered

Gothic Italian English German Dutch Spanish 2,3 0,5 1,7 1,5 1 1,5 1 1 0,5 1,7

Gerhard Jäger Phylogenetic trees I WBGT 15 / 27

slide-28
SLIDE 28

Ultrametric trees

If distances are not ultra-metric

taxa <- c('German','Dutch','English', 'Spanish','Italian','Gothic') d <- as.dist(matrix(c(0,2,3,8,8,3, 2,0,3,8,8,3, 3,3,0,8,8,3, 8,8,8,0,3.4,6, 8,8,8,3.4,0,6, 3,3,3,6,6,0), byrow=T,nrow=6, dimnames=list(taxa,taxa)))

Gerhard Jäger Phylogenetic trees I WBGT 16 / 27

slide-29
SLIDE 29

Ultrametric trees

If distances are not ultra-metric

upgma.tree <- upgma(d) cophenetic(upgma.tree)-as.matrix(d) ## German Dutch English Spanish Italian Gothic ## German 0.0 0.0 0.0

  • 0.5
  • 0.5

0.0 ## Dutch 0.0 0.0 0.0

  • 0.5
  • 0.5

0.0 ## English 0.0 0.0 0.0

  • 0.5
  • 0.5

0.0 ## Spanish

  • 0.5
  • 0.5
  • 0.5

0.0 0.0 1.5 ## Italian

  • 0.5
  • 0.5
  • 0.5

0.0 0.0 1.5 ## Gothic 0.0 0.0 0.0 1.5 1.5 0.0

Gerhard Jäger Phylogenetic trees I WBGT 16 / 27

slide-30
SLIDE 30

Ultrametric trees

If distances are not ultra-metric

plot(upgma.tree,type='cladogram') edgelabels(round(upgma.tree$edge.length,2))

German Dutch English Spanish Italian Gothic 1.7 1.7 1 1 1.5 0.5 1.5 2.25 2.05

Gerhard Jäger Phylogenetic trees I WBGT 16 / 27

slide-31
SLIDE 31

Ultrametric trees

Neighbor Joining

If distances are derived from non-ultrametric distances, we can recover the correct unrooted tree. Most commonly used method: Neighbor Joining (NJ) (Saitou and Nei, 1987) Neighbors: Two tips are neighbors if the path between them consists

  • f only one node.

Gerhard Jäger Phylogenetic trees I WBGT 17 / 27

slide-32
SLIDE 32

Ultrametric trees

Neighbor Joining

suppose distances between N vertices are given auxiliary quantity: δ(x, y) = (N − 4)d(x, y) −

  • z∈{x,y}

(d(x, z) + d(y, z)) Theorem If d is derived from a tree T and δ(x, y) is minimal, then x and y are neighbors in T . Proof: See Ewens and Grant (2005), 15.4.

Gerhard Jäger Phylogenetic trees I WBGT 18 / 27

slide-33
SLIDE 33

Ultrametric trees

Neigbor Joining

Neighbor Joining algorithm Initialization:

X ← the set of taxa V ← X E ← ∅

Iteration:

while |X| > 1

δ(x, y) = (|X| − 4)d(x, y) −

z∈{x,y}(d(x, z) + d(y, z))

{i, j} ← argx∈X,y∈X,x=y min δ(x, y) V ← V ∪ {{i, j}} E ← E ∪ {({i, j}, i), ({i, j}, j)} l({i, j}, i) = 1

2d(i, j) + 1 2(|X|−2)

  • k∈X(d(i, k) − d(j, k))

l({i, j}, j) = 1

2d(i, j) + 1 2(|X|−2)

  • k∈X(d(j, k) − d(i, k))

d({i, j}, k) = 1

2(d(i, k) + d(j, k) − d(i, j))

X ← X \ {i, j} ∪ {{i, j}}

Gerhard Jäger Phylogenetic trees I WBGT 19 / 27

slide-34
SLIDE 34

Ultrametric trees

Example

d German Dutch English Spanish Italian Dutch 2.0 English 3.0 3.0 Spanish 8.0 8.0 8.0 Italian 8.0 8.0 8.0 3.4 Gothic 3.0 3.0 3.0 6.0 6.0

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-35
SLIDE 35

Ultrametric trees

Example

d German Dutch English Spanish Italian Dutch 2.0 English 3.0 3.0 Spanish 8.0 8.0 8.0 Italian 8.0 8.0 8.0 3.4 Gothic 3.0 3.0 3.0 6.0 6.0 δ German Dutch English Spanish Italian Dutch −40.0 English −37.0 −37.0 Spanish −25.4 −25.4 −26.4 Italian −25.4 −25.4 −26.4 −53.2 Gothic −33.0 −33.0 −34.0 −30.4 −30.4

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-36
SLIDE 36

Ultrametric trees

Example

d German Dutch English Spanish Italian Dutch 2.0 English 3.0 3.0 Spanish 8.0 8.0 8.0 Italian 8.0 8.0 8.0 3.4 Gothic 3.0 3.0 3.0 6.0 6.0 δ German Dutch English Spanish Italian Dutch −40.0 English −37.0 −37.0 Spanish −25.4 −25.4 −26.4 Italian −25.4 −25.4 −26.4 −53.2 Gothic −33.0 −33.0 −34.0 −30.4 −30.4

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-37
SLIDE 37

Ultrametric trees

Example

d German Dutch English Gothic Dutch 2.0 English 3.0 3.0 Gothic 3.0 3.0 3.0 a 6.3 6.3 6.3 4.3

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 . 7 1 . 7 a

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-38
SLIDE 38

Ultrametric trees

Example

d German Dutch English Gothic Dutch 2.0 English 3.0 3.0 Gothic 3.0 3.0 3.0 a 6.3 6.3 6.3 4.3 δ German Dutch English Gothic Dutch −22.6 English −20.6 −20.6 Gothic −18.6 −18.6 −19.6 a −18.6 −18.6 −19.6 −23.6

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 . 7 1 . 7 a

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-39
SLIDE 39

Ultrametric trees

Example

d German Dutch English Gothic Dutch 2.0 English 3.0 3.0 Gothic 3.0 3.0 3.0 a 6.3 6.3 6.3 4.3 δ German Dutch English Gothic Dutch −22.6 English −20.6 −20.6 Gothic −18.6 −18.6 −19.6 a −18.6 −18.6 −19.6 −23.6

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 . 7 1 . 7 a

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-40
SLIDE 40

Ultrametric trees

Example

d German Dutch English Dutch 2.0 English 3.0 3.0 b 3.5 2.5 2.5

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 . 7 1 . 7 0.5 3.8 a b

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-41
SLIDE 41

Ultrametric trees

Example

d German Dutch English Dutch 2.0 English 3.0 3.0 b 3.5 2.5 2.5 δ German Dutch English Dutch −11.0 English −10.0 −10.0 b −10.0 −10.0 −11.0

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 . 7 1 . 7 0.5 3.8 a b

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-42
SLIDE 42

Ultrametric trees

Example

d German Dutch English Dutch 2.0 English 3.0 3.0 b 3.5 2.5 2.5 δ German Dutch English Dutch −11.0 English −10.0 −10.0 b −10.0 −10.0 −11.0

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 . 7 1 . 7 0.5 3.8 a b

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-43
SLIDE 43

Ultrametric trees

Example

d English b b 2.5 c 2.0 1.5

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 1 1 . 7 1 . 7 0.5 3.8 a b c

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-44
SLIDE 44

Ultrametric trees

Example

d English b b 2.5 c 2.0 1.5 δ English b b −6.0 c −6.0 −6.0

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 1 1 . 7 1 . 7 0.5 3.8 a b c

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-45
SLIDE 45

Ultrametric trees

Example

d English b b 2.5 c 2.0 1.5 δ English b b −6.0 c −6.0 −6.0

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 1 1 . 7 1 . 7 0.5 3.8 a b c

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-46
SLIDE 46

Ultrametric trees

Example

d c d 0.5

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 1 . 5 1 1 . 7 1 1 . 7 0.5 3.8 a b c d

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-47
SLIDE 47

Ultrametric trees

Example

d c d 0.5 δ c d −1.0

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 1 . 5 1 1 . 7 1 1 . 7 0.5 3.8 a b c d

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-48
SLIDE 48

Ultrametric trees

Example

d c d 0.5 δ c d −1.0

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 1 . 5 1 1 . 7 1 1 . 7 0.5 3.8 a b c d

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-49
SLIDE 49

Ultrametric trees

Example

D u t c h E n g l i s h German I t a l i a n S p a n i s h Gothic 1 1 . 5 1 1 . 7 1 1 . 7 0.5 3.8 . 5 a b c d

Gerhard Jäger Phylogenetic trees I WBGT 20 / 27

slide-50
SLIDE 50

Ultrametric trees

Example

This is the correct unrooted tree. There is no way to locate the root just from the distance information. Generally, NJ will recover the correct unrooted tree if the distances are derived from a tree.

Gerhard Jäger Phylogenetic trees I WBGT 21 / 27

slide-51
SLIDE 51

Ultrametric trees

Doing it in Paup

start the command-line version of Paup: paup> load the distance matrix example2.nex into Paup paup> execute example2.nex compute NJ-tree paup> nj save tree in Nexus format (no other choice available) paup> nj treefile=example2.paup.nj.tre replace=yes \ brlens=yes load tree again paup> gettrees file= example2.paup.nj.tre save tree in Newick format paup> savetrees format=newick brlen=user file = \ example2.paup.nj.tre replace=yes quit Paup paup> q view tree with Dendroscope and Figtree

Gerhard Jäger Phylogenetic trees I WBGT 22 / 27

slide-52
SLIDE 52

Ultrametric trees

Doing it in R

library(phangorn) taxa <- c('German','Dutch','English', 'Spanish','Italian','Gothic') distMatrix <- matrix(c(0,2,3,8,8,3, 2,0,3,8,8,3, 3,3,0,8,8,3, 8,8,8,0,3.4,6, 8,8,8,3.4,0,6, 3,3,3,6,6,0), byrow=T,nrow=6, dimnames=list(taxa,taxa))

Gerhard Jäger Phylogenetic trees I WBGT 23 / 27

slide-53
SLIDE 53

Ultrametric trees

Doing it in R

d <- as.dist(distMatrix) nj.tree <- nj(d) cophenetic(nj.tree)[taxa,taxa]-as.matrix(d) ## German Dutch English Spanish ## German 0.000000e+00 0.000000e+00 -4.440892e-16 ## Dutch 0.000000e+00 0.000000e+00 -4.440892e-16 ## English -4.440892e-16 -4.440892e-16 0.000000e+00 ## Spanish 0.000000e+00 0.000000e+00 0.000000e+00 ## Italian 0.000000e+00 0.000000e+00 0.000000e+00 ## Gothic 8.881784e-16 8.881784e-16 8.881784e-16 ## Gothic ## German 8.881784e-16 ## Dutch 8.881784e-16 ## English 8.881784e-16 ## Spanish 0.000000e+00 ## Italian 0.000000e+00 ## Gothic 0.000000e+00

Gerhard Jäger Phylogenetic trees I WBGT 23 / 27

slide-54
SLIDE 54

Ultrametric trees

Doing it in R

plot(nj.tree,type='unrooted',use.edge.length=T) edgelabels(nj.tree$edge.length)

German Dutch English Spanish Italian Gothic 1 1 0.5 1 3.8 1.7 1.7 0.5 1.5

Gerhard Jäger Phylogenetic trees I WBGT 23 / 27

slide-55
SLIDE 55

Ultrametric trees

Where do we go from there?

in practice, “true” distances are never known → must be estimated ideally, we want to know/estimate distances in terms of historical time in practice, the best we can hope for are estimates of the amount of change whether or not historical and evolutionary time are proportional depends in how much rate of change varies across lineages

Gerhard Jäger Phylogenetic trees I WBGT 24 / 27

slide-56
SLIDE 56

Ultrametric trees

Where do we go from there?

ultrametric trees only make sense if

rate of change is (approximately) constant (“molecular clock assumption”) all taxa exist at the same point in time

as the fjrst condition is rarely fulfjlled, this NJ is usually superior to UPGMA However: for n taxa,

branch lengths in ultrametric tree have n − 1 degrees of freedom branch lengths in unrooted (non-ultrametric) tree have 2n − 3 degrees

  • f freedom

⇒ UPGMA is less prone to overfjtting than NJ

both UPGMA and NJ are computationally effjcient — O(n3) for naive implementations

Gerhard Jäger Phylogenetic trees I WBGT 25 / 27

slide-57
SLIDE 57

Exercises

Exercises: Theory

Exercises 15.1–15.5 (pages 535/536) from Ewens and Grant (2005)

Gerhard Jäger Phylogenetic trees I WBGT 26 / 27

slide-58
SLIDE 58

Exercises

Exercises: Programming

Install the R-packages ape and phangorn. Type in and run the R-code shown in these slides. Play around with modifjed distance matrices and difgerent options of the plot.phylo command for trees. Implement UPGMA and NJ yourself.

Gerhard Jäger Phylogenetic trees I WBGT 27 / 27

slide-59
SLIDE 59

Exercises

Ewens, W. and G. Grant (2005). Statistical Methods in Bioinformatics: An Introduction. Springer, New York. Nunn, C. L. (2011). The Comparative Approach in Evolutionary Anthropology and Biology. The University of Chicago Press, Chicago. Saitou, N. and M. Nei (1987). The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular biology and evolution, 4(4):406–425.

Gerhard Jäger Phylogenetic trees I WBGT 27 / 27