tree like reticulation networks
play

Tree-like reticulation networks Andrew R Francis Centre for - PowerPoint PPT Presentation

Tree-like reticulation networks Andrew R Francis Centre for Research in Mathematics University of Western Sydney Australia Phylomania 2014. Andrew R Francis (CRM @ UWS) November 2014 1 / 13 Andrew R. Francis, Mike Steel Tree-like


  1. Tree-like reticulation networks Andrew R Francis Centre for Research in Mathematics University of Western Sydney Australia Phylomania 2014. Andrew R Francis (CRM @ UWS) November 2014 1 / 13

  2. Andrew R. Francis, Mike Steel Tree-like reticulation networks - when do tree-like distances also support reticulate evolution? Mathematical Biosciences , in press (arXiv:1405.2965). Andrew R Francis (CRM @ UWS) November 2014 2 / 13

  3. Tree metrics ◮ A phylogenetic tree with edge weights defines a metric on the leaves: ◮ add weights on the unique path. 2 u v w y u 0 3 5 10 1 v 0 6 11 6 3 1 w 0 11 2 y 0 y u v w ◮ A metric that can be placed on a tree is called a tree metric. Andrew R Francis (CRM @ UWS) November 2014 3 / 13

  4. Tree metrics ◮ A phylogenetic tree with edge weights defines a metric on the leaves: ◮ add weights on the unique path. 2 u v w y u 0 3 5 10 1 v 0 6 11 6 3 1 w 0 11 2 y 0 y u v w ◮ A metric that can be placed on a tree is called a tree metric. ◮ A metric d is a tree metric if and only if it satisfies the four point condition: ◮ for all quartets of leaves { u , v , w , y } , two out of d ( u , v ) + d ( w , y ) , d ( u , w ) + d ( v , y ) , d ( u , y ) + d ( v , w ) are equal, and are greater than or equal to the other one. Andrew R Francis (CRM @ UWS) November 2014 3 / 13

  5. Tree metrics ◮ A phylogenetic tree with edge weights defines a metric on the leaves: ◮ add weights on the unique path. 2 u v w y u 0 3 5 10 1 v 0 6 11 6 3 1 w 0 11 2 y 0 y u v w ◮ A metric that can be placed on a tree is called a tree metric. ◮ A metric d is a tree metric if and only if it satisfies the four point condition: ◮ for all quartets of leaves { u , v , w , y } , two out of d ( u , v ) + d ( w , y ) , d ( u , w ) + d ( v , y ) , d ( u , y ) + d ( v , w ) are equal, and are greater than or equal to the other one. ◮ In the above we have 3 + 11 = 14, 5 + 11 = 16 and 10 + 6 = 16. Andrew R Francis (CRM @ UWS) November 2014 3 / 13

  6. Reticulated networks Any metric may be able to be represented on a reticulated network : tree vertices reticulation vertices

  7. Reticulated networks Any metric may be able to be represented on a reticulated network : tree vertices reticulation vertices HGT vertex hybridization vertex

  8. Reticulated networks Any metric may be able to be represented on a reticulated network : tree vertices reticulation vertices HGT vertex hybridization vertex tree arc reticulation arc Andrew R Francis (CRM @ UWS) November 2014 4 / 13

  9. So we have 1. 4PC satisfied = ⇒ there is a tree that can represent the metric. 2. 4PC not satisfied = ⇒ there may be a reticulated network that can represent the metric. (Note: the 4PC statement is an if-and-only-if). ◮ What’s missing? Andrew R Francis (CRM @ UWS) November 2014 5 / 13

  10. So we have 1. 4PC satisfied = ⇒ there is a tree that can represent the metric. 2. 4PC not satisfied = ⇒ there may be a reticulated network that can represent the metric. (Note: the 4PC statement is an if-and-only-if). ◮ What’s missing? ◮ The 4PC does not rule out a tree metric also being representable on a reticulated network. ◮ One side of the if-and-only-if is an existence statement. Andrew R Francis (CRM @ UWS) November 2014 5 / 13

  11. Take home message of this talk: ◮ A tree metric can also be represented on reticulated networks using average distances. ◮ We are able to characterise this precisely in some cases, but not all yet! . . . now to clarify what is meant by average distances . . . Andrew R Francis (CRM @ UWS) November 2014 6 / 13

  12. Metrics on reticulated networks. ◮ Let T ( N ) be the set of trees “displayed” by N . E.g. w 1 w 2 + w 4 w 3 w 5 a b c b w 1 w 2 1 − α α w 3 w 5 w 4 w 2 w 1 + w 3 a b c w 5 w 4 a b c ◮ For the purposes of distance, we treat a reticulated network N as the weighted sum of the trees in T ( N ): Andrew R Francis (CRM @ UWS) November 2014 7 / 13

  13. Metrics on reticulated networks. 1. HGT. d ( a , b ) = α ( w 1 + w 5 ) + (1 − α )( w 1 + w 2 + w 3 + w 5 ) w 2 w 3 = w 1 + w 5 + (1 − α )( w 2 + w 3 ) d ( b , c ) = w 4 + w 5 + α ( w 2 + w 3 ) 1 − α α w 1 w 4 d ( a , c ) = w 1 + w 2 + w 3 + w 4 w 5 a c b 2. Hybridization. d ( a , b ) = w 1 + w 5 + w 6 w 2 d ( b , c ) = α ( w 4 + w 6 ) + (1 − α )( w 2 + w 3 + w 4 + w 5 + w 6 ) w 3 = w 4 + w 6 + (1 − α )( w 2 + w 3 + w 5 ) w 5 d ( a , c ) = α ( w 1 + w 5 + w 4 )+ w 1 w 4 (1 − α )( w 1 + w 2 + w 3 + w 4 ) α w 6 = w 1 + w 4 + α w 5 + (1 − α )( w 2 + w 3 ) a c b Andrew R Francis (CRM @ UWS) November 2014 8 / 13

  14. Results Theorem Suppose that all the trees in T ( N ) are isomorphic as unrooted phylogenetic X-trees to some tree T. Then d N is a tree metric that is represented by T. ◮ For example, if there is a single reticulation near the root, the network is treelike. ◮ We can be more precise. Andrew R Francis (CRM @ UWS) November 2014 9 / 13

  15. Tree-like hybridization networks Theorem Let X be a finite set of taxa, and suppose d is a metric on X. ◮ If d is a binary tree metric, then it is a metric on a primitive 1-hybridization network N. ◮ If N is a hybridization network, and d is a tree metric on N, then N is either a tree, or is a primitive 1-hybridization network. Andrew R Francis (CRM @ UWS) November 2014 10 / 13

  16. Tree-like hybridization networks Theorem Let X be a finite set of taxa, and suppose d is a metric on X. ◮ If d is a binary tree metric, then it is a metric on a primitive 1-hybridization network N. ◮ If N is a hybridization network, and d is a tree metric on N, then N is either a tree, or is a primitive 1-hybridization network. Theorem For each tree metric on n leaves, there are 4( n − 3) 1-hybridization networks that realise the metric. Proof: A B C D A B C D B A C D A B C D A B D C Andrew R Francis (CRM @ UWS) November 2014 10 / 13

  17. Tree-like HGT networks ◮ Let N be an HGT network with T N the underlying tree (delete all reticulation arcs). Lemma If each reticulation arc in N is between adjacent tree-arcs of T N , then d N is tree-like on T N . ◮ This means we have huge numbers of reticulated (HGT) networks whose metrics are tree-like! Andrew R Francis (CRM @ UWS) November 2014 11 / 13

  18. Tree-like HGT networks ◮ Let N be an HGT network with T N the underlying tree (delete all reticulation arcs). Lemma If each reticulation arc in N is between adjacent tree-arcs of T N , then d N is tree-like on T N . ◮ This means we have huge numbers of reticulated (HGT) networks whose metrics are tree-like! Theorem Suppose d N is a metric from an HGT network N with a single reticulation arc. Then d N is tree-like if and only if that arc is either 1. from one arc to an adjacent arc, or 2. between a root arc and one of the two children of the other root arc. The only tree that harbours a representation for d N is T N . Andrew R Francis (CRM @ UWS) November 2014 11 / 13

  19. Strange magic ◮ There are 2-reticulated HGT networks N that can be represented on T N and (for other parameter settings) on a tree that is different from T N , even when the mixing distribution treats the two reticulations independently. c a b d α ′ e ∗ ∗ α * * 1 2 3 4 ◮ Setting α ≥ 1 2 , b ≥ a and α ′ a = (1 − α ′ ) c gives 14 | 23. Andrew R Francis (CRM @ UWS) November 2014 12 / 13

  20. Further questions 1. Is the following true: For any two binary phylogenetic X -trees T 1 and T 2 , is there an HGT network N for which T N = T 1 and yet where d N is representable on T 2 (mixing distribution given by the independence model)? Andrew R Francis (CRM @ UWS) November 2014 13 / 13

  21. Further questions 1. Is the following true: For any two binary phylogenetic X -trees T 1 and T 2 , is there an HGT network N for which T N = T 1 and yet where d N is representable on T 2 (mixing distribution given by the independence model)? 2. Let ρ ( d ) denote the minimum number of hybridizations required to represent d on a hybridization or an HGT network. ◮ What conditions characterise those metrics d with ρ ( d ) = 1? ◮ What about ρ ( d ) = k for any k ≥ 1? (We know about ρ ( d ) = 0: the 4PC). Andrew R Francis (CRM @ UWS) November 2014 13 / 13

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend