divergences David Penny Phylomania Nov 2013 The mathematicos - - PowerPoint PPT Presentation

divergences
SMART_READER_LITE
LIVE PREVIEW

divergences David Penny Phylomania Nov 2013 The mathematicos - - PowerPoint PPT Presentation

Loss of information at deeper divergences David Penny Phylomania Nov 2013 The mathematicos caused the problem!!! Now they should solve it! Okay, maybe we could help them, Here are some ideas Need relative not absolute - information the


slide-1
SLIDE 1

Loss of information at deeper divergences

David Penny Phylomania Nov 2013

The mathematicos caused the problem!!! Now they should solve it! Okay, maybe we could help them, Here are some ideas Need relative – not absolute - information

slide-2
SLIDE 2

the comfort zone

ML Int ML Rel Mlav

ML

MLan MP ML MLep MP MP popn classic phylogeny deep phylogeny

slide-3
SLIDE 3

can we go further back in time? Markov models - Loss of information

slide-4
SLIDE 4

damned eukaryotes!

slide-5
SLIDE 5

Calculated results, Δ ≤ ¼ + ne-qt

  • 0.2

0.2 0.4 0.6 0.8 1 1 10 100 1000 10000 0.01 0.005 0.002 0.001

slide-6
SLIDE 6

0% 20% 40% 60% 80% 100% 120% 0.1 1 10 percentage of trees correct d=0.001 d=0.100 d=0.500 d=1.000 d=2.000 d=5.000 infinite

  • 1. simulation results with covarion model
slide-7
SLIDE 7

number of internal edges correct, out of 6

neighbor joining, 9 taxa, 1000 columns, i.i.d.

0.5 1

5 8 13 20 32 50 80 125 200 320 500 790 1250 2000 millions of years (log scale) 6 5 4 3 2 1

simulation results with standard model

slide-8
SLIDE 8

1 idea, delete fast sites

If there were a mixture of a) faster evolving sites, and b) and we could identify them c) and remove them would that help go further back in time?

slide-9
SLIDE 9

deleting faster sites

slide-10
SLIDE 10

Ancestral Sequence Reconstruction

Giardia animals plants

slide-11
SLIDE 11

2, 3, testing Ancestral Sequence Reconstr- uction vaults 3-D info

slide-12
SLIDE 12

subgroups X and Y

a b c d e k l m n o

ax ay

subgroup X

subgroup Y

slide-13
SLIDE 13
  • 4. gene length vs similarity
slide-14
SLIDE 14

f1 a . . . . a . . . . . . f2 . . . . . a . . . . a . f3 . . . . . a . . . . . a f4 . . . . . a . . . . . . g1 . a a a a . a a . . . . g2 a a a a a . . a . . a . h1 . . a a a . . . a a . . h2 . . a . a . . . a a . . h3 a . . a a . . . a a . . i j 3 4 5 6 7 8 9 0 1 2

i j i j . . . a a . a a

upper bound = 17 lower bound = 12

?

13

Would weighting by incompatibilities help?

5, Weighting

slide-15
SLIDE 15

information from sequence order not used

Alignment Reordered Alignment

  • riginal sequence order

shuffled/reordered AIIFLNSALGPSPELFPIILATKVL ASAGPSPPATPLLIIIILLFFNEKV AIMFLNSALGPPTELFPVILATKVL ASAGPPTPATPLLIMVILLFFNEKV SIMFLNHTLNPTPELFPIILATETL SHTNPTPPATPLLIMIILLFFNEET TILFLNSSLGLQPEVTPTVLATKTL TSSGLQPPATPLLILTVLVTFNEKT TLLFLNSMLKPPSELFPIILATKTL TSMKPPSPATPLLLLIILLFFNEKT ALLFLNSTLNPPTELFPLILATKTL ASTNPPTPATPLLLLLILLFFNEKT AILFLNSFLNPPKEFFPIILATKIL ASFNPPKPATPLLILIILFFFNEKI

c! ways to reorder alignment shuffle by columns & by taxa

  • 6. So, could we use ‘words’
  • f 2, 3, 4, 5, … letters
  • 7. Alphabet

reduction