TITLE PAGE: Is protein sequence evolution constant over time? - - PowerPoint PPT Presentation

title page is protein sequence evolution constant over
SMART_READER_LITE
LIVE PREVIEW

TITLE PAGE: Is protein sequence evolution constant over time? - - PowerPoint PPT Presentation

TITLE PAGE: Is protein sequence evolution constant over time? Carolin Kosiol & Nick Goldman goldman@ebi.ac.uk http:/www.ebi.ac.uk/goldman MIEP08 12 June 2008 Are Markov process models appropriate convert to question about Markov


slide-1
SLIDE 1

TITLE PAGE: Is protein sequence evolution constant over time?

MIEP08 12 June 2008

Carolin Kosiol & Nick Goldman

goldman@ebi.ac.uk http:/www.ebi.ac.uk/goldman

slide-2
SLIDE 2

convert to question about Markov processes

Are Markov process models appropriate for protein sequence evolution?

slide-3
SLIDE 3

evidence of non-Markov behaviour

Amino Acid Substitution Matrices From Protein Blocks

  • S. Henikoff and J.G. Henikoff

Proceedings of the National Academy of Sciences

  • f the United States of America 89:10915–10919. 1992

Evidence of non-Markov evolution

  • f amino acid sequences

Tree-based Maximal Likelihood Substitution Matrices and Hidden Markov Models

  • G. Mitchison and R. Durbin

Journal of Molecular Evolution 41:1139–1151. 1995

Amino Acid Substitution During Functionally Constrained Divergent Evolution of Protein Sequences

S.A. Benner, M.A. Cohen and G.H. Gonnet Protein Engineering 7:1323–1332. 1994

slide-4
SLIDE 4

P(t) = exp(tQ) etc …

time probability of change (as a function of time) instantaneous rate matrix

slide-5
SLIDE 5

… and in pictures

time t ∞

P(t) t Q

= = = ≠ ≠ ≠

slide-6
SLIDE 6

We can estimate P(t) from data …

It is possible to infer P(t) from sequence data…

slide-7
SLIDE 7

We can estimate Q from P(t) …

…and possible to infer Q from P(t)

T N R M K L I H G E Q C D S P F W Y V A R N D C Q E G H I L K M F P S T W Y V A

WAG

slide-8
SLIDE 8

… and in pictures

time t “Matrix space”

slide-9
SLIDE 9

… and in pictures

time t “Matrix space”

not constant, according to Henikoff x 2 (BLOSUM) and Mitchison & Durbin

slide-10
SLIDE 10

Benner et al. evidence

Benner et al. found rate matrix elements varied with observed divergence They argued that the genetic code influences the matrix strongly at early stages

  • f divergence, while physicochemical

properties are dominant at later stages

slide-11
SLIDE 11

Mitchison & Durbin evidence

Mitchison & Durbin found the accumulation

  • f amino acid replacements that could be

generated by a single nucleotide change was inconsistent with a simple Markov process

(Mitchison & Durbin) (this study)

slide-12
SLIDE 12

time travel thought experiment (1)

slide-13
SLIDE 13

time travel thought experiment (2)

slide-14
SLIDE 14

AMPs definition

So, how will we explain the evidence of non-Markov behaviour? — the aggregated Markov process (AMP):

(codon evolution) Markov process (codon evolution) Deterministic function on states (genetic code) Non-Markov process (protein evolution)

time t

slide-15
SLIDE 15
  • ur AMP model
slide-16
SLIDE 16

AMPs are not Markov

Aggregated Markov processes are not Markov:

slide-17
SLIDE 17

Benner et al. evidence explained

Benner et al. evidence: this study:

slide-18
SLIDE 18

Mitchison & Durbin evidence explained

(Mitchison & Durbin) (this study)

Mitchison & Durbin evidence:

slide-19
SLIDE 19

“Proceed With Caution”

PROCEED WITH CAUTION

Are Markov process models appropriate for protein sequence evolution?

slide-20
SLIDE 20

4 take home messages (TBC)

Things to remember from Nick’s talk:

evolution should look the same whether we study it 100MYA or 1MYA or 1YA or today or tomorrow or … published evidence of non-Markov protein evolution can be explained by a time-independent codon model-based AMP we may proceed with current approaches to sequence evolution based on Markov models! possible consequences: non-Markov evolution of: protein sequences purine/pyrimidine (R/Y) encoded DNA (nucleotide-based AMP)