Complexity in Evolutionary Processes Peter Schuster Institut fr - - PowerPoint PPT Presentation

complexity in evolutionary processes
SMART_READER_LITE
LIVE PREVIEW

Complexity in Evolutionary Processes Peter Schuster Institut fr - - PowerPoint PPT Presentation

Complexity in Evolutionary Processes Peter Schuster Institut fr Theoretische Chemie, Universitt Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA 7th Vienna Central European Seminar on Particle Physics and Quantum Field


slide-1
SLIDE 1
slide-2
SLIDE 2

Complexity in Evolutionary Processes

Peter Schuster

Institut für Theoretische Chemie, Universität Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA 7th Vienna Central European Seminar on Particle Physics and Quantum Field Theory Vienna, 26.– 28.11.2010

slide-3
SLIDE 3

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-4
SLIDE 4

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-5
SLIDE 5

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-6
SLIDE 6

1 , ;

1 1 1

= = + =

− +

F F F F F

n n n

Leonardo da Pisa „Fibonacci“ ~1180 – ~1240 Thomas Robert Malthus 1766 – 1834

1, 2 , 4 , 8 ,16 , 32 , 64, 128 , ... geometric progression exponential growth

n n

f ⎟ ⎟ ⎠ ⎞ ⎜ ⎜ ⎝ ⎛ + ≈ 2 5 1 5 1

The history of exponential growth

slide-7
SLIDE 7

Three necessary conditions for Darwinian evolution are: 1. Multiplication, 2. Variation, and 3. Selection. Darwin discovered the principle of natural selection from empirical observations in nature.

slide-8
SLIDE 8

Pierre-François Verhulst, 1804-1849

( )

t r

e x C x C x t x C x x r dt dx

− + = ⎟ ⎠ ⎞ ⎜ ⎝ ⎛ − = ) ( ) ( ) ( ) ( , 1

The logistic equation, 1828

slide-9
SLIDE 9

1 .

1 1 2

= − = f f f s

Two variants with a mean progeny of ten or eleven descendants

slide-10
SLIDE 10

Numbers N1(n) and N2(n) N1(0) = 9999 , N2(0) = 1 ; s = 0.1 , 0.02 , 0.01

Selection of advantageous mutants in populations of N = 10 000 individuals

slide-11
SLIDE 11

( )

Φ r x x C Φ x r x r C x x r x C x x r x − = = ≡ − = ⇒ ⎟ ⎠ ⎞ ⎜ ⎝ ⎛ − = dt d : 1 , ) t ( dt d 1 dt d

Darwin

[ ]

( ) ( )

∑ ∑ ∑

= = =

= − = − = = = =

n i i i j j n i i i j j j n i i i i n

x f Φ Φ f x x f f x x C x x

1 1 1 2 1

; dt d 1 ; X : X , , X , X K

( )

{ }

var 2 2 dt d

2 2

≥ = > < − > < = f f f Φ

Generalization of the logistic equation to n variables yields selection

slide-12
SLIDE 12

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-13
SLIDE 13

Taq = thermus aquaticus

Accuracy of replication: Q = q1 · q2 · q3 · … · qn

The logics of DNA replication

slide-14
SLIDE 14

Point mutation

slide-15
SLIDE 15

Manfred Eigen 1927 -

∑ ∑ ∑

= = =

= = − =

n i i n i i i j i n i ji j

x x f Φ n j Φ x x W x

1 1 1

, , 2 , 1 ; dt d K

Mutation and (correct) replication as parallel chemical reactions

  • M. Eigen. 1971. Naturwissenschaften 58:465,
  • M. Eigen & P. Schuster.1977. Naturwissenschaften 64:541, 65:7 und 65:341
slide-16
SLIDE 16

∑ ∑ ∑ ∑

= = = =

= = − = − =

n i i n i i i j i i n i ji j i n i ji j

x x f Φ n j Φ x x f Q Φ x x W x

1 1 1 1

, , 2 , 1 ; dt d K

Factorization of the value matrix W separates mutation and fitness effects.

slide-17
SLIDE 17

integrating factor transformation eigenvalue problem

Solution of the mutation-selection equation

slide-18
SLIDE 18

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-19
SLIDE 19

Error rate p = 1-q

0.00 0.05 0.10

Quasispecies Uniform distribution

Stationary population or quasispecies as a function

  • f the mutation or error

rate p

slide-20
SLIDE 20

The no-mutational backflow or zeroth order approximation

slide-21
SLIDE 21

quasispecies

The error threshold in replication and mutation

slide-22
SLIDE 22

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-23
SLIDE 23

single peak landscape

„Rugged“ fitness landscapes

slide-24
SLIDE 24

Error threshold on the single peak landscape

slide-25
SLIDE 25

linear and multiplicative landscape

Smooth fitness landscapes

slide-26
SLIDE 26

The linear fitness landscape shows no error threshold

slide-27
SLIDE 27

Make things as simple as possible, but not simpler !

Albert Einstein

Albert Einstein‘s razor, precise refence is unknown.

slide-28
SLIDE 28

Sewall Wright. 1931. Evolution in Mendelian populations. Genetics 16:97-159.

  • - --. 1932. The roles of mutation, inbreeding, crossbreeding,

and selection in evolution. In: D.F.Jones, ed. Proceedings of the Sixth International Congress on Genetics, Vol.I. Brooklyn Botanical Garden. Ithaca, NY, pp. 356-366.

  • - --. 1988. Surfaces of selective value revisited.

The American Naturalist 131:115-131.

slide-29
SLIDE 29

Build-up principle of binary sequence spaces

slide-30
SLIDE 30
slide-31
SLIDE 31

single peak landscape

Rugged fitness landscapes

  • ver individual binary

sequences with n = 10

„realistic“ landscape

slide-32
SLIDE 32

Error threshold: Individual sequences n = 10, = 2, s = 491 and d = 0, 1.0, 1.875

slide-33
SLIDE 33

d = 0.100

Case I: Strong Quasispecies n = 10, f0 = 1.1, fn = 1.0, s = 919

d = 0.200

slide-34
SLIDE 34

d = 0.100

Case III: Multiple transitions n = 10, f0 = 1.1, fn = 1.0, s = 637

d = 0.195

slide-35
SLIDE 35

d = 0.199

Case III: Multiple transitions n = 10, f0 = 1.1, fn = 1.0, s = 637

d = 0.200

slide-36
SLIDE 36

Paul E. Phillipson, Peter Schuster. (2009) Modeling by nonlinear differential equations. Dissipative and conservative processes. World Scientific, Singapore, pp.9-60.

slide-37
SLIDE 37

W = G

  • F

1 , 1 largest eigenvalue and eigenvector

diagonalization of matrix W „ complicated but not complex “ fitness landscape mutation matrix „ complex “ ( complex )

sequence

  • structure

„ complex “

mutation selection

Complexity in molecular evolution

slide-38
SLIDE 38

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-39
SLIDE 39

N = 4n NS < 3n Criterion: Minimum free energy (mfe) Rules: _ ( _ ) _ {AU,CG,GC,GU,UA,UG} A symbolic notation of RNA secondary structure that is equivalent to the conventional graphs

slide-40
SLIDE 40
slide-41
SLIDE 41

The inverse folding algorithm searches for sequences that form a given RNA secondary structure under the minimum free energy criterion.

slide-42
SLIDE 42

What is neutrality ?

Selective neutrality = = several genotypes having the same fitness. Structural neutrality = = several genotypes forming molecules with the same structure.

slide-43
SLIDE 43

A mapping and its inversion

  • Gk =

( ) | ( ) =

  • 1

U

  • S

I S

k j j k

I

( ) = I S

j k Space of genotypes: = { I

S I I I I I S S S S S

1 2 3 4 N 1 2 3 4 M

, , , , ... , } ; Hamming metric Space of phenotypes: , , , , ... , } ; metric (not required) N M = {

slide-44
SLIDE 44

many genotypes

  • ne phenotype
slide-45
SLIDE 45

AUCAAUCAG GUCAAUCAC GUCAAUCAU GUCAAUCAA G U C A A U C C G G U C A A U C G G GUCAAUCUG G U C A A U G A G G U C A A U U A G GUCAAUAAG GUCAACCAG G U C A A G C A G GUCAAACAG GUCACUCAG G U C A G U C A G GUCAUUCAG GUCCAUCAG GUCGAUCAG GUCUAUCAG GUGAAUCAG GUUAAUCAG GUAAAUCAG GCCAAUCAG GGCAAUCAG GACAAUCAG UUCAAUCAG CUCAAUCAG

GUCAAUCAG

One-error neighborhood

The surrounding of GUCAAUCAG in sequence space

slide-46
SLIDE 46

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-47
SLIDE 47

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-48
SLIDE 48

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-49
SLIDE 49

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-50
SLIDE 50

GGCUAUCGUAUGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUAGACG GGCUAUCGUACGUUUACUCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGCUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCCAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUGUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAACGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCUGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCACUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGUCCCAGGCAUUGGACG GGCUAGCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCGAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGCCUACGUUGGACCCAGGCAUUGGACG

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-51
SLIDE 51

Number Mean Value Variance Std.Dev. Total Hamming Distance: 150000 11.647973 23.140715 4.810480 Nonzero Hamming Distance: 99875 16.949991 30.757651 5.545958 Degree of Neutrality: 50125 0.334167 0.006961 0.083434 Number of Structures: 1000 52.31 85.30 9.24 1 (((((.((((..(((......)))..)))).))).))............. 50125 0.334167 2 ..(((.((((..(((......)))..)))).)))................ 2856 0.019040 3 ((((((((((..(((......)))..)))))))).))............. 2799 0.018660 4 (((((.((((..((((....))))..)))).))).))............. 2417 0.016113 5 (((((.((((.((((......)))).)))).))).))............. 2265 0.015100 6 (((((.(((((.(((......))).))))).))).))............. 2233 0.014887 7 (((((..(((..(((......)))..)))..))).))............. 1442 0.009613 8 (((((.((((..((........))..)))).))).))............. 1081 0.007207 9 ((((..((((..(((......)))..))))..)).))............. 1025 0.006833 10 (((((.((((..(((......)))..)))).))))).............. 1003 0.006687 11 .((((.((((..(((......)))..)))).))))............... 963 0.006420 12 (((((.(((...(((......)))...))).))).))............. 860 0.005733 13 (((((.((((..(((......)))..)))).)).)))............. 800 0.005333 14 (((((.((((...((......))...)))).))).))............. 548 0.003653 15 (((((.((((................)))).))).))............. 362 0.002413 16 ((.((.((((..(((......)))..)))).))..))............. 337 0.002247 17 (.(((.((((..(((......)))..)))).))).).............. 241 0.001607 18 (((((.(((((((((......))))))))).))).))............. 231 0.001540 19 ((((..((((..(((......)))..))))...))))............. 225 0.001500 20 ((....((((..(((......)))..)))).....))............. 202 0.001347 G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G

Shadow – Surrounding of an RNA structure in shape space: AUGC alphabet, chain length n=50

slide-52
SLIDE 52

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-53
SLIDE 53

Stochastic phenomena in evolutionary processes

ODEs (in population genetics) describe expectation values in infinite populations. 1. Finite population size effects 2. Low numbers of individual species 3. Selective neutrality Every mutant starts from a single copy. Populations drift randomly in the space of neutral variants.

slide-54
SLIDE 54

probabilistic notion of particle numbers Xj master equation flow reactor

slide-55
SLIDE 55

Evolution of RNA molecules as a Markow process

slide-56
SLIDE 56

Evolution of RNA molecules as a Markow process

slide-57
SLIDE 57

Evolution of RNA molecules as a Markow process

slide-58
SLIDE 58

Evolution of RNA molecules as a Markow process

slide-59
SLIDE 59

RNA replication and mutation as a multitype branching process

slide-60
SLIDE 60
slide-61
SLIDE 61

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-62
SLIDE 62

Population size Ne = 10000 , s = 0

Stochastic population genetics of neutral, asexually reproducing species

slide-63
SLIDE 63

Motoo Kimura‘s population genetics of neutral evolution. Evolutionary rate at the molecular level. Nature 217: 624-626, 1955. The Neutral Theory of Molecular Evolution. Cambridge University Press. Cambridge, UK, 1983.

slide-64
SLIDE 64

The average time of replacement of a dominant genotype in a population is the reciprocal mutation rate, 1/, and therefore independent of population size.

Fixation of mutants in neutral evolution (Motoo Kimura, 1955)

slide-65
SLIDE 65

Is the Kimura scenario correct for frequent mutations?

Fixation of mutants in neutral evolution (Motoo Kimura, 1955)

slide-66
SLIDE 66

5 . ) ( ) ( lim

2 1

= =

p x p x

p

dH = 1

a p x a p x

p p

− = =

→ →

1 ) ( lim ) ( lim

2 1

dH = 2 dH ≥3

1 ) ( lim , ) ( lim

  • r

) ( lim , 1 ) ( lim

2 1 2 1

= = = =

→ → → →

p x p x p x p x

p p p p

Random fixation in the sense of Motoo Kimura Pairs of neutral sequences in replication networks

  • P. Schuster, J. Swetina. 1988. Bull. Math. Biol. 50:635-650
slide-67
SLIDE 67

A fitness landscape including neutrality

slide-68
SLIDE 68

Neutral network: Individual sequences n = 10, = 1.1, d = 1.0

slide-69
SLIDE 69

Consensus sequence of a quasispecies of two strongly coupled sequences of Hamming distance dH(Xi,,Xj) = 1.

slide-70
SLIDE 70

Neutral network: Individual sequences n = 10, = 1.1, d = 1.0

slide-71
SLIDE 71

Consensus sequence of a quasispecies of two strongly coupled sequences of Hamming distance dH(Xi,,Xj) = 2.

slide-72
SLIDE 72

N = 7 Neutral networks with increasing : = 0.10, s = 229

Adjacency matrix

slide-73
SLIDE 73

1. Exponential growth and selection 2. Evolution as replication and mutation 3. A phase transition in evolution 4. Fitness landscapes as source of complexity 5. Molecular landscapes from biopolymers 6. The role of stochasticity 7. Neutrality and selection 8. Computer simulation of evolution

slide-74
SLIDE 74

Computer simulation using Gillespie‘s algorithm: Replication rate constant: fk = / [ + dS

(k)]

dS

(k) = dH(Sk,S)

Selection constraint: Population size, N = # RNA molecules, is controlled by the flow Mutation rate: p = 0.001 / site replication N N t N ± ≈ ) ( The flowreactor as a device for studies

  • f evolution in vitro and in silico
slide-75
SLIDE 75

Evolution in silico

  • W. Fontana, P. Schuster,

Science 280 (1998), 1451-1455

slide-76
SLIDE 76

Phenylalanyl-tRNA as target structure Structure of randomly chosen initial sequence

slide-77
SLIDE 77

In silico optimization in the flow reactor: Evolutionary Trajectory

slide-78
SLIDE 78

28 neutral point mutations during a long quasi-stationary epoch Transition inducing point mutations change the molecular structure Neutral point mutations leave the molecular structure unchanged

Neutral genotype evolution during phenotypic stasis

slide-79
SLIDE 79

Evolutionary trajectory Spreading of the population

  • n neutral networks

Drift of the population center in sequence space

slide-80
SLIDE 80
slide-81
SLIDE 81
slide-82
SLIDE 82

Coworkers

Peter Stadler, Bärbel M. Stadler, Universität Leipzig, GE Walter Fontana, Harvard Medical School, MA Ivo L.Hofacker, Christoph Flamm, Universität Wien, AT Martin Nowak, Harvard University, MA Christian Reidys, Nankai University, Tien Tsin, China Christian Forst, Los Alamos National Laboratory, NM Kurt Grünberger, Michael Kospach , Andreas Wernitznig, Stefanie Widder, Stefan Wuchty, Jan Cupal, Stefan Bernhart, Lukas Endler, Ulrike Langhammer, Rainer Machne, Ulrike Mückstein, Erich Bornberg-Bauer, Universität Wien, AT Thomas Wiehe, Ulrike Göbel, Walter Grüner, Stefan Kopp, Jaqueline Weber, Institut für Molekulare Biotechnologie, Jena, GE

Universität Wien

slide-83
SLIDE 83

Acknowledgement of support

Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Projects No. 09942, 10578, 11065, 13093 13887, and 14898 Wiener Wissenschafts-, Forschungs- und Technologiefonds (WWTF) Project No. Mat05 Jubiläumsfonds der Österreichischen Nationalbank Project No. Nat-7813 European Commission: Contracts No. 98-0189, 12835 (NEST) Austrian Genome Research Program – GEN-AU: Bioinformatics Network (BIN) Österreichische Akademie der Wissenschaften Siemens AG, Austria Universität Wien and the Santa Fe Institute

Universität Wien

slide-84
SLIDE 84

Thank you for your attention!

slide-85
SLIDE 85

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-86
SLIDE 86