Mathematical Modeling of Evolution Solved and Open Problems Peter - - PowerPoint PPT Presentation

mathematical modeling of evolution
SMART_READER_LITE
LIVE PREVIEW

Mathematical Modeling of Evolution Solved and Open Problems Peter - - PowerPoint PPT Presentation

Mathematical Modeling of Evolution Solved and Open Problems Peter Schuster Institut fr Theoretische Chemie, Universitt Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA Emerging Modeling Methodologies in Medicine and


slide-1
SLIDE 1
slide-2
SLIDE 2

Mathematical Modeling of Evolution

Solved and Open Problems Peter Schuster

Institut für Theoretische Chemie, Universität Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA Emerging Modeling Methodologies in Medicine and Biology Edinburgh, 20.– 24.07.2009

slide-3
SLIDE 3

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-4
SLIDE 4

1. Darwin, Mendel, and evolutionary optimization 2. Evolution as an exercise in chemical kinetics 3. Genotype – phenoytype mappings in biopolymers 4. Neutrality in evolution 5. Extending the notion of structure 6. Simulation of molecular evolution 7. Some origins of complexity in biology

slide-5
SLIDE 5

1. Darwin, Mendel, and evolutionary optimization 2. Evolution as an exercise in chemical kinetics 3. Genotype – phenoytype mappings in biopolymers 4. Neutrality in evolution 5. Extending the notion of structure 6. Simulation of molecular evolution 7. Some origins of complexity in biology

slide-6
SLIDE 6

Three necessary conditions for Darwinian evolution are: 1. Multiplication, 2. Variation, and 3. Selection. Biologists distinguish the genotype – the genetic information – and the phenotype – the organisms and all its properties. The genotype is unfolded in development and yields the phenotype. Variation operates on the genotype – through mutation and recombination – whereas the phenotype is the target of selection. Without human intervention natural selection is based on the number

  • f fertile progeny in forthcoming generations that is called fitness.

Question: Is Darwinian evolution optimizing fitness?

slide-7
SLIDE 7

{ }

∞ → → = = =

∑ =

t for t x n j f f t N t N t x

m j m n i i j j

1 ) ( , , 2 , 1 ; max ) ( ) ( ) (

1

K

Reproduction of organisms or replication of molecules as the basis of selection

slide-8
SLIDE 8

( )

f x f x n i f x dt dx

n j j j n i i i i i

= = = = − =

∑ ∑

= = 1 1

; 1 ; , , 2 , 1 , φ φ L

Selection equation: [Xi] = xi 0 , fi 0 mean fitness or dilution flux, φ (t), is a non-decreasing function of time,

( )

{ }

var

2 2 1

≥ = − = = ∑

=

f f f dt dx f dt d

i n i i

φ

solutions are obtained by integrating factor transformation

( ) ( ) ( ) ( )

( )

n i t f x t f x t x

j n j j i i i

, , 2 , 1 ; exp exp

1

L = ⋅ ⋅ =

=

The mean reproduction rate or mean fitness, (t), is optimized in populations.

slide-9
SLIDE 9

Gregor Mendel, 1822-1884

Mendel‘s rules of inheritance: white and red colors of flowers

slide-10
SLIDE 10

Ronald Fisher, 1890-1962, mathematician, statistician, and founder of population genetics.

Ronald Aylmer Fisher and the other scholars of population genetics, John Burdon Sanderson Haldane, and Sewall Wright, reconciled the theory

  • f natural selection with Mendelian

genetics.

Ronald A Fisher, The genetical theory of natural selection (1930). Sewall Wright, Evolution in Mendelian populations, (1931). JBS Haldane, The causes of evolution (1932).

slide-11
SLIDE 11

Sexual reproduction and recombination

slide-12
SLIDE 12

Fisher‘s selection equation: [Xi] = xi 0 , gij 0 , gij = gji

( )

( )

f x f x x g x g f x n i f x x g x dt dx

n i i i i j n n j i ij j n j ij i n i i i i n j j ij i i

= = = = = = − = − =

∑ ∑ ∑ ∑ ∑

= = = = = = 1 , 1 , 1 1 1 1

; ; 1 , , 2 , 1 ; φ φ φ L

mean fitness or dilution flux, φ (t), is a non-decreasing function of time,

( ) { }

var

2 2 1

≥ = − = = ∑

= i i n i i

f f f dt dx f dt dφ

Fisher‘s fundamental theorem of natural selection is valid for independent genes (single locus model) and autosomal symmetry, gij = gji .

slide-13
SLIDE 13

The symmetric three-allele case

slide-14
SLIDE 14

1. Darwin, Mendel, and evolutionary optimization 2. Evolution as an exercise in chemical kinetics 3. Genotype – phenoytype mappings in biopolymers 4. Neutrality in evolution 5. Extending the notion of structure 6. Simulation of molecular evolution 7. Some origins of complexity in biology

slide-15
SLIDE 15

1977 1988 1971

Chemical kinetics of molecular evolution

slide-16
SLIDE 16

Accuracy of replication: Q = q1 · q2 · q3 · … · qn

Template induced nucleic acid synthesis proceeds from 5‘-end to 3‘-end

slide-17
SLIDE 17

Kinetics of RNA replication

C.K. Biebricher, M. Eigen, W.C. Gardiner, Jr. Biochemistry 22:2544-2559, 1983

slide-18
SLIDE 18

1 1 2 2 2 1

and x f dt dx x f dt dx = =

2 1 2 1 2 1 2 1 2 1 2 1

, , , , f f f f x f x = − = + = = = ξ ξ η ξ ξ ζ ξ ξ

ft ft

e t e t ) ( ) ( ) ( ) ( ζ ζ η η = =

Complementary replication as the simplest molecular mechanism of reproduction

slide-19
SLIDE 19

Replication and mutation are parallel chemical reactions.

slide-20
SLIDE 20

Chemical kinetics of replication and mutation as parallel reactions

slide-21
SLIDE 21

1 Q

1 ji =

∑ =

N i

Chemical kinetics of replication and mutation as parallel reactions

slide-22
SLIDE 22

1 Q

1 ji =

∑ =

N i

Chemical kinetics of replication and mutation as parallel reactions

slide-23
SLIDE 23

Factorization of the value matrix W separates mutation and fitness effects.

slide-24
SLIDE 24

Mutation-selection equation: [Ii] = xi 0, fi 0, Qij 0 solutions are obtained after integrating factor transformation by means

  • f an eigenvalue problem

f x f x n i x x f Q dt dx

n j j j n i i i j j n j ij i

= = = = − =

∑ ∑ ∑

= = = 1 1 1

; 1 ; , , 2 , 1 , φ φ L

( ) ( ) ( ) ( ) ( )

) ( ) ( ; , , 2 , 1 ; exp exp

1 1 1 1

∑ ∑ ∑ ∑

= = − = − =

= = ⋅ ⋅ ⋅ ⋅ =

n i i ki k n j k k n k jk k k n k ik i

x h c n i t c t c t x L l l λ λ

{ } { } { }

n j i h H L n j i L n j i Q f W

ij ij ij i

, , 2 , 1 , ; ; , , 2 , 1 , ; ; , , 2 , 1 , ;

1

L L l L = = = = = = ÷

{ }

1 , , 1 , ;

1

− = = Λ = ⋅ ⋅

n k L W L

k

L λ

slide-25
SLIDE 25

Fitness landscapes showing error thresholds

slide-26
SLIDE 26

q p p p Q

H ij H ij

d d n ij

− = − ≅

1 ; ) 1 (

Error threshold: Individual sequences n = 10, = 2 and d = 0, 1.0, 1.85

slide-27
SLIDE 27

Quasispecies

Driving virus populations through threshold

The error threshold in replication

slide-28
SLIDE 28
slide-29
SLIDE 29

Three necessary conditions for Darwinian evolution are: 1. Multiplication, 2. Variation, and 3. Selection.

Charles Darwin, 1809-1882

All three conditions are fulfilled not only by cellular organisms but also by nucleic acid molecules – DNA or RNA – in suitable cell-free experimental assays:

Darwinian evolution in the test tube

slide-30
SLIDE 30

Application of molecular evolution to problems in biotechnology

slide-31
SLIDE 31

Artificial evolution in biotechnology and pharmacology G.F. Joyce. 2004. Directed evolution of nucleic acid enzymes. Annu.Rev.Biochem. 73:791-836.

  • C. Jäckel, P. Kast, and D. Hilvert. 2008. Protein design by

directed evolution. Annu.Rev.Biophys. 37:153-173. S.J. Wrenn and P.B. Harbury. 2007. Chemical evolution as a tool for molecular discovery. Annu.Rev.Biochem. 76:331-349.

slide-32
SLIDE 32

constant level sets of

Selection of quasispecies with f1 = 1.9, f2 = 2.0, f3 = 2.1, and p = 0.01 , parametric plot on S3

slide-33
SLIDE 33

Phenomenon Optimization of fitness Unique selection outcome Selection yes yes Recombination and selection Independent genes yes no Recombination and selection Interacting genes no no Mutation and selection no yes

The Darwinian mechanism of variation and selection is a very powerful optimization heuristic.

The Darwinian mechanism and optimization of fitness

slide-34
SLIDE 34

W = G

  • F

0 , 0 largest eigenvalue and eigenvector

diagonalization of matrix W „ complicated but not complex “ fitness landscape mutation matrix „ complex “ ( complex )

sequence

  • structure

„ complex “

mutation selection

Complexity in molecular evolution

slide-35
SLIDE 35

1. Darwin, Mendel, and evolutionary optimization 2. Evolution as an exercise in chemical kinetics 3. Genotype – phenoytype mappings in biopolymers 4. Neutrality in evolution 5. Extending the notion of structure 6. Simulation of molecular evolution 7. Some origins of complexity in biology

slide-36
SLIDE 36

O CH2 OH O O P O O O

N1

O CH2 OH O P O O O

N2

O CH2 OH O P O O O

N3

O CH2 OH O P O O O

N4

N A U G C

k =

, , ,

3' - end 5' - end Na Na Na Na

5'-end 3’-end

GCGGAU AUUCGC UUA AGUUGGGA G CUGAAGA AGGUC UUCGAUC A ACCA GCUC GAGC CCAGA UCUGG CUGUG CACAG

RNA structure The molecular phenotype

slide-37
SLIDE 37

N = 4n NS < 3n Criterion: Minimum free energy (mfe) Rules: _ ( _ ) _ {AU,CG,GC,GU,UA,UG} A symbolic notation of RNA secondary structure that is equivalent to the conventional graphs

slide-38
SLIDE 38

The inverse folding algorithm searches for sequences that form a given RNA structure.

slide-39
SLIDE 39

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-40
SLIDE 40

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-41
SLIDE 41

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-42
SLIDE 42

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-43
SLIDE 43

GGCUAUCGUAUGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUAGACG GGCUAUCGUACGUUUACUCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGCUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCCAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUGUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAACGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCUGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCACUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGUCCCAGGCAUUGGACG GGCUAGCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCGAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGCCUACGUUGGACCCAGGCAUUGGACG

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G

One error neighborhood – Surrounding of an RNA molecule of chain length n=50 in sequence and shape space

slide-44
SLIDE 44

Number Mean Value Variance Std.Dev. Total Hamming Distance: 150000 11.647973 23.140715 4.810480 Nonzero Hamming Distance: 99875 16.949991 30.757651 5.545958 Degree of Neutrality: 50125 0.334167 0.006961 0.083434 Number of Structures: 1000 52.31 85.30 9.24 1 (((((.((((..(((......)))..)))).))).))............. 50125 0.334167 2 ..(((.((((..(((......)))..)))).)))................ 2856 0.019040 3 ((((((((((..(((......)))..)))))))).))............. 2799 0.018660 4 (((((.((((..((((....))))..)))).))).))............. 2417 0.016113 5 (((((.((((.((((......)))).)))).))).))............. 2265 0.015100 6 (((((.(((((.(((......))).))))).))).))............. 2233 0.014887 7 (((((..(((..(((......)))..)))..))).))............. 1442 0.009613 8 (((((.((((..((........))..)))).))).))............. 1081 0.007207 9 ((((..((((..(((......)))..))))..)).))............. 1025 0.006833 10 (((((.((((..(((......)))..)))).))))).............. 1003 0.006687 11 .((((.((((..(((......)))..)))).))))............... 963 0.006420 12 (((((.(((...(((......)))...))).))).))............. 860 0.005733 13 (((((.((((..(((......)))..)))).)).)))............. 800 0.005333 14 (((((.((((...((......))...)))).))).))............. 548 0.003653 15 (((((.((((................)))).))).))............. 362 0.002413 16 ((.((.((((..(((......)))..)))).))..))............. 337 0.002247 17 (.(((.((((..(((......)))..)))).))).).............. 241 0.001607 18 (((((.(((((((((......))))))))).))).))............. 231 0.001540 19 ((((..((((..(((......)))..))))...))))............. 225 0.001500 20 ((....((((..(((......)))..)))).....))............. 202 0.001347 G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G

Shadow – Surrounding of an RNA structure in shape space: AUGC alphabet, chain length n=50

slide-45
SLIDE 45

many genotypes

  • ne phenotype
slide-46
SLIDE 46

A mapping and its inversion

  • Gk =

( ) | ( ) =

  • 1

U

  • S

I S

k j j k

I

( ) = I S

j k Space of genotypes: = { I

S I I I I I S S S S S

1 2 3 4 N 1 2 3 4 M

, , , , ... , } ; Hamming metric Space of phenotypes: , , , , ... , } ; metric (not required) N M = {

slide-47
SLIDE 47

Degree of neutrality of neutral networks and the connectivity threshold

slide-48
SLIDE 48

A multi-component neutral network formed by a rare structure: < cr

slide-49
SLIDE 49

A connected neutral network formed by a common structure: > cr

slide-50
SLIDE 50

RNA 9:1456-1463, 2003

Evidence for neutral networks and shape space covering

slide-51
SLIDE 51

Evidence for neutral networks and intersection of aptamer functions

slide-52
SLIDE 52

1. Darwin, Mendel, and evolutionary optimization 2. Evolution as an exercise in chemical kinetics 3. Genotype – phenoytype mappings in biopolymers 4. Neutrality in evolution 5. Extending the notion of structure 6. Simulation of molecular evolution 7. Some origins of complexity in biology

slide-53
SLIDE 53

Motoo Kimuras population genetics of neutral evolution. Evolutionary rate at the molecular level. Nature 217: 624-626, 1955. The Neutral Theory of Molecular Evolution. Cambridge University Press. Cambridge, UK, 1983.

slide-54
SLIDE 54

The average time of replacement of a dominant genotype in a population is the reciprocal mutation rate, 1/, and therefore independent of population size.

Is the Kimura scenario correct for frequent mutations?

slide-55
SLIDE 55
slide-56
SLIDE 56

5 . ) ( ) ( lim

2 1

= =

p x p x

p

dH = 1

a p x a p x

p p

− = =

→ →

1 ) ( lim ) ( lim

2 1

dH = 2 dH ≥3

1 ) ( lim , ) ( lim

  • r

) ( lim , 1 ) ( lim

2 1 2 1

= = = =

→ → → →

p x p x p x p x

p p p p

Random fixation in the sense of Motoo Kimura Pairs of genotypes in neutral replication networks

slide-57
SLIDE 57
slide-58
SLIDE 58

for comparison: = 0, = 1.1, d = 0

Neutral network: Individual sequences n = 10, = 1.1, d = 1.0

slide-59
SLIDE 59

Consensus sequence of a quasispecies of two strongly coupled sequences of Hamming distance dH(Xi,,Xj) = 1.

slide-60
SLIDE 60

Neutral network: Individual sequences n = 10, = 1.1, d = 1.0

slide-61
SLIDE 61

Consensus sequence of a quasispecies of two strongly coupled sequences of Hamming distance dH(Xi,,Xj) = 2.

slide-62
SLIDE 62

N = 7

Computation of sequences in the core of a neutral network

slide-63
SLIDE 63

1. Darwin, Mendel, and evolutionary optimization 2. Evolution as an exercise in chemical kinetics 3. Genotype – phenoytype mappings in biopolymers 4. Neutrality in evolution 5. Extending the notion of structure 6. Simulation of molecular evolution 7. Some origins of complexity in biology

slide-64
SLIDE 64

Extension of the notion of structure

slide-65
SLIDE 65

GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG (((((.((((..(((......)))..)))).))).))............. -7.30 ..........((((((.((....((((.....))))...))...)))))) -6.70 ..........((((((.((....(((((...)))))...))...)))))) -6.60 ..(((.((((..(((......)))..)))).)))..((((...))))... -6.10 (((((.((((..(((......)))..)))).))).))..(........). -6.00 (((((.((((..((........))..)))).))).))............. -6.00 .(((.((..((((..((......))..))))..))....)))........ -6.00 GGCUAUCGUACGUUUACACAAAAGUCUACGUUGGACCCAGGCAUUGGACG (((((.((((..(((......)))..)))).))).))............. -7.30 .(((.((..((((..((......))..))))..))....)))........ -6.50 .(((.....((((..((......))..))))((....)))))........ -6.30 ..(((.((((..(((......)))..)))).)))..((((...))))... -6.10 (((((.((((..(((......)))..)))).))).))..(........). -6.00 (((((.((((..((........))..)))).))).))............. -6.00 .(((...((((((..((......))..))))...))...)))........ -6.00 GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAAUGGACG (((((.((((..(((......)))..)))).))).))............. -7.30 ..(((.((((..(((......)))..)))).)))..(((.....)))... -7.20 ..........((((((.((....((((.....))))...))...)))))) -6.70 ..........((((((.((....(((((...)))))...))...)))))) -6.60 (((((.((((..(((......)))..)))).))).))((.....)).... -6.50 (.(((.((((..(((......)))..)))).))).)(((.....)))... -6.30 .((((.((((..(((......)))..)))).))).)(((.....)))... -6.30 .....(((.((((..((......))..)))))))..(((.....)))... -6.30 (.(((.((((..(((......)))..)))).)))..(((.....))).). -6.10 .....((..((((..((......))..))))..)).(((.....)))... -6.10 ......(((.((((...((....((((.....))))...)).)))).))) -6.10 (((((.((((..(((......)))..)))).))).))..(........). -6.00 (((((.((((..((........))..)))).))).))............. -6.00 .(((.((..((((..((......))..))))..))....)))........ -6.00 ......(((.((((...((....(((((...)))))...)).)))).))) -6.00

slide-66
SLIDE 66

JN1LH

1D 1D 1D 2D 2D 2D R R R

G GGGUGGAAC GUUC GAAC GUUCCUCCC CACGAG CACGAG CACGAG

  • 28.6 kcal·mol
  • 1

G/

  • 31.8 kcal·mol
  • 1

G G G G G G C C C C C C A A U U U U G G C C U U A A G G G C C C A A A A G C G C A A G C /G

  • 28.2 kcal·mol
  • 1

G G G G G G GG CCC C C C C C U G G G G C C C C A A A A A A A A U U U U U G G C C A A

  • 28.6 kcal·mol
  • 1

3 3 3 13 13 13 23 23 23 33 33 33 44 44 44

5' 5' 3’ 3’

J.H.A. Nagel, C. Flamm, I.L. Hofacker, K. Franke, M.H. de Smit, P. Schuster, and C.W.A. Pleij. Structural parameters affecting the kinetic competition of RNA hairpin formation. Nucleic Acids Res. 34:3568-3576, 2006.

An RNA switch

slide-67
SLIDE 67

A ribozyme switch

E.A.Schultes, D.B.Bartel, Science 289 (2000), 448-452

slide-68
SLIDE 68

Two ribozymes of chain lengths n = 88 nucleotides: An artificial ligase (A) and a natural cleavage ribozyme of hepatitis--virus (B)

slide-69
SLIDE 69

The sequence at the intersection: An RNA molecules which is 88 nucleotides long and can form both structures

slide-70
SLIDE 70

Two neutral walks through sequence space with conservation of structure and catalytic activity

slide-71
SLIDE 71

1. Darwin, Mendel, and evolutionary optimization 2. Evolution as an exercise in chemical kinetics 3. Genotype – phenoytype mappings in biopolymers 4. Neutrality in evolution 5. Extending the notion of structure 6. Simulation of molecular evolution 7. Some origins of complexity in biology

slide-72
SLIDE 72

Computer simulation using Gillespie‘s algorithm: Replication rate constant: fk = / [ + dS

(k)]

dS

(k) = dH(Sk,S)

Selection constraint: Population size, N = # RNA molecules, is controlled by the flow Mutation rate: p = 0.001 / site replication N N t N ± ≈ ) ( The flowreactor as a device for studies

  • f evolution in vitro and in silico
slide-73
SLIDE 73

Evolution in silico

  • W. Fontana, P. Schuster,

Science 280 (1998), 1451-1455

slide-74
SLIDE 74

Phenylalanyl-tRNA as target structure Structure of randomly chosen initial sequence

slide-75
SLIDE 75

In silico optimization in the flow reactor: Evolutionary Trajectory

slide-76
SLIDE 76

Randomly chosen initial structure Phenylalanyl-tRNA as target structure

slide-77
SLIDE 77

28 neutral point mutations during a long quasi-stationary epoch Transition inducing point mutations change the molecular structure Neutral point mutations leave the molecular structure unchanged

Neutral genotype evolution during phenotypic stasis

slide-78
SLIDE 78
slide-79
SLIDE 79
slide-80
SLIDE 80

A sketch of optimization on neutral networks

slide-81
SLIDE 81

Is the degree of neutrality in GC space much lower than in AUGC space ? Statistics of RNA structure optimization: P. Schuster, Rep.Prog.Phys. 69:1419-1477, 2006

slide-82
SLIDE 82

Number Mean Value Variance Std.Dev. Total Hamming Distance: 150000 11.647973 23.140715 4.810480 Nonzero Hamming Distance: 99875 16.949991 30.757651 5.545958 Degree of Neutrality: 50125 0.334167 0.006961 0.083434 Number of Structures: 1000 52.31 85.30 9.24 1 (((((.((((..(((......)))..)))).))).))............. 50125 0.334167 2 ..(((.((((..(((......)))..)))).)))................ 2856 0.019040 3 ((((((((((..(((......)))..)))))))).))............. 2799 0.018660 4 (((((.((((..((((....))))..)))).))).))............. 2417 0.016113 5 (((((.((((.((((......)))).)))).))).))............. 2265 0.015100 6 (((((.(((((.(((......))).))))).))).))............. 2233 0.014887 7 (((((..(((..(((......)))..)))..))).))............. 1442 0.009613 8 (((((.((((..((........))..)))).))).))............. 1081 0.007207 9 ((((..((((..(((......)))..))))..)).))............. 1025 0.006833 10 (((((.((((..(((......)))..)))).))))).............. 1003 0.006687 11 .((((.((((..(((......)))..)))).))))............... 963 0.006420 12 (((((.(((...(((......)))...))).))).))............. 860 0.005733 13 (((((.((((..(((......)))..)))).)).)))............. 800 0.005333 14 (((((.((((...((......))...)))).))).))............. 548 0.003653 15 (((((.((((................)))).))).))............. 362 0.002413 16 ((.((.((((..(((......)))..)))).))..))............. 337 0.002247 17 (.(((.((((..(((......)))..)))).))).).............. 241 0.001607 18 (((((.(((((((((......))))))))).))).))............. 231 0.001540 19 ((((..((((..(((......)))..))))...))))............. 225 0.001500 20 ((....((((..(((......)))..)))).....))............. 202 0.001347 Number Mean Value Variance Std.Dev. Total Hamming Distance: 50000 13.673580 10.795762 3.285691 Nonzero Hamming Distance: 45738 14.872054 10.821236 3.289565 Degree of Neutrality: 4262 0.085240 0.001824 0.042708 Number of Structures: 1000 36.24 6.27 2.50 1 (((((.((((..(((......)))..)))).))).))............. 4262 0.085240 2 ((((((((((..(((......)))..)))))))).))............. 1940 0.038800 3 (((((.(((((.(((......))).))))).))).))............. 1791 0.035820 4 (((((.((((.((((......)))).)))).))).))............. 1752 0.035040 5 (((((.((((..((((....))))..)))).))).))............. 1423 0.028460 6 (.(((.((((..(((......)))..)))).))).).............. 665 0.013300 7 (((((.((((..((........))..)))).))).))............. 308 0.006160 8 (((((.((((..(((......)))..)))).))))).............. 280 0.005600 9 (((((.((((..(((......)))..)))).))).))...(((....))) 278 0.005560 10 (((((.(((...(((......)))...))).))).))............. 209 0.004180 11 (((((.((((..(((......)))..)))).))).)).(((......))) 193 0.003860 12 (((((.((((..(((......)))..)))).))).))..(((.....))) 180 0.003600 13 (((((.((((..((((.....)))).)))).))).))............. 180 0.003600 14 ..(((.((((..(((......)))..)))).)))................ 176 0.003520 15 (((((.((((.((((.....))))..)))).))).))............. 175 0.003500 16 ((((( (((( ((( ))) ))))))))) 167 0 003340

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G C C C C G G G C C G G G G G C G C G C GG GCC GG CGGC G CGGC GG G G GG G G G G C G G C C

Shadow – Surrounding of an RNA structure in shape space – AUGC and GC alphabet

slide-83
SLIDE 83

1. Darwin, Mendel, and evolutionary optimization 2. Evolution as an exercise in chemical kinetics 3. Genotype – phenoytype mappings in biopolymers 4. Neutrality in evolution 5. Extending the notion of structure 6. Simulation of molecular evolution 7. Some origins of complexity in biology

slide-84
SLIDE 84

The bacterial cell as an example for the simplest form of autonomous life Escherichia coli genome: 4 million nucleotides 4460 genes The structure of the bacterium Escherichia coli

slide-85
SLIDE 85

1 2 3 4 5 6 7 8 9 10 11 12 Regulatory protein or RNA Enzyme Metabolite Regulatory gene Structural gene

A model genome with 12 genes

Sketch of a genetic and metabolic network

slide-86
SLIDE 86

A B C D E F G H I J K L 1

Biochemical Pathways

2 3 4 5 6 7 8 9 10

The reaction network of cellular metabolism published by Boehringer-Ingelheim.

slide-87
SLIDE 87

Evolution does not design with the eyes of an engineer, evolution works like a tinkerer.

François Jacob. The Possible and the Actual. Pantheon Books, New York, 1982, and Evolutionary tinkering. Science 196 (1977), 1161-1166.

slide-88
SLIDE 88
  • D. Duboule, A.S. Wilkins. 1998.

The evolution of ‚bricolage‘. Trends in Genetics 14:54-59.

slide-89
SLIDE 89

Common ancestor

A model for the genome duplication in yeast 100 million years ago

Manolis Kellis, Bruce W. Birren, and Eric S. Lander. Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428: 617-624, 2004

slide-90
SLIDE 90

Common ancestor

A model for the genome duplication in yeast 100 million years ago

Manolis Kellis, Bruce W. Birren, and Eric S. Lander. Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428: 617-624, 2004

slide-91
SLIDE 91

Common ancestor

A model for the genome duplication in yeast 100 million years ago

Manolis Kellis, Bruce W. Birren, and Eric S. Lander. Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428: 617-624, 2004

slide-92
SLIDE 92

Common ancestor

A model for the genome duplication in yeast 100 million years ago

Manolis Kellis, Bruce W. Birren, and Eric S. Lander. Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428: 617-624, 2004

slide-93
SLIDE 93

Common ancestor

Kluyveromyces waltii

A model for the genome duplication in yeast 100 million years ago

Manolis Kellis, Bruce W. Birren, and Eric S. Lander. Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428: 617-624, 2004

slide-94
SLIDE 94

The difficulty to define the notion of „gene”. Helen Pearson, Nature 441: 399-401, 2006

slide-95
SLIDE 95

ENCODE Project Consortium. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447:799-816, 2007

ENCODE stands for ENCyclopedia Of DNA Elements.

slide-96
SLIDE 96

Coworkers

Walter Fontana, Harvard Medical School, MA Matin Nowak, Harvard University, MA Christoph Flamm, Ivo L.Hofacker, Andreas Svrček-Seiler, Universität Wien, AT Peter Stadler, Bärbel Stadler, Universität Leipzig, GE Sebastian Bonhoeffer, ETH Zürich, CH Christian Reidys, Nankai University, Tien Tsin, CN Christian Forst, Los Alamos National Laboratory, NM Kurt Grünberger, Michael Kospach, Andreas Wernitznig, Stefanie Widder, Stefan Wuchty, Universität Wien, AT Jan Cupal, Ulrike Langhammer, Ulrike Mückstein, Jörg Swetina, Universität Wien, AT Ulrike Göbel, Walter Grüner, Stefan Kopp, Jaqueline Weber, Institut für Molekulare Biotechnologie, Jena, GE

Universität Wien

slide-97
SLIDE 97

Acknowledgement of support

Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Projects No. 09942, 10578, 11065, 13093 13887, and 14898 Wiener Wissenschafts-, Forschungs- und Technologiefonds (WWTF) Project No. Mat05 Jubiläumsfonds der Österreichischen Nationalbank Project No. Nat-7813 European Commission: Contracts No. 98-0189, 12835 (NEST) Austrian Genome Research Program – GEN-AU Siemens AG, Austria Universität Wien and the Santa Fe Institute

Universität Wien

slide-98
SLIDE 98

Thank you for your attention !

slide-99
SLIDE 99

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-100
SLIDE 100