How Nature Circumvents Low Probabilities: The Molecular Basis of - - PowerPoint PPT Presentation

how nature circumvents low probabilities the molecular
SMART_READER_LITE
LIVE PREVIEW

How Nature Circumvents Low Probabilities: The Molecular Basis of - - PowerPoint PPT Presentation

How Nature Circumvents Low Probabilities: The Molecular Basis of Information and Complexity Peter Schuster Institut fr Theoretische Chemie Universitt Wien, Austria Nonlinearity, Fluctuations, and Complexity Brussels, 16. 19.03.2005


slide-1
SLIDE 1
slide-2
SLIDE 2

How Nature Circumvents Low Probabilities: The Molecular Basis of Information and Complexity Peter Schuster

Institut für Theoretische Chemie Universität Wien, Austria

Nonlinearity, Fluctuations, and Complexity Brussels, 16.– 19.03.2005

slide-3
SLIDE 3

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-4
SLIDE 4

Lysozyme – A small protein molecule Protein folding: Levinthal’s paradox How can Nature find the native conformation in the folding process? Evolution: Wigner’s paradox How can Nature find the optimal sequence of a protein in the evolutionary optimization process? n = 130 amino acid residues 6130 = 1.44 10101 conformations 20130 = 1.36 10169 sequences

slide-5
SLIDE 5

1. Solutions to Levinthal‘s paradox 2. Selection as solution to low probabilities in evolution 3. Origin of information by mutation and selection 4. Evolution of RNA phenotypes 5. The role of neutrality in molecular evolution 6. An experiment with RNA molecules 7. Summary

slide-6
SLIDE 6

1. Solutions to Levinthal‘s paradox 2. Selection as solution to low probabilities in evolution 3. Origin of information by mutation and selection 4. Evolution of RNA phenotypes 5. The role of neutrality in molecular evolution 6. An experiment with RNA molecules 7. Summary

slide-7
SLIDE 7

The golf course landscape

Levinthal’s paradox

K.A. Dill, H.S. Chan, Nature Struct. Biol. 4:10-19

slide-8
SLIDE 8

The pathway landscape

The pathway solution to Levinthal’s paradox

K.A. Dill, H.S. Chan, Nature Struct. Biol. 4:10-19

slide-9
SLIDE 9

The folding funnel

The answer to Levinthal’s paradox

K.A. Dill, H.S. Chan, Nature Struct. Biol. 4:10-19

slide-10
SLIDE 10

A more realistic folding funnel

The answer to Levinthal’s paradox

K.A. Dill, H.S. Chan, Nature Struct. Biol. 4:10-19

slide-11
SLIDE 11

An “all (or many) paths lead to Rome” situation

N … native conformation A reconstructed free energy surface for lysozyme folding:

C.M. Dobson, A. Šali, and M. Karplus, Angew.Chem.Internat.Ed. 37: 868-893, 1988

slide-12
SLIDE 12
slide-13
SLIDE 13

1. Solutions to Levinthal‘s paradox 2. Selection as solution to low probabilities in evolution 3. Origin of information by mutation and selection 4. Evolution of RNA phenotypes 5. The role of neutrality in molecular evolution 6. An experiment with RNA molecules 7. Summary

slide-14
SLIDE 14

Earlier abstract of the ‚Origin of Species‘

Alfred Russell Wallace, 1823-1913 Charles Robert Darwin, 1809-1882

The two competitors in the formulation of evolution by natural selection

slide-15
SLIDE 15

dx / dt = x - x x

i i i j j

; Σ = 1 ; i,j f f

i j

Φ Φ fi Φ = ( = Σ x

  • i

)

j j

x =1,2,...,n [I ] = x 0 ;

i i

i =1,2,...,n ; Ii I1 I2 I1 I2 I1 I2 I i I n I i I n I n

+ + + + + +

(A) + (A) + (A) + (A) + (A) + (A) + fn fi f1 f2 I m I m I m

+

(A) + (A) + fm fm fj = max { ; j=1,2,...,n} xm(t) 1 for t

  • [A] = a = constant

Reproduction of organisms or replication of molecules as the basis of selection

slide-16
SLIDE 16

( )

{ }

var

2 2 1

≥ = − = = ∑

=

f f f dt dx f dt d

i n i i

φ

Selection equation: [Ii] = xi 0 , fi > 0 Mean fitness or dilution flux, φ (t), is a non-decreasing function of time, Solutions are obtained by integrating factor transformation

( )

f x f x n i f x dt dx

n j j j n i i i i i

= = = = − =

∑ ∑

= = 1 1

; 1 ; , , 2 , 1 , φ φ L

( ) ( ) ( ) ( )

( )

n i t f x t f x t x

j n j j i i i

, , 2 , 1 ; exp exp

1

L = ⋅ ⋅ =

∑ =

slide-17
SLIDE 17

s = ( f2-f1) / f1; f2 > f1 ; x1(0) = 1 - 1/N ; x2(0) = 1/N

200 400 600 800 1000 0.2 0.4 0.6 0.8 1 Time [Generations] Fraction of advantageous variant s = 0.1 s = 0.01 s = 0.02

Selection of advantageous mutants in populations of N = 10 000 individuals

slide-18
SLIDE 18

1. Solutions to Levinthal‘s paradox 2. Selection as solution to low probabilities in evolution 3. Origin of information by mutation and selection 4. Evolution of RNA phenotypes 5. The role of neutrality in molecular evolution 6. An experiment with RNA molecules 7. Summary

slide-19
SLIDE 19

Ij In I2 Ii I1 I j I j I j I j I j I j

+ + + + +

(A) + fj Qj1 fj Qj2 fj Qji fj Qjj fj Qjn Q (1- )

ij

  • d(i,j)

d(i,j)

=

l

p p

p .......... Error rate per digit d(i,j) .... Hamming distance between Ii and Ij ........... Chain length of the polynucleotide l

dx / dt = x - x x

i j j i j j

Σ

; Σ = 1 ; f f x

j j j i

Φ Φ = Σ Qji Qij

Σi

= 1 [A] = a = constant [Ii] = xi 0 ;

  • i =1,2,...,n ;

Chemical kinetics of replication and mutation as parallel reactions

slide-20
SLIDE 20

Mutation-selection equation: [Ii] = xi 0, fi > 0, Qij 0 Solutions are obtained after integrating factor transformation by means of an eigenvalue problem

f x f x n i x x Q f dt dx

n j j j n i i i j n j ji j i

= = = = − =

∑ ∑ ∑

= = = 1 1 1

; 1 ; , , 2 , 1 , φ φ L

( ) ( ) ( ) ( ) ( )

) ( ) ( ; , , 2 , 1 ; exp exp

1 1 1 1

∑ ∑ ∑ ∑

= = − = − =

= = ⋅ ⋅ ⋅ ⋅ =

n i i ki k n j k k n k jk k k n k ik i

x h c n i t c t c t x L l l λ λ

{ } { } { }

n j i h H L n j i L n j i Q f W

ij ij ij i

, , 2 , 1 , ; ; , , 2 , 1 , ; ; , , 2 , 1 , ;

1

L L l L = = = = = = ÷

{ }

1 , , 1 , ;

1

− = = Λ = ⋅ ⋅

n k L W L

k

L λ

slide-21
SLIDE 21

e1 e1 e3 e3 e2 e2

l0 l1 l2

x3 x1 x2

The quasispecies on the concentration simplex S3= {

}

1 ; 3 , 2 , 1 ,

3 1

= = ≥

∑ =

i i i

x i x

slide-22
SLIDE 22

Error rate p = 1-q

0.00 0.05 0.10

Quasispecies Uniform distribution Quasispecies as a function of the replication accuracy q

slide-23
SLIDE 23

space Sequence Concentration

Master sequence Mutant cloud “Off-the-cloud” mutations

The molecular quasispecies in sequence space

slide-24
SLIDE 24

1. Solutions to Levinthal‘s paradox 2. Selection as solution to low probabilities in evolution 3. Origin of information by mutation and selection 4. Evolution of RNA phenotypes 5. The role of neutrality in molecular evolution 6. An experiment with RNA molecules 7. Summary

slide-25
SLIDE 25
slide-26
SLIDE 26

Computer simulation of RNA optimization

Walter Fontana and Peter Schuster, Biophysical Chemistry 26:123-147, 1987 Walter Fontana, Wolfgang Schnabl, and Peter Schuster, Phys.Rev.A 40:3301-3321, 1989

slide-27
SLIDE 27
slide-28
SLIDE 28

Walter Fontana, Wolfgang Schnabl, and Peter Schuster, Phys.Rev.A 40:3301-3321, 1989

slide-29
SLIDE 29

Evolution in silico

  • W. Fontana, P. Schuster,

Science 280 (1998), 1451-1455

slide-30
SLIDE 30

Mapping from sequence space into structure space and into function

slide-31
SLIDE 31

Neutral networks are sets of sequences forming the same object in a phenotype space. The neutral network Gk is, for example, the pre- image of the structure Sk in sequence space: Gk = -1(Sk) π{j | (Ij) = Sk} The set is converted into a graph by connecting all sequences of Hamming distance one. Neutral networks of small biomolecules can be computed by exhaustive folding of complete sequence spaces, i.e. all RNA sequences of a given chain length. This number, N=4n , becomes very large with increasing length, and is prohibitive for numerical computations. Neutral networks can be modelled by random graphs in sequence

  • space. In this approach, nodes are inserted randomly into sequence

space until the size of the pre-image, i.e. the number of neutral sequences, matches the neutral network to be studied.

slide-32
SLIDE 32

λj = 27 = 0.444 ,

/

12 λk = (k)

j

| | Gk

λ κ

cr = 1 -

  • 1 (

1)

/ κ- λ λ

k cr . . . .

> λ λ

k cr . . . .

< network is connected Gk network is connected not Gk Connectivity threshold: Alphabet size : = 4

  • AUGC

G S S

k k k

= ( ) | ( ) =

  • 1

U

  • I

I

j j

  • cr

2 0.5 3 0.423 4 0.370

GC,AU GUC,AUG AUGC

Mean degree of neutrality and connectivity of neutral networks

slide-33
SLIDE 33

A connected neutral network formed by a common structure

slide-34
SLIDE 34

Giant Component

A multi-component neutral network formed by a rare structure

slide-35
SLIDE 35

Properties of RNA sequence to secondary structure mapping

  • 1. More sequences than structures
slide-36
SLIDE 36

Properties of RNA sequence to secondary structure mapping

  • 1. More sequences than structures
slide-37
SLIDE 37

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures

slide-38
SLIDE 38

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures

n = 100, stem-loop structures n = 30

RNA secondary structures and Zipf’s law

slide-39
SLIDE 39

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures 3. Shape space covering of common structures

slide-40
SLIDE 40

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures 3. Shape space covering of common structures

slide-41
SLIDE 41

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures 3. Shape space covering of common structures 4. Neutral networks of common structures are connected

slide-42
SLIDE 42

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures 3. Shape space covering of common structures 4. Neutral networks of common structures are connected

slide-43
SLIDE 43

S{ = ( ) I{ f S

{ {

ƒ = ( )

S{ f{ I{

Mutation Genotype-Phenotype Mapping Evaluation of the Phenotype

Q{

j

I1 I2 I3 I4 I5 In

Q

f1 f2 f3 f4 f5 fn

I1 I2 I3 I4 I5 I{ In+1 f1 f2 f3 f4 f5 f{ fn+1

Q

Evolutionary dynamics including molecular phenotypes

slide-44
SLIDE 44

Replication rate constant: fk = / [ + dS

(k)]

dS

(k) = dH(Sk,S)

Selection constraint: Population size, N = # RNA molecules, is controlled by the flow Mutation rate: p = 0.001 / site replication N N t N ± ≈ ) ( The flowreactor as a device for studies of evolution in vitro and in silico

slide-45
SLIDE 45

f0 f f1 f2 f3 f4 f6 f5 f7

Replication rate constant: fk = / [ + dS

(k)]

dS

(k) = dH(Sk,S)

Evaluation of RNA secondary structures yields replication rate constants

slide-46
SLIDE 46

5'-End 3'-End

70 60 50 40 30 20 10

Randomly chosen initial structure Phenylalanyl-tRNA as target structure

slide-47
SLIDE 47

In silico optimization in the flow reactor: Evolutionary trajectory Time (arbitrary units) A v e r a g e s t r u c t u r e d i s t a n c e t

  • t

a r g e t d

  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

slide-48
SLIDE 48

10 08 12 14 Time (arbitrary units) Average structure distance to target dS

  • 500

250 20 10

Uninterrupted presence Evolutionary trajectory Number of relay step

28 neutral point mutations during a long quasi-stationary epoch Transition inducing point mutations Neutral point mutations

Neutral genotype evolution during phenotypic stasis

slide-49
SLIDE 49

1. Solutions to Levinthal‘s paradox 2. Selection as solution to low probabilities in evolution 3. Origin of information by mutation and selection 4. Evolution of RNA phenotypes 5. The role of neutrality in molecular evolution 6. An experiment with RNA molecules 7. Summary

slide-50
SLIDE 50

Evolutionary trajectory Spreading of the population through diffusion on the neutral network Velocity of the population center in sequence space

slide-51
SLIDE 51

Spread of population in sequence space during a quasistationary epoch: t = 150

slide-52
SLIDE 52

Spread of population in sequence space during a quasistationary epoch: t = 170

slide-53
SLIDE 53

Spread of population in sequence space during a quasistationary epoch: t = 200

slide-54
SLIDE 54

Spread of population in sequence space during a quasistationary epoch: t = 350

slide-55
SLIDE 55

Spread of population in sequence space during a quasistationary epoch: t = 500

slide-56
SLIDE 56

Spread of population in sequence space during a quasistationary epoch: t = 650

slide-57
SLIDE 57

Spread of population in sequence space during a quasistationary epoch: t = 820

slide-58
SLIDE 58

Spread of population in sequence space during a quasistationary epoch: t = 825

slide-59
SLIDE 59

Spread of population in sequence space during a quasistationary epoch: t = 830

slide-60
SLIDE 60

Spread of population in sequence space during a quasistationary epoch: t = 835

slide-61
SLIDE 61

Spread of population in sequence space during a quasistationary epoch: t = 840

slide-62
SLIDE 62

Spread of population in sequence space during a quasistationary epoch: t = 845

slide-63
SLIDE 63

Spread of population in sequence space during a quasistationary epoch: t = 850

slide-64
SLIDE 64

Spread of population in sequence space during a quasistationary epoch: t = 855

slide-65
SLIDE 65
slide-66
SLIDE 66

1. Solutions to Levinthal‘s paradox 2. Selection as solution to low probabilities in evolution 3. Origin of information by mutation and selection 4. Evolution of RNA phenotypes 5. The role of neutrality in molecular evolution 6. An experiment with RNA molecules 7. Summary

slide-67
SLIDE 67
slide-68
SLIDE 68

A ribozyme switch

E.A.Schultes, D.B.Bartel, Science 289 (2000), 448-452

slide-69
SLIDE 69

Two ribozymes of chain lengths n = 88 nucleotides: An artificial ligase (A) and a natural cleavage ribozyme of hepatitis--virus (B)

slide-70
SLIDE 70

The sequence at the intersection: An RNA molecules which is 88 nucleotides long and can form both structures

slide-71
SLIDE 71

Two neutral walks through sequence space with conservation of structure and catalytic activity

slide-72
SLIDE 72

1. Solutions to Levinthal‘s paradox 2. Selection as solution to low probabilities in evolution 3. Origin of information by mutation and selection 4. Evolution of RNA phenotypes 5. The role of neutrality in molecular evolution 6. An experiment with RNA molecules 7. Summary

slide-73
SLIDE 73

Mount Fuji

Example of a smooth landscape on Earth

slide-74
SLIDE 74

Dolomites Bryce Canyon

Examples of rugged landscapes on Earth

slide-75
SLIDE 75

Genotype Space Fitness

Start of Walk End of Walk

Evolutionary optimization in absence of neutral paths in sequence space

slide-76
SLIDE 76

Genotype Space F i t n e s s

Start of Walk End of Walk Random Drift Periods Adaptive Periods

Evolutionary optimization including neutral paths in sequence space

slide-77
SLIDE 77

Grand Canyon

Example of a landscape on Earth with ‘neutral’ ridges and plateaus

slide-78
SLIDE 78

Conformational and mutational landscapes of biomolecules as well as fitness landscapes of evolutionary biology are rugged.

Genotype Space Fitness Start of Walk End of Walk

Adaptive or non-descending walks on rugged landscapes end commonly at one of the low lying local maxima.

Genotype Space Fitness Start of Walk End of Walk

Selective neutrality in the form of neutral networks plays an active role in evolutionary optimization and enables populations to reach high local maxima or even the global optimum.

slide-79
SLIDE 79

Acknowledgement of support

Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Projects No. 09942, 10578, 11065, 13093 13887, and 14898 Jubiläumsfonds der Österreichischen Nationalbank Project No. Nat-7813 European Commission: Project No. EU-980189 Austrian Genome Research Program – GEN-AU Siemens AG, Austria Universität Wien and the Santa Fe Institute

Universität Wien

slide-80
SLIDE 80

Coworkers

Walter Fontana, Harvard Medical School, MA Christian Reidys, Christian Forst, Los Alamos National Laboratory, NM Peter Stadler, Bärbel Stadler, Universität Leipzig, GE Jord Nagel, Kees Pleij, Universiteit Leiden, NL Ivo L.Hofacker, Christoph Flamm, Universität Wien, AT Andreas Wernitznig, Michael Kospach, Universität Wien, AT Ulrike Langhammer, Ulrike Mückstein, Stefanie Widder Jan Cupal, Kurt Grünberger, Andreas Svrcek-Seiler, Stefan Wuchty Stefan Bernhart, Lukas Endler Ulrike Göbel, Institut für Molekulare Biotechnologie, Jena, GE Walter Grüner, Stefan Kopp, Jaqueline Weber

Universität Wien

slide-81
SLIDE 81

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-82
SLIDE 82