Networks from Replicating Molecules Peter Schuster Institut fr - - PowerPoint PPT Presentation

networks from replicating molecules
SMART_READER_LITE
LIVE PREVIEW

Networks from Replicating Molecules Peter Schuster Institut fr - - PowerPoint PPT Presentation

Networks from Replicating Molecules Peter Schuster Institut fr Theoretische Chemie, Universitt Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA Workshop on Networks, Complexity, and Competition Bled, 02. 04.05.2008


slide-1
SLIDE 1
slide-2
SLIDE 2

Networks from Replicating Molecules

Peter Schuster

Institut für Theoretische Chemie, Universität Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA

Workshop on Networks, Complexity, and Competition Bled, 02.– 04.05.2008

slide-3
SLIDE 3

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-4
SLIDE 4

1. Replication and selection 2. Mutation, quasispecies and error thresholds 3. Sequences, structures and neutrality 4. Realistic fitness landscapes 5. Replicating networks 6. RNA structure optimization 7. Experiments with RNA

slide-5
SLIDE 5
  • 1. Replication and selection

2. Mutation, quasispecies and error thresholds 3. Sequences, structures and neutrality 4. Realistic fitness landscapes 5. Replicating networks 6. RNA structure optimization 7. Experiments with RNA

slide-6
SLIDE 6

James D. Watson, 1928-, and Francis H.C. Crick, 1916-2004 Nobel prize 1962

1953 – 2003 fifty years double helix The three-dimensional structure of a short double helical stack of B-DNA

slide-7
SLIDE 7

Base complementarity and conservation of genetic information

slide-8
SLIDE 8

‚Replication fork‘ in DNA replication The mechanism of DNA replication is ‚semi-conservative‘

slide-9
SLIDE 9

Complementary replication is the simplest copying mechanism

  • f RNA.

Complementarity is determined by Watson-Crick base pairs: GC and A=U

slide-10
SLIDE 10

Chemical kinetics of molecular evolution

  • M. Eigen, P. Schuster, `The Hypercycle´, Springer-Verlag, Berlin 1979
slide-11
SLIDE 11

Complementary replication as the simplest molecular mechanism of reproduction

slide-12
SLIDE 12

Equation for complementary replication: [Ii] = xi 0 , fi > 0 ; i=1,2 Solutions are obtained by integrating factor transformation

f x f x f x x f dt dx x x f dt dx = + = − = − =

2 2 1 1 2 1 1 2 1 2 2 1

, , φ φ φ

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

2 1 2 2 1 1 2 2 2 1 1 1 1 2 1 1 2 1 2 1 1 , 2 2 , 1

, ) ( ) ( ) ( , ) ( ) ( ) ( exp ) ( exp ) ( exp exp f f f x f x f x f x f t f f f t f f f t f t f f t x = − = + = − ⋅ − − ⋅ + − ⋅ + ⋅ = γ γ γ γ γ γ ) ( exp as ) ( and ) (

2 1 1 2 2 1 2 1

→ − + → + → ft f f f t x f f f t x

slide-13
SLIDE 13

Kinetics of RNA replication

C.K. Biebricher, M. Eigen, W.C. Gardiner, Jr. Biochemistry 22:2544-2559, 1983

slide-14
SLIDE 14

Reproduction of organisms or replication of molecules as the basis of selection

slide-15
SLIDE 15

( )

{ }

var

2 2 1

≥ = − = = ∑

=

f f f dt dx f dt d

i n i i

φ

Selection equation: [Ii] = xi 0 , fi > 0 Mean fitness or dilution flux, φ (t), is a non-decreasing function of time, Solutions are obtained by integrating factor transformation

( )

f x f x n i f x dt dx

n j j j n i i i i i

= = = = − =

∑ ∑

= = 1 1

; 1 ; , , 2 , 1 , φ φ L

( ) ( ) ( ) ( )

( )

n i t f x t f x t x

j n j j i i i

, , 2 , 1 ; exp exp

1

L = ⋅ ⋅ =

=

slide-16
SLIDE 16

Selection between three species with f1 = 1, f2 = 2, and f3 = 3

slide-17
SLIDE 17

1. Replication and selection

  • 2. Mutation, quasispecies and error thresholds

3. Sequences, structures and neutrality 4. Realistic fitness landscapes 5. Replicating networks 6. RNA structure optimization 7. Experiments with RNA

slide-18
SLIDE 18

Variation of genotypes through mutation and recombination

slide-19
SLIDE 19

Variation of genotypes through mutation

slide-20
SLIDE 20

Chemical kinetics of replication and mutation as parallel reactions

slide-21
SLIDE 21

The replication-mutation equation

slide-22
SLIDE 22

Mutation-selection equation: [Ii] = xi 0, fi > 0, Qij 0 Solutions are obtained after integrating factor transformation by means

  • f an eigenvalue problem

f x f x n i x x f Q dt dx

n j j j n i i i j j n j ij i

= = Φ = = Φ − =

∑ ∑ ∑

= = = 1 1 1

; 1 ; , , 2 , 1 , L

( ) ( ) ( ) ( ) ( )

) ( ) ( ; , , 2 , 1 ; exp exp

1 1 1 1

∑ ∑ ∑ ∑

= = − = − =

= = ⋅ ⋅ ⋅ ⋅ =

n i i ki k n j k k n k jk k k n k ik i

x h c n i t c t c t x L l l λ λ

{ } { } { }

n j i h H L n j i L n j i Q f W

ij ij ij i

, , 2 , 1 , ; ; , , 2 , 1 , ; ; , , 2 , 1 , ;

1

L L l L = = = = = = ÷

{ }

1 , , 1 , ;

1

− = = Λ = ⋅ ⋅

n k L W L

k

L λ

slide-23
SLIDE 23

Variation of genotypes through point mutation

slide-24
SLIDE 24
slide-25
SLIDE 25

Formation of a quasispecies in sequence space

slide-26
SLIDE 26

Formation of a quasispecies in sequence space

slide-27
SLIDE 27

Formation of a quasispecies in sequence space

slide-28
SLIDE 28

Formation of a quasispecies in sequence space

slide-29
SLIDE 29

Uniform distribution in sequence space

slide-30
SLIDE 30

Quasispecies

Driving virus populations through threshold

The error threshold in replication

slide-31
SLIDE 31
slide-32
SLIDE 32

Quasispecies as a function of the mutation rate p f0 = = 10 Single peak fitness landscape: 1 and

2 1

= = = =

N

f f f f f K

n N i i i

N I x f x f κ σ = − =

∑ =

; sequence master ) 1 (

1

K

slide-33
SLIDE 33

1. Replication and selection 2. Mutation, quasispecies and error thresholds

  • 3. Sequences, structures and neutrality

4. Realistic fitness landscapes 5. Replicating networks 6. RNA structure optimization 7. Experiments with RNA

slide-34
SLIDE 34

N = 4n NS < 3n Criterion: Minimum free energy (mfe) Rules: _ ( _ ) _ {AU,CG,GC,GU,UA,UG} A symbolic notation of RNA secondary structure that is equivalent to the conventional graphs

slide-35
SLIDE 35

The inverse folding algorithm searches for sequences that form a given RNA structure.

slide-36
SLIDE 36
slide-37
SLIDE 37

Sequence space of binary sequences of chain length n = 5

slide-38
SLIDE 38

Sequence space of binary sequences of chain length n = 5

slide-39
SLIDE 39

Sequence space of binary sequences of chain length n = 5

slide-40
SLIDE 40

GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG

One error neighborhood – Surrounding of an RNA molecule in sequence and shape space

slide-41
SLIDE 41

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG

One error neighborhood – Surrounding of an RNA molecule in sequence and shape space

slide-42
SLIDE 42 G C U A U C G U A C G U U A C A G U C U A C G U G G A C A G G C A U U G A C G G G C U A U C G U A C G U A C A A A A G U C U A C G U U G A C A G G C A U G G A C G

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCC AAAGUCUACGUUGGACCCAGGCAUUGGACG

G

One error neighborhood – Surrounding of an RNA molecule in sequence and shape space

slide-43
SLIDE 43

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCC AAAGUCUACGUUGGACCCAGGCAUUGGACG

G

G G C U A U C G U A C G U U U A C C

G

A AA G U C U A C G U U G G A C C C A G G C A U U G G A C G C

One error neighborhood – Surrounding of an RNA molecule in sequence and shape space

slide-44
SLIDE 44

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGG CCCAGGCAUUGGACG

U

GGCUAUCGUACGUUUACCC AAAGUCUACGUUGGACCCAGGCAUUGGACG

G

G G C U A U C G U A C G U U U A C C

G

A AA G U C U A C G U U G G A C C C A G G C A U U G G A C G C

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGU C C C A G G C A U U G G A C G

One error neighborhood – Surrounding of an RNA molecule in sequence and shape space

slide-45
SLIDE 45

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCA UGGACG

C

GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGG CCCAGGCAUUGGACG

U

GGCUAUCGUACGUUUACCC AAAGUCUACGUUGGACCCAGGCAUUGGACG

G

G G C U A U C G U A C G U U U A C C

G

A AA G U C U A C G U U G G A C C C A G G C A U U G G A C G C

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGU C C C A G G C A U U G G A C G

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGG A C C C AG G C A

C

U G G A C G

One error neighborhood – Surrounding of an RNA molecule in sequence and shape space

slide-46
SLIDE 46

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCA UGGACG

C

GGCUAUCGUACGU UACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG

G

GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGG CCCAGGCAUUGGACG

U

GGCUAUCGUACGUUUACCC AAAGUCUACGUUGGACCCAGGCAUUGGACG

G

G G C U A U C G U A C G U U U A C C

G

A AA G U C U A C G U U G G A C C C A G G C A U U G G A C G C

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGU C C C A G G C A U U G G A C G

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGG A C C C AG G C A

C

U G G A C G

G G C U A U C G U A C G U

G

U A C C C A A A A G U C U A C G U U G G ACC C A G G C A U U G G A C G

One error neighborhood – Surrounding of an RNA molecule in sequence and shape space

slide-47
SLIDE 47

GGCUAUCGUAUGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUAGACG GGCUAUCGUACGUUUACUCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGCUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCCAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUGUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAACGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCUGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCACUGGACG GGCUAUCGUACGUUUACCCAAAAGUCUACGUUGGUCCCAGGCAUUGGACG GGCUAGCGUACGUUUACCCAAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCGAAAGUCUACGUUGGACCCAGGCAUUGGACG GGCUAUCGUACGUUUACCCAAAAGCCUACGUUGGACCCAGGCAUUGGACG

G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G

One error neighborhood – Surrounding of an RNA molecule in sequence and shape space

slide-48
SLIDE 48

Number Mean Value Variance Std.Dev. Total Hamming Distance: 150000 11.647973 23.140715 4.810480 Nonzero Hamming Distance: 99875 16.949991 30.757651 5.545958 Degree of Neutrality: 50125 0.334167 0.006961 0.083434 Number of Structures: 1000 52.31 85.30 9.24 1 (((((.((((..(((......)))..)))).))).))............. 50125 0.334167 2 ..(((.((((..(((......)))..)))).)))................ 2856 0.019040 3 ((((((((((..(((......)))..)))))))).))............. 2799 0.018660 4 (((((.((((..((((....))))..)))).))).))............. 2417 0.016113 5 (((((.((((.((((......)))).)))).))).))............. 2265 0.015100 6 (((((.(((((.(((......))).))))).))).))............. 2233 0.014887 7 (((((..(((..(((......)))..)))..))).))............. 1442 0.009613 8 (((((.((((..((........))..)))).))).))............. 1081 0.007207 9 ((((..((((..(((......)))..))))..)).))............. 1025 0.006833 10 (((((.((((..(((......)))..)))).))))).............. 1003 0.006687 11 .((((.((((..(((......)))..)))).))))............... 963 0.006420 12 (((((.(((...(((......)))...))).))).))............. 860 0.005733 13 (((((.((((..(((......)))..)))).)).)))............. 800 0.005333 14 (((((.((((...((......))...)))).))).))............. 548 0.003653 15 (((((.((((................)))).))).))............. 362 0.002413 16 ((.((.((((..(((......)))..)))).))..))............. 337 0.002247 17 (.(((.((((..(((......)))..)))).))).).............. 241 0.001607 18 (((((.(((((((((......))))))))).))).))............. 231 0.001540 19 ((((..((((..(((......)))..))))...))))............. 225 0.001500 20 ((....((((..(((......)))..)))).....))............. 202 0.001347 G G C U A U C G U A C G U U U A C C C AA AAG UC UACG U UGGA CC C A GG C A U U G G A C G

Shadow – Surrounding of an RNA structure in shape space – AUGC alphabet

slide-49
SLIDE 49

1. Replication and selection 2. Mutation, quasispecies and error thresholds 3. Sequences, structures and neutrality

  • 4. Realistic fitness landscapes

5. Replicating networks 6. RNA structure optimization 7. Experiments with RNA

slide-50
SLIDE 50

Fitness landscapes showing error thresholds

slide-51
SLIDE 51

n N i i i

N I x f x f κ σ = − =

∑ =

; sequence master ) 1 (

1

K

Error threshold: Error classes and individual sequences n = 10 and = 2

slide-52
SLIDE 52

Error threshold: Individual sequences n = 10, = 2 and d = 0, 1.0, 1.85

slide-53
SLIDE 53

1. Replication and selection 2. Mutation, quasispecies and error thresholds 3. Sequences, structures and neutrality 4. Realistic fitness landscapes

  • 5. Replicating networks

6. RNA structure optimization 7. Experiments with RNA

slide-54
SLIDE 54
slide-55
SLIDE 55

Error threshold: Individual sequences n = 10, = 1.1, d = 1.0

slide-56
SLIDE 56

Error threshold: Individual sequences n = 10, = 1.1, d = 1.0

slide-57
SLIDE 57
slide-58
SLIDE 58

Error threshold: Individual sequences n = 10, = 1.1, d = 1.0

slide-59
SLIDE 59

Error threshold: Individual sequences n = 10, = 1.1, d = 1.0

slide-60
SLIDE 60
slide-61
SLIDE 61

Neutral networks with increasing

n = 10, = 1.1, d = 1.0

slide-62
SLIDE 62

N = 7 Neutral networks with increasing

slide-63
SLIDE 63

N = 24 Neutral networks with increasing

slide-64
SLIDE 64

N = 68 Neutral networks with increasing

slide-65
SLIDE 65

1. Replication and selection 2. Mutation, quasispecies and error thresholds 3. Sequences, structures and neutrality 4. Realistic fitness landscapes 5. Replicating networks

  • 6. RNA structure optimization

7. Experiments with RNA

slide-66
SLIDE 66

Stochastic simulation of evolution

  • f RNA molecules
slide-67
SLIDE 67

Replication rate constant: fk = / [ + dS

(k)]

dS

(k) = dH(Sk,S)

Selection constraint: Population size, N = # RNA molecules, is controlled by the flow Mutation rate: p = 0.001 / site replication N N t N ± ≈ ) ( The flowreactor as a device for studies of evolution in vitro and in silico

slide-68
SLIDE 68

Randomly chosen initial structure Phenylalanyl-tRNA as target structure

slide-69
SLIDE 69

In silico optimization in the flow reactor: Evolutionary Trajectory

slide-70
SLIDE 70

28 neutral point mutations during a long quasi-stationary epoch Transition inducing point mutations change the molecular structure Neutral point mutations leave the molecular structure unchanged

Neutral genotype evolution during phenotypic stasis

slide-71
SLIDE 71
slide-72
SLIDE 72
slide-73
SLIDE 73

A sketch of optimization on neutral networks

slide-74
SLIDE 74

1. Replication and selection 2. Mutation, quasispecies and error thresholds 3. Sequences, structures and neutrality 4. Realistic fitness landscapes 5. Replicating networks 6. RNA structure optimization

  • 7. Experiments with RNA
slide-75
SLIDE 75

Evolutionary design of RNA molecules

D.B.Bartel, J.W.Szostak, In vitro selection of RNA molecules that bind specific ligands. Nature 346 (1990), 818-822 C.Tuerk, L.Gold, SELEX - Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. Science 249 (1990), 505-510 D.P.Bartel, J.W.Szostak, Isolation of new ribozymes from a large pool of random sequences. Science 261 (1993), 1411-1418 R.D.Jenison, S.C.Gill, A.Pardi, B.Poliski, High-resolution molecular discrimination by RNA. Science 263 (1994), 1425-1429

  • Y. Wang, R.R.Rando, Specific binding of aminoglycoside antibiotics to RNA. Chemistry &

Biology 2 (1995), 281-290 Jiang, A. K. Suri, R. Fiala, D. J. Patel, Saccharide-RNA recognition in an aminoglycoside antibiotic-RNA aptamer complex. Chemistry & Biology 4 (1997), 35-50

slide-76
SLIDE 76

An example of selection of molecules with predefined properties in laboratory experiments

slide-77
SLIDE 77

The SELEX-technique for evolutionary design of strongly binding molecules called aptamers

slide-78
SLIDE 78

tobramycin

A A A A A C C C C C C C C G G G G G G G G U U U U U U

5’- 3’-

A A A A A U U U U U U C C C C C C C C G G G G G G G G

5’-

  • 3’

RNA aptamer

Formation of secondary structure of the tobramycin binding RNA aptamer with KD = 9 nM

  • L. Jiang, A. K. Suri, R. Fiala, D. J. Patel, Saccharide-RNA recognition in an aminoglycoside antibiotic-

RNA aptamer complex. Chemistry & Biology 4:35-50 (1997)

slide-79
SLIDE 79

The three-dimensional structure of the tobramycin aptamer complex

  • L. Jiang, A. K. Suri, R. Fiala, D. J. Patel,

Chemistry & Biology 4:35-50 (1997)

slide-80
SLIDE 80

Application of molecular evolution to problems in biotechnology

slide-81
SLIDE 81

A ribozyme switch

E.A.Schultes, D.B.Bartel, Science 289 (2000), 448-452

slide-82
SLIDE 82

Two ribozymes of chain lengths n = 88 nucleotides: An artificial ligase (A) and a natural cleavage ribozyme of hepatitis--virus (B)

slide-83
SLIDE 83

The sequence at the intersection: An RNA molecules which is 88 nucleotides long and can form both structures

slide-84
SLIDE 84

Two neutral walks through sequence space with conservation of structure and catalytic activity

slide-85
SLIDE 85

Acknowledgement of support

Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Projects No. 09942, 10578, 11065, 13093 13887, and 14898 Wiener Wissenschafts-, Forschungs- und Technologiefonds (WWTF) Project No. Mat05 Jubiläumsfonds der Österreichischen Nationalbank Project No. Nat-7813 European Commission: Contracts No. 98-0189, 12835 (NEST) Austrian Genome Research Program – GEN-AU: Bioinformatics Network (BIN) Österreichische Akademie der Wissenschaften Siemens AG, Austria Universität Wien and the Santa Fe Institute

Universität Wien

slide-86
SLIDE 86

Coworkers

Peter Stadler, Bärbel M. Stadler, Universität Leipzig, GE Paul E. Phillipson, University of Colorado at Boulder, CO Heinz Engl, Philipp Kügler, James Lu, Stefan Müller, RICAM Linz, AT Jord Nagel, Kees Pleij, Universiteit Leiden, NL Walter Fontana, Harvard Medical School, MA Christian Reidys, Christian Forst, Los Alamos National Laboratory, NM Ulrike Göbel, Walter Grüner, Stefan Kopp, Jaqueline Weber, Institut für Molekulare Biotechnologie, Jena, GE Ivo L.Hofacker, Christoph Flamm, Andreas Svrček-Seiler, Universität Wien, AT Kurt Grünberger, Michael Kospach , Andreas Wernitznig, Stefanie Widder, Stefan Wuchty, Universität Wien, AT Jan Cupal, Stefan Bernhart, Lukas Endler, Ulrike Langhammer, Rainer Machne, Ulrike Mückstein, Hakim Tafer, Thomas Taylor, Universität Wien, AT

Universität Wien

slide-87
SLIDE 87

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-88
SLIDE 88