RNA A Magic Molecule* Peter Schuster Institut fr Theoretische - - PowerPoint PPT Presentation

rna a magic molecule
SMART_READER_LITE
LIVE PREVIEW

RNA A Magic Molecule* Peter Schuster Institut fr Theoretische - - PowerPoint PPT Presentation

RNA A Magic Molecule* Peter Schuster Institut fr Theoretische Chemie und Molekulare Strukturbiologie der Universitt Wien 38th Winter Seminar Biophysical Chemistry, Molecular Biology, and Cybernetics of Cell Function Klosters, 15.


slide-1
SLIDE 1
slide-2
SLIDE 2

RNA – A Magic Molecule*

Peter Schuster Institut für Theoretische Chemie und Molekulare Strukturbiologie der Universität Wien 38th Winter Seminar Biophysical Chemistry, Molecular Biology, and Cybernetics of Cell Function Klosters, 15.– 28.01.2003

* Larry Gold at the conference „Frontiers of Life“, Blois (France), 1991

slide-3
SLIDE 3

RNA

RNA as scaffold for supramolecular complexes

ribosome ? ? ? ? ?

RNA as adapter molecule

GAC ... CUG ...

leu genetic code

RNA as transmitter of genetic information

DNA

...AGAGCGCCAGACUGAAGAUCUGGAGGUCCUGUGUUC...

messenger-RNA protein transcription translation RNA as

  • f genetic information

working copy

RNA as carrier of genetic information RNA RNA viruses and retroviruses as information carrier in evolution and evolutionary biotechnology in vitro

RNA as catalyst ribozyme

The RNA DNA protein world as a precursor of the current + biology

RNA as regulator of gene expression

gene silencing by small interfering RNAs

RNA is modified by epigenetic control RNA RNA editing Alternative splicing of messenger RNA is the catalytic subunit in

supramolecular complexes

Functions of RNA molecules

slide-4
SLIDE 4

1. Introduction 2. A few experiments 3. Analysing neutral networks 4. Mechanisms of neutral evolution

slide-5
SLIDE 5

1. Introduction 2. A few experiments 3. Analysing neutral networks 4. Mechanisms of neutral evolution

slide-6
SLIDE 6 O CH2 OH O O P O O O

N1

O CH2 OH O P O O O

N2

O CH2 OH O P O O O

N3

O CH2 OH O P O O O

N4

N A U G C

k =

, , ,

3' - end 5' - end Na Na Na Na

RNA

nd 3’-end

GCGGAU AUUCGC UUA AGUUGGGA G CUGAAGA AGGUC UUCGAUC A ACCA GCUC GAGC CCAGA UCUGG CUGUG CACAG 3'-end 5’-end

70 60 50 40 30 20 10

Definition of RNA structure

5'-e

slide-7
SLIDE 7

Definition and physical relevance of RNA secondary structures

RNA secondary structures are listings of Watson-Crick and GU wobble base pairs, which are free of knots and pseudokots. „Secondary structures are folding intermediates in the formation of full three-dimensional structures.“ D.Thirumalai, N.Lee, S.A.Woodson, and D.K.Klimov. Annu.Rev.Phys.Chem. 52:751-762 (2001):

slide-8
SLIDE 8

5'-End 5'-End 5'-End 3'-End 3'-End 3'-End

70 60 50 40 30 20 10

GCGGAU AUUCGC UUA AGDDGGGA M CUGAAYA AGMUC TPCGAUC A ACCA GCUC GAGC CCAGA UCUGG CUGUG CACAG

Sequence Secondary structure Tertiary structure Symbolic notation

The RNA secondary structure is a listing of GC, AU, and GU base pairs. It is understood in contrast to the full 3D-

  • r tertiary structure at the resolution of atomic coordinates. RNA secondary structures are biologically relevant.

They are, for example, conserved in evolution and they are intermediates in RNA folding.

slide-9
SLIDE 9

CGTCGTTACAATTTA GTTATGTGCGAATTC CAAATT AAAA ACAAGAG..... CGTCGTTACAATTTA GTTATGTGCGAATTC CAAATT AAAA ACAAGAG..... G A G T A C A C

Hamming distance d (S ,S ) =

H 1 2

4 d (S ,S ) = 0

H 1 1

d (S ,S ) = d (S ,S )

H H 1 2 2 1

d (S ,S ) d (S ,S ) + d (S ,S )

H H H 1 3 1 2 2 3

  • (i)

(ii) (iii)

The Hamming distance induces a metric in sequence space

slide-10
SLIDE 10

Sk I. = ( ) ψ fk f Sk = ( )

Sequence space Shape space Real numbers

Functions Secondary structures RNA sequences Mapping of RNA sequences into structures and structures into functions

slide-11
SLIDE 11

Reference for postulation and in silico verification of neutral networks

slide-12
SLIDE 12

A connected neutral network

slide-13
SLIDE 13

Reference for the definition of the intersection and the proof of the intersection theorem

slide-14
SLIDE 14

:

  • C0

C1 :

  • C0

C1

G0 G1

Structure S Structure S

1

The intersection of two compatible sets is always non empty: C0 C1 π

slide-15
SLIDE 15

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G G G G G G G G G G G G G G G G G G G C C C C C C C C U U U U U U G G G G G C C C C C C C C C C C C C U U U A A A A A A A A A A U

3’- end

Minimum free energy conformation S0 Suboptimal conformation S1

C G

A sequence at the intersection of two neutral networks is compatible with both structures

slide-16
SLIDE 16 5.10

2

2.90

8 14 15 18

2.60

17 23 19 27 22 38 45 25 36 33 39 40

3.10

43

3.40

41

3.30 7.40

5 3 7

3.00

4 10 9

3.40

6 13 12

3.10

11 21 20 16 28 29 26 30 32 42 46 44 24 35 34 37 49

2.80

31 47 48

S0 S1

Barrier tree of a sequence which switches between two conformations

5.90
slide-17
SLIDE 17

1. Introduction 2. A few experiments 3. Analysing neutral networks 4. Mechanisms of neutral evolution

slide-18
SLIDE 18

Hammerhead ribozyme – The smallest RNA based catalyst

H.W.Pley, K.M.Flaherty, D.B.McKay, Three dimensional structure of a hammerhead

  • ribozyme. Nature 372 (1994), 68-74

W.G.Scott, J.T.Finch, A.Klug, The crystal structures of an all-RNA hammerhead ribozyme: A proposed mechanism for RNA catalytic cleavage. Cell 81 (1995), 991-1002 J.E.Wedekind, D.B.McKay, Crystallographic structures of the hammerhead ribozyme: Relationship to ribozyme folding and catalysis. Annu.Rev.Biophys.Biomol.Struct. 27 (1998), 475-502 G.E.Soukup, R.R.Breaker, Design of allosteric hammerhead ribozymes activated by ligand- induced structure stabilization. Structure 7 (1999), 783-791

slide-19
SLIDE 19

Hammerhead ribozyme: The smallest known catalytically active RNA molecule

Cleavage site

OH OH OH ppp 5' 5' 3' 3'

RNA DNA

slide-20
SLIDE 20

theophylline

Allosteric effectors:

FMN = flavine mononucleotide H10 – H12 theophylline H14 Self-splicing allosteric ribozyme H13

Hammerhead ribozymes with allosteric effectors

slide-21
SLIDE 21

Nature , 323-325, 1999 402

Catalytic activity in the AUG alphabet

slide-22
SLIDE 22

O O O O H H H H H H H H H N N N N N N N N N O O H N N H O N N N N N N N

G=U (U=A) A=U U=G

O N

Base pairs in the AUG alphabet

slide-23
SLIDE 23
slide-24
SLIDE 24

Nature , 841-844, 2002 420

Catalytic activity in the DU alphabet

slide-25
SLIDE 25

2 2 6 5 6 8 C ’

1

C ’

1

5 4 4 4 2 9 7 6 3 3 2 1 1

The 2,6-diamino purine – uracil, DU, base pair

slide-26
SLIDE 26 2 2 6 5 6 8 C ’ 1 C ’ 1 5 4 4 6 2 9 7 4 3 3 2 1 1 54.4 55.7 10.72 Å 2 2 6 5 6 8 C ’ 1 C ’ 1 5 4 4 4 2 9 7 6 3 3 2 1 1 2 2 6 5 6 8 C ’ 1 C ’ 1 5 4 4 4 2 9 7 6 3 3 1 1 56.2 57.4 10.44 Å

A = U G C

  • D U
  • Three Watson-Crick type base pairs
slide-27
SLIDE 27
slide-28
SLIDE 28

A ribozyme switch

E.A.Schultes, D.B.Bartel, One sequence, two ribozymes: Implication for the emergence of new ribozyme folds. Science 289 (2000), 448-452

slide-29
SLIDE 29

Two ribozymes of chain lengths n = 88 nucleotides: An artificial ligase (A) and a natural cleavage ribozyme of hepatitis-

  • virus (B)
slide-30
SLIDE 30

The sequence at the intersection: An RNA molecules which is 88 nucleotides long and can form both structures

slide-31
SLIDE 31

Two neutral walks through sequence space with conservation of structure and catalytic activity

slide-32
SLIDE 32

1. Introduction 2. A few experiments 3. Analysing neutral networks 4. Mechanisms of neutral evolution

slide-33
SLIDE 33

RNA Minimum Free Energy Structures

Efficient algorithms based on dynamical programming are available for computation of secondary structures for given

  • sequences. Inverse folding algorithms compute sequences

for given secondary structures.

M.Zuker and P.Stiegler. Nucleic Acids Res. 9:133-148 (1981) Vienna RNA Package: http:www.tbi.univie.ac.at (includes inverse folding, suboptimal structures, kinetic folding, etc.) I.L.Hofacker, W. Fontana, P.F.Stadler, L.S.Bonhoeffer, M.Tacker, and P. Schuster. Mh.Chem. 125:167-188 (1994)

slide-34
SLIDE 34

Statistics of RNA structures from random sequences over different nucleotide alphabets

Walter Fontana, Danielle A. M. Konings, Peter F. Stadler, Peter Schuster, Statistics of RNA secondary structures. Biopolymers 33 (1993), 1389-1404 Peter Schuster, Walter Fontana, Peter F. Stadler, Ivo L. Hofacker, From sequences to shapes and back: A case study in RNA secondary structures. Proc.Roy.Soc.London B 255 (1994), 279-284 Ivo L. Hofacker, Peter Schuster, Peter F. Stadler, Combinatorics of RNA secondary structures. Discr.Appl.Math. 89 (1998), 177-207

slide-35
SLIDE 35

O O O H H H H H H N N N N O O H N N H O N N N N N N N

G=U U=G

O H H H N N N N N

(U=A) A=U

O N

O O H H H H H N N N N N N N

(C G)

  • G C
  • The six base pairing alphabets built from natural nucleotides A, U, G, and C
slide-36
SLIDE 36

O O O H H H H H H N N N N O O H N N H O N N N N N N N

G=U U=G

O H H H N N N N N

(U=A) A=U

O N

O O H H H H H N N N N N N N

(C G)

  • G C
  • The six base pairing alphabets built from natural nucleotides A, U, G, and C
slide-37
SLIDE 37

Recursion formula for the number of acceptable RNA secondary structures

slide-38
SLIDE 38

Computed numbers of minimum free energy structures over different alphabets

  • P. Schuster, Molecular insights into evolution of phenotypes. In: J. Crutchfield & P.Schuster,

Evolutionary Dynamics. Oxford University Press, New York 2003, pp.163-215.

slide-39
SLIDE 39

5'-End 3'-End

70 60 50 40 30 20 10

RNA clover-leaf secondary structures

  • f sequences with chain length n=76

tRNAphe

slide-40
SLIDE 40

UUUAGCCAGCGCGAGUCGUGCGGACGGGGUUAUCUCUGUCGGGCUAGGGCGC GUGAGCGCGGGGCACAGUUUCUCAAGGAUGUAAGUUUUUGCCGUUUAUCUGG UUAGCGAGAGAGGAGGCUUCUAGACCCAGCUCUCUGGGUCGUUGCUGAUGCG CAUUGGUGCUAAUGAUAUUAGGGCUGUAUUCCUGUAUAGCGAUCAGUGUCCG GUAGGCCCUCUUGACAUAAGAUUUUUCCAAUGGUGGGAGAUGGCCAUUGCAG

Minimum free energy criterion Inverse folding

1st 2nd 3rd trial 4th 5th

The inverse folding algorithm searches for sequences that form a given RNA secondary structure under the minimum free energy criterion.

slide-41
SLIDE 41

Initial trial sequences Target sequence Stop sequence of an unsucessful trial Intermediate compatible sequences

Approach to the target structure in the inverse folding algorithm

slide-42
SLIDE 42 5'-End 5'-End 5'-End 5'-End 3'-End 3'-End 3'-End 3'-End 70 70 70 70 60 60 60 60 50 50 50 50 40 40 40 40 30 30 30 30 20 20 20 20 10 10 10 10

A B C D

RNA clover-leaf secondary structures of sequences with chain length n=76

slide-43
SLIDE 43 5'-End 5'-End 5'-End 5'-End 3'-End 3'-End 3'-End 3'-End 70 70 70 70 60 60 60 60 50 50 50 50 40 40 40 40 30 30 30 30 20 20 20 20 10 10 10 10

Alphabet AU AUG AUGC UGC GC

  • - -
  • - -

790 570 64 6

  • - -

4 2

  • 900

630 89 15

  • - -

24 8

  • 940

710 84 10

  • - -

30 6

  • 960

740 77 5

  • Number of successful inverse foldings out of 1000 trials

Search for clover-leef structures by means of the inverse folding algorithm

slide-44
SLIDE 44

Theory of sequence – structure mappings

  • P. Schuster, W.Fontana, P.F.Stadler, I.L.Hofacker, From sequences to shapes and back:

A case study in RNA secondary structures. Proc.Roy.Soc.London B 255 (1994), 279-284 W.Grüner, R.Giegerich, D.Strothmann, C.Reidys, I.L.Hofacker, P.Schuster, Analysis of RNA sequence structure maps by exhaustive enumeration. I. Neutral networks. Mh.Chem. 127 (1996), 355-374 W.Grüner, R.Giegerich, D.Strothmann, C.Reidys, I.L.Hofacker, P.Schuster, Analysis of RNA sequence structure maps by exhaustive enumeration. II. Structure of neutral networks and shape space covering. Mh.Chem. 127 (1996), 375-389 C.M.Reidys, P.F.Stadler, P.Schuster, Generic properties of combinatory maps. Bull.Math.Biol. 59 (1997), 339-397 I.L.Hofacker, P. Schuster, P.F.Stadler, Combinatorics of RNA secondary structures. Discr.Appl.Math. 89 (1998), 177-207 C.M.Reidys, P.F.Stadler, Combinatory landscapes. SIAM Review 44 (2002), 3-54

slide-45
SLIDE 45

Sequence-structure relations are highly complex and only the simplest case can be studied. An example is the folding of RNA sequences into RNA structures represented in course-grained form as secondary structures. The RNA sequence-structure relation is understood as a mapping from the space of RNA sequences into a space of RNA structures.

slide-46
SLIDE 46

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Phenotype space Non-negative numbers Mapping from sequence space into phenotype space and into function

slide-47
SLIDE 47

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Phenotype space Non-negative numbers

slide-48
SLIDE 48

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Phenotype space Non-negative numbers

The pre-image of the structure Sk in sequence space is the neutral network Gk

slide-49
SLIDE 49

Neutral networks are sets of sequences forming the same structure. Gk is the pre-image of the structure Sk in sequence space: Gk =

  • 1(Sk) π{

j |

(Ij) = Sk} The set is converted into a graph by connecting all sequences of Hamming distance one. Neutral networks of small RNA molecules can be computed by exhaustive folding of complete sequence spaces, i.e. all RNA sequences of a given chain length. This number, N=4n , becomes very large with increasing length, and is prohibitive for numerical computations. Neutral networks can be modelled by random graphs in sequence

  • space. In this approach, nodes are inserted randomly into sequence

space until the size of the pre-image, i.e. the number of neutral sequences, matches the neutral network to be studied.

slide-50
SLIDE 50

Random graph approach to neutral networks Sketch of sequence space Step 00

slide-51
SLIDE 51

Random graph approach to neutral networks Sketch of sequence space Step 01

slide-52
SLIDE 52

Random graph approach to neutral networks Sketch of sequence space Step 02

slide-53
SLIDE 53

Random graph approach to neutral networks Sketch of sequence space Step 03

slide-54
SLIDE 54

Random graph approach to neutral networks Sketch of sequence space Step 04

slide-55
SLIDE 55

Random graph approach to neutral networks Sketch of sequence space Step 05

slide-56
SLIDE 56

Random graph approach to neutral networks Sketch of sequence space Step 10

slide-57
SLIDE 57

Random graph approach to neutral networks Sketch of sequence space Step 15

slide-58
SLIDE 58

Random graph approach to neutral networks Sketch of sequence space Step 25

slide-59
SLIDE 59

Random graph approach to neutral networks Sketch of sequence space Step 50

slide-60
SLIDE 60

Random graph approach to neutral networks Sketch of sequence space Step 75

slide-61
SLIDE 61

Random graph approach to neutral networks Sketch of sequence space Step 100

slide-62
SLIDE 62

λj = 27 ,

/

12 λk = (k)

j

| | Gk

λ κ

cr = 1 - -1 (

1)

/ κ- λ λ

k cr . . . .

> λ λ

k cr . . . .

< network is connected Gk network is connected not Gk Connectivity threshold: Alphabet size : = 4

  • AUGC

G S S

k k k

= ( ) | ( ) =

  • 1

U

  • I

I

j j

  • cr

2 0.5 3 0.4226 4 0.3700

Mean degree of neutrality and connectivity of neutral networks

slide-63
SLIDE 63

Giant Component

A multi-component neutral network

slide-64
SLIDE 64

A connected neutral network

slide-65
SLIDE 65

5'-End 3'-End

70 60 50 40 30 20 10

Alphabet Degree of neutrality AUGC UGC GC 0.27 0.07

  • 0.26 0.07
  • 0.06 0.03
  • Computated degree of neutrality for the tRNA neutral network
slide-66
SLIDE 66

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C G G U C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C G C G G G G G G G G G G G G G G G G G G G G G G G C U C C C G C C C C C C U U U U U U G G G G G G G G G G G G G G G C C C C C C C C C C C C C C C C C C C C C U U U U U U A A A A A A A A A A A A A A A U U U

C

  • m

p a t i b l e I n c

  • m

p a t i b l e

5’-end 5’-end 5’-end 3’-end 3’-end 3’-end

Definition of compatibility of sequences and structures

slide-67
SLIDE 67

Structure

slide-68
SLIDE 68

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C G G U C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G G G G G G G G G G G G G G G C U C C C C C C U U U U G G G G G G G G G G C C C C C C C C C C C C C C U U U U A A A A A A A A A A U U

Compatible sequences Structure

5’-end 5’-end 3’-end 3’-end

slide-69
SLIDE 69

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C G C G G G G G G G G G C G C C U U G G G G G C C C C C C C U U A A A A A U

Structure Incompatible sequence

5’-end 3’-end

slide-70
SLIDE 70

G C

k k

Gk

Neutral network Compatible set Ck The compatible set Ck of a structure Sk consists of all sequences which form Sk as its minimum free energy structure (neutral network Gk) or one of its suboptimal structures.

slide-71
SLIDE 71

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G G G G G G G G G G G G G G G G G G G C C C C C C C C U U U U U U G G G G G C C C C C C C C C C C C C U U U A A A A A A A A A A U

3’- end

Minimum free energy conformation S0 Suboptimal conformation S1

C G

A sequence at the intersection of two neutral networks is compatible with both structures

slide-72
SLIDE 72

:

  • C1

C2 :

  • C1

C2

G1 G2

The intersection of two compatible sets is always non empty: C1 C2 π

slide-73
SLIDE 73

1. Introduction 2. A few experiments 3. Analysing neutral networks 4. Mechanisms of neutral evolution

slide-74
SLIDE 74

Optimization of RNA molecules in silico

W.Fontana, P.Schuster, A computer model of evolutionary optimization. Biophysical Chemistry 26 (1987), 123-147 W.Fontana, W.Schnabl, P.Schuster, Physical aspects of evolutionary optimization and

  • adaptation. Phys.Rev.A 40 (1989), 3301-3321

M.A.Huynen, W.Fontana, P.F.Stadler, Smoothness within ruggedness. The role of neutrality in adaptation. Proc.Natl.Acad.Sci.USA 93 (1996), 397-401 W.Fontana, P.Schuster, Continuity in evolution. On the nature of transitions. Science 280 (1998), 1451-1455 W.Fontana, P.Schuster, Shaping space. The possible and the attainable in RNA genotype- phenotype mapping. J.Theor.Biol. 194 (1998), 491-515 B.M.R. Stadler, P.F. Stadler, G.P. Wagner, W. Fontana, The topology of the possible: Formal spaces underlying patterns of evolutionary change. J.Theor.Biol. 213 (2001), 241-274

slide-75
SLIDE 75

Stock Solution Reaction Mixture

Fitness function: fk = / [+ dS

(k)]

  • dS

(k) = ds(Ik,I

) The flowreactor as a device for studies of evolution in vitro and in silico

slide-76
SLIDE 76

5'-End 3'-End

70 60 50 40 30 20 10

Randomly chosen initial structure Phenylalanyl-tRNA as target structure

slide-77
SLIDE 77

s p a c e Sequence Concentration

Master sequence Mutant cloud “Off-the-cloud” mutations

The molecular quasispecies in sequence space

slide-78
SLIDE 78

S{ = ( ) I{ f S

{ {

ƒ = ( )

S{ f{ I{

Mutation Genotype-Phenotype Mapping Evaluation of the Phenotype

Q{

j

I1 I2 I3 I4 I5 In

Q

f1 f2 f3 f4 f5 fn

I1 I2 I3 I4 I5 I{ In+1 f1 f2 f3 f4 f5 f{ fn+1

Q

Evolutionary dynamics including molecular phenotypes

slide-79
SLIDE 79

In silico optimization in the flow reactor: Trajectory (biologists‘ view) Time (arbitrary units) A v e r a g e d i s t a n c e f r

  • m

i n i t i a l s t r u c t u r e 5

  • d
  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

slide-80
SLIDE 80

In silico optimization in the flow reactor: Trajectory (physicists‘ view) Time (arbitrary units) A v e r a g e s t r u c t u r e d i s t a n c e t

  • t

a r g e t d

  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

slide-81
SLIDE 81

44

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Endconformation of optimization

slide-82
SLIDE 82

44 43

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of the last step 43 44

slide-83
SLIDE 83

44 43 42

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of last-but-one step 42 43 ( 44)

slide-84
SLIDE 84

44 43 42 41

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of step 41 42 ( 43 44)

slide-85
SLIDE 85

44 43 42 41 40

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of step 40 41 ( 42 43 44)

slide-86
SLIDE 86

44 43 42 41 40 39 Evolutionary process Reconstruction

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of the relay series

slide-87
SLIDE 87

Transition inducing point mutations Neutral point mutations

Change in RNA sequences during the final five relay steps 39 44

slide-88
SLIDE 88

In silico optimization in the flow reactor: Trajectory and relay steps Time (arbitrary units) A v e r a g e s t r u c t u r e d i s t a n c e t

  • t

a r g e t d

  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

Relay steps

slide-89
SLIDE 89

In silico optimization in the flow reactor: Uninterrupted presence Time (arbitrary units) A v e r a g e s t r u c t u r e d i s t a n c e t

  • t

a r g e t d

  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory Uninterrupted presence

Relay steps

slide-90
SLIDE 90

10 08 12 14 Time (arbitrary units) Average structure distance to target dS

  • 500

250 20 10

Uninterrupted presence Evolutionary trajectory Number of relay step

Transition inducing point mutations Neutral point mutations

Neutral genotype evolution during phenotypic stasis

slide-91
SLIDE 91

18 19 20 21 26 28 29 31

Time (arbitrary units)

750 1000 1250

Average structure distance to target dS

  • 30

20 10

Uninterrupted presence Evolutionary trajectory 35 30 25 20 Number of relay step

A random sequence of minor or continuous transitions in the relay series

slide-92
SLIDE 92

18 19 20 21 26 28 29 31

A random sequence of minor or continuous transitions in the relay series

slide-93
SLIDE 93

Elongation of Stacks Shortening of Stacks Opening of Constrained Stacks

Multi- loop

Minor or continuous transitions: Occur frequently on single point mutations

slide-94
SLIDE 94

In silico optimization in the flow reactor: Uninterrupted presence Time (arbitrary units) A v e r a g e s t r u c t u r e d i s t a n c e t

  • t

a r g e t d

  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory Uninterrupted presence

Relay steps

slide-95
SLIDE 95

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

38 37 36 Main transition leading to clover leaf

Reconstruction of a main transitions 36 37 ( 38)

slide-96
SLIDE 96

In silico optimization in the flow reactor: Main transitions Main transitions Relay steps Time (arbitrary units) Average structure distance to target d S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

slide-97
SLIDE 97

Shift Roll-Over Flip Double Flip

a a b a a b α α α α β β

Closing of Constrained Stacks

Multi- loop

Main or discontinuous transitions: Structural innovations, occur rarely on single point mutations

slide-98
SLIDE 98

In silico optimization in the flow reactor Time (arbitrary units) Average structure distance to target d S

500 750 1000 1250 250 50 40 30 20 10

Relay steps Main transitions

Uninterrupted presence Evolutionary trajectory

slide-99
SLIDE 99

Statistics of evolutionary trajectories

Population size N Number of replications < n >

rep

Number of transitions < n >

tr

Number of main transitions < n >

dtr

The number of main transitions or evolutionary innovations is constant.

slide-100
SLIDE 100

00 09 31 44

Three important steps in the formation of the tRNA clover leaf from a randomly chosen initial structure corresponding to three main transitions.

slide-101
SLIDE 101

Stable tRNA clover leaf structures built from binary, GC-only, sequences exist. The corresponding sequences are readily found through inverse folding. Optimization by mutation and selection in the flow reactor has so far always been unsuccessful.

5'-End 3'-End

70 60 50 40 30 20 10

The neutral network of the tRNA clover leaf in GC sequence space is not connected, whereas to the corresponding neutral network in AUGC sequence space is very close to the critical connectivity threshold,

cr . Here, both inverse folding

and optimization in the flow reactor are successful.

The success of optimization depends on the connectivity of neutral networks.

slide-102
SLIDE 102

Main results of computer simulations of molecular evolution

  • No trajectory was reproducible in detail. Sequences of target structures were always
  • different. Nevertheless solutions of the same quality are almost always achieved.
  • Transitions between molecular phenotypes represented by RNA structures can be

classified with respect to the induced structural changes. Highly probable minor transitions are opposed by main transitions with low probability of occurrence.

  • Main transitions represent important innovations in the course of evolution.
  • The number of minor transitions decreases with increasing population size.
  • The number of main transitions or evolutionary innovations is approximately

constant for given start and stop structures.

  • Not all known structures are accessible through evolution in the flow reactor. An

example is the tRNA clover leaf for GC-only sequences.

slide-103
SLIDE 103

Coworkers

Walter Fontana, Santa Fe Institute, NM Christian Reidys, Christian Forst, Los Alamos National Laboratory, NM Peter Stadler, Universität Leipzig, GE Ivo L.Hofacker, Christoph Flamm, Universität Wien, AT Bärbel Stadler, Andreas Wernitznig, Universität Wien, AT Michael Kospach, Ulrike Langhammer, Ulrike Mückstein, Stefanie Widder Jan Cupal, Kurt Grünberger, Andreas Svrček-Seiler, Stefan Wuchty Ulrike Göbel, Institut für Molekulare Biotechnologie, Jena, GE Walter Grüner, Stefan Kopp, Jaqueline Weber