The Indispensibility of Neutrality in Molecular Evolution Peter - - PowerPoint PPT Presentation

the indispensibility of neutrality in molecular evolution
SMART_READER_LITE
LIVE PREVIEW

The Indispensibility of Neutrality in Molecular Evolution Peter - - PowerPoint PPT Presentation

The Indispensibility of Neutrality in Molecular Evolution Peter Schuster Institut fr Theoretische Chemie und Molekulare Strukturbiologie der Universitt Wien Max-Planck-Institut fr Physik Komplexer Systeme Dresden, 05.07.2004 Web-Page for


slide-1
SLIDE 1
slide-2
SLIDE 2

The Indispensibility of Neutrality in Molecular Evolution Peter Schuster Institut für Theoretische Chemie und Molekulare Strukturbiologie der Universität Wien Max-Planck-Institut für Physik Komplexer Systeme Dresden, 05.07.2004

slide-3
SLIDE 3

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-4
SLIDE 4

dx / dt = x - x x

i i i j j

; Σ = 1 ; i,j f f

i j

Φ Φ fi Φ = ( = Σ x

  • i

)

j j

x =1,2,...,n [I ] = x 0 ;

i i

i =1,2,...,n ; Ii I1 I2 I1 I2 I1 I2 I i I n I i I n I n

+ + + + + +

(A) + (A) + (A) + (A) + (A) + (A) + fn fi f1 f2 I m I m I m

+

(A) + (A) + fm fm fj = max { ; j=1,2,...,n} xm(t) 1 for t

  • [A] = a = constant

Reproduction of organisms or replication of molecules as the basis of selection

slide-5
SLIDE 5

James D. Watson and Francis H.C. Crick Nobel prize 1962 1953 – 2003 fifty years double helix Base pairs: A = T and G C Stacking of base pairs in nucleic acid double helices (B-DNA)

slide-6
SLIDE 6

G G G C C C C C C G G G C C C G G G C C C G G G G G G C C C

Plus Strand Plus Strand Plus Strand Minus Strand Minus Strand Minus Strand

3' 3' 3' 5' 5' 5' 5' 5' 5' 3' 3' 3'

+

Direct replication of DNA is a higly complex copying mechanism involving more than ten different protein molecules. Complementarity is determined by Watson-Crick base pairs: A=T and GC

slide-7
SLIDE 7

( )

{ }

var

2 2 1

≥ = − = = ∑

=

f f f dt dx f dt d

i n i i

φ

Selection equation: [Ii] = xi 0 , fi > 0 Mean fitness or dilution flux, φ (t), is a non-decreasing function of time, Solutions are obtained by integrating factor transformation

( )

f x f x n i f x dt dx

n j j j n i i i i i

= = = = − =

∑ ∑

= = 1 1

; 1 ; , , 2 , 1 , φ φ L

( ) ( ) ( ) ( )

( )

n i t f x t f x t x

j n j j i i i

, , 2 , 1 ; exp exp

1

L = ⋅ ⋅ =

∑ =

slide-8
SLIDE 8

s = ( f2-f1) / f1; f2 > f1 ; x1(0) = 1 - 1/N ; x2(0) = 1/N

200 400 600 800 1000 0.2 0.4 0.6 0.8 1 Time [Generations] Fraction of advantageous variant s = 0.1 s = 0.01 s = 0.02

Selection of advantageous mutants in populations of N = 10 000 individuals

slide-9
SLIDE 9

„...Variations neither useful not injurious would not be affected by natural selection, and would be left either a fluctuating element, as perhaps we see in certain polymorphic species, or would ultimately become fixed, owing to the nature of the organism and the nature of the conditions. ...“

Charles Darwin, Origin of species (1859)

slide-10
SLIDE 10

Motoo Kimura’s population genetics of neutral evolution. Evolutionary rate at the molecular level. Nature 217: 624-626, 1955. The Neutral Theory of Molecular Evolution. Cambridge University Press. Cambridge, UK, 1983.

slide-11
SLIDE 11

Motoo Kimura. The Neutral Theory of Molecular Evolution. Cambridge University Press. Cambridge, UK, 1983.

slide-12
SLIDE 12

CGTCGTTACAATTTA GTTATGTGCGAATTC CAAATT AAAA ACAAGAG..... CGTCGTTACAATTTA GTTATGTGCGAATTC CAAATT AAAA ACAAGAG..... G A G T A C A C

Hamming distance d (I ,I ) =

H 1 2

4 d (I ,I ) = 0

H 1 1

d (I ,I ) = d (I ,I )

H H 1 2 2 1

d (I ,I ) d (I ,I ) + d (I ,I )

H H H 1 3 1 2 2 3

  • (i)

(ii) (iii)

The Hamming distance between genotypes induces a metric in sequence space

slide-13
SLIDE 13

Motoo Kimura. The Neutral Theory of Molecular Evolution. Cambridge University Press. Cambridge, UK, 1983.

slide-14
SLIDE 14

The molecular clock of evolution Motoo Kimura. The Neutral Theory of Molecular Evolution. Cambridge University Press. Cambridge, UK, 1983.

slide-15
SLIDE 15

Bacterial Evolution

  • S. F. Elena, V. S. Cooper, R. E. Lenski. Punctuated evolution caused by selection of

rare beneficial mutants. Science 272 (1996), 1802-1804

  • D. Papadopoulos, D. Schneider, J. Meier-Eiss, W. Arber, R. E. Lenski, M. Blot.

Genomic evolution during a 10,000-generation experiment with bacteria. Proc.Natl.Acad.Sci.USA 96 (1999), 3807-3812

slide-16
SLIDE 16

24 h 24 h

Serial transfer of Escherichia coli cultures in Petri dishes

1 day 6.67 generations 1 month 200 generations

  • 1 year 2400 generations
  • lawn of E.coli

nutrient agar

slide-17
SLIDE 17

1 year

Epochal evolution of bacteria in serial transfer experiments under constant conditions

  • S. F. Elena, V. S. Cooper, R. E. Lenski. Punctuated evolution caused by selection of rare beneficial mutants.

Science 272 (1996), 1802-1804

slide-18
SLIDE 18

2000 4000 6000 8000 Time 5 10 15 20 25 Hamming distance to ancestor Generations

Variation of genotypes in a bacterial serial transfer experiment

  • D. Papadopoulos, D. Schneider, J. Meier-Eiss, W. Arber, R. E. Lenski, M. Blot. Genomic evolution during a

10,000-generation experiment with bacteria. Proc.Natl.Acad.Sci.USA 96 (1999), 3807-3812

slide-19
SLIDE 19

No new principle will declare itself from below a heap of facts.

Sir Peter Medawar, 1985

slide-20
SLIDE 20
slide-21
SLIDE 21

In evolution variation occurs on genotypes but selection operates on the phenotype. Mappings from genotypes into phenotypes are highly complex objects. The only computationally accessible case is in the evolution of RNA molecules. The mapping from RNA sequences into secondary structures and function, sequence structure fitness, is used as a model for the complex relations between genotypes and phenotypes. Fertile progeny measured in terms of fitness in population biology is determined quantitatively by replication rate constants of RNA molecules.

Population biology Molecular genetics Evolution of RNA molecules Genotype Genome RNA sequence Phenotype Organism RNA structure and function Fitness Reproductive success Replication rate constant

The RNA model

slide-22
SLIDE 22

O CH2 OH O O P O O O

N1

O CH2 OH O P O O O

N2

O CH2 OH O P O O O

N3

O CH2 OH O P O O O

N4

N A U G C

k =

, , ,

3' - end 5' - end Na Na Na Na

5'-end 3’-end

GCGGAU AUUCGC UUA AGUUGGGA G CUGAAGA AGGUC UUCGAUC A ACCA GCUC GAGC CCAGA UCUGG CUGUG CACAG 3'-end 5’-end

70 60 50 40 30 20 10

Definition of RNA structure

slide-23
SLIDE 23

5'-End 5'-End 3'-End 3'-End

70 60 50 40 30 20 10 GCGGAUUUAGCUCAGDDGGGAGAGCMCCAGACUGAAYAUCUGGAGMUCCUGUGTPCGAUCCACAGAAUUCGCACCA

Sequence Secondary structure Base pairs: A = U , G C , and G - U

slide-24
SLIDE 24

Definition and physical relevance of RNA secondary structures

RNA secondary structures are listings of Watson-Crick and GU wobble base pairs, which are free of knots and pseudokots. „Secondary structures are folding intermediates in the formation of full three-dimensional structures.“ D.Thirumalai, N.Lee, S.A.Woodson, and D.K.Klimov. Annu.Rev.Phys.Chem. 52:751-762 (2001):

slide-25
SLIDE 25

RNA sequence RNA structure

  • f minimal free

energy

RNA folding: Structural biology, spectroscopy of biomolecules, understanding molecular function Empirical parameters Biophysical chemistry: thermodynamics and kinetics

Sequence, structure, and design

Inverse folding of RNA: Biotechnology, design of biomolecules with predefined structures and functions

slide-26
SLIDE 26

G G G G G G G G G G G G G G G G U U U U U U U U U U U A A A A A A A A A A A A U C C C C C C C C C C C C 5’-end 3’-end

S1

(h)

S9

(h)

F r e e e n e r g y G

  • Minimum of free energy

Suboptimal conformations

S0

(h) S2

(h)

S3

(h)

S4

(h)

S7

(h)

S6

(h)

S5

(h)

S8

(h)

The minimum free energy structures on a discrete space of conformations

slide-27
SLIDE 27

UUUAGCCAGCGCGAGUCGUGCGGACGGGGUUAUCUCUGUCGGGCUAGGGCGC GUGAGCGCGGGGCACAGUUUCUCAAGGAUGUAAGUUUUUGCCGUUUAUCUGG UUAGCGAGAGAGGAGGCUUCUAGACCCAGCUCUCUGGGUCGUUGCUGAUGCG CAUUGGUGCUAAUGAUAUUAGGGCUGUAUUCCUGUAUAGCGAUCAGUGUCCG GUAGGCCCUCUUGACAUAAGAUUUUUCCAAUGGUGGGAGAUGGCCAUUGCAG

Criterion of Minimum Free Energy

Sequence Space Shape Space

slide-28
SLIDE 28

Reference for postulation and in silico verification of neutral networks

slide-29
SLIDE 29

Evolution in silico

  • W. Fontana, P. Schuster,

Science 280 (1998), 1451-1455

slide-30
SLIDE 30

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Structure space Real numbers Mapping from sequence space into structure space and into function

slide-31
SLIDE 31

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Structure space Real numbers

slide-32
SLIDE 32

Sk I. = ( ) ψ

Sequence space Structure space

slide-33
SLIDE 33

Sk I. = ( ) ψ

Sequence space Structure space

The pre-image of the structure Sk in sequence space is the neutral network Gk

slide-34
SLIDE 34

Neutral networks are sets of sequences forming the same object in a phenotype space. The neutral network Gk is, for example, the pre- image of the structure Sk in sequence space: Gk = -1(Sk) π{j | (Ij) = Sk} The set is converted into a graph by connecting all sequences of Hamming distance one. Neutral networks of small biomolecules can be computed by exhaustive folding of complete sequence spaces, i.e. all RNA sequences of a given chain length. This number, N=4n , becomes very large with increasing length, and is prohibitive for numerical computations. Neutral networks can be modelled by random graphs in sequence

  • space. In this approach, nodes are inserted randomly into sequence

space until the size of the pre-image, i.e. the number of neutral sequences, matches the neutral network to be studied.

slide-35
SLIDE 35

λj = 27 = 0.444 ,

/

12 λk = (k)

j

| | Gk

λ κ

cr = 1 -

  • 1 (

1)

/ κ- λ λ

k cr . . . .

> λ λ

k cr . . . .

< network is connected Gk network is connected not Gk Connectivity threshold: Alphabet size : = 4

  • AUGC

G S S

k k k

= ( ) | ( ) =

  • 1

U

  • I

I

j j

  • cr

2 0.5 3 0.423 4 0.370

GC,AU GUC,AUG AUGC

Mean degree of neutrality and connectivity of neutral networks

slide-36
SLIDE 36

A connected neutral network formed by a common structure

slide-37
SLIDE 37

Giant Component

A multi-component neutral network formed by a rare structure

slide-38
SLIDE 38

Gk Neutral Network

Structure S

k

Gk C

  • k

Compatible Set Ck

The compatible set Ck of a structure Sk consists of all sequences which form Sk as its minimum free energy structure (the neutral network Gk) or one of its suboptimal structures.

slide-39
SLIDE 39

Structure

slide-40
SLIDE 40

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G

Compatible sequence Structure

5’-end 3’-end

slide-41
SLIDE 41

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G G G G G G G G C C C C G G G G C C C C C C C U A U U G U A A A A U

Compatible sequence Structure

5’-end 3’-end

slide-42
SLIDE 42

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G G G G G G G G C C C C U U G G G G G C C C C C C C U U A A A A A U

Compatible sequence Structure

5’-end 3’-end

Single nucleotides: A U G C , , ,

Single bases pairs are varied independently

slide-43
SLIDE 43

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G G G C C C C G G G G C C G G G G G C C C C C U A U U G U A A A A U

Compatible sequence Structure

5’-end 3’-end

Base pairs: AU , UA GC , CG GU , UG

Base pairs are varied in strict correlation

slide-44
SLIDE 44

Structure S Structure S

1

The intersection of two compatible sets is always non empty: C0 C1

slide-45
SLIDE 45

Reference for the definition of the intersection and the proof of the intersection theorem

slide-46
SLIDE 46

5.10 5.90

2 8

14 15 18 17 23 19 27 22 38 45 25 36 33 39 40 43 41

3.30 7.40

5 3 7 4 10 9 6

13 12

3.10

11 21 20 16 28 29 26 30 32 42 46 44 24 35 34 37 49 31 47 48

S0 S1

Kinetic folding

S0 S1 S2 S3 S4 S5 S6 S7 S8 S10 S9

Suboptimal structures

lim t finite folding time

A typical energy landscape of a sequence with two (meta)stable comformations

slide-47
SLIDE 47

G G G C C C G C C G C C C G C C C G C G G G G C

Plus Strand Plus Strand Minus Strand Plus Strand 3' 3' 3' 3' 5' 3' 5' 5' 5'

Point Mutation Insertion Deletion

GAA AA UCCCG GAAUCC A CGA GAA AA UCCCGUCCCG GAAUCCA

The origins of changes in RNA sequences are replication errors called mutations.

slide-48
SLIDE 48

5'-End 5'-End 5'-End 3'-End 3'-End 3'-End

70 60 50 40 30 20 10 GCGGAUUUAGCUCAGDDGGGAGAGCMCCAGACUGAAYAUCUGGAGMUCCUGUGTPCGAUCCACAGAAUUCGCACCA

Sequence Secondary structure Symbolic notation

  • A symbolic notation of RNA secondary structure that is equivalent to the conventional graphs
slide-49
SLIDE 49

Hamming distance d (S ,S ) =

H 1 2

4 d (S ,S ) = 0

H 1 1

d (S ,S ) = d (S ,S )

H H 1 2 2 1

d (S ,S ) d (S ,S ) + d (S ,S )

H H H 1 3 1 2 2 3

  • (i)

(ii) (iii)

The Hamming distance between structures in parentheses notation forms a metric in structure space

slide-50
SLIDE 50

f0 f f1 f2 f3 f4 f6 f5 f7

Replication rate constant: fk = / [ + dS

(k)]

dS

(k) = dH(Sk,S)

Evaluation of RNA secondary structures yields replication rate constants

slide-51
SLIDE 51

Stock Solution Reaction Mixture

Replication rate constant: fk = / [ + dS

(k)]

dS

(k) = dH(Sk,S)

Selection constraint: Population size, N = # RNA molecules, is controlled by the flow Mutation rate: p = 0.001 / site replication N N t N ± ≈ ) ( The flowreactor as a device for studies of evolution in vitro and in silico

slide-52
SLIDE 52

5'-End 3'-End

70 60 50 40 30 20 10

Randomly chosen initial structure Phenylalanyl-tRNA as target structure

slide-53
SLIDE 53

space Sequence Concentration

Master sequence Mutant cloud “Off-the-cloud” mutations

The molecular quasispecies in sequence space

slide-54
SLIDE 54

S{ = ( ) I{ f S

{ {

ƒ = ( )

S{ f{ I{

Mutation Genotype-Phenotype Mapping Evaluation of the Phenotype

Q{

j

I1 I2 I3 I4 I5 In

Q

f1 f2 f3 f4 f5 fn

I1 I2 I3 I4 I5 I{ In+1 f1 f2 f3 f4 f5 f{ fn+1

Q

Evolutionary dynamics including molecular phenotypes

slide-55
SLIDE 55

In silico optimization in the flow reactor: Evolutionary trajectory Time (arbitrary units) A v e r a g e s t r u c t u r e d i s t a n c e t

  • t

a r g e t d

  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

slide-56
SLIDE 56

44

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Final conformation of optimization

slide-57
SLIDE 57

44 43

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of the last step 43 44

slide-58
SLIDE 58

44 43 42

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of last-but-one step 42 43 ( 44)

slide-59
SLIDE 59

44 43 42 41

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of step 41 42 ( 43 44)

slide-60
SLIDE 60

44 43 42 41 40

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of step 40 41 ( 42 43 44)

slide-61
SLIDE 61

44 43 42 41 40 39 Evolutionary process Reconstruction

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of the relay series

slide-62
SLIDE 62

Transition inducing point mutations Neutral point mutations

Change in RNA sequences during the final five relay steps 39 44

slide-63
SLIDE 63

10 08 12 14 Time (arbitrary units) Average structure distance to target dS

  • 500

250 20 10

Uninterrupted presence Evolutionary trajectory Number of relay step

28 neutral point mutations during a long quasi-stationary epoch Transition inducing point mutations Neutral point mutations

Neutral genotype evolution during phenotypic stasis

slide-64
SLIDE 64

AUGC GC Movies of optimization trajectories over the AUGC and the GC alphabet

slide-65
SLIDE 65

Runtime of trajectories F r e q u e n c y

1000 2000 3000 4000 5000 0.05 0.1 0.15 0.2

Statistics of the lengths of trajectories from initial structure to target (AUGC-sequences)

slide-66
SLIDE 66

Alphabet Runtime Transitions Main transitions

  • No. of runs

AUGC 385.6 22.5 12.6 1017 GUC 448.9 30.5 16.5 611 GC 2188.3 40.0 20.6 107

Mean population size: N = 3000 ; mutation rate: p = 0.001 Statistics of trajectories and relay series (mean values of log-normal distributions).

AUGC neutral networks of tRNAs are near the connectivity threshold, GC neutral networks are way below.

slide-67
SLIDE 67

5'-End 5'-End 5'-End 5'-End 3'-End 3'-End 3'-End 3'-End

70 70 70 70 60 60 60 60 50 50 50 50 40 40 40 40 30 30 30 30 20 20 20 20 10 10 10 10

Alphabet Degree of neutrality

AU AUG AUGC UGC GC

  • -
  • -

0.275 0.064 0.263 0.071 0.052 0.033

  • -

0.217 0.051 0.279 0.063 0.257 0.070

  • 0.057 0.034
  • 0.073 0.032

0.201 0.056 0.313 0.058 0.250 0.064 0.068 0.034

  • Degree of neutrality of cloverleaf RNA secondary structures over different alphabets
slide-68
SLIDE 68

Stable tRNA clover leaf structures built from binary, GC-only, sequences exist. The corresponding sequences are found through inverse folding. Optimization by mutation and selection in the flow reactor turned out to be a hard problem.

5'-End 3'-End

70 60 50 40 30 20 10

The neutral network of the tRNA clover leaf in GC sequence space is not connected, whereas to the corresponding neutral network in AUGC sequence space is close to the connectivity threshold, cr . Here, both inverse folding and optimization in the flow reactor are much more effective than with GC sequences.

The hardness of the structure optimization problem depends on the connectivity of neutral networks.

slide-69
SLIDE 69

Initial state Target Extinction

Replication, mutation and dilution

slide-70
SLIDE 70

10 12 14 16 18 20 22 Population size 0.2 0.4 0.6 0.8 1 P r

  • b

a b i l i t y t

  • r

e a c h t h e t a r g e t s t r u c t u r e

AUGC GC

Probability of a single trajectory to reach the target structure

slide-71
SLIDE 71
slide-72
SLIDE 72
slide-73
SLIDE 73
slide-74
SLIDE 74
slide-75
SLIDE 75

A ribozyme switch

E.A.Schultes, D.B.Bartel, Science 289 (2000), 448-452

slide-76
SLIDE 76

Two ribozymes of chain lengths n = 88 nucleotides: An artificial ligase (A) and a natural cleavage ribozyme of hepatitis--virus (B)

slide-77
SLIDE 77

The sequence at the intersection: An RNA molecules which is 88 nucleotides long and can form both structures

slide-78
SLIDE 78

Two neutral walks through sequence space with conservation of structure and catalytic activity

slide-79
SLIDE 79

Structure S Structure S

1

The intersection of two compatible sets is always non empty: C0 C1

slide-80
SLIDE 80
  • J. H. A. Nagel, C. Flamm, I. L. Hofacker, K. Franke, M. H. de Smit, P. Schuster, and
  • C. W. A. Pleij. Structural parameters affecting the kinetic competition of RNA hairpin

formation, in press 2004.

  • J. H. A. Nagel, J. Møller-Jensen, C. Flamm, K. J. Öistämö, J. Besnard, I. L. Hofacker,
  • A. P. Gultyaev, M. H. de Smit, P. Schuster, K. Gerdes and C. W. A. Pleij. The refolding

mechanism of the metastable structure in the 5’-end of the hok mRNA of plasmid R1, submitted 2004.

slide-81
SLIDE 81

J.H.A. Nagel, C. Flamm, I.L. Hofacker, K. Franke, M.H. de Smit, P. Schuster, and C.W.A. Pleij. Structural parameters affecting the kinetic competition of RNA hairpin formation, in press 2004.

JN2C

A A A G A A A U U U C U U U U U U U U U U U U U UC U U U U U U G G G G G G G G G C C C C C A G A A A U G G G C C C G G C A A G A G C G C A G A A G G C C C

5' 5' 3' 3'

CUGUUUUUGCA U AGCUUCUGUUG GCAGAAGC GCAGAAGC

  • 19.5 kcal·mol
  • 1
  • 21.9 kcal·mol
  • 1

A A A B B B C C C

3 3 3 15 15 15 36 36 36 24 24 24

slide-82
SLIDE 82

JN1LH

1D 1D 1D 2D 2D 2D R R R

G GGGUGGAAC GUUC GAAC GUUCCUCCC CACGAG CACGAG CACGAG

  • 28.6 kcal·mol
  • 1

G/

  • 31.8 kcal·mol
  • 1

G G G G G G C C C C C C A A U U U U G G C C U U A A G G G C C C A A A A G C G C A A G C /G

  • 28.2 kcal·mol
  • 1

G G G G G G GG CCC C C C C C U G G G G C C C C A A A A A A A A U U U U U G G C C A A

  • 28.6 kcal·mol
  • 1

3 3 3 13 13 13 23 23 23 33 33 33 44 44 44

5' 5' 3’ 3’

J.H.A. Nagel, C. Flamm, I.L. Hofacker, K. Franke, M.H. de Smit, P. Schuster, and C.W.A. Pleij. Structural parameters affecting the kinetic competition of RNA hairpin formation, in press 2004.

slide-83
SLIDE 83

4 5 8 9 11

1 9 2 2 4 2 5 2 7 3 3 3 4

36

38 39 41 46 47

3

49

1

2 6 7 10

1 2 1 3 1 4 1 5 1 6 1 7 1 8 2 1 22 2 3 2 6 2 8 2 9 3 3 1 32 3 5 3 7

40

4 2 4 3 44 45 48 50

  • 26.0
  • 28.0
  • 30.0
  • 32.0
  • 34.0
  • 36.0
  • 38.0
  • 40.0
  • 42.0
  • 44.0
  • 46.0
  • 48.0
  • 50.0

2.77 5.32 2 . 9 3.4 2.36 2 . 4 4 2.44 2.44 1.46 1.44 1.66

1.9

2.14

2.51 2.14 2.51

2 . 1 4 1 . 4 7

1.49

3.04 2.97 3.04 4.88 6.13 6 . 8 2.89

Free energy [kcal / mole]

J1LH barrier tree

slide-84
SLIDE 84

Riboswitches

Jord H. A. Nagel and Cornelius W. A. Pleij. Self-induced structural switches in RNA. Biochimie 84 (2002), 913-923 Wade Winkler, Ali Nahvi, and Ronald R. Breaker. Thiamine derivatives bind messenger RNA directly to regulate bacterial gene expression. Nature 419 (2002), 952-956 Ronald Micura and Claudia Höbartner. On Secondary Structure Rearrangements and Equilibria of Small RNAs. Nature 419 (2002), 952-956 Alexey G. Vitreschak, Dimitry A. Rodionov, Andrey A. Mironov, and Mikhail S. Gefland. Riboswitches: The oldest mechanism for the regulation of gene expression? Trends in Genetics 20 (2004), 44-50 Jeffrey E. Barrick, Keith A. Corbino, Wade C. Winkler, Ali Nahvi, Maumita Mandal, Jennifer Collins, Mark Lee, Adam Roth, Narasimhan Sundarasan, Inbal Jona, J. Kenneth Wickiser, and Ronald R. Breaker. New RNA motifs suggest an expanded scope for riboswitches in bacterial genetic control. Proc.Natl.Acad.Sci.USA 101 (2004), 6421-6426

slide-85
SLIDE 85

RNA as metabolite sensor and translation regulator: The figure represents part of a messenger RNA that encodes a bacterial protein involved in the biosynthesis of vitamin B1 (thiamine pyrophosphate, ThiPP). This part contains a region involved in sensing ThiPP; the so-called Shine–Dalgarno (SD) sequence, which is recognized by the ribosome; and a sequence that marks where the enzyme- encoding portion of the mRNA begins. a, Winkler et al.1 have found that in the absence of ThiPP, the sensor region adopts a conformation that exposes the Shine–Dalgarno

  • sequence. This would

allow the ribosome to bind and begin translation. b, Binding to ThiPP causes the sensor region to change shape, obscuring the Shine–Dalgarno

  • sequence. Thus ThiPP controls the production
  • f one of its biosynthetic enzymes directly via

a sequence within the enzyme-encoding mRNA.

Jack W. Szostak, RNA gets a grip on translation. Nature 419:890-891, 2002

slide-86
SLIDE 86

Schematic representation of the proposed mechanism for TPP- dependent deactivation of thiM translation: In the absence of TPP, the P8* pairing is formed between the anti-SD element and the anti-anti-SD

  • element. This conformation permits

the SD sequence to interact with the ribosome, and thus translation

  • proceeds. In the presence of TPP

(blue), the obligate formation of the P1 stem sequesters a portion of the anti-anti-SD element, and therefore the complete P8 stem also forms. This precludes ribosome access to the SD element, which inhibits

  • translation. Complementary sequence

elements that form P1 and P8 are depicted in green and orange, respectively.

Wade Winkler, Ali Nahvi, and Ronald R. Breaker, Thiamine derivatives bind messenger RNA directly to regulate bacterial gene expression. Nature 419:952-956, 2002

slide-87
SLIDE 87

Mount Fuji

Example of a smooth landscape on Earth

slide-88
SLIDE 88

Dolomites Bryce Canyon

Examples of rugged landscapes on Earth

slide-89
SLIDE 89

Genotype Space Fitness

Start of Walk End of Walk

Evolutionary optimization in absence of neutral paths in sequence space

slide-90
SLIDE 90

Genotype Space F i t n e s s

Start of Walk End of Walk Random Drift Periods Adaptive Periods

Evolutionary optimization including neutral paths in sequence space

slide-91
SLIDE 91

Grand Canyon

Example of a landscape on Earth with ‘neutral’ ridges and plateaus

slide-92
SLIDE 92

Conformational and mutational landscapes of biomolecules as well as fitness landscapes of evolutionary biology are rugged.

Genotype Space Fitness Start of Walk End of Walk

Adaptive or non-descending walks on rugged landscapes end commonly at one of the low lying local maxima.

Genotype Space Fitness Start of Walk End of Walk

Selective neutrality in the form of neutral networks plays an active role in evolutionary optimization and enables populations to reach high local maxima or even the global optimum.

slide-93
SLIDE 93

Acknowledgement of support

Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Projects No. 09942, 10578, 11065, 13093, 13887, and 14898 Jubiläumsfonds der Österreichischen Nationalbank Project No. Nat-7813 European Commission: Project No. EU-980189 Siemens AG, Austria Österreichische Akademie der Wissenschaften Universität Wien The software for producing RNA movies was developed by Robert Giegerich and coworkers at the Universität Bielefeld

Universität Wien Österreichische Akademie der Wissenschaften

slide-94
SLIDE 94

Coworkers

Walter Fontana, Harvard Medical School, Boston, MA Christian Reidys, Christian Forst, Los Alamos National Laboratory, NM Peter Stadler, Bärbel Stadler, Universität Leipzig, GE Ivo L.Hofacker, Christoph Flamm, Andreas Svrček-Seiler, Universität Wien, AT Kurt Grünberger, Stefan Müller, Stefanie Widder, Stefan Wuchty, Andreas Wernitznig, Michael Kospach, Jan Cupal, Stefan Bernhard, Lukas Endler, Ulrike Langhammer, Ulrike Mückstein, Universität Wien AT Ulrike Göbel, Walter Grüner, Stefan Kopp, Jaqueline Weber Institut für Molekulare Biotechnologie, Jena, GE

Universität Wien

slide-95
SLIDE 95

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-96
SLIDE 96