Biology Meets Chemistry Molecular Evolution and Systems Biology at - - PowerPoint PPT Presentation

biology meets chemistry
SMART_READER_LITE
LIVE PREVIEW

Biology Meets Chemistry Molecular Evolution and Systems Biology at - - PowerPoint PPT Presentation

Biology Meets Chemistry Molecular Evolution and Systems Biology at the Cross-Roads Peter Schuster Institut fr Theoretische Chemie, Universitt Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA Mini-Symposium on Theoretical


slide-1
SLIDE 1
slide-2
SLIDE 2

Biology Meets Chemistry

Molecular Evolution and Systems Biology at the Cross-Roads

Peter Schuster

Institut für Theoretische Chemie, Universität Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA

Mini-Symposium on Theoretical Biology ETH-Zürich, 04.07.2005

slide-3
SLIDE 3

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-4
SLIDE 4

Genotype, Genome Phenotype

Unfolding of the genotype

Highly specific environmental conditions Developmental program

Collection of genes

Evolution

slide-5
SLIDE 5

The macroscopic biologists‘ nightmare: The conquest of biology by chemists and physicists and, eventually, by mathematicians and computer scientists has become reality in the last fifty years.

slide-6
SLIDE 6

The macroscopic biologists‘ revenche: Chemists and physicists don‘t know any biology, and this is true even more for mathematicians and computer scientists. So they all have to learn biology.

The victory of the live sciences, 2005.

slide-7
SLIDE 7

Genotype, Genome

GCGGATTTAGCTCAGTTGGGAGAGCGCCAGACTGAAGATCTGGAGGTCCTGTGTTCGATCCACAGAATTCGCACCA

Phenotype

Unfolding of the genotype

Highly specific environmental conditions Biochemistry molecular biology structural biology molecular evolution molecular genetics systems biology bioinfomatics

Max Perutz Hemoglobin sequence Gerhard Braunitzer Molecular evolution Linus Pauling and Emile Zuckerkandl The exciting RNA story evolution of RNA molecules, ribozymes and splicing, the idea of an RNA world, selection of RNA molecules, RNA editing, the ribosome is a ribozyme, small RNAs and RNA switches.

Omi Omics

‘the new biology is the chemistry of living matter’ James D. Watson und Francis H.C. Crick

slide-8
SLIDE 8

GCGGATTTAGCTCAGTTGGGAGAGCGCCAGACTGAAGATCTGGAGGTCCTGTGTTCGATCCACAGAATTCGCACCA

1 2 3 4 5 6 7 8 9 10 11 12 Regulatory protein or RNA Enzyme Metabolite Regulatory gene Structural gene

A model genome with 12 genes

Genotype RNA secondary structure RNA spatial structure Genetic and metabolic network

Three different genotype-phenotype mappings

slide-9
SLIDE 9

1. RNA phenotypes 2. Genotype-phenotype mappings 3. Evolution on neutral networks 4. Genetic and metabolic networks 5. A glimpse of chemical kinetics and dynamics 6. How do model metabolisms evolve?

slide-10
SLIDE 10
  • 1. RNA phenotypes

2. Genotype-phenotype mappings 3. Evolution on neutral networks 4. Genetic and metabolic networks 5. A glimpse of chemical kinetics and dynamics 6. How do model metabolisms evolve?

slide-11
SLIDE 11

Complementary replication is the simplest copying mechanism of RNA. Complementarity is determined by Watson-Crick base pairs: GC and A=U

slide-12
SLIDE 12

RNA sample Stock solution: Q RNA-replicase, ATP, CTP, GTP and UTP, buffer

  • Time

1 2 3 4 5 6 69 70 The serial transfer technique applied to RNA evolution in vitro

slide-13
SLIDE 13

Reproduction of the original figure of the serial transfer experiment with Q RNA β D.R.Mills, R,L,Peterson, S.Spiegelman, . Proc.Natl.Acad.Sci.USA (1967), 217-224 An extracellular Darwinian experiment with a self-duplicating nucleic acid molecule 58

slide-14
SLIDE 14

Decrease in mean fitness due to quasispecies formation

The increase in RNA production rate during a serial transfer experiment

slide-15
SLIDE 15

RNA structure determines fitness in RNA evolution experiments

slide-16
SLIDE 16

Definition and physical relevance of RNA secondary structures

RNA secondary structures are listings of Watson-Crick and GU wobble base pairs, which are free of knots and

  • pseudokots. This definition allows for rigorous

mathematical analysis by means of combinatorics. „Secondary structures are folding intermediates in the formation of full three-dimensional structures.“ Secondary structures have been and still are frequently used to predict and discuss RNA function. D.Thirumalai, N.Lee, S.A.Woodson, and D.K.Klimov. Annu.Rev.Phys.Chem. 52:751-762 (2001):

slide-17
SLIDE 17

5'-End

5'-End 5'-End 3'-End 3'-End

3'-End

70 60 50 40 30 20 10 GCGGAUUUAGCUCAGDDGGGAGAGCMCCAGACUGAAYAUCUGGAGMUCCUGUGTPCGAUCCACAGAAUUCGCACCA

Sequence Secondary structure Symbolic notation

  • A symbolic notation of RNA secondary structure that is equivalent to the conventional graphs
slide-18
SLIDE 18

RNA sequence RNA structure

  • f minimal free

energy

RNA folding: Structural biology, spectroscopy of biomolecules, understanding molecular function Empirical parameters Biophysical chemistry: thermodynamics and kinetics

Sequence, structure, and design

slide-19
SLIDE 19

G G G G G G G G G G G G G G G G U U U U U U U U U U U A A A A A A A A A A A A U C C C C C C C C C C C C 5’-end 3’-end

S1

(h)

S9

(h)

F r e e e n e r g y G

  • Minimum of free energy

Suboptimal conformations

S0

(h) S2

(h)

S3

(h)

S4

(h)

S7

(h)

S6

(h)

S5

(h)

S8

(h)

The minimum free energy structures on a discrete space of conformations

slide-20
SLIDE 20

RNA sequence RNA structure

  • f minimal free

energy

RNA folding: Structural biology, spectroscopy of biomolecules, understanding molecular function Inverse Folding Algorithm Iterative determination

  • f a sequence for the

given secondary structure

Sequence, structure, and design

Inverse folding of RNA: Biotechnology, design of biomolecules with predefined structures and functions

slide-21
SLIDE 21

The Vienna RNA-Package: A library of routines for folding, inverse folding, sequence and structure alignment, kinetic folding, cofolding, …

slide-22
SLIDE 22

1. RNA phenotypes

  • 2. Genotype-phenotype mappings

3. Evolution on neutral networks 4. Genetic and metabolic networks 5. A glimpse of chemical kinetics and dynamics 6. How do model metabolisms evolve?

slide-23
SLIDE 23
slide-24
SLIDE 24

A mapping and its inversion

  • Gk =

( ) | ( ) =

  • 1

U

  • S

I S

k j j k

I

( ) = I S

j k Space of genotypes: = { I

S I I I I I S S S S S

1 2 3 4 N 1 2 3 4 M

, , , , ... , } ; Hamming metric Space of phenotypes: , , , , ... , } ; metric (not required) N M = {

slide-25
SLIDE 25

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Structure space Real numbers Mapping from sequence space into structure space and into function

slide-26
SLIDE 26

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Structure space Real numbers

slide-27
SLIDE 27

Sk I. = ( ) ψ

Sequence space Structure space

slide-28
SLIDE 28

The pre-image of the structure Sk in sequence space is the neutral network Gk

slide-29
SLIDE 29

Degree of neutrality of neutral networks and the connectivity threshold

slide-30
SLIDE 30

Giant Component

A multi-component neutral network formed by a rare structure: < cr

slide-31
SLIDE 31

A connected neutral network formed by a common structure: > cr

slide-32
SLIDE 32

Reference for postulation and in silico verification of neutral networks

slide-33
SLIDE 33

Properties of RNA sequence to secondary structure mapping

  • 1. More sequences than structures
slide-34
SLIDE 34

Properties of RNA sequence to secondary structure mapping

  • 1. More sequences than structures
slide-35
SLIDE 35

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures

slide-36
SLIDE 36

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures

n = 100, stem-loop structures n = 30

RNA secondary structures and Zipf’s law

slide-37
SLIDE 37

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures 3. Shape space covering of common structures

slide-38
SLIDE 38

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures 3. Shape space covering of common structures

slide-39
SLIDE 39

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures 3. Shape space covering of common structures 4. Neutral networks of common structures are connected

slide-40
SLIDE 40

Properties of RNA sequence to secondary structure mapping 1. More sequences than structures 2. Few common versus many rare structures 3. Shape space covering of common structures 4. Neutral networks of common structures are connected

slide-41
SLIDE 41

RNA 9:1456-1463, 2003

Evidence for neutral networks and shape space covering

slide-42
SLIDE 42

Evidence for neutral networks and intersection of apatamer functions

slide-43
SLIDE 43

A ribozyme switch

E.A.Schultes, D.B.Bartel, Science 289 (2000), 448-452

slide-44
SLIDE 44

Two ribozymes of chain lengths n = 88 nucleotides: An artificial ligase (A) and a natural cleavage ribozyme of hepatitis--virus (B)

slide-45
SLIDE 45

The sequence at the intersection: An RNA molecules which is 88 nucleotides long and can form both structures

slide-46
SLIDE 46

Two neutral walks through sequence space with conservation of structure and catalytic activity

slide-47
SLIDE 47

1. RNA phenotypes 2. Genotype-phenotype mappings

  • 3. Evolution on neutral networks

4. Genetic and metabolic networks 5. A glimpse of chemical kinetics and dynamics 6. How do model metabolisms evolve?

slide-48
SLIDE 48

Evolution in silico

  • W. Fontana, P. Schuster,

Science 280 (1998), 1451-1455

slide-49
SLIDE 49

Replication rate constant (Fitness): fk = / [ + dS

(k)]

dS

(k) = dH(Sk,S)

Selection pressure: The population size, N = # RNA moleucles, is determined by the flux: Mutation rate: p = 0.001 / Nucleotide Replication N N t N ± ≈ ) ( The flow reactor as a device for studying the evolution of molecules in vitro and in silico.

slide-50
SLIDE 50
slide-51
SLIDE 51
slide-52
SLIDE 52
slide-53
SLIDE 53

In silico optimization in the flow reactor: Evolutionary Trajectory

slide-54
SLIDE 54

28 neutral point mutations during a long quasi-stationary epoch Transition inducing point mutations change the molecular structure Neutral point mutations leave the molecular structure unchanged

Neutral genotype evolution during phenotypic stasis

slide-55
SLIDE 55

Evolutionary trajectory Spreading of the population

  • n neutral networks

Drift of the population center in sequence space

slide-56
SLIDE 56

Spreading and evolution of a population on a neutral network: t = 150

slide-57
SLIDE 57

Spreading and evolution of a population on a neutral network : t = 170

slide-58
SLIDE 58

Spreading and evolution of a population on a neutral network : t = 200

slide-59
SLIDE 59

Spreading and evolution of a population on a neutral network : t = 350

slide-60
SLIDE 60

Spreading and evolution of a population on a neutral network : t = 500

slide-61
SLIDE 61

Spreading and evolution of a population on a neutral network : t = 650

slide-62
SLIDE 62

Spreading and evolution of a population on a neutral network : t = 820

slide-63
SLIDE 63

Spreading and evolution of a population on a neutral network : t = 825

slide-64
SLIDE 64

Spreading and evolution of a population on a neutral network : t = 830

slide-65
SLIDE 65

Spreading and evolution of a population on a neutral network : t = 835

slide-66
SLIDE 66

Spreading and evolution of a population on a neutral network : t = 840

slide-67
SLIDE 67

Spreading and evolution of a population on a neutral network : t = 845

slide-68
SLIDE 68

Spreading and evolution of a population on a neutral network : t = 850

slide-69
SLIDE 69

Spreading and evolution of a population on a neutral network : t = 855

slide-70
SLIDE 70

1. RNA phenotypes 2. Genotype-phenotype mappings 3. Evolution on neutral networks

  • 4. Genetic and metabolic networks

5. A glimpse of chemical kinetics and dynamics 6. How do model metabolisms evolve?

slide-71
SLIDE 71

GCGGATTTAGCTCAGTTGGGAGAGCGCCAGACTGAAGATCTGGAGGTCCTGTGTTCGATCCACAGAATTCGCACCA

1 2 3 4 5 6 7 8 9 10 11 12 Regulatory protein or RNA Enzyme Metabolite Regulatory gene Structural gene

A model genome with 12 genes

Genotype RNA secondary structure RNA spatial structure Genetic and metabolic network

Three different genotype-phenotype mappings

slide-72
SLIDE 72

The search for more complex phenotypes inevitably leads from evolvable molecules to genetic regulation and metabolism. The simplest systems of this kind are artificial regulatory systems on plasmids that can be expressed and studied in Escherichia coli cells.

slide-73
SLIDE 73

1 2 3 4 5 6 7 8 9 10 11 12 Regulatory protein or RNA Enzyme Metabolite Regulatory gene Structural gene

A model genome with 12 genes

Sketch of a genetic and metabolic network

slide-74
SLIDE 74

1 2 3 4 5 6 7 8 9 10 11 12 Regulatory protein or RNA Enzyme Metabolite Regulatory gene Structural gene

A model genome with 12 genes

Genetic regulatory network Metabolic network

Proposal of a new name: Genetic and metabolic network

slide-75
SLIDE 75

A B C D E F G H I J K L 1

Biochemical Pathways

2 3 4 5 6 7 8 9 10

The reaction network of cellular metabolism published by Boehringer-Ingelheim.

slide-76
SLIDE 76

The citric acid

  • r Krebs cycle

(enlarged from previous slide).

slide-77
SLIDE 77

1. RNA phenotypes 2. Genotype-phenotype mappings 3. Evolution on neutral networks 4. Genetic and metabolic networks

  • 5. A glimpse of chemical kinetics and dynamics

6. How do model metabolisms evolve?

slide-78
SLIDE 78

Time t Concentration xi (t)

Sequences

Vienna RNA Package

Structures and kinetic parameters Stoichiometric equations

SBML – systems biology markup language

Kinetic differential equations

ODE Integration by means of CVODE

Solution curves

A + B X 2 X Y Y + X D

y x k d y x k x k y y x k x k b a k x b a k b a

3 3 2 2 3 2 2 1 1

t d d t d d t d d t d d t d d = − = − − = − = =

The elements of the simulation tool MiniCellSim

SBML: Bioinformatics 19:524-531, 2003; CVODE: Computers in Physics 10:138-143, 1996

slide-79
SLIDE 79

ATGCCTTATACGGCAGTCAGGTGCACCATT...GGC TACGGAATATGCCGTCAGTCCACGTGGTAA...CCG DNA string genotype environment mRNA Protein RNA

Metabolism

RNA and protein structures enzymes and small molecules Recycling of molecules cell membrane nutrition waste genotype-p e h p mapping e y not genetic regulation network metabolic reaction network transport system

The regulatory logic of MiniCellSym

slide-80
SLIDE 80

The model regulatory gene in MiniCellSim

slide-81
SLIDE 81

The model structural gene in MiniCellSim

slide-82
SLIDE 82

Cross-regulation of two genes

slide-83
SLIDE 83

2 , 1 , ) ( : Repression ) ( : Activation

n n n

= + = + = j i p K K p F p K p p F

j j i j j j i

Gene regulatory binding functions

slide-84
SLIDE 84

2 P 2 2 P 2 2 1 P 2 1 P 1 1 2 Q 2 1 2 Q 2 2 1 Q 1 2 1 Q 1 1

) ( ) ( p d q k dt dp p d q k dt dp q d p F k dt dq q d p F k dt dq − = − = − = − =

2 2 1 1 2 2 1 1 2 1

] P [ , ] P [ , ] Q [ , ] Q [ . const ] G [ ] G [ p p q q g = = = = = = = 2 , 1 , ) ( : Repression ) ( : Activation

n n n

= + = + = j i p K K p F p K p p F

j j i j j j i

P 2 Q 2 P 2 Q 2 2 P 1 Q 1 P 1 Q 1 1 1 2 2 2 1 2 2 1 1 1

, ) ( , )) ( ( : points Stationary d d k k d d k k p F p p F F p = = = = − ϑ ϑ ϑ ϑ ϑ

Qualitative analysis of cross-regulation of two genes

slide-85
SLIDE 85

) , ( ) ε ( ) ε ( ) ε ( ) ε (

2 1 P 2 P 1 Q 2 Q 1 P 2 P 1 Q 2 Q 1

p p k k k k D D d d d d Γ − = = + + + + +

Eigenvalues of the Jacobian of the cross-regulatory two gene system

slide-86
SLIDE 86

) , ( ) ε ( ) ε ( ) ε ( ) ε (

2 1 P 2 P 1 Q 2 Q 1 P 2 P 1 Q 2 Q 1

p p k k k k D D d d d d Γ − = = + + + + +

Eigenvalues of the Jacobian of the cross-regulatory two gene system

slide-87
SLIDE 87

2 P 2 P 1 Q 2 Q 1 P 2 P 1 P 2 Q 2 P 1 Q 2 P 2 Q 1 P 1 Q 1 Q 2 Q 1 Hopf P 2 P 1 Q 2 Q 1 trans

) ( ) )( )( )( )( )( ( d d d d d d d d d d d d d d d d D d d d d D + + + + + + + + + = − =

slide-88
SLIDE 88

Regulatory dynamics at D < DHopf , act.-repr., n=3

slide-89
SLIDE 89

Regulatory dynamics at D > DHopf , act.-repr., n=3

slide-90
SLIDE 90

Hill coefficient: n Act.-Act. Act.-Rep. Rep.-Rep. 1 S , E S S 2 E , B(E,P) S S , B(P1,P2) 3 E , B(E,P) S , O S , B(P1,P2) 4 E , B(E,P) S , O S , B(P1,P2)

slide-91
SLIDE 91

An example analyzed and simulated by MiniCellSim

The repressilator: M.B. Ellowitz, S. Leibler. A synthetic oscillatory network of transcriptional

  • regulators. Nature 403:335-338, 2002
slide-92
SLIDE 92

Stable stationary state Limit cycle oscillations Fading oscillations caused by a stable heteroclinic orbit Hopf bifurcation Bifurcation to May-Leonhard system Increasing inhibitor strength

slide-93
SLIDE 93

P1 P2 P3

start start

The repressilator limit cycle

slide-94
SLIDE 94

P1 P2 P2 P2 P3

Stable heteroclinic orbit Unstable heteroclinic orbit

1 1 2 2 2<0 2>0 2=0

Bifurcation from limit cycle to stable heteroclinic orbit at

The repressilator heteroclinic orbit

slide-95
SLIDE 95

1. RNA phenotypes 2. Genotype-phenotype mappings 3. Evolution on neutral networks 4. Genetic and metabolic networks 5. A glimpse of chemical kinetics and dynamics

  • 6. How do model metabolisms evolve?
slide-96
SLIDE 96

Evolutionary time: 0000 Number of genes 12 : + 06 structural 06 regulatory Number of interactions 15 : + + 04 inhibitory + 10 activating 1 self-activating

A genabolic network formed from a genotype of n = 200 nucleotides

slide-97
SLIDE 97

100 1000 10000 1e+05 5 10 15 20 TF00 TF01 TF02 TF03 SP04 TF05 SP06 SP07 SP08 SP09 TF10 SP11

Evolutionary time 0000 , initial network : Intracellular time Stationary state Intracellular time scale Evolutionary time scale [generations]: 0000 initial network

slide-98
SLIDE 98

Evolution of a genabolic network:

Initial genome: random sequence of length n = 200, AUGC alphabet Gene length: n = 25 Simulation with mutation rate: p = 0.01 Evolutionary time unit >> intracellular time unit

slide-99
SLIDE 99

Number of genes: total / structural genes regulatory genes

slide-100
SLIDE 100

Evolution of a genabolic network:

Initial genome: random sequence of length n = 200, AUGC alphabet Gene length: n = 25 Simulation with mutation rate: p = 0.01 Evolutionary time unit >> intracellular time unit Recorded events: (i) Loss of a gene through corruption of the start signal “TA” (analogue of the “TATA Box”), (ii) creation of a gene, (iii) change in the edges through mutation-induced changes in the affinities of translation products to the binding sites, and (iv) change in the class of genes (tf sp).

slide-101
SLIDE 101

Statistics of one thousand generations Total number of genes: 11.67 2.69 Regulatory genes: 5.97 2.22 Structural genes: 5.70 2.17

slide-102
SLIDE 102

Acknowledgement of support

Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Projects No. 09942, 10578, 11065, 13093 13887, and 14898 Wiener Wissenschafts-, Forschungs- und Technologiefonds (WWTF) Project No. Mat05 Jubiläumsfonds der Österreichischen Nationalbank Project No. Nat-7813 European Commission: Contracts No. 98-0189, 12835 (NEST) Austrian Genome Research Program – GEN-AU Siemens AG, Austria Universität Wien, Austrian Academy of Sciences, and the Santa Fe Institute

Universität Wien

slide-103
SLIDE 103

Coworkers

Walter Fontana, Harvard Medical School, MA Christian Forst, Christian Reidys, Los Alamos National Laboratory, NM Peter Stadler, Bärbel Stadler, Universität Leipzig, GE Sebastian Bonhoeffer, ETH Zürich, Erich Bornberg-Bauer, Universität Münster, Martin Nowak, Harvard University, Thomas Wiehe, Universität Köln Jord Nagel, Kees Pleij, Universiteit Leiden, NL Heinz Engl, Stefan Müller, Josef Schiko, Johann Radon-Institut für Angewandte und Computergestützte Mathematik der Österreichischen Akademie der Wissenschaften, Linz, AT Christoph Flamm, Ivo L.Hofacker, Andreas Svrček-Seiler, Universität Wien, AT Kurt Grünberger, Michael Kospach, Andreas Wernitznig, Stefanie Widder, Michael Wolfinger, Stefan Wuchty,Universität Wien, AT Stefan Bernhart, Jan Cupal, Lukas Endler, Ulrike Langhammer, Rainer Machne, Ulrike Mückstein, Hakim Tafer, Universität Wien, AT Ulrike Göbel, Walter Grüner, Stefan Kopp, Jaqueline Weber, Institut für Molekulare Biotechnologie, Jena, GE

Universität Wien

slide-104
SLIDE 104

Afterthought in occasion of Sebastian Bonhoeffer’s promotion to full professor at the ETH

slide-105
SLIDE 105

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-106
SLIDE 106