Stochastic hybrid models for DNA replication in the fission yeast - - PowerPoint PPT Presentation

stochastic hybrid models for dna replication in the
SMART_READER_LITE
LIVE PREVIEW

Stochastic hybrid models for DNA replication in the fission yeast - - PowerPoint PPT Presentation

Stochastic hybrid models for DNA replication in the fission yeast John Lygeros Automatic Control Laboratory, ETH Zrich www.control.ethz.ch Outline 1. Hybrid and stochastic hybrid systems 2. Reachability & randomized methods 3. DNA


slide-1
SLIDE 1

Automatic Control Laboratory, ETH Zürich

www.control.ethz.ch

Stochastic hybrid models for DNA replication in the fission yeast

John Lygeros

slide-2
SLIDE 2

Outline

  • 1. Hybrid and stochastic hybrid systems
  • 2. Reachability & randomized methods
  • 3. DNA replication

– DNA replication in the cell cycle – A stochastic hybrid model – Simulation results for the fission yeast – Analysis

  • 4. Summary
slide-3
SLIDE 3

Hybrid dynamics

Discrete and continuous interactions Air traffic

Flight plan FMS modes Aircraft motion Networked control Network topology Quantization Network delays Controlled state

Multi-agent Biology

Coordination communication Agent motion Gene activation/ inhibition Protein concentration fluctuation

slide-4
SLIDE 4

Hybrid dynamics

  • Both continuous and discrete state and input
  • Interleaving of discrete and continuous

– Evolve continuously – Then take a jump – Then evolve continuously again – Etc.

  • Tight coupling

– Discrete evolution depends on continuous state – Continuous evolution depends on discrete state

slide-5
SLIDE 5

Hybrid systems

Air traffic

Networked control

Multi-agent Biology

Flight plan FMS modes Network topology Quantization Coordination communication Gene activation/ inhibition Aircraft motion Network delays Controlled state Agent motion Protein concentration fluctuation

Computation

  • Automata
  • Languages

Control

  • ODE
  • Trajectories

Hybrid systems = Computation & Control

slide-6
SLIDE 6

But what about uncertainty?

  • Hybrid systems allow uncertainty in

– Continuous evolution direction – Discrete & continuous state destinations – Choice between flowing and jumping

  • “Traditionally” uncertainty worst case

– “Non‐deterministic” – Yes/No type questions – Robust control – Pursuit evasion game theory

  • May be too coarse for some applications
slide-7
SLIDE 7

Example: Air traffic safety

Is a fatal accident possible in the current air traffic system? YES! Is this an interesting question? NO! What it is the probability

  • f a fatal accident?

How can this probability be reduced? Much more difficult!

slide-8
SLIDE 8

Stochastic hybrid systems

  • Answering (or even asking) these questions

requires additional complexity

  • Richer models to allow probabilities

– Continuous evolution (e.g. SDE) – Discrete transition timing (Markovian, forced) – Discrete transition destination (transition kernel)

  • Stochastic hybrid systems

Shameless plug: H.A.P. Blom and J. Lygeros (eds.), “Stochastic hybrid systems: Theory and safety critical applications”, Springer‐Verlag, 2006 C.G. Cassandras and J. Lygeros (eds.), “Stochastic hybrid systems”, CRC Press, 2006

slide-9
SLIDE 9

Computation

  • Automata
  • Languages

Control

  • ODE
  • Trajectories

Hybrid systems = Computation & Control Stochastic analysis

  • Stochastic DE
  • Martingales

Stochastic Hybrid Systems

slide-10
SLIDE 10

Outline

  • 1. Hybrid and stochastic hybrid systems
  • 2. Reachability & randomized methods
  • 3. DNA replication

– DNA replication in the cell cycle – A stochastic hybrid model – Simulation results for the fission yeast – Analysis

  • 4. Summary
slide-11
SLIDE 11

Reachability: Stochastic HS

State space Terminal states Initial states Estimate “measure”

  • f this set, P
slide-12
SLIDE 12

Monte‐Carlo simulation

  • Exact solutions impossible
  • Numerical solutions computationally intensive
  • Assume we have a simulator for the system

– Can generate trajectories of the system – With the right probability distribution

  • “Algorithm”

– Simulate the system N times – Count number of times terminal states reached (M) – Estimate reach probability P by ˆ

M P N =

slide-13
SLIDE 13
  • Moreover …
  • Simulating more we get as close as we like
  • “Fast” growth with ε slow growth with δ
  • No. of simulations independent of state size
  • Time needed for each simulation dependent on it
  • Have to give up certainty

Convergence

ˆ as P P N → → ∞

2

1 2 ln 2 N ε δ ⎛ ⎞ ≥ ⎜ ⎟ ⎝ ⎠ ˆ Probability that is at most as long as P P ε δ − ≥

  • It can be shown that
slide-14
SLIDE 14

Not as naïve as it sounds

  • Efficient implementations

– Interacting particle systems, parallelism

  • With control inputs

– Expected value cost – Randomized optimization problem – Asymptotic convergence – Finite sample bounds

  • Parameter identification

– Randomized optimization problem

  • Can randomize deterministic problems
slide-15
SLIDE 15

Outline

  • 1. Hybrid and stochastic hybrid systems
  • 2. Reachability & randomized methods
  • 3. DNA replication

– DNA replication in the cell cycle – A stochastic hybrid model – Simulation results for the fission yeast – Analysis

  • 4. Summary
slide-16
SLIDE 16

Credits

  • ETH Zurich:

– John Lygeros – K. Koutroumpas

  • U. of Patras:

– Zoe Lygerou – S. Dimopoulos – P. Kouretas – I. Legouras

  • Rockefeller U.:

– Paul Nurse – C. Heichinger – J. Wu

www.hygeiaweb.gr HYGEIA FP6‐NEST‐04995

slide-17
SLIDE 17

Systems biology

  • Mathematical modeling
  • f biological processes

at the molecular level

  • Genes proteins and

their interactions

  • Abundance of data

– Micoarray – Imaging and microscopy – Gene reporter systems, bioinformatics, robotics

slide-18
SLIDE 18

Systems biology

  • Models based on biologist intuition
  • Can “correlate” large data sets
  • Model predictions

– Highlight “gaps” in understanding – Motivate new experiments

Model Experiments

Understanding

slide-19
SLIDE 19

Cell cycle

S G2 G1 M

“Gap” Synthesis Mitosis Segregation

+

Replication

G1

slide-20
SLIDE 20

Process needs to be tightly regulated

Metastatic colon cancer Normal cell

slide-21
SLIDE 21

Origins of replication

slide-22
SLIDE 22

Regulatory biochemical network

  • CDK activity sets cell cycle pace [Nurse et.al.]
  • Complex biochemical network, ~12 proteins,

nonlinear dynamics [Novak et.al.]

Hybrid Process!

slide-23
SLIDE 23

Process “mechanics”

  • Discrete

– Firing of origins – Passive replication by adjacent origin

  • Continuous

– Forking: replication movement along genome – Speed depends on location along genome

  • Stochastic

– Location of origins (where?) – Firing of origins (when?)

slide-24
SLIDE 24

Different organisms, different strategies

  • Bacteria and budding yeast

– Specific sequences that act as origins – With very high efficiency (>95%) – Process very deterministic

  • Frog and fly embryos

– Any position along genome can act as an origin – Random number of origins fire – Random patterns of replication

  • Most eukaryots (incl. humans and S. pombe)

– Origin sequences have certain characteristics – Fire randomly with some “efficiency”

  • N. Rind, “DNA replication timing: random thoughts about origin firing”,

Nature cell biology, 8(12), pp. 1313‐1316, December 2006

slide-25
SLIDE 25

Model data

  • Split genome into pieces

– Chromosomes – May have to split further

  • For each piece need:

– Length in bases – # of potential origins of replication (n) – p(x) p.d.f. of origin positions on genome – λ(x) firing rate of origin at position x – v(x) forking speed at position x

slide-26
SLIDE 26

Stochastic terms

  • Extract origin positions
  • Extract firing time, Ti, of origin i

P{Ti > t} = eàõ(Xi)t

Xi ø p(x), i = 1, . . ., n Xi xi‐ xi+ Xi+1

slide-27
SLIDE 27

Different “modes”

PreR RB RR RL PostR PassR

Origin i

slide-28
SLIDE 28

Discrete dynamics (origin i)

PreRi RBi RLi RRi PassRi Guards depend on

  • Ti, xi+, xi‐
  • xi‐1+, xi+1‐

PostRi

slide-29
SLIDE 29

Continuous dynamics (origin i)

  • Progress of forking process
  • P. Kouretas, K. Koutroumpas, J. Lygeros, and Z. Lygerou, “Stochastic

hybrid modeling of biochemical processes,” in Stochastic Hybrid Systems (C. Cassandras and J. Lygeros, eds.), no. 24 in Control Engineering, pp. 221–248, Boca Raton: CRC Press, 2006

x ç +

i =

v(Xi + x+

i )

if q(i) ∈ {RB, RR}

  • therwise

(

x ç à

i =

v(Xi à xà

i )

if q(i) ∈ {RB, RL}

  • therwise

(

slide-30
SLIDE 30

Fission yeast model

  • Instantiate: Schizzosacharomyces pombe

– Fully sequenced [Bahler et.al.] – ~12 Mbases, in 3 chromosomes – Exclude

  • Telomeric regions of all chromosomes
  • Centromeres of chromosomes 2 & 3

– 5 DNA segments to model

  • Remaining data from experiments

– C. Heichinger & P. Nurse

  • C. Heichinger, C.J. Penkett, J. Bahler, P. Nurse, “Genome

wide characterization of fission yeast DNA replication

  • rigins”, EMBO Journal, vol. 25, pp. 5171-5179, 2006
slide-31
SLIDE 31

Experimental data input

  • 863 origins
  • Potential origin locations known, p(x) trivial
  • “Efficiency”, FPi, for each origin, i

– Fraction of cells where origin observed to fire – Firing probability – Assuming 20 minute nominal S‐phase

  • Fork speed constant, v(x)=3kbases/minute

FPi = R

20 õieàõitdt ⇒ õi = à 20 ln(1àFPi)

slide-32
SLIDE 32

Simulation

  • Piecewise Deterministic Process [Davis]
  • Model size formidable

– Up to 1726 continuous states – Up to 6863 discrete states

  • Monte‐Carlo simulation in Matlab

– Model probabilistic, each simulation different – Run 1000 simulations, collect statistics

  • Check statistical model predictions against

independent experimental evidence

– S. phase duration – Number of firing origins

slide-33
SLIDE 33

Example runs

Created by

  • K. Koutroumpas
slide-34
SLIDE 34

MC estimate: efficiency

Close to experimental

slide-35
SLIDE 35

MC estimate: S‐phase duration

Empirical: 19 minutes!

slide-36
SLIDE 36

MC estimate: Max inter‐origin dist.

Random gap problem

slide-37
SLIDE 37

Possible explanations

  • Efficiencies used in model are wrong

– System identification to match efficiencies – Not a solution, something will not fit

  • Speed approximation inaccurate

– “Filtering” of raw experimental data – Not a solution, something will not fit

  • Inefficient origins play important role

– Motivation for bioinformatic study – AT content, asymmetry, inter‐gene, … – Also chromatin structure – Not a solution

slide-38
SLIDE 38

Possible explanations (not!)

Increasing efficiency Increasing fork speed

slide-39
SLIDE 39

Possible explanations

  • DNA replication continues into G2 phase

– Circumstantial evidence S phase may be longer – Use model to guide DNA combing experiments

200 400 600 800 1000 50 100 150 200 250 300 Distribution of ORIs that end replication after 95% of the total replication ORIs of Chr1 ORIs of Chr2 ORIs of Chr3 Iterations

slide-40
SLIDE 40

Possible explanations

  • Firing propensity

redistribution

– Limiting “factor” binding to potential origins – Factor released on firing or passive replication – Can bind to pre‐replicating

  • rigins

– Propensity to fire increases in time

Factor x

slide-41
SLIDE 41

Firing propensity redistribution

slide-42
SLIDE 42

Re‐replication

Created by

  • K. Koutroumpas
slide-43
SLIDE 43

Outline

  • 1. Hybrid and stochastic hybrid systems
  • 2. Reachability & randomized methods
  • 3. DNA replication

– DNA replication in the cell cycle – A stochastic hybrid model – Simulation results for the fission yeast – Analysis

  • 4. Summary
slide-44
SLIDE 44

Concluding remarks

  • DNA replication in cell cycle

– Develop SHS model based on biological intuition & experimental data – Code model for specific organism and simulate – Exposed gaps in intuition – Suggested new questions and experiments

  • Simple model gave rise to many studies

– System identification for efficiencies, filtering for fork speed estimation, bioinformatics origin selection criteria – DNA combing to detect G2 replication – Theoretical analysis – Extensions: re‐replication

  • Promote understanding, e.g.

– Why do some organisms prefer deterministic origin positions?