Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic - - PowerPoint PPT Presentation

multilevel hybrid split step implicit tau leap for
SMART_READER_LITE
LIVE PREVIEW

Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic - - PowerPoint PPT Presentation

Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic Reaction Networks (SRNs) Chiheb Ben Hammouda, Alvaro Moraes and Ra ul Tempone DCSE Fall School on ROM and UQ, November 48, 2019 1/24 Outline 1 Introduction 2 Multilevel Hybrid


slide-1
SLIDE 1

Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic Reaction Networks (SRNs)

Chiheb Ben Hammouda, Alvaro Moraes and Ra´ ul Tempone DCSE Fall School on ROM and UQ, November 4–8, 2019

1/24

slide-2
SLIDE 2

Outline

1 Introduction 2 Multilevel Hybrid Split Step Implicit Tau-Leap (SSI-TL)

Estimator

3 Numerical Results 2/24

slide-3
SLIDE 3

SRNs applications: epidemic processes [Anderson and Kurtz, 2015] and virus kinetics [Hensel et al., 2009],. . .

Biological Models In-vivo population control: the expected number of proteins.

Figure 1.1: DNA transcription and mRNA translation [Briat et al., 2015]

Chemical reactions the expected number of molecules.

Figure 1.2: Chemical reaction network [Briat et al., 2015]

3/24

slide-4
SLIDE 4

Statement of the Forward Problem

A stochastic reaction network (SRN) is a continuous-time Markov Chain X defined on a probability space (Ω, F, P) X = (X1, . . . , Xd) : [0, T] × Ω → Zd

+

described by J reactions channels, Rj := (νj, aj), where νj ∈ Zd

+: stoichiometric vector (state change vector)

aj : Rd

+ → R+: Propensity (jump intensity) functions such that

P

  • X(t + ∆t) = x + νj
  • X(t) = x
  • = aj(x)∆t + o (∆t) , j = 1, . . . , J.

(1) Goal: Given i) an initial state X0 = x0, ii) a smooth scalar observable of X, g : Rd → R, iii) a user-selected tolerance, TOL, and iv) a confidence level 1 − α close to 1, provide accurate estimator ˆ Q of E [g(X(T))] such that P

  • E [g(X(T))] − ˆ

Q

  • < TOL
  • > 1 − α,

(2) with near-optimal expected computational work and for a class of systems characterized by having simultaneously fast and slow time scales (Stiff systems).

4/24

slide-5
SLIDE 5

Statement of the Forward Problem

A stochastic reaction network (SRN) is a continuous-time Markov Chain X defined on a probability space (Ω, F, P) X = (X1, . . . , Xd) : [0, T] × Ω → Zd

+

described by J reactions channels, Rj := (νj, aj), where νj ∈ Zd

+: stoichiometric vector (state change vector)

aj : Rd

+ → R+: Propensity (jump intensity) functions such that

P

  • X(t + ∆t) = x + νj
  • X(t) = x
  • = aj(x)∆t + o (∆t) , j = 1, . . . , J.

(1) Goal: Given i) an initial state X0 = x0, ii) a smooth scalar observable of X, g : Rd → R, iii) a user-selected tolerance, TOL, and iv) a confidence level 1 − α close to 1, provide accurate estimator ˆ Q of E [g(X(T))] such that P

  • E [g(X(T))] − ˆ

Q

  • < TOL
  • > 1 − α,

(2) with near-optimal expected computational work and for a class of systems characterized by having simultaneously fast and slow time scales (Stiff systems).

4/24

slide-6
SLIDE 6

Simulation of SRNs

Pathwise-Exact methods:

◮ Stochastic simulation algorithm (SSA) [Gillespie, 1976]. ◮ Modified next reaction algorithm (MNRA) [Anderson, 2007].

Pathwise-approximate methods:

◮ Explicit tau-leap [Gillespie, 2001, J. Aparicio, 2001]. ◮ Drift-implicit tau-leap (explicit-TL) [Rathinam et al., 2003b]. ◮ Split step implicit tau-leap (SSI-TL)[Ben Hammouda et al., 2016]. 5/24

slide-7
SLIDE 7

Motivation and Contribution

MLMC estimator using explicit TL [Anderson and Higham, 2012]. In systems characterized by having simultaneously fast and slow time scales, exact methods and the explicit-TL method can be very slow and numerically unstable. We propose in [Ben Hammouda et al., 2016] an efficient MLMC method that uses SSI-TL approximation at levels where the explicit-TL method is not applicable due to numerical stability issues.

6/24

slide-8
SLIDE 8

The Explicit-TL Method [Gillespie, 2001, J. Aparicio, 2001]

From Kurtz’s random time-change representation X(t) = x0 +

J

  • j=1

Yj t

t0

λj(X(s))ds

  • νj,

(3) where Yj are independent unit-rate Poisson processes, we obtain the explicit-TL method (kind of forward Euler approximation): Given Zexp(t) = z ∈ Zd

+,

Zexp(t + τ) = z +

J

  • j=1

Pj   aj(z)τ

λj

   νj, Pj(λj) are independent Poisson random variables with rate λj. Caveat: The explicit-TL is not adequate when dealing with stiff problems ( Numerical stability ⇒ τ exp

threshold ≪ 1.)

7/24

slide-9
SLIDE 9

Split Step Implicit Tau-Leap (SSI-TL) Method I

The explicit-TL scheme, where z = Zexp(t), can be rewritten as follows: Zexp(t + τ) = z +

J

  • j=1

Pj (aj(z)τ) νj = z +

J

  • j=1

(Pj (aj(z)τ) − aj(z)τ + aj(z)τ) νj = z +

J

  • j=1

aj(z)τνj

  • drift

+

J

  • j=1

(Pj (aj(z)τ) − aj(z)τ) νj

  • zero-mean noise

. (4)

8/24

slide-10
SLIDE 10

Split Step Implicit Tau-Leap (SSI-TL) Method II

The idea of SSI-TL method is to take only the drift part as implicit while the noise part is left explicit. Let us define z = Zimp(t) and define Zimp(t + τ) through the following two steps: y = z +

J

  • j=1

aj (y) τνj (Drift-Implicit step) (5) Zimp(t + τ) = y +

J

  • j=1

(Pj(aj(y)τ) − aj(y)τ) νj = z +

J

  • j=1

Pj(aj(y)τ)νj (Tau-leap step)

9/24

slide-11
SLIDE 11

Multilevel Monte Carlo [Giles, 2008]

Goal: Reduce the variance of the standard Monte Carlo estimator. A hierarchy of nested meshes of the time interval [0, T], indexed by ℓ = 0, 1, . . . , L. hℓ = M−ℓh0: The size of the subsequent time steps for levels ℓ ≥ 1 , where M>1 is a given integer constant and h0 the step size used at level ℓ = 0. Zℓ : The approximate process generated using a step size of hℓ. Consider now the following telescoping decomposition of E [g(ZL(T))]: E [g(ZL(T))] = E [g(Z0(T))] +

L

  • ℓ=1

E [g(Zℓ(T)) − g(Zℓ−1(T))] . (6) By defining          ˆ Q0 :=

1 N0 N0

  • n0=1

g(Z0,[n0](T)) ˆ Qℓ :=

1 Nℓ Nℓ

  • nℓ=1
  • g(Zℓ,[nℓ](T)) − g(Zℓ−1,[nℓ](T))
  • ,

(7) we arrive at the unbiased MLMC estimator, ˆ Q, of E [g(ZL(T))]: ˆ Q :=

L

  • ℓ=0

ˆ Qℓ. (8)

10/24

slide-12
SLIDE 12

Multilevel Hybrid SSI-TL I

Our multilevel hybrid SSI-TL estimator is defined as: ˆ Q := ˆ QLimp

c

+

Lint−1

  • ℓ=Limp

c

+1

ˆ Qℓ + ˆ QLint +

L

  • ℓ=Lint+1

ˆ Qℓ, (9) where                              ˆ QLimp

c

:=

1 Ni,Limp

c

Ni,Limp

c

  • n=1

g(Zimp

Limp

c

,[n](T))

ˆ Qℓ :=

1 Nii,ℓ Nii,ℓ

  • nℓ=1
  • g(Zimp

ℓ,[nℓ](T)) − g(Zimp ℓ−1,[nℓ](T))

  • , Limp

c

+ 1 ≤ ℓ ≤ Lint − 1 ˆ QLint :=

1 Nie,Lint Nie,Lint

  • n=1
  • g(Zexp

Lint,[n](T)) − g(Zimp Lint−1,[n](T))

  • ˆ

Qℓ :=

1 Nee,ℓ Nee,ℓ

  • nℓ=1
  • g(Zexp

ℓ,[nℓ](T)) − g(Zexp ℓ−1,[nℓ](T))

  • , Lint + 1 ≤ ℓ ≤ L.

(10)

11/24

slide-13
SLIDE 13

Coupling (Idea) [Kurtz, 1982, Anderson and Higham, 2012]

To couple two Poisson random variables, P1(λ1), P2(λ2), with rates λ1 and λ2, respectively, we define λ⋆ := min{λ1, λ2} and we consider the decomposition

  • P1(λ1) := Q(λ⋆) + Q1(λ1 − λ⋆)

P2(λ2) := Q(λ⋆) + Q2(λ2 − λ⋆) where Q(λ⋆), Q1(λ1 − λ⋆) and Q2(λ2 − λ⋆) are three independent Poisson random variables. We have small variance between the coupled processes

Var [P1(λ1) − P2(λ2)] = Var

  • Q1(λ1 − λ⋆) − Q2(λ2 − λ⋆)
  • =| λ1 − λ2 | .

Observe: If P1(λ1) and P2(λ2) are independent, then, we have a larger variance Var [P1(λ1) − P2(λ2)] = λ1 + λ2.

12/24

slide-14
SLIDE 14

Multilevel Hybrid SSI-TL II

ˆ Q := ˆ QLimp

c

+

Lint−1

  • ℓ=Limp

c

+1

ˆ Qℓ + ˆ QLint +

L

  • ℓ=Lint+1

ˆ Qℓ, (11)

1 Limp

c

, the coarsest discretization level.

2 Lint, the interface level. 3 L, the finest discretization level. 4 N := {Ni,Limp c

, {Nii,ℓ}Lint−1

ℓ=Limp

c

+1, Nie,Lint, {Nee,ℓ}L ℓ=Lint+1}, the number of

samples per level.

13/24

slide-15
SLIDE 15

Estimation procedure I

Coarsest discretization level, Limp

c

, is determined by the numerical stability constraint of our MLMC estimator, two conditions must be satisfied: The stability of a single path: determined by a linearized stability analysis of the backward Euler method applied to the deterministic ODE model corresponding to our system. The stability of the variance of the coupled paths of our MLMC estimator: expressed by Var

  • g(ZLimp

c

+1)−g(ZLimp

c

)

  • ≪ Var
  • g(ZLimp

c

)

  • .

14/24

slide-16
SLIDE 16

Estimation procedure II

Total number of levels, L, and the set of the number of samples per level, N, are selected to satisfy the accuracy constraint given by (2), with near-optimal expected computational work. As a result, the MLMC algorithm should bound the bias and the statistical error as follows [Collier et al., 2014]:

  • E
  • g(X(T)) − ˆ

Q

  • ≤ (1 − θ) TOL,

(12) Var

  • ˆ

Q

θ TOL Cα 2 (13) for some given confidence parameter, Cα, such that Φ(Cα) = 1 − α/2 ; here, Φ is the cumulative distribution function of a standard normal random variable.

15/24

slide-17
SLIDE 17

Estimation procedure III

The finest discretization level, L, is determined by satisfying relation (12) for θ = 1

2, implying

|Bias(L) := E [g(X(T)) − g(ZL(T))]| < TOL 2 . (14) In our numerical experiments, we use the following approximation (see [Giles, 2008]) Bias(L) ≈ E [g(ZL(T)) − g(ZL−1(T))] . (15)

16/24

slide-18
SLIDE 18

Estimation procedure IV

The interface level, Lint and optimal number of samples per level, N:

1 The first step is to solve (16), for a fixed value of the interface level, Lint

       min

N WLint(N)

s.t. Cα

  • L
  • ℓ=Limp

c

N−1

Vℓ ≤ TOL

2 ,

(16) where Vℓ = Var [g(Zℓ(T)) − g(Zℓ−1(T)] is estimated by the extrapolation

  • f the sample variances obtained from the coarsest levels (due to the

presence of large kurtosis) and WLint is the expected computational cost of the MLMC estimator.

2 Let us denote N∗(Lint) as the solution of (16). Then, the optimal value of

the switching parameter, Lint, should be chosen to minimize the expected computational work; that is, the value Lint∗ that solves    min

Lint WLint(N∗(Lint))

s.t. Lexp

c

≤ Lint ≤ L.

17/24

slide-19
SLIDE 19

Multilevel Hybrid SSI-TL: Results I

This example was studied in [Rathinam et al., 2003a] and is given by the following reaction set S1

c1

→ S3, S3

c2

→ S1 S1 + S2

c3

→ S1 + S4. (17) The corresponding propensity functions are a1(X) = c1X1, a2(X) = c2(K − X1), a3(X) = c3X1X2. where K = X1 + X3. c = (c1, c2, c3) = (105, 105, 5 × 10−3), X(0) = (104, 102) and K = 2.104. We are interested in approximating E [X2(T)], where T = 0.01 seconds. This setting implies that the stability limit of the explicit-TL is τ lim

exp ≈ 10−5.

17/24

slide-20
SLIDE 20

Multilevel Hybrid SSI-TL: Results II

Method / TOL 0.02 0.01 0.005 Multilevel explicit-TL 890 (9) 5300 (45) 2.2e+04 (96) Multilevel SSI-TL 240 (0.8) 110 (3) 5.3e+02 (7) W exp

MLMC/W SSI MLMC

37.04 47.33 41.27

Table 1: Comparison of the expected total work for the different methods (in seconds) using 100 multilevel runs. (·) refers to the standard deviation. The last row shows that the multilevel SSI-TL estimator outperforms by about 40 times the multilevel explicit-TL estimator in terms of computational work.

18/24

slide-21
SLIDE 21

Multilevel Hybrid SSI-TL: Results III

10

−3

10

−2

10

−1

10 10

2

10

4

10

6

10

8

Comparison of the expected total work for the different methods for different values of tolerance TOL TOL Expected total work (W) (seconds)

Multilevel SSI−TL (Lc

imp=0)

Linear reference with slope −2.25 Multilevel explicit TL (L

c exp=11)

Linear reference with slope −2.37 MC SSI−TL Linear reference with slope −2.77 y=TOL−2 (log(TOL))2 y=TOL−3

Figure 3.1: Comparison of the expected total work for the different methods with different values of tolerance (TOL) using 100 multilevel runs. The computational work, W, of both multilevel SSI-TL and multilevel explit-TL methods is of O

  • TOL−2(log(TOL))2

compared to O

  • TOL−3

for the MC SSI-TL.

19/24

slide-22
SLIDE 22

Multilevel Hybrid SSI-TL: Results IV

10

−3

10

−2

10

−1

10

−4

10

−3

10

−2

10

−1

TOL Total error TOL vs. Global error 1 % 1 % 2 % 5 %

Figure 3.2: TOL versus the actual computational error. The numbers above the straight line show the percentage of runs that had errors larger than the required tolerance.

20/24

slide-23
SLIDE 23

Summary

Our proposed estimator is useful in systems with the presence of slow and fast timescales (stiff systems): obtained substantial gains with respect to the multilevel explit-TL [Anderson and Higham, 2012]. For large values of TOL, the multilevel SSI-TL method has the same order of computational work as does the multilevel explit-TL method (O

  • TOL−2 log(TOL)2

) [Anderson et al., 2014], but with a smaller constant.

21/24

slide-24
SLIDE 24

Future work

Implementing Hybridization techniques involving methods that deal with non-negativity of species [Moraes et al., 2015b] and adaptive methods introduced in [Karlsson and Tempone, 2011, Moraes et al., 2015a, Lester et al., 2015], which allows us to construct adaptive hybrid multilevel estimators by switching between SSI-TL, explicit-TL and exact SSA. Extending the τ-ROCK method [Abdulle et al., 2010] to the multilevel setting and compare it with the multilevel hybrid SSI-TL method. The main challenge of this task is to couple two consecutive paths based on the τ-ROCK method. Addressing spatial inhomogeneities described, for instance, by graphs and/or continuum volumes through the use of Multi-Index Monte Carlo technology [Haji-Ali et al., 2015].

22/24

slide-25
SLIDE 25

References I

Abdulle, A., Hu, Y., and Li, T. (2010). Chebyshev methods with discrete noise: the tau-rock methods.

  • J. Comput. Math, 28(2):195–217.

Anderson, D. and Higham, D. (2012). Multilevel Monte Carlo for continuous Markov chains, with applications in biochemical kinetics. SIAM Multiscal Model. Simul., 10(1). Anderson, D. F. (2007). A modified next reaction method for simulating chemical systems with time dependent propensities and delays. The Journal of Chemical Physics, 127(21):214107. Anderson, D. F., Higham, D. J., and Sun, Y. (2014). Complexity of multilevel Monte Carlo tau-leaping. 52(6):3106–3127.

23/24

slide-26
SLIDE 26

References II

Anderson, D. F. and Kurtz, T. G. (2015). Stochastic analysis of biochemical systems. Springer. Ben Hammouda, C., Moraes, A., and Tempone, R. (2016). Multilevel hybrid split-step implicit tau-leap. Numerical Algorithms, pages 1–34. Brauer, F. and Castillo-Chavez, C. (2011). Mathematical Models in Population Biology and Epidemiology (Texts in Applied Mathematics). Springer, 2nd edition. Briat, C., Gupta, A., and Khammash, M. (2015). A Control Theory for Stochastic Biomolecular Regulation.

  • Paris. SIAM.

24/24

slide-27
SLIDE 27

References III

Collier, N., Haji-Ali, A.-L., Nobile, F., von Schwerin, E., and Tempone, R. (2014). A continuation multilevel monte carlo algorithm. BIT Numerical Mathematics, 55(2):399–432. Giles, M. (2008). Multi-level Monte Carlo path simulation. Operations Research, 53(3):607–617. Gillespie, D. T. (1976). A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. Journal of Computational Physics, 22:403–434.

25/24

slide-28
SLIDE 28

References IV

Gillespie, D. T. (2001). Approximate accelerated stochastic simulation of chemically reacting systems. Journal of Chemical Physics, 115:1716–1733. Haji-Ali, A.-L., Nobile, F., and Tempone, R. (2015). Multi-index monte carlo: when sparsity meets sampling. Numerische Mathematik, pages 1–40. Hensel, S., Rawlings, J., and Yin, J. (2009). Stochastic kinetic modeling of vesicular stomatitis virus intracellular growth. Bulletin of Mathematical Biology, 71(7):1671–1692.

26/24

slide-29
SLIDE 29

References V

  • J. Aparicio, H. S. (2001).

Population dynamics: Poisson approximation and its relation to the langevin process. Physical Review Letters, page 4183. Karlsson, J. and Tempone, R. (2011). Towards automatic global error control: Computable weak error expansion for the tau-leap method. Monte Carlo Methods and Applications, 17(3):233–278. Kurtz, T. (1982). Representation and approximation of counting processes. In Advances in Filtering and Optimal Stochastic Control, volume 42 of Lecture Notes in Control and Information Sciences, pages 177–191. Springer Berlin Heidelberg.

27/24

slide-30
SLIDE 30

References VI

Lester, C., Yates, C. A., Giles, M. B., and Baker, R. E. (2015). An adaptive multi-level simulation algorithm for stochastic biological systems. The Journal of chemical physics, 142(2):024113. Moraes, A., Tempone, R., and Vilanova, P. (2015a). Multilevel adaptive reaction-splitting simulation method for stochastic reaction networks. preprint arXiv:1406.1989. Moraes, A., Tempone, R., and Vilanova, P. (2015b). Multilevel hybrid chernoff tau-leap. BIT Numerical Mathematics, pages 1–51.

28/24

slide-31
SLIDE 31

References VII

Rathinam, M., Petzold, L., Cao, Y., and Gillespie, D. T. (2003a). Stiffness in stochastic chemically reacting systems: the implicit tau-leaping method. Journal of Chemical Physics, 119(24):12784–12794. Rathinam, M., Petzold, L. R., Cao, Y., and Gillespie, D. T. (2003b). Stiffness in stochastic chemically reacting systems: The implicit tau-leaping method. The Journal of Chemical Physics, 119(24):12784–12794. Srivastava, R., You, L., Summers, J., and Yin, J. (2002). Stochastic vs. deterministic modeling of intracellular viral kinetics. Journal of Theoretical Biology, 218(3):309–321.

29/24

slide-32
SLIDE 32

Thank you for your attention

30/24