Bayesian Emulation and Calibration of a Dynamic Epidemic Model for - - PowerPoint PPT Presentation

bayesian emulation and calibration of a dynamic epidemic
SMART_READER_LITE
LIVE PREVIEW

Bayesian Emulation and Calibration of a Dynamic Epidemic Model for - - PowerPoint PPT Presentation

Bayesian Emulation and Calibration of a Dynamic Epidemic Model for H1N1 Influenza Marian Farah 1 Paul Birrell 1 , Stefano Conti 2 , Daniela De Angelis 1 , 2 1 MRC Biostatistics Unit, Cambridge, UK 2 Health Protection Agency, London, UK ICERM


slide-1
SLIDE 1

Bayesian Emulation and Calibration of a Dynamic Epidemic Model for H1N1 Influenza

Marian Farah1

Paul Birrell1, Stefano Conti2, Daniela De Angelis1,2

1MRC Biostatistics Unit, Cambridge, UK 2Health Protection Agency, London, UK

ICERM Bayesian Nonparametrics Workshop September 19, 2012

slide-2
SLIDE 2

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Motivation

  • Tracking and predicting the behavior of an emerging

epidemic is essential for a prompt public health response.

  • Inferential goals:
  • What is happening? i.e., real-time estimation of the

epidemic parameters.

  • What is going to happen next? i.e., forecasting the

(short-term) evolution of the epidemic.

  • What happened? i.e., “reconstructing” the epidemic by

estimating its parameters and evolution dynamics.

  • Noisy time-series data coming from different sources.
slide-3
SLIDE 3

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

1 Introduction: Epidemic modeling 2 Emulation and calibration of epidemic models 3 Preliminary results 4 Discussion

slide-4
SLIDE 4

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Introduction

slide-5
SLIDE 5

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Epidemic modeling

  • Transmission model:

S(t)

getting infected

→ E(t)

latent period

→ I(t)

infectious period

→ R(t)

  • Transmission depends on the virulence, the mixing

patterns in the population, and the transition rates among the S, E, I, and R states.

  • Transmission dynamics are typically described by a system
  • f differential equations.
slide-6
SLIDE 6

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Birrell et al. (2011) H1N1 model

S(t)

η1,η2

− → E(t)

η3

− → I(t)

η4

− → R(t)

slide-7
SLIDE 7

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Birrell et al. (2011) H1N1 model

S(t)

η1,η2

− → E(t)

η3

− → I(t)

η4

− → R(t)

η5 ↓ incubation

Expected # of symptomatic individuals

η6 ↓ propensity to consult doctor

Expected # of doctor consultations

↓ delay in reporting

Expected # of reported cases, µ(η, t)

slide-8
SLIDE 8

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Birrell et al. (2011) H1N1 model

S(t)

η1,η2

− → E(t)

η3

− → I(t)

η4

− → R(t)

η5 ↓ incubation

Expected # of symptomatic individuals

η6 ↓ propensity to consult doctor

Expected # of doctor consultations

↓ delay in reporting

Expected # of reported cases, µ(η, t)

  • η = (η1, . . . , η6): underlying parameters of the epidemic.
  • Proportion of symptomatic cases, propensity to consult,

exponential growth rate, expected infectious period, a measure of the initial number of infected individuals, population interaction parameters.

slide-9
SLIDE 9

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Computational challenge

  • The likelihood of reported data, z(t), t = 1, . . . , T,

depends on µ.

  • p(η | z{1:T}, µ) ∝

T

  • t=1

p

  • z(t); µ(η, t)
  • × p(η)
  • µ(η, t) must be computed at every MCMC iteration.
  • µ is computationally expensive.
slide-10
SLIDE 10

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Computational challenge

  • The likelihood of reported data, z(t), t = 1, . . . , T,

depends on µ.

  • p(η | z{1:T}, µ) ∝

T

  • t=1

p

  • z(t); µ(η, t)
  • × p(η)
  • µ(η, t) must be computed at every MCMC iteration.
  • µ is computationally expensive.
  • What about an efficient estimate?
slide-11
SLIDE 11

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Computer simulator

specify inputs η = X run code

  • utputs

X =      x1,1 . . . x1,6 x2,1 . . . x2,6 . . . . . . . . . xn,1 . . . xn,6      →

Birrell et al. (2011) →

µ(x1, 1), . . . , µ(x1, T) µ(x2, 1), . . . , µ(x2, T) µ(x2, t) . . . µ(xn, 1), . . . , µ(xn, T)

slide-12
SLIDE 12

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Computer simulator

specify inputs η = X run code

  • utputs

X =      x1,1 . . . x1,6 x2,1 . . . x2,6 . . . . . . . . . xn,1 . . . xn,6      →

Birrell et al. (2011) →

µ(x1, 1), . . . , µ(x1, T) µ(x2, 1), . . . , µ(x2, T) µ(x2, t) . . . µ(xn, 1), . . . , µ(xn, T)

50 100 150 200 250 2 4 6 8 10 12x 10

4

time µ(η,t)

50 100 150 200 250 2 4 6 8 10 12

time log µ(η,t)

slide-13
SLIDE 13

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Calibration and Emulation

  • Calibration: (e.g., Higdon et al., 2004)

Posterior inference for η through the simulator, µ, and “field” observed data z(t), Observed = Reality + Error Observed = Simulator + bias + Error z↑ µ↑ b↑

  • p(η, b | z{1:T}, µ) ∝

T

  • t=1

p

  • z(t); µ(η, t) + b
  • × p(η)p(b)
  • Emulation: (e.g., Kennedy and O’Hagan, 2000)

Estimating a slow computer simulator output, µ, using fast statistical model (an emulator), say ˆ µ.

slide-14
SLIDE 14

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Calibration and Emulation

  • Idea: (e.g., Bayarri et al., 2007a)

Replace the slow simulator output, µ, with the fast emulator estimation, ˆ µ, and obtain posterior inference for η through

  • p(η, b | z{1:T}, ˆ

µ) ∝

T

  • t=1

p

  • z(t); ˆ

µ(η, t) + b

  • × p(η)p(b)
slide-15
SLIDE 15

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Emulation and calibration of dynamic models

slide-16
SLIDE 16

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Emulation review

  • A deterministic computer simulator is a function f (·)

that maps input x to a unique output y = f (x).

  • The function f (·) is treated as unknown and given a prior.
  • Likelihood: data are runs of the simulator, given a

design over the input space, e.g., Latin Hypercube.

  • Emulator: the posterior (predictive) distribution of f (·).
slide-17
SLIDE 17

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

The Gaussian process

y(x) ∼ GP( m(x), v c(x, x′) )

m(·), v, and c(·, ·) are the mean, variance, & correlation function (e.g., Neal 1998; Rasmussen & Williams 2006).

−3 −2 −1 1 2 3 −5 −4 −3 −2 −1 1 2 3 4 5 6

x y

slide-18
SLIDE 18

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Toy example

2 4 6 8 −2 2 4 6 8 10

input

  • uput

20

  • utput function

f (x) = x + 3sin(x/2)

slide-19
SLIDE 19

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Toy example

2 4 6 8 −2 2 4 6 8 10

input

  • uput
  • utput function

simulator data prior realizations

slide-20
SLIDE 20

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Toy example

2 4 6 8 −2 2 4 6 8 10

input

  • uput
  • utput function

simulator data prior realizations 95% posterior region

slide-21
SLIDE 21

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Toy example

2 4 6 8 −2 2 4 6 8 10

input

  • uput
  • utput function

simulator data prior realizations 95% posterior region

slide-22
SLIDE 22

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Calibration review

  • Simulator: Specify x → f (x).
  • For x = η, f (η) simulates a physical system.
  • η is uncertain.
  • Calibration: solving the inverse-problem, i.e., η | z, f (·).
  • If f (·) is computationally expensive, it is emulated.
  • Priors for η and f (·).
  • Likelihood: data come from field observations and

simulator runs.

slide-23
SLIDE 23

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Toy example

2 4 6 8 −2 2 4 6 8 10

input

  • uput

6 8

  • utput function

simulator data field data

slide-24
SLIDE 24

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Toy example

2 4 6 8 −2 2 4 6 8 10

input

  • uput

6 8

  • utput function

simulator data field data z ∼ N(f (η), σ2 = 0.32) η ∼ N(2, 0.052)

slide-25
SLIDE 25

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Toy example

1 2 3 4 5 2 4 6 8

η density

truth prior posterior

  • Assuming σ2 is known.
slide-26
SLIDE 26

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Dynamic emulation

  • yt(xi) = f (xi, t) is the simulator output at input point xi

and time t. x1 − → y1(x1), y2(x1), . . . , yT(x1) x2 − → y1(x2), y2(x2), . . . , yT(x2) . . . . . . . . . . . . xn − → y1(xn), y2(xn), . . . , yT(xn)

slide-27
SLIDE 27

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Dynamic emulation

  • yt(xi) = f (xi, t) is the simulator output at input point xi

and time t. x1 − → y1(x1), y2(x1), . . . , yT(x1) x2 − → y1(x2), y2(x2), . . . , yT(x2) . . . . . . . . . . . . xn − → y1(xn), y2(xn), . . . , yT(xn)

  • Need to model three types of interdependencies:

1 over the input space. 2 over time within each time series. 3 across series of different input points.

slide-28
SLIDE 28

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Dynamic emulation

  • Modeling dependence over the input space alone

Typically using a Gaussian process prior for outputs. y(x) ∼ GP( m(x), v c(x, x′) )

slide-29
SLIDE 29

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Dynamic emulation

  • Modeling dependence over the input space alone

Typically using a Gaussian process prior for outputs. y(x) ∼ GP( m(x), v c(x, x′) )

  • Modeling dependence for a single time series

Typically, TVAR(p) model is used; e.g., p = 1, yt(x) = φt yt−1(x) + ǫt(x), ǫt(x) ∼ N(0, vt), φt = φt−1 + ωt, ωt ∼ N(0, wt).

slide-30
SLIDE 30

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Dynamic emulation

  • Linking across time series for different inputs using a

multivariate TVAR(p) model (Liu and West, 2009),

   yt (x1) . . . yt (xn)    =    yt−1 (x1) · · · yt−p (x1) . . . ... . . . yt−1 (xn) · · · yt−p (xn)       φ1,t . . . φp,t    +    ǫt(x1) . . . ǫt(xn)   

slide-31
SLIDE 31

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Dynamic emulation

  • Linking across time series for different inputs using a

multivariate TVAR(p) model (Liu and West, 2009),

   yt (x1) . . . yt (xn)    =    yt−1 (x1) · · · yt−p (x1) . . . ... . . . yt−1 (xn) · · · yt−p (xn)       φ1,t . . . φp,t    +    ǫt(x1) . . . ǫt(xn)   

Cov

  • ǫt(xi), ǫt(xj)
  • = vt c(xi, xj)
  • c(xi, xj) is the (i, j) element in the n × n correlation

matrix induced by the Gaussian process.

slide-32
SLIDE 32

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Dynamic emulation

  • Linking across time series for different inputs using a

multivariate TVAR(p) model (Liu and West, 2009),

   yt (x1) . . . yt (xn)    =    yt−1 (x1) · · · yt−p (x1) . . . ... . . . yt−1 (xn) · · · yt−p (xn)       φ1,t . . . φp,t    +    ǫt(x1) . . . ǫt(xn)   

Cov

  • ǫt(xi), ǫt(xj)
  • = vt c(xi, xj)
  • c(xi, xj) is the (i, j) element in the n × n correlation

matrix induced by the Gaussian process.

  • φt = φt−1 + ωt, where φt = (φ1t, . . . , φpt)′.
slide-33
SLIDE 33

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Birrell et al. (2011) simulator

0.2 0.4 0.6 0.8 1 2 4 6 8 10 12

t = 20 x2 log µ

0.2 0.4 0.6 0.8 1 2 4 6 8 10 12

t = 40 x2 log µ

0.2 0.4 0.6 0.8 1 2 4 6 8 10 12

t = 85 x2 log µ

0.2 0.4 0.6 0.8 1 2 4 6 8 10 12

t = 140 x2 log µ

  • x2: Exponential growth rate.
slide-34
SLIDE 34

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Birrell et al. (2011) simulator

50 100 150 200 250 2 4 6 8 10 12

time log µ(η,t)

slide-35
SLIDE 35

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Dynamic emulation

  • Extending Liu and West (2009)
  • Modeling input-dependent trends:

yt(x) = φt yt−1(x) + h(x)βt + ǫt

  • Modeling systematic temporal trend:

yt(x) = θt + φt yt−1(x) + h(x)βt + ǫt   θt φt βt   =   θt−1 φt−1 βt−1   +   ω1t ω2t ω3t  

  • Posterior inference through Forward-Filtering

Backward-Sampling.

slide-36
SLIDE 36

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Calibration

  • Two sources of data:
  • Simulator data: Ds = {(yt, x); t = 1, . . . , T}. Model

parameters are specified as inputs x.

  • “Field” observed epidemic data DF = {zt; t = 1, . . . , T}.

Model parameters, η, are unknown.

  • Two-stage calibration (e.g., Bayarri et al., 2007b)
  • Stage 1: Estimate the emulator model parameters using
  • nly Ds.
  • Stage 2: Model zt using a parametric distribution centered
  • n the emulator model. Then, conditional on

stage 1, estimate p(η | DF, Ds).

slide-37
SLIDE 37

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Results

slide-38
SLIDE 38

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Validating the emulator

50 100 150 200 250 −5 5 10 15

time log µ

50 100 150 200 250 −5 5 10 15

time log µ

50 100 150 200 250 −5 5 10 15

time log µ

50 100 150 200 250 −5 5 10 15

time log µ

50 100 150 200 250 −5 5 10 15

time log µ

50 100 150 200 250 −5 5 10 15

time log µ

50 100 150 200 250 −5 5 10 15

time log µ

50 100 150 200 250 −5 5 10 15

time log µ

50 100 150 200 250 −5 5 10 15

time log µ

  • Simulation runs (black), emulator’s median & 95% region (red).
  • Plots based on a MVTVAR(1) and Gaussian correlation function.
slide-39
SLIDE 39

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Calibration

  • Generated synthetic epidemic data.
  • Set η = η0. Then, z ∼Poisson
  • µ(η0, t)
  • 50

100 150 200 250 −2 2 4 6 8

time log observation

slide-40
SLIDE 40

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Calibration

0.2 0.4 0.6 0.8 1 0.5 1 1.5 2 2.5

η1

0.2 0.4 0.6 0.8 1 1 2 3 4 5 6

η2

0.2 0.4 0.6 0.8 1 0.5 1 1.5 2 2.5 3

η3

0.2 0.4 0.6 0.8 1 0.5 1 1.5 2 2.5 3 3.5

η4

0.2 0.4 0.6 0.8 1 1 2 3 4 5

η5

0.2 0.4 0.6 0.8 1 0.5 1 1.5 2

η6

  • Truth

— Prior

  • η2 is exponential growth rate, and η5 is effect of summer

holiday on population interaction.

slide-41
SLIDE 41

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

Discussion

slide-42
SLIDE 42

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

  • What we have done:
  • Estimation of epidemic dynamics by combining a

statistical emulator with reported epidemic data.

  • Dynamic emulation through modeling dependencies

across time and epidemic parameter space.

  • Still to do:
  • Consider different age groups in the population.
  • Incorporate additional sources of information.
  • Real-time calibration and forecasting using epidemic data.
  • . . .
slide-43
SLIDE 43

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

  • What we have done:
  • Estimation of epidemic dynamics by combining a

statistical emulator with reported epidemic data.

  • Dynamic emulation through modeling dependencies

across time and epidemic parameter space.

  • Still to do:
  • Consider different age groups in the population.
  • Incorporate additional sources of information.
  • Real-time calibration and forecasting using epidemic data.
  • . . .

Thank you!

slide-44
SLIDE 44

Dynamic Bayesian modeling for epidemics Marian Farah Biostatistics Cambridge Outline Introduction Methods Results Discussion

References:

  • Bayarri, M., Berger, J., Paulo, R., Sacks, J., Cafeo, J., Cavendish, J., Lin,

C., and Tu, J. (2007a), “A framework for validation of computer models,” Technometrics, 49, 138–154.

  • Bayarri, M. J., Berger, J. O., Cafeo, J., Garcia-Donato, G., Liu, F., Palomo,

J., Parthasarathy, R., Paulo, R., Sacks, J., and Walsh, D. (2007b), “Computer model validation with functional output,” Annals of Statistics, 35, 1874–1906.

  • Birrell, P. J., Ketsetzis, G., Gay, N. J., Cooper, B. S., Presanis, A. M.,

Harris, R. J., Charlett, A., Zhang, X.-S., White, P. J., Pebody, R. G., and De Angelis, D. (2011), “Bayesian modeling to unmask and predict infuenza A/H1N1pdm dynamics in London,” Proceedings of the National Academy

  • f Sciences.
  • Kennedy, M. C. and O’Hagan, A. (2000), “Predicting the output from a

complex computer code when fast approximations are available,” Biometrika, 87, 1–13.

  • Liu, F. and West, M. (2009), “A Dynamic Modelling Strategy for Bayesian

Computer Model Emulation,” Bayesian Analysis, 4, 393–412.