Particle Markov Chain Monte Carlo Methods in Marine Biogeochemistry - - PowerPoint PPT Presentation

particle markov chain monte carlo methods in marine
SMART_READER_LITE
LIVE PREVIEW

Particle Markov Chain Monte Carlo Methods in Marine Biogeochemistry - - PowerPoint PPT Presentation

Particle Markov Chain Monte Carlo Methods in Marine Biogeochemistry Lawrence Murray and Emlyn Jones CSIRO Mathematics, Informatics and Statistics Outline Case studies in marine biogeochemistry and generic challenges. Conventional


slide-1
SLIDE 1

Particle Markov Chain Monte Carlo Methods in Marine Biogeochemistry

Lawrence Murray and Emlyn Jones CSIRO Mathematics, Informatics and Statistics

slide-2
SLIDE 2

Lawrence Murray PMCMC in marine biogeochemistry : 2 of 41

Outline

  • Case studies in marine biogeochemistry and generic challenges.
  • Conventional state-space models
  • Collapsed state-space models (to deal with the absence of a

closed-form transition density)

  • The particle marginal Metropolis-Hastings (PMMH) sampler in

this context.

  • Results and implications.

Acknowledgements: Eddy Campbell (CSIRO), John Parslow (CSIRO), Nugzar Margvelashvili (CSIRO), Noel Cressie (Ohio State University).

slide-3
SLIDE 3

Lawrence Murray PMCMC in marine biogeochemistry : 3 of 41

Marine biogeochemical model

N

(DIN)

P

(Phytoplankton)

Z

(Zooplankton)

D

(Detritus)

Sea Surface Base of Mixed Layer Phytoplankton Growth Zooplankton Grazing Zooplankton Mortality Messy Feeding Remineralisation Mixing Sinking

slide-4
SLIDE 4

Lawrence Murray PMCMC in marine biogeochemistry : 4 of 41

Issues, issues, issues...

  • These models tend to have long memory and don’t mix well.
  • Available observations may be sparse.
  • The model may be strongly nonlinear, ...
  • ...it might even be chaotic.
  • The transition density is unlikely to have a closed form.
slide-5
SLIDE 5

Lawrence Murray PMCMC in marine biogeochemistry : 5 of 41

Conventional state-space model

Y2 YT ... U1 U2 Y1 XT UT X2 X0 X1 ... ...

slide-6
SLIDE 6

Lawrence Murray PMCMC in marine biogeochemistry : 6 of 41

Conventional state-space model

Sampling with sequential Monte Carlo (auxiliary particle filter)... For particle i at time t, extend xi

t−1 by drawing xi t ∼ q(xt) and

weight with:

wi

t = p(yt|xi t)p(xi t|xi t−1)

q(xi

t)

wi

t−1.

slide-7
SLIDE 7

Lawrence Murray PMCMC in marine biogeochemistry : 7 of 41

Conventional state-space model

Sampling with sequential Monte Carlo (auxiliary particle filter)... For particle i at time t, extend xi

t−1 by drawing xi t ∼ q(xt) and

weight with:

wi

t = p(yt|xi t)p(xi t|xi t−1)

q(xi

t)

wi

t−1.

We don’t have that!

slide-8
SLIDE 8

Lawrence Murray PMCMC in marine biogeochemistry : 8 of 41

Conventional state-space model

Y2 YT ... U1 U2 Y1 XT UT X2 X0 X1 ... ...

slide-9
SLIDE 9

Lawrence Murray PMCMC in marine biogeochemistry : 9 of 41

Collapsed state-space model

YT ... U1 U2 UT Y1 X0 Y2 ...

slide-10
SLIDE 10

Lawrence Murray PMCMC in marine biogeochemistry : 10 of 41

Collapsed state-space model

Sampling with sequential Monte Carlo (auxiliary particle filter)... For particle i at time t, extend ui

1:t−1 by drawing ui t ∼ q(ut) and

weight with:

wi

t = p(yt|ui 1:t, xi 0)p(ui t|ui 1:t−1, xi 0)

q(ui

t)

wi

t−1.

slide-11
SLIDE 11

Lawrence Murray PMCMC in marine biogeochemistry : 11 of 41

Collapsed state-space model

Sampling with sequential Monte Carlo (auxiliary particle filter)... For particle i at time t, extend ui

1:t−1 by drawing ui t ∼ q(ut) and

weight with:

wi

t = p(yt|ui 1:t, xi 0)p(ui t)

q(ui

t)

wi

t−1.

We do have that!

slide-12
SLIDE 12

Lawrence Murray PMCMC in marine biogeochemistry : 12 of 41

What should q(·) be?

  • PF0: bootstrap
  • PF1: as PF0 plus one-step single-pilot lookahead and resample.
  • MUPF0: UKF run offline, use time marginals pN(ut|y1:t) as

proposals.

  • MUPF1: as MUPF0 plus one-step single-pilot lookahead and

resample.

  • CUPF0: UKF conditioned on each particle, so use pN(ut|u1:t−1, y1:t).
  • CUPF1: as CUPF0 plus UKF lookahead and resample.

(UKF = Unscented Kalman Filter)

slide-13
SLIDE 13

Lawrence Murray PMCMC in marine biogeochemistry : 13 of 41

Particle marginal Metropolis-Hastings

The target (posterior) density is π(u1:T, x0, θ|y1:T), factorised as either:

π1(u1:T, x0|θ, y1:T)π2(θ|y1:T)

  • r

π1(u1:T|x0, θ, y1:T)π2(x0, θ|y1:T).

In either case π1 is targeted with an auxiliary particle filter, π2 with Metropolis-Hastings [Andrieu et al., 2010, Pitt et al., 2011]. The second factorisation should be preferred when prior information

  • ver x0 is scarce, so that importance sampling of it will be degenerate.
slide-14
SLIDE 14

Lawrence Murray PMCMC in marine biogeochemistry : 14 of 41

PZ model

Simple phytoplankton-zooplankton model [Jones et al., 2010]. Lotka-Volterra with stochastic growth rate and quadratic mortality term.

dP dt = αtP − cPZ dZ dt = ecPZ − mlZ − mqZ2 αt ∼ N(µ, σ). P is observed with noise. Simulated data used here.

slide-15
SLIDE 15

Lawrence Murray PMCMC in marine biogeochemistry : 15 of 41

PZ model: state estimate

1 2 3 4 5 6 7 20 40 60 80 100 P t Prior Posterior Observed Truth 0.5 1 1.5 2 2.5 20 40 60 80 100 Z

  • 2
  • 1

1 2 20 40 60 80 100 α t

slide-16
SLIDE 16

Lawrence Murray PMCMC in marine biogeochemistry : 16 of 41

NPZD model

The model features nine noise terms, ξi for i = 1, . . . , 9, each coupled to a univariate autoregressive process Bi.

Bi(t + ∆t) = Bi(t) · (1 − ∆t/τP) + (µi + PDF · σiξi) · ∆t/τP,

where ∆t is a discrete time step (one day), µi a parameter to be estimated, PDF a common diversity factor to be estimated, σi a prescribed scaling factor, and τP a common characteristic time scale, also prescribed.

slide-17
SLIDE 17

Lawrence Murray PMCMC in marine biogeochemistry : 17 of 41

NPZD model

A multiplicative temperature correction Tc is applied to all rate processes, for which a Q10 formulation for dependence on temperature, T , is used:

Tc = Q(T−Tref)/10

10

,

where Tref is a reference temperature, and Q10 a prescribed constant.

slide-18
SLIDE 18

Lawrence Murray PMCMC in marine biogeochemistry : 18 of 41

NPZD model

The zooplankton grazing rate (gr) is dependent on the phytoplankton concentration (zooplankton functional response):

gr = Tc · IZ · Aυ (1 + Aυ) ,

(1) where υ is a given power.

slide-19
SLIDE 19

Lawrence Murray PMCMC in marine biogeochemistry : 19 of 41

NPZD model

The relative availability of phytoplankton, A, is

A = ClZ · P IZ ; IZ is the maximum zooplankton ingestion rate (mg P per mg Z

per day); ClZ is the maximum clearance rate (volume in m3 swept clear per mg Z per day). For υ = 1, this takes the form of a Type-2 functional response (standard rectangular hyperbola) [Holling, 1966], and for υ > 1 a Type-3 sigmoid functional response.

slide-20
SLIDE 20

Lawrence Murray PMCMC in marine biogeochemistry : 20 of 41

NPZD model

A quadratic formulation for zooplankton mortality is adopted after Steele [1976] and Steele and Henderson [1992]:

m = Tc · mQ · Z,

where the quadratic mortality rate mQ has units of d−1(mgZm−3)−1.

slide-21
SLIDE 21

Lawrence Murray PMCMC in marine biogeochemistry : 21 of 41

NPZD model

The detrital remineralization rate is dependent only on temperature:

r = Tc · rD,

where rD prescribes the remineralisation rate at a reference temperature.

slide-22
SLIDE 22

Lawrence Murray PMCMC in marine biogeochemistry : 22 of 41

NPZD model

The phytoplankton specific growth rate, g, depends on temperature,

T , available light or irradiance, E, and dissolved inorganic nutrient, N: g = Tc · gmax · hE · hN/(hE + hN).

slide-23
SLIDE 23

Lawrence Murray PMCMC in marine biogeochemistry : 23 of 41

NPZD model

The light-limitation factor is given by

hE = 1 − exp(−α · λmax · E/gmax),

where α is the initial slope of the photosynthesis versus irradiance curve (mg C mg Chla−1 mol photon−1 m2), and λmax is the maximum

Chla : C ratio (mg Chla mg C−1).

slide-24
SLIDE 24

Lawrence Murray PMCMC in marine biogeochemistry : 24 of 41

NPZD model

E is the mean photosynthetic available radiation (PAR) in the mixed

layer and is given by

E = E0.(1 − exp(−Kz))/Kz,

where E0 is the mean daily photosynthetically available radiation (PAR) just below the air-sea interface and Kz is given by

Kz = (KW + aCh · Chla) · MLD.

(2)

KW is attenuation due to the seawater and aCh.

slide-25
SLIDE 25

Lawrence Murray PMCMC in marine biogeochemistry : 25 of 41

NPZD model

The nutrient-limitation factor is given by

hN = N (gmax · Tc/aN) + N ,

where aN is the maximum specific affinity for nitrogen uptake (d−1 mg N −1 m3).

slide-26
SLIDE 26

Lawrence Murray PMCMC in marine biogeochemistry : 26 of 41

NPZD model

The phytoplankton N : C ratio, χ, predicted by the model is given by

χ = χmin · hE + χmax · hN hE + hN ,

where χmin and χmax are the prescribed minimum and maximum

N : C ratios (mg N mg C−1).

slide-27
SLIDE 27

Lawrence Murray PMCMC in marine biogeochemistry : 27 of 41

NPZD model

The equations governing interactions between the remaining state variables {P, Z, D, N} are:

dP dt = g · P − gr · Z + κ MLD · (BCP − P) dZ dt = EZ · gr · Z − m · Z dD dt = (1 − EZ) · fD · gr · Z + m · Z − r · D − SD · D MLD + κ MLD · (BCD − D) dN dt = −g · P + (1 − EZ) · (1 − fD) · gr · Z + r · D + κ MLD · (BCN − N).

slide-28
SLIDE 28

Lawrence Murray PMCMC in marine biogeochemistry : 28 of 41

State estimation (observed)

1971 1972 1973 1974 1975 50 100 150 200 250 300 350

DIN (µ g N l−1)

1971 1972 1973 1974 1975 −8 −6 −4 −2 2 4

ln(Chla (µ g Chla l−1))

Prior−95% Posterior−95% Observations Prior−median Posterior−median

slide-29
SLIDE 29

Lawrence Murray PMCMC in marine biogeochemistry : 29 of 41

State estimation (unobserved)

1971 1972 1973 1974 1975 −4 −2 2 4 6

ln(P(µ g N l−1)

1971 1972 1973 1974 1975 −2 −1 1 2 3

ln(Z(µ g N l−1)

Prior−95% Posterior−95% Prior−median Posterior−median 1971 1972 1973 1974 1975 −6 −4 −2 2 4

ln(D(µ g N l−1)

slide-30
SLIDE 30

Lawrence Murray PMCMC in marine biogeochemistry : 30 of 41

Parameter estimation

−4 −3.5 −3 1 2 ln(KW) −4.5 −4 −3.5 0.5 1 ln(aCh) 2 4 6 8 0.2 0.4 sD −0.8 −0.6 −0.4 2 4 ln(fD) −2.5 −2 −1.5 −1 −0.5 0.5 1 ln(PDF) −2.5 −2 −1.5 −1 −0.5 0.5 1 ln(ZDF) −1 1 2 0.2 0.4 0.6 0.8 ln(µg

max)

−4.5 −4 −3.5 −3 −2.5 0.5 1 ln(µλ

max)

−2 −1.5 −1 −0.5 0.5 1 ln(µR

N

) −4 −2 0.2 0.4 ln(µa

N

) 1 2 3 0.2 0.4 0.6 ln(µI

Z

) −4 −2 2 0.2 0.4 0.6 ln(µCl

Z

) −1.5 −1 −0.5 0.5 1 1.5 ln(µE

Z

) −3 −2 −1 0.2 0.4 0.6 0.8 ln(µr

D

) −6 −4 −2 0.2 0.4 ln(µm

Q

) Posterior Prior

slide-31
SLIDE 31

Lawrence Murray PMCMC in marine biogeochemistry : 31 of 41

State forecast

07/74 02/75 09/75 50 100 150 200 250

DIN(µ g N l

−1)

07/74 02/75 09/75 −4 −2 2 4 6

ln(P(µ g N l

−1))

07/74 02/75 09/75 −2 −1 1 2 3 4

ln(Z(µ g N l

−1))

07/74 02/75 09/75 −8 −6 −4 −2 2 4 6

ln(D(µ g N l

−1))

07/74 02/75 09/75 −8 −6 −4 −2 2 4

ln(Chla(µ g N l

−1))

Prior−95% Posterior−95% Forecast−95% Observations Prior−median Posterior−median Forecast−median

slide-32
SLIDE 32

Lawrence Murray PMCMC in marine biogeochemistry : 32 of 41

PZ model: convergence

1 1.002 1.004 1.006 1.008 1.01 1.012 5000 10000 15000 20000 Rp Step PF0 MUPF0 CUPF0 PF1 MUPF1 CUPF1 1 1.002 1.004 1.006 1.008 1.01 1.012 5000 10000 15000 20000 Step PF0 MUPF0 CUPF0 PF1 MUPF1 CUPF1

Convergence rates of Markov chains for the PZ case study, (left) particle-matched, and (right) compute-matched. Each line shows the evolution of the ˆ

Rp statistic of Brooks and Gelman

[1998] for a particular method.

slide-33
SLIDE 33

Lawrence Murray PMCMC in marine biogeochemistry : 33 of 41

NPZD model: convergence

1 1.2 1.4 1.6 1.8 2 2.2 2.4 2000 4000 6000 8000 10000 Step PF0 MUPF0 CUPF0 PF1 MUPF1 CUPF1 1 1.2 1.4 1.6 1.8 2 2.2 2.4 2000 4000 6000 8000 10000 Step PF0 MUPF0 CUPF0 PF1 MUPF1 CUPF1

Convergence rates of Markov chains for the NPZD case study, (left) particle-matched, and (right) compute-matched. Each line shows the evolution of the ˆ

Rp statistic of Brooks and

Gelman [1998] for a particular method.

slide-34
SLIDE 34

Lawrence Murray PMCMC in marine biogeochemistry : 34 of 41

PZ model: acceptance rates

σ PF0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 MUPF0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 CUPF0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 σ µ PF1 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 µ MUPF1 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 µ CUPF1 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1

slide-35
SLIDE 35

Lawrence Murray PMCMC in marine biogeochemistry : 35 of 41

PZ model: acceptance rates, compute-matched

σ PF0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 MUPF0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 CUPF0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 σ µ PF1 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 µ MUPF1 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 µ CUPF1 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1

slide-36
SLIDE 36

Lawrence Murray PMCMC in marine biogeochemistry : 36 of 41

NPZD model: acceptance rates

0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Cumulative density Conditional acceptance rate (CAR) PF0 MUPF0 CUPF0 PF1 MUPF1 CUPF1 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Cumulative density Conditional acceptance rate (CAR) PF0 MUPF0 CUPF0 PF1 MUPF1 CUPF1

Empirical cdfs of the acceptance rates for each method on the NPZD case study, (left) particle-matched, and (right) compute-matched.

slide-37
SLIDE 37

Lawrence Murray PMCMC in marine biogeochemistry : 37 of 41

NPZD model: parameters vs acceptance rates

KW 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 aCh 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 SD 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 fD 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 PDF 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 ZDF 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µgmax 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µλmax 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µRN 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µaN 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µIZ 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µClZ 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µEZ 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µrD 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 µmQ 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1

slide-38
SLIDE 38

Lawrence Murray PMCMC in marine biogeochemistry : 38 of 41

Lessons for PMMH

  • Initialisation is extremely important, sampler will fail if initialised

in a region where the acceptance rate is low.

  • A particularly informative prior distribution may bias the posterior

into regions where the acceptance rate is low.

  • Is the rule-of-thumb 23% acceptance rate appropriate in this

context?

slide-39
SLIDE 39

Lawrence Murray PMCMC in marine biogeochemistry : 39 of 41

Summary

  • Environmental models such as these in marine biogeochemistry

pose significant challenges to Monte Carlo methodology.

  • One particular challenge is the absence of a closed-form transition
  • density. The collapsed state-space model facilitates one means
  • f improving sampler performance without needing this.
  • The particle marginal Metropolis-Hastings sampler can have

significant mixing and acceptance rate problems, initialisation and careful consideration of the prior distribution are important.

slide-40
SLIDE 40

Lawrence Murray PMCMC in marine biogeochemistry : 40 of 41

References

  • C. Andrieu, A. Doucet, and R. Holenstein. Particle Markov chain Monte Carlo methods. Journal
  • f the Royal Statistical Society Series B, 72:269–302, 2010.
  • S. P

. Brooks and A. Gelman. General methods for monitoring convergence of iterative

  • simulations. Journal of Computational and Graphical Statistics, 7:434–455, 1998.
  • C. S. Holling. The functional response of predators to prey density and its role in mimicry and

population regulation. Memoirs of the Entomology Society of Canada, 45:60, 1966.

  • E. Jones, J. Parslow, and L. M. Murray. A Bayesian approach to state and parameter estimation

in a phytoplankton-zooplankton model. Australian Meteorological and Oceanographic Journal, 59(SP):7–16, 2010.

  • M. K. Pitt, R. S. Silva, P

. Giordani, and R. Kohn. Auxiliary particle filtering within adaptive Metropolis-Hastings sampling. 2011. URL http://arxiv.org/abs/1006.1914.

  • J. Steele. Role of predation in ecosystem models. Marine Biology, 35(1):9–11, 1976. ISSN

0025-3162.

  • J. Steele and E. Henderson. The role of predation in plankton models. Journal of Plankton

Research, 14(1):157–172, 1992.

slide-41
SLIDE 41

CSIRO Mathematics, Informatics and Statistics Lawrence Murray Phone: +61 8 9333 6480 Email: lawrence.murray@csiro.au Web: www.cmis.csiro.au Contact Us Phone: 1300 363 400 or +61 3 9545 2176 Email: enquiries@csiro.au Web: www.csiro.au