Ensemble transform filters for geophysical data assimilation - - PowerPoint PPT Presentation

ensemble transform filters for geophysical data
SMART_READER_LITE
LIVE PREVIEW

Ensemble transform filters for geophysical data assimilation - - PowerPoint PPT Presentation

Ensemble transform filters for geophysical data assimilation Sebastian Reich in collaboration with Georg Gottwald (University of Sydney), Kay Bergemann, Seoleun Shin (Uni Potsdam), Eugenia Kalnay, Kayo Ide, Javier Amezcua (U of Maryland) 1.


slide-1
SLIDE 1

Ensemble transform filters for geophysical data assimilation

Sebastian Reich in collaboration with Georg Gottwald (University of Sydney), Kay Bergemann, Seoleun Shin (Uni Potsdam), Eugenia Kalnay, Kayo Ide, Javier Amezcua (U of Maryland)

slide-2
SLIDE 2
  • 1. Data assimilation (DA)

Text Nature Text empirical evidence/ experiments Text Text statistical models Text Text Bayesian data assimilation Text state/parameter estimation, model selection, prediction mathematical/ computational models theory & hypotheses/ scientific modeling

slide-3
SLIDE 3
  • 2. A simplified mathematical problem statement

1) Mathematical model ˙

x = f(x)

2) Random initial conditions x(0), i.e. x(0) ∼ ρ0 for some given probability density function (PDF) ρ0(x). 3) Observations/measurements

y(tj) = Hx(tj) + ηj, y ∈ Rk, H ∈ Rk×n,

k < n, at discrete times tj > 0, j = 1, . . . , J, subject to random measurement errors: ηj ∼ N(0, R).

slide-4
SLIDE 4

Liouville (forecast): x = (x1, x2), πpost(x, tj) → πprior(x, tj+1) Bayes (assimilation): y = x1 + η, πprior(x, tj+1) → πpost(x, tj+1) (figures courtesy Chris Jones)

slide-5
SLIDE 5
  • 3. Ensemble propagation and particle filters

Particle/Monte-Carlo methods. Given a collection

X(t) = [x1(t) | x2(t) | · · · | xm(t)] ∈ Rn×m

  • f m independent solutions of

˙

x = f(x)

with initial conditions xi(0) ∼ ρ0(x). The empirical measure ρem(x, t) =

m

  • i=1

αiδ(x − xi(t)) is an exact (weak) solution to Liouville’s equation (δ(·) Dirac’s delta function, αi > 0 (constant) weights).

slide-6
SLIDE 6

Data assimilation for particle approximation at tj. Bayes’ formula: ρ(x, t+

j ) = m

  • i=1

αi δ(x − xi(tj)). with new weights αi ⇒ π(y(tj)|xi(tj)) × αi. Problem: Particles are still sampled from the prior and not from the posterior PDF. The mismatch is compensated for by non- uniform weight factors (importance sampling). Near mutual sin- gularity of measures leads to degeneracy of weights.

slide-7
SLIDE 7
  • 4. Assimilation as a continuous deformation of probability I

Dynamics and assimilation are not easily compatible in a numer- ical sense. We have to shift focus somehow. Here is our basic line of attack: We view the change of measure induced by Bayes’ theorem as an (optimal) transportation problem. See Crisan and Xiong (2010) for a related approach in the con- text of continuous time filtering.

slide-8
SLIDE 8

Bayes (assimilation): πprior(x, tj+1) → πpost(x, tj+1) Task: Shuffle blue prior into red posterior: x’ = T(x) s.t. πpost(T(x))|DT(x)| = πpr(x).

slide-9
SLIDE 9

Negative log-likelihood: L = (Hx − y(tj))TR−1(Hx − y(tj))/2. Bayes’ theorem: πpost(x) = π(x|y(tj)) ∝ πpr(x)e−L = πpr(x)

N

  • n=1

e−L/N Limit N → ∞, Taylor expansion in ∆s = 1/N: ∂ρ ∂s = −ρ (L − Eρ[L]), s ∈ [0, 1], ρ(x, 0) = πpr(x).

slide-10
SLIDE 10

“Kushner-Stratonovich” equation for intermittent data assimila- tion (Reich, 2011): ∂ρ ∂t = −∇x(ρf) −

  • j≥1

δǫ(t − tj)ρ(L − ¯ L) with δǫ(·) is a (compact) mollifier to the Dirac delta function with support [tj − ǫ, tj + ǫ] and ¯ L = Eρ[L]. If Ψ(x) is a smooth observable, then d dt ¯ Ψ = ∇xΨ · f −

  • j≥1

δǫ(t − tj)(Ψ − ¯ Ψ)(L − ¯ L) One can in particular set ρ = ρem, Ψ = x or Ψ = (x − ¯ x)(x − ¯ x)T.

slide-11
SLIDE 11
  • 4. Assimilation as a continuous deformation of probability II

For demonstration purposes, consider a single degree of freedom: We got to transport particles distributed according to the heap

  • n the left (prior) such that they form the heap on the right

(posterior). Direct approach: Monge-Ampere, nonlinear elliptic PDF.

slide-12
SLIDE 12

Goal: Find vector field g(x; ρ) ∈ L2(dρ, Rn) such that ∇x · (ρg) = ρ

L − ¯

L

.

The vector field g is not unique. Find minimizer with respect to the norm (see Otto, 2001, for an application in gradient flow dynamics):

  • ∂ρ

∂s

  • 2

ρ

= inf

v∈L2(dρ,Rn)

  • dρ vTMv : ∂ρ

∂s + ∇x · (ρv) = 0

  • .

Gradient flow: g(x; ρ) = M−1∇xψ(x; ρ).

slide-13
SLIDE 13

−10 −5 5 10 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 position probability density functions prior likelihood posterior −8 −6 −4 −2 2 4 6 8 −0.1 −0.05 0.05 0.1 0.15 position error in posterior probability density functions EGMF, M=2000 EnKF, M=2000 RHF, M=2000 0.2 0.4 0.6 0.8 1 −8 −6 −4 −2 2 4 6 8 ficticious time s particle positions particle dynamics under EGMF assimilation step 0.2 0.4 0.6 0.8 1 −8 −6 −4 −2 2 4 6 8 ficticious time s particle positions particle dynamics under EnKF assimilation step

slide-14
SLIDE 14

Formulation of a complete dynamics-assimilation step (Reich, 2010): ˙

x = f(x) + δǫ(t − tj)M−1∇xψ(x; ρ),

(1) ρt = −∇x · (ρ˙

x)

(2) closed by the elliptic PDE ∇x · (ρM−1∇xψ) = ρ

L − ¯

L

.

(3) for t ∈ [tj − ǫ, tj + ǫ]. This system constitutes an example of a Vlasov-McKean system (infinite-dimensional interacting particle system).

slide-15
SLIDE 15

Algorithmic summary of data assimilation step: 1) Use the ensemble of solutions xi(t), i = 1, . . . , m, to define a statistical model ˜ ρ(x; {xi(t)}). 2) Solve ∇x · (˜ ρM−1∇xψ) = ˜ ρ

  • L − E˜

ρ[L]

  • (4)

numerically or by quadrature for the potential ψ. 3) Propagate particles under vector field g = M−1∇ψ (gradient flow) .

slide-16
SLIDE 16

Choices for ˜ ρ include (a) Gaussian PDF parametrized by the ensemble mean and co- variance matrix (ensemble Kalman filter (EnKF)) (b) Gaussian mixture models/ Gaussian kernel density estima-

  • tors. Resulting elliptic PDE can be solved analytically!

(c) Empirical PDF and weak form of McKean-Vlasov; e.g. find vector field such that ensemble mean and covariance matrix are consistently propagated (different from EnKF!) The second and third approach give rise to new filters (Reich 2011/2012).

slide-17
SLIDE 17
  • 6. Numerics I: Langevin dynamics under bistable potential

Second-order Langevin dynamics under a one-dimensional double- well potential V (q): dv = −V ′(q)dt − γvdt + √ 2σdw(t), dq = vdt, w(t) denotes standard Brownian motion, potential V (q) given by

−10 −5 5 10 −1 1 2 3 4 5 q variable potential energy

slide-18
SLIDE 18

Assimilate velocities v only (!); posterior PDFs in positions q:

10 5 5 10 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 position posterior probability density function 10 5 5 10 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 position posterior probability density function

slide-19
SLIDE 19
slide-20
SLIDE 20

Results from EGMF (M = 50 ensemble members):

2000 4000 6000 8000 10000 −10 −5 5 10 continuous ensemble Gaussian mixture filter position 2000 4000 6000 8000 10000 −10 −5 5 10 time position truth filtered

RMSE = 1.9148 for EGMF compared to 2.331 for the EnKF. A bi-Gaussian is used in 25% of the assimilation steps.

slide-21
SLIDE 21
  • 7. First and second-order filter consistency

Bayes’ theorem implies ¯ Ψ = Eπpr[Ψe−L]

Eπpr[e−L]

for the posterior expectation value of an observable Ψ. Set πpr = ρem (ensemble), Ψ = x (ensemble mean) or Ψ = (x − ¯ x)(x − ¯ x)T (ensemble covariance matrix). The ensemble Kalman filter (EnKF) is not consistent with re- spect to the Bayesian posterior mean and covariance matrix. Example for classic moment closure problem: d ds ¯ Ψ = −(Ψ − ¯ Ψ)(L − ¯ L).

slide-22
SLIDE 22

Simple example: x ∈ R, obs y = x + η, prior with mean ¯ x = 0 and variance P: Ψ = x2 → ¯ Ψ = P Then d dsP = −1 2(x2 − P)(x − y)2 = 1 2

  • x4 − P 2 − 2x3y
  • For Gaussians: x4 = 3P 2, x3 = 0 (EnKF closure).
slide-23
SLIDE 23

Appropriate corrections proposed by Lei & Bickel (2011) (NLEAF) and Reich (2011) (MEnKF). Requires computing the square root

  • f covariance matrix.

Basic idea: xp

i are the proposal ensemble members with mean ¯

xp and covari- ance matrix ¯ P p. These can be obtained from a standard EnKF. The Bayesian posterior mean and covariance are denoted by ¯ x and ¯ P, respectively. The analysed ensemble members are now given by xa

i = ¯

x + ¯ P 1/2( ¯ P p)−1/2(xp

i − ¯

xp).

slide-24
SLIDE 24

Lorenz-63 model: Mean RMSE over 2000 assimilation cycles with tj = 0.05 j for the Lorenz-63 model using five different filter algorithms and three different ensemble sizes. FD (filter divergence) is being used whenever the mean RMSE exceeds the observation error R = 4. EnKF MEnKF1 MEnKF2 NLEAF1 NLEAF2 M = 10 0.4405 1.2045 FD FD FD M = 40 0.3004 0.3140 0.2510 0.3772 FD M = 400 0.3272 0.3262 0.2375 0.3037 0.2336

slide-25
SLIDE 25

Continuous & consistent update: d ds¯ x = −L∆x (ensemble mean), and d ds∆x = −1 2(L − ¯ L)∆x − 1 2L∆x (ensemble deviations) with ∆x = x − ¯ x. Ensemble filter equations for assimilation at tj: d dtxi = f(xi) − 1 2δǫ(t − tj)

  • (Li − ¯

L)∆xi + L∆x

  • .

EnKF closure seems preferable. Work in progress ...

slide-26
SLIDE 26
  • 8. Summary

Ensemble deformation techniques lead to a dynamical systems interpretation of data assimilation and provide an alternative to particle filters with (stochastic) resampling (sequential Monte Carlo methods). Future work:

1) analyse filter algorithms from dynamical systems point of view, 2) minimum variance estimate (EnKF) vs. MAP estimate (4DVar) 3) smoothing and parameter estimation, model selection 4) data assimilation for multi-scale problems, 5) assimilation of information aside from “observations”, 6) links to inverse problems (e.g. L1 regularization)

slide-27
SLIDE 27

Publications:

1) Bergemann, Gottwald, Reich: Ensemble propagation and continuous ma- trix factorization algorithms, QJ, 2009. 2) Bergemann, Reich: A localization technique for ensemble Kalman filters, QJ, 2010. 3) Bergemann, Reich: A mollified ensemble Kalman filter, QJ, 2010. 4) Reich: A dynamical systems framework for intermittent data assimilation, BIT, 2011. 5) Reich: A Gaussian-mixture ensemble transform filter, QJ, 2011. 6) Gottwald, Mitchell, Reich: Controlling overestimation of error covariance in ensemble Kalman filters with sparse observations: A variance limiting Kalman filter, MWR, 2011. 7) Bergemann, Reich: An ensemble Kalman-Bucy filter for continuous data assimilation, submitted. 8) Amezcua, Kalnay, Ide, Reich: Using the ensemble Kalman-Bucy filter in an ensemble framework, submitted. 9) Reich, Shin: On the consistency of ensemble transform filter formulations, submitted.