Background-error correlation modelling in variational assimilation - - PowerPoint PPT Presentation

background error correlation modelling in variational
SMART_READER_LITE
LIVE PREVIEW

Background-error correlation modelling in variational assimilation - - PowerPoint PPT Presentation

. Background-error correlation modelling in variational assimilation using a diffusion equation, with application to oceanography . Anthony Weaver and Isabelle Mirouze CERFACS, Toulouse October 28, 2011 Large-Scale Inverse Problems and


slide-1
SLIDE 1

. .

Background-error correlation modelling in variational assimilation using a diffusion equation, with application to oceanography

Anthony Weaver and Isabelle Mirouze

CERFACS, Toulouse October 28, 2011

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-2
SLIDE 2

Outline

. .

1

Data assimilation in oceanography . .

2

Variational data assimilation . .

3

Characteristics of the background-error covariance matrix . .

4

Correlation modelling with a diffusion operator. Part 1: isotropy, boundary conditions, solution algorithm . .

5

Correlation modelling with a diffusion operator. Part 2: anisotropy, inhomogeneity, ensemble estimation . .

6

Concluding remarks

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-3
SLIDE 3

Outline

. .

1

Data assimilation in oceanography . .

2

Variational data assimilation . .

3

Characteristics of the background-error covariance matrix . .

4

Correlation modelling with a diffusion operator. Part 1: isotropy, boundary conditions, solution algorithm . .

5

Correlation modelling with a diffusion operator. Part 2: anisotropy, inhomogeneity, ensemble estimation . .

6

Concluding remarks

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-4
SLIDE 4

Applications of data assimilation in oceanography

Providing initial conditions for climate forecasts.

▶ Monthly, seasonal, multi-annual, decadal. ▶ Global models. ▶ Mainly low resolution (∼1◦).

Providing initial conditions for ocean forecasts (with a focus on mesoscale eddies).

▶ Days to weeks. ▶ Global, regional and coastal models. ▶ Modest/high resolution (∼1/4◦/∼1/12◦+).

Reconstructing the history of the ocean (reanalysis).

▶ Mainly global models. ▶ Mainly low/modest resolution for multi-decadal reanalysis. Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-5
SLIDE 5

The core components of the global ocean observing system

Temperature from moorings

  • Temp. and salinity from Argo floats

SST from satellites, ships, buoys SSH from satellite altimeters

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-6
SLIDE 6

Applications of ocean data assimilation

Detecting decadal variability and trends due to global warming in

  • cean reanalyses.

¡

(From Balmaseda et al. (2010), European COMBINE project)

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-7
SLIDE 7

Outline

. .

1

Data assimilation in oceanography . .

2

Variational data assimilation . .

3

Characteristics of the background-error covariance matrix . .

4

Correlation modelling with a diffusion operator. Part 1: isotropy, boundary conditions, solution algorithm . .

5

Correlation modelling with a diffusion operator. Part 2: anisotropy, inhomogeneity, ensemble estimation . .

6

Concluding remarks

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-8
SLIDE 8

Variational data assimilation: I

We consider the general 4D-Var assimilation problem: min

x∈Rn J[x] = 1

2 (x − xb)T B−1 (x − xb)

  • Jb

+ 1 2 ( G (x) − yo )TR−1( G (x) − yo )

  • Jo

where yo =     . . . yo

i

. . .     , G(x) =     . . . Gi(x) . . .     =     . . . Hi(M(ti, t0)(x)) . . .     Assimilation window t0 ≤ ti ≤ tN (order of days in the ocean). x = initial condition for temp., salinity, velocity, SSH on 3D grid. dim(x) = n ∼ 106 − 108. dim(yo) = m ∼ 105 − 106.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-9
SLIDE 9

Variational data assmilation: II

We transform the 4D-Var problem into a more convenient form: min

  • x∈Rn J[

x ] = 1 2 ( x − xb)T B−1 ( x − xb) + 1 2 ( G( x) − yo )TR−1( G( x) − yo ) where

  • G(x) =

    . . .

  • Gi(x)

. . .     =     . . . Hi(M(ti, t0)(K( x))) . . .    

  • x = K −1(x) is the initial condition transformed into dynamically

decorrelated variables.

  • B is assumed block diagonal wrt to the transformed variables.

The solution is xa = K( xa) where xa is the minimum of J.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-10
SLIDE 10

Variational data assimilation: III

We use a solution algorithm based on incremental 4D-Var (Gauss-Newton): min

δ x(k)∈Rn J(k)[δ

x(k)] = 1 2(δ x(k) − δ xb,(k−1))T B−1 (δ x(k) − δ xb,(k−1)) + 1 2( G(k−1)δ x(k) − δyo,(k−1))TR−1( Gk−1δ x(k) − δyo,(k−1)) where

  • G(k−1) =

    . . . H(k−1)

i

M(k−1)(ti, t0) K(k−1) . . .     ≈     . . . ∂ Gi/∂ x|

x= x(k−1)

. . .     δ xb,(k−1) = xb − x(k−1) and δyo,(k−1) = yo − G( x(k−1)). Inner problem: δ x(k) found approximately using conjugate gradients. Outer problem: very few iterations k are affordable in practice. Setting M(k−1)(ti, t0) = I gives a (much cheaper!) 3D-Var algorithm (widely used in ocean DA).

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-11
SLIDE 11

Outline

. .

1

Data assimilation in oceanography . .

2

Variational data assimilation . .

3

Characteristics of the background-error covariance matrix . .

4

Correlation modelling with a diffusion operator. Part 1: isotropy, boundary conditions, solution algorithm . .

5

Correlation modelling with a diffusion operator. Part 2: anisotropy, inhomogeneity, ensemble estimation . .

6

Concluding remarks

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-12
SLIDE 12

Characteristics of the background-error covariance matrix B

Specification of B−1 is not required when B is used as a preconditioner.

  • B is an enormous matrix that is difficult to estimate and represent.

▶ Need simplifying assumptions to reduce number of tunable parameters. ▶ Need to account for inhomogeneous and anisotropic structures. ▶ Need computationally efficient operators that can run in parallel. ▶ Need to deal with complex boundaries in the ocean. ▶ Need to apply with complicated grids referenced to the thin-spherical

shell geometry (S2 × R1) of the Earth.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-13
SLIDE 13

Basic properties of B: I

On each CG iteration we need to multiply a vector ψ in state space by B. For a given block Bi of B this multiplication in R3 can be interpreted as

  • Biψi →

R3

  • Bi(x, x′) ψi(x′) dx′

where x = (x, y, z)T and dx = dx dy dz.1

  • Bi(x, x′) is a symmetric and positive definite (covariance) function:
  • Bi(x, x′) = σi(x) σi(x′)
  • std devs

correlation

  • Ci(x, x′)

with Ci(x, x) = 1 Remark: Evaluating the integral equation numerically is in general prohibitively expensive for large-scale problems !

1Not to be confused with the state vector x on previous slides! Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-14
SLIDE 14

Basic properties of B: II

Multiplication by σi(x) is easy. Rescale the variables ψi(x) = σi(x) ψi(x). The difficult computation is Ci ψi → ∫

R3 Ci(x, x′)

ψi(x′) dx′. For the ocean (and atmosphere) we need to define a valid correlation

  • perator on the spherical space S2

Ci ψi → ∫

S2 Ci(λ, ϕ, λ′, ϕ′)

ψi(λ′, ϕ′) a2 cos ϕ′ dλ′ dϕ′ where a is the Earth’s radius and (λ, ϕ) are geographical coordinates.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-15
SLIDE 15

Outline

. .

1

Data assimilation in oceanography . .

2

Variational data assimilation . .

3

Characteristics of the background-error covariance matrix . .

4

Correlation modelling with a diffusion operator. Part 1: isotropy, boundary conditions, solution algorithm . .

5

Correlation modelling with a diffusion operator. Part 2: anisotropy, inhomogeneity, ensemble estimation . .

6

Concluding remarks

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-16
SLIDE 16

Correlation operators based on implicit diffusion: I

Consider the linear partial differential equation on S2 (no boundaries) γ−1

h

( 1 − L2∇2)M ψi(λ, ϕ) = ψi(λ, ϕ) (1) where ∇2 ≡ 1 a2 cos ϕ ∂ ∂ϕ ( cos ϕ ∂ ∂ϕ ) + 1 a2 cos2 ϕ ∂2 ∂λ2 , γ is a constant and M a positive integer.

  • Eq. (1) results from an implicit “time”-discretization of the diffusion

eqn from t = 0 to t = M∆t: ∂η ∂t − κ∇2η = 0 where η(λ, ϕ, 0) = γ ψi(λ, ϕ) , η(λ, ϕ, t) = ψi(λ, ϕ) and L2 = κ∆t.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-17
SLIDE 17

Correlation operators based on implicit diffusion: II

The solution of Eq. (1) is a correlation operator whose kernel is an isotropic correlation function that depends on the angular separation θ between (λ, ϕ) and (λ′, ϕ′): C h(θ) =

n=0

ch

n P0 n(cos θ)

where P0

n are the Legendre polynomials (with P0 n(1) = √2n + 1),

ch

n = γh

√2n + 1 4πa2 ( 1 + L2 a2 n(n + 1) )

−M

> 0 ⇒ positive definite γh = 4πa2 ( ∞ ∑

n=0

√ 2n + 1 ch

n

)

−1

⇒ C h(0) = 1 Dh = √ − 2 ∇2C h|θ=0 = a √ 2 ∑∞

n=0 n(n + 1)√2n + 1 ch n

is a standard definition of length-scale (“Daley” length-scale).

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-18
SLIDE 18

Correlation operators based on implicit diffusion: III

In the limit as M → ∞ such that Dh = D is fixed then C h(θ) ≈ exp ( −r2/2D2) where r(θ) = a √ 2 (1 − cos θ) is chordal distance. (λ, ϕ) (λ′, ϕ′) θ r(θ) a The Gaussian correlation operator can also be approximated with an explicit diffusion scheme but the stability criterion makes it generally less practical than the implicit scheme. The key idea is to iterate a discretized implicit or explicit diffusion

  • perator to avoid the expensive direct evaluation of an integral eqn.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-19
SLIDE 19

Examples of isotropic implicit-diffusion kernels on S2

1000 2000 3000 4000 −0.2 0.2 0.4 0.6 0.8 1 Distance (km) Correlation Ch(θ), Dh = 500 km M=3 M=4 M=10 Gaussian 10 10

1

10

2

10

−8

10

−6

10

−4

10

−2

Wavenumber (n) Variance (2n+1)1/2 ch

n, Dh = 500 km

M=3 M=4 M=10 Gaussian Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-20
SLIDE 20

Correlation operators based on implicit diffusion: IV

Now consider the linear partial differential equation on Rd (no boundaries) γ−1

w

( 1 − L2∇2)M ψi(x) = ψi(x) where x = (x1, . . . , xd)T, and ∇2 ≡ ∂2 ∂x2

1

+ · · · + ∂2 ∂x2

d

. With foresight we set the normalization constant to γw = 2dπd/2 Γ(M) Γ(M − d/2) Ld. where Γ is the Gamma function.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-21
SLIDE 21

Correlation operators based on implicit diffusion: V

The solution is a correlation operator whose kernel is from the Whittle-Matérn family (Guttorp and Gneiting 2006): C w

d (r) =

21−M+d/2 Γ(M − d/2) ( r L )M−d/2 KM−d/2 ( r L ) where KM−d/2 are modified Bessel functions of the second kind, and r = |x − x′| = √ (x1 − x′

1)2 + . . . + (xd − x′ d)2.

The Daley length-scale is Dw = √ − d ∇2C w

d

  • r=0

= √ 2M − d − 2 L where M > (d + 1)/2 if d odd, or M > (d + 2)/2 if d even.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-22
SLIDE 22

Comparison of isotropic implicit-diffusion kernels on S2

2000 4000 6000 8000 0.2 0.4 0.6 0.8 1 Distance (km) Correlation M = 4, Dh = 500, 1000, 2000, 4000 km Ch(r) Cw

2(r)

Cw

3(r)

For length-scales of interest, diffusion on S2 can be well approximated by diffusion on R2 (but C w

2 (r) is not a valid correlation function on S2 !).

= ⇒ Dh ≈ Dw and γh ≈ γw for d = 2.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-23
SLIDE 23

Implicit diffusion near boundaries: I

Example: implicit diffusion with M = 2 (SOAR function) and L = 3.

1 5 10 15 20 25 30 0.5 1 1.5 2 Location Correlation Dirac at z1 Dirac at z4 Dirac at z7

(a) Neumann BCs

1 5 10 15 20 25 30 0.5 1 1.5 2 Location Correlation Dirac at z1 Dirac at z4 Dirac at z7

(b) Dirichlet BCs

The amplitude of the correlation function is overestimated with Neumann BCs and underestimated with Dirichlet BCs.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-24
SLIDE 24

Implicit diffusion near boundaries: II

Example: implicit diffusion with M = 2 (SOAR function) and L = 3.

(a) Neumann BCs (b) Dirichlet BCs

Computing appropriate normalization factors near the boundary corrects the amplitude of the correlation function but distorts the correlation shape.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-25
SLIDE 25

Implicit diffusion near boundaries: III

Example: implicit diffusion with M = 2 (SOAR function) and L = 3.

1 5 10 15 20 25 30 0.5 1 1.5 2 Location Correlation Dirac at z1 Dirac at z4 Dirac at z7

(a) Average of Neumann and Dirichlet

Transparent boundaries can be simulated by defining the correlation

  • perator as an average of Neumann and Dirichlet solutions, but at the

expense of doubling the cost of the operator.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-26
SLIDE 26

Solving the implicit diffusion equation: I

The implicit diffusion equation requires the solution of a large linear system AMψ = (I − D)M ψ = ψ (1) where D → ∇ · L2∇ with L = L(x) to account for geographically varying length-scales. Let L−1 = A so that the (unnormalized) solution can be written as ψ = LM ψ. Direct methods for solving Eq. (1) are not practical in 2D or 3D, but can be applied in 1D (e.g., using Cholesky factorization).

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-27
SLIDE 27

Solving the implicit diffusion equation: II

There are several ways to build a 2D or 3D correlation matrix from a product of self-adjoint 1D operators, but special care is needed to ensure self-adjointness since these operators do not commute in general. To construct a self-adjoint 3D correlation operator, with an easily identifiable “square-root” factor, we make use of the following properties of the 1D diffusion operators: L = W−1 LT W = L∗ LM = LM/2 LM/2 = LM/2 W−1(LT)M/2 W where W is a diagonal matrix of local grid-size elements, and M is assumed to be even.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-28
SLIDE 28

A 3D correlation matrix based on 1D implicit diffusion operators Formulate each 1D (x, y, z) correlation block as a symmetric product Cx = 1 2Λ1/2

x

( LM/2

Nx W−1 x

( LT

Nx

)M/2 + LM/2

Dx W−1 x

( LT

Dx

)M/2) Λ1/2

x

etc. where N and D denote Neumann and Dirichlet BCs. Multiply out different terms in the product C = Cx Cy Cz and interchange “square-root” factors to force symmetry: C ≈ 1 8Λ1/2 { LM/2

Nx LM/2 Ny LM/2 Nz W−1(

LT

Nz

)M/2 (LT

Ny ) M/2(

LT

Nx

)M/2 + LM/2

Nx LM/2 Ny LM/2 Dz W−1(

LT

Dz

)M/2 (LT

Ny ) M/2(

LT

Nx

)M/2 + . . . + LM/2

Dx LM/2 Dy LM/2 Dz W−1(

LT

Dz

)M/2 (LT

Dy ) M/2(

LT

Dx

)M/2}Λ1/2 W = WxWyWz and Λ = ΛxΛyΛz are diagonal matrices of volume elements and normalization factors.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-29
SLIDE 29

How many iterations should we use?

Matérn with M = 2 2×1D implicit diffusion with M = 2 Matérn with M = 10 2×1D implicit diffusion with M = 10

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-30
SLIDE 30

Representing correlations with geographically varying length-scales

Zonal length scales (degs) for temperature (T) estimated from ensemble perturbations T-T correlations at selected points

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-31
SLIDE 31

Limitations of the 2 × 1D separability assumption: I

Unphysical features can appear when the length-scale is large compared to the local geometry. It is advantageous to alternate the smoothing directions more frequently:

C = Λ1/2 ( LM/2m

x

LM/2m

y

)m W−1 (( LT

y

)M/2m ( LT

x

)M/2m)m Λ1/2

Example: 2 × 1D implicit diffusion, with M = 10, near a small island.

(b) C with m = 1 (c) C with m = 5

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-32
SLIDE 32

Limitations of the 2 × 1D separability assumption: II

SST bias increments obtained with a 2 × 1D implicit diffusion: M = 12, m = 1. (Courtesy of James While, Met Office)

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-33
SLIDE 33

Limitations of the 2 × 1D separability assumption: III

SST bias increments obtained with a 2 × 1D implicit diffusion: M = 12, m = 3. (Courtesy of James While, Met Office)

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-34
SLIDE 34

Outline

. .

1

Data assimilation in oceanography . .

2

Variational data assimilation . .

3

Characteristics of the background-error covariance matrix . .

4

Correlation modelling with a diffusion operator. Part 1: isotropy, boundary conditions, solution algorithm . .

5

Correlation modelling with a diffusion operator. Part 2: anisotropy, inhomogeneity, ensemble estimation . .

6

Concluding remarks

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-35
SLIDE 35

Accounting for anisotropy using tensors: some definitions

Aspect tensor A: For a correlation function C( r ) that depends on the (non-dimensional) distance r between locations x and x′ then

  • r = ∥

r ∥A−1 = √ (x − x′)TA−1(x − x′)

▶ Isotropic case: A = L2 I and

r = |x − x′| /L.

Correlation Hessian H and “Daley” tensor D: H = D−1 = − ∇∇TC( r )

  • r=0

▶ Isotropic case: D = D2 I where D is the Daley length-scale.

Diffusion tensor κ: ∂η ∂t − ∇ · κ∇η = 0

▶ Rescaled tensor: L = ∆t κ after “temporal” discretization.

All these tensors are assumed to be symmetric and positive definite (and hence invertible).

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-36
SLIDE 36

Why are these different tensors of interest?

The normalized kernel of a diffusion operator with constant κ is a correlation function C( r ) with known analytical form. From earlier we know that:

▶ The diffusion kernel of an explicit scheme approximates a Gaussian. ▶ The diffusion kernel of an M-step implicit scheme is a member of the

Whittle-Matérn or Matérn correlation family.

Link to ensemble estimation.

▶ The Hessian H, and hence D, can be estimated from ensemble

statistics (see later).

▶ H can in turn be related to the aspect tensor A of the Gaussian and

Matérn functions.

▶ A can in turn be related to κ (or L) of the explicit or implicit diffusion

  • perator.

Estimating H(x) at each grid-point x and using it to define κ(x) in the diffusion operator allows us to model anisotropic and inhomogeneous correlation functions.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-37
SLIDE 37

Representing anisotropic correlations with an implicit scheme

The anisotropic implicit diffusion problem with constant tensor L is a straightforward extension of the isotropic problem: γ−1

w (1 − ∇ · L∇)M ψ(x) =

ψ(x) As before, the solution leads to the Whittle-Matérn correlation family C w

d (

r ) with

  • r =

√ (x − x′)T A−1 (x − x′) A = L = 1 2M − d − 2 D = H−1 = ( − ∇∇TC w

d (

r )

  • r=0

)−1 γw = 2dπd/2 Γ(M) Γ(M − d/2) |L|d/2

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-38
SLIDE 38

Anisotropic and inhomogeneous implicit-diffusion kernels

A class of anisotropic and inhomogeneous correlation functions from the Matérn family is (Paciorek and Schervish 2006) C(x, x′) = α ( x, x′) 21−M+d/2 Γ(M−d/2) r M−d/2KM−d/2( r ) where

  • r =

√ (x − x′)T (A(x) + A(x′) 2 )

−1

(x − x′) and α ( x, x′) = |A(x) |1/4 |A ( x′) |1/4

  • 1

2 ( A(x) + A ( x′))

  • −1/2

. These are the approximate kernels of the implicit form of the anisotropic diffusion operator when the aspect tensors vary slowly and smoothly in space.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-39
SLIDE 39

Inhomogeneous implicit-diffusion kernels

Example: SOAR function vs implicit-diffusion kernel with M = 2 when L = L(x).

1 20 40 60 80 100 0.2 0.4 0.6 0.8 1 Correlation Location 3.5 4 4.5 Length scale fM ¯ fM

  • fM

1 20 40 60 80 100 0.2 0.4 0.6 0.8 1 Correlation Location 3.5 4 4.5 Length scale fM ¯ fM

  • fM

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-40
SLIDE 40

Inhomogeneous implicit-diffusion kernels

Example: SOAR function vs implicit-diffusion kernel with M = 2 when L = L(x).

1 20 40 60 80 100 0.2 0.4 0.6 0.8 1 Correlation Location 3.5 4 4.5 Length scale fM ¯ fM

  • fM

1 20 40 60 80 100 0.2 0.4 0.6 0.8 1 Correlation Location 2.5 3 3.5 4 4.5 5 5.5 Length scale fM ¯ fM

  • fM

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-41
SLIDE 41

Estimating the correlation Hessian tensor from ensemble statistics: I

Assume that the covariance function of a random field ϵ is twice differentiable and that the associated correlation function C(x, x′) = C( r ) is homogeneous. If E[ · ] denotes the expectation operator then it can be shown (e.g. Belo Pereira and Berre 2006) that H(x) = − ∇∇TC( r )

  • r=0

= E [ ∇˜ ϵ(x) (∇˜ ϵ(x))T] − ∇σ(x) (∇σ(x))T (σ(x))2 where ˜ ϵ(x) = ϵ(x) − E[ϵ(x)] and (σ(x))2 = E [ (˜ ϵ(x)) 2] .

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-42
SLIDE 42

Estimating the correlation Hessian tensor from ensemble statistics: II

Now assume the availability of a sample of l = 1, . . . , Ne ensemble perturbations εl, or some other proxy for background error. Apply the inverse of the linearized balance operator to εl: K−1εl = ϵl. The sample estimate of the correlation Hessian tensor is

  • H(x)= ∇˜

ϵ(x) (∇˜ ϵ(x))T − ∇ σ(x) (∇ σ(x))T ( σ(x))2 where ∇˜ ϵ(x) (∇˜ ϵ(x))T = 1 Ne − 1

Ne

l=1

∇˜ ϵl(x) (∇˜ ϵl(x))T , ( σ(x))2 = (˜ ϵ(x))2 = 1 Ne − 1

Ne

l=1

(˜ ϵl(x))2 . This equation will be a good approximation of the Hessian tensor when the correlation function is locally homogeneous.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-43
SLIDE 43

Estimating the correlation Hessian tensor from ensemble statistics: III

From the local estimate of H(x), we can invert it to obtain D(x), and specify the rescaled implicit diffusion tensor according to L(x) = 1 2M − d − 2 D(x) The number of elements to estimate is 3N in 2D (or 6N in 3D) for the tensor plus N for the variances, where N is the number of grid points. This is much smaller than the (N2 + N)/2 independent elements required to estimate the full covariance matrix, so sampling errors will be much smaller (in practice, Ne ∼ O(10 − 100)). Local spatial filtering can be used to increase the effective sample size (Berre and Desroziers 2010).

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-44
SLIDE 44

Examples from an idealized numerical experiment

Generate a set of random vectors ϵl, l = 1, . . . , Ne, such that ϵ ∼ N(0, B⋆) where B⋆ is the ‘true’ covariance matrix. B⋆ is Gaussian and constructed using a 2D explicit diffusion operator. The ‘true’ variances are spatially varying with a cosine dependence on x = (x, y). The ‘true’ anisotropic tensor of the diffusion operator is formulated as D⋆(x) = R D(x) R−1 where R is a constant rotation matrix and D(x) is a diagonal tensor. The elements of D(x) are spatially varying with a cosine dependence

  • n x.

The objective here is to try to reconstruct the tensor (and variances)

  • f B⋆ given the ‘ensemble’ perturbations ϵl.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-45
SLIDE 45

Accuracy of the Hessian tensor elements versus ensemble size

10 20 30 40 50 60 70 80 90 100 0.01 0.02 0.03 0.04 0.05 0.06 Members Rmse 10 20 30 40 50 60 70 80 90 100 0.01 0.02 0.03 0.04 0.05 0.06 Members Rmse

Hxx rmse Hxy rmse

—— H —— H + local avg —— H —— H + local avg

  • Filtering using a simple local averaging operator is beneficial

when the ensemble size is small.

  • Optimal filtering is important with larger ensembles.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-46
SLIDE 46

Sample correlations: estimated versus truth

Truth Estimated with Ne = 100 Estimated - Truth

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-47
SLIDE 47

Sample correlations: estimated versus truth

Truth Estimated with Ne = 10 and local averaging Estimated - Truth

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-48
SLIDE 48

Outline

. .

1

Data assimilation in oceanography . .

2

Variational data assimilation . .

3

Characteristics of the background-error covariance matrix . .

4

Correlation modelling with a diffusion operator. Part 1: isotropy, boundary conditions, solution algorithm . .

5

Correlation modelling with a diffusion operator. Part 2: anisotropy, inhomogeneity, ensemble estimation . .

6

Concluding remarks

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-49
SLIDE 49

Concluding remarks

There is an increasing interest in using ensembles to improve the estimation of the background-error covariances in VarDA. A diffusion-based model can be used to synthesize relevant correlation information contained in a small ensemble.

▶ Possible to handle anisotropy, inhomogeneity and complex boundaries.

The choice of explicit or implicit diffusion solver depends on the desired correlation function (Gaussian or Matérn) as well as computational issues.

▶ Implicit schemes are more robust with general tensors, but require

efficient solvers (CG, multigrid,...).

Other applications of the diffusion operator:

▶ Grid-point covariance localization in hybrid Ensemble Var. ▶ Spatial filtering of ensemble-estimated variance and tensor elements. Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-50
SLIDE 50

References I

Balmaseda, M. A., K. Mogensen, F. Molteni and A. T. Weaver, 2010: The NEMOVAR-COMBINE ocean re-analysis. COMBINE Technical report No. 1. Available at http://www.combine-project.eu/Technical-Reports.1688.0.html. Belo Pereira, M. and L. Berre, 2006: The use of an ensemble approach to study the background error covariances in a global NWP model. Mon. Weather Rev., 134, 2466–2489. Berre, L. and G. Desroziers, 2010: Filtering of background-error variances and correlations by local spatial averaging: a review. Mon. Weather Rev., 138, 3693–3720. Guttorp, P. and T. Gneiting, 2006: Miscellanea studies in the history of probability an statistics XLIX: On the Matérn correlation family. Biometrika, 93, 989–995. Mirouze, I. and A. T. Weaver, 2010: Representation of correlation functions in variational assimilation using an implicit diffusion operator. Q. J. R. Meteorol. Soc., 136, 1421–1443. Paciorek, C. J. and M. J. Schervish, 2006: Spatial modelling using a new class of nonstationary covariance functions. Environmetrics, 17, 483–506.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011

slide-51
SLIDE 51

References II

Purser, R. J., Wu, W. S., Parrish, D. F. and N. M. Roberts, 2003. Numerical aspects of the application of recursive filters to variational statistical analysis. Part I: spatially homogeneous and isotropic Gaussian covariances. Mon. Weather Rev., 131, 1524–1535. Weaver, A. T. and P. Courtier, 2001: Correlation modelling on the sphere using a generalized diffusion equation. Q. J. R. Meteorol. Soc., 131, 3605–3625. Weaver, A. T. and S. Ricci, 2004: Constructing a background-error correlation model using generalized diffusion operators. In Proceedings of the ECMWF Seminar Series on “Recent developments in atmospheric and ocean data assimilation”, ECMWF, Reading, U. K., 8–12 September 2003, pp. 327–340. Weaver, A. T. and I. Mirouze, 2011: On the diffusion equation and its application to isotropic and anisotropic correlation modelling. In Revision.

Large-Scale Inverse Problems and Applications in the Earth Sciences, Linz, Austria, 24-28 October 2011