Statistical Inversion for Basal Parameters for the Antarctic Ice - - PowerPoint PPT Presentation

statistical inversion for basal parameters for the
SMART_READER_LITE
LIVE PREVIEW

Statistical Inversion for Basal Parameters for the Antarctic Ice - - PowerPoint PPT Presentation

Statistical Inversion for Basal Parameters for the Antarctic Ice Sheet Tobin Isaac, Nomi Petra, Georg Stadler, Omar Ghattas 1 Institute for Computational Engineering & Sciences The University of Texas at Austin SIAM UQ 2014 Savannah,


slide-1
SLIDE 1

Statistical Inversion for Basal Parameters for the Antarctic Ice Sheet

Tobin Isaac, Noémi Petra, Georg Stadler, Omar Ghattas1

Institute for Computational Engineering & Sciences The University of Texas at Austin

SIAM UQ 2014 Savannah, Georgia April 3, 2014

1Departments of Geological Sciences and Mechanical Engineering Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 1 / 28

slide-2
SLIDE 2

1

Forward Model: Ice Sheet Dynamics

2

Bayesian Inversion via methods of Constrained Optimization

3

Example: Pine Island Glacier

4

Example: Antarctic Ice Sheet These slides can be found at www.ices.utexas.edu/~tisaac/slides.

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 2 / 28

slide-3
SLIDE 3

Motivation

Observed surface flow velocity [Rignot et al., 2011] Antarctic ice sheet inversion for the basal friction parameter field using InSAR surface velocity measurements

IPCC Fourth Assessment Report [Meehl et al., 2007]

[T]he uncertainty in the projections of the land ice contributions [to sea level rise] is dominated by the various uncertainties in the land ice models themselves ... rather than in the temperature projections.’’

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 3 / 28

slide-4
SLIDE 4

Equations

Balance of linear momentum, mass, and energy −∇ · [η(θ, u) ˙ ε − Ip] = ρg, [˙ ε = 1

2(∇u + ∇uT)]

∇ · u = 0, ρc ∂θ ∂t + u · ∇θ

  • − ∇ · (K∇θ) = 2 η tr(˙

ε2) Constitutive relations η(θ, u) =

  • A0 exp
  • − Q

− 1

n

˙ ε

1−n 2n

II

[˙ εII = 1

2tr(˙

ε2)] Basal boundary conditions

T(σn) + β(x)Tu = 0

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 4 / 28

slide-5
SLIDE 5

Discretization

High-order finite element discretization: Qp × Qp−2

3 1 4 2

complex geometry high-order accurate LBB stable element-wise mass conservative inf-sup constant independent of h and weakly dependent on p inf-sup constant unaffected by high aspect ratio ice sheet geometry amenable to scalable parallel AMR schemes better efficiency than low-order (increased flops per memop), if there is an efficient solver.

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 5 / 28

slide-6
SLIDE 6

Solution

Nonlinear solver is inexact Newton-Krylov method: Newton’s method: linearized Stokes equations have 4th-order tensor coefficient, as do true adjoint equations. Inexact Krylov tolerances [Eisenstat and Walker, 1996]: minimizes Krylov iterations in Newton steps not believed to be in asymptotic regime. Linear solver: trilinear submesh preconditioner for high-order elements [Brown, 2010] Smoothed Aggregation AMG: tolerates heterogeneities/anisotropies of viscous operator better than geometric multigrid ASM(i)/ILU(j) smoother for high-aspect-ratio geometries

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 6 / 28

slide-7
SLIDE 7

Antarctic simulation: scaling

Newton-Krylov Solver for Antarctic Ice Sheet Stokes Equations

50 100 150 200

Krylov iterations

10−13 10−10 10−7 10−4 10−1

rk/r0

p = 3 [18] p = 4 [23] p = 5 [23]

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 7 / 28

slide-8
SLIDE 8

Antarctic simulation: scaling

3 6 9 12 15 10−14 10−12 10−10 10−8 10−6 10−4 10−2 100

Newton iterations (Krylov iterations labeled) ||rk||/||r0|| Newton-Krylov Solver for Antarctic Ice Sheet Stokes Equations

bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC bC

28M dofs, 1024 cores 122M dofs, 4096 cores 122M dofs, 16384 cores

bC bC bC

30 4 7 14 21 54 8 3 8 82 104 33 5 7 17 33 38 6 5 13 75 143 35 5 7 16 40 42 4 3 3 4 4 93 196 356

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 7 / 28

slide-9
SLIDE 9

Inventory

We have Scalable iterative solution of nonlinear forward problem Scalable iterative solution of Newton linearization

Scalable iterative solution of adjoint equations Time to solution is more-or-less linear in the tolerance Algebraic multilevel representation of linear operator

We do not currently have Full multilevel approximation scheme of the nonlinear problem

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 8 / 28

slide-10
SLIDE 10

Bayesian-inversion-in-a-slide

Let f be the map from the parameter m (controlling the basal coefficient β) to the observations through the forward problem: f (m) = B(u), A(u, m) = 0 We assume additive Gaussian noise in the measurements: d = f (m) + e, e ∼ N(0, Γ

noise)

Thus:

πlike(d|m) = exp

  • − 1

2(f (m) − d)TΓ−1

noise(f (m) − d)

  • If the prior is Gaussian with mean mprior and covariance Γ

prior, then we obtain for

the posterior pdf:

πpost(m) ∝ exp

  • − 1

2 f (m) − d 2

Γ−1

noise − 1

2 m − mprior 2

Γ−1

prior

  • Isaac, Petra, Stadler, Ghattas (ICES, UT Austin)

Inversion for Ice Sheet Parameters SIAM UQ 2014 9 / 28

slide-11
SLIDE 11

Posterior statistic 0: MAP point

The maximum a posteriori point is mMAP

def

= arg max

m

πpost(m) = arg min

m

− log πpost(m) [

def

= J(m)] = arg min

m,u 1 2 B(u) − d 2

Γ−1

noise + 1

2 m − mprior 2

Γ−1

prior

such that A(u, m) = 0. This is classical PDE-constrained optimization: 1

2Γ−1

prior characterizes the

regularization.

Γ−1

prior is (should be) given a priori by expert knowledge, not chosen

a posteriori when the solution has desirable regularity (although if ‘‘expert prior knowledge’’ is ‘‘I know it when I see it’’, there is little difference)

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 10 / 28

slide-12
SLIDE 12

Scalable MAP point computation via RSINK

The KKT first-order necessary conditions (J′(m) = 0) of a solution m are

A(u, m) = 0

[state],

AT

u(u, m)p = B(u) − d

[adjoint],

AT

m(u, m)p + Γ−1

prior m = 0

[gradient], where u and p are state and adjoint variables. We solve this using Reduced Space Inexact Newton-Krylov: Reduced space: search for a minimum of J(m); given mk, uk and pk satisfy state and adjoint equations. Inexact Newton-Krylov: like inexact Newton-Krylov used to solve state equations

J′′(mk)δk = −J′(mk)

mk+1 = mk + αδk, where α is determined by line search.

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 11 / 28

slide-13
SLIDE 13

Necessary components of scalable, efficient RSINK implementation

Scalable, efficient cost evaluation ⇔ Scalable, efficient nonlinear forward solver Scalable, efficient gradient computation Scalable, efficient (Gauss-)Newton Hessian application

H(m)

def

= J′′(m).

The Hessian, or an approximation, is useful for UQ: please attend Noémi’s talk in MS7 three days ago for more details.

Scalable, efficient (Gauss-)Newton Hessian preconditioner

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 12 / 28

slide-14
SLIDE 14

Adjoint-methods-in-a-slide

If uk and pk solve state and adjoint equations (uk computed from mk, pk computed form uk and mk), then

J′(m) = pT

kAm(uk, mk) + mT kΓ−1

prior

All terms in equations have finite element expressions (not differentiating matrices).

Marginal cost of J′(m), given J(m): one linear PDE solve.

Linearizing the first-order KKT system of equations results in set of PDEs for computing Hessian action

Computing H(mk) ˜ m costs two (‘‘incremental’’) linear PDE solves with the same operators as Newton step / adjoint. Another opportunity to exploit inexactness: loose tolerance on incrementals approximates H(mk), but quadratic/superlinear convergence of inexact Newton only requires accurate Hessian near the solution [Hicken and Alonso, 2013].

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 13 / 28

slide-15
SLIDE 15

Prior preconditioning the Hessian

We precondition the full Hessian H = H

misfit + Γ−1 prior with Γ prior, so the

preconditioned system is

H

misfit Γ prior + I ∼ Γ

1/2

prior H misfit Γ

1/2

prior + I.

The justification from PDE-constrained optimization: The precision Γ−1

prior is a differential operator.

The prior Γ

prior is compact (low effective rank), so H misfit Γ prior is compact.

Identity + compact ⇒ good (h-independent) Krylov method convergence. The implication for statistical inversion: Covariance Γ

post of Gaussianized posterior distribution is H(mMAP)−1.

The effective numerical rank of H

misfit Γ prior is the dimension of the subspace

  • f parameters informed by the data.

The more information the data contains, the less compact is H

misfit Γ prior Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 14 / 28

slide-16
SLIDE 16

Posterior statistic 1: MAP covariance

To compute an estimate of Γ

post, we compute a low-rank approximate

eigendecomposition of Γ

1/2

prior H misfitΓ

1/2

prior . There are two main approaches:

Lanczos methods, randomized SVD [Halko et al., 2011], combinations (e.g. block Lanczos). Note that in both cases, it is not necessary to have Γ

1/2

prior , as one can use

Lanczos on H

misfit preconditioned by Γ prior, or

a randomized Γ

prior-orthogonal SVD of H misfit.

Note that randomized methods are inherently more parallel than Lanczos methods.

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 15 / 28

slide-17
SLIDE 17

Sampling the Gaussianized posterior distribution

However we do our eigendecomposition, we end up with an approximation

  • f Γ

post of the form

Γ

post ≈ Γ prior − WrDrW T

r ,

where W T

r Γ−1

prior Wr = Ir.

How to generate a sample z ∼ N(0, Γ

post)? Given x ∼ N(0, Γ prior),

z = (I + Wr((Ir − Dr)1/2 − Ir)W T

r Γ−1

prior )x

Depending on its form, sampling Γ

prior may itself be non-trivial:

requires access to some symmetric factorization Γ

prior = GGT (such as Γ1/2 prior )

  • r a good approximation thereof.

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 16 / 28

slide-18
SLIDE 18

Inventory

Scalable, efficient nonlinear forward and adjoint solvers buy us

cost and gradient evaluation, and Hessian application.

Assuming the information gained from the data is low rank:

a good approximation of Γ

prior buys us fast, h-independent solution of systems

involving the reduced Hessian H(m); drawing samples from N(0, Γ

post) is as easy as drawing samples from

N(0, Γ

prior). Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 17 / 28

slide-19
SLIDE 19

Bayesian inference for the basal friction field: InSAR data (Pine Island region)

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 18 / 28

slide-20
SLIDE 20

Bayesian inference for the basal friction field: InSAR data (Pine Island region)

Left: InSAR-based Antarctica ice surface velocity observations Right: Inferred basal friction field (MAP point)

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 19 / 28

slide-21
SLIDE 21

Bayesian inference for the basal friction field: InSAR data (Pine Island region)

Left: Recovered ice surface velocity observations Right: Inferred basal friction field (MAP point)

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 19 / 28

slide-22
SLIDE 22

Bayesian inference for the basal friction field: InSAR data (Pine Island region)

Left: Prior pointwise standard deviation Right: Posterior pointwise standard deviation

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 20 / 28

slide-23
SLIDE 23

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-24
SLIDE 24

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-25
SLIDE 25

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-26
SLIDE 26

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-27
SLIDE 27

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-28
SLIDE 28

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-29
SLIDE 29

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-30
SLIDE 30

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-31
SLIDE 31

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-32
SLIDE 32

Samples from posterior distribution

Left: Inferred basal friction field (MAP point) Right: Sample from the posterior pdf

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 21 / 28

slide-33
SLIDE 33

Antarctic synthetic inversion for β (deterministic)

Setup & performance

# state parameters: 4,085,841 # inversion parameters: 409,545 # elements: 99,984 # of cores: 1024 inexact Newton-CG noise level used to synthesize data: 1%

Stampede, Texas Advanced Computing Center (TACC)

reduction in norm of gradient: 10−6

# of Newton iterations: 18

average # of CG iterations per Newton iteration: 85 total # of (linearized) Stokes solves: 2600 (mostly in application

  • f Hessian)

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 22 / 28

slide-34
SLIDE 34

Antarctic ice sheet inversion for basal friction field: InSAR data

Left: InSAR-based Antarctica ice surface velocity observations Right: Inferred basal friction field (MAP point)

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 23 / 28

slide-35
SLIDE 35

Antarctic ice sheet inversion for basal friction field: InSAR data

Left: Recovered ice surface velocity observations Right: Inferred basal friction field (MAP point)

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 23 / 28

slide-36
SLIDE 36

Antarctic ice sheet inversion for basal friction field: InSAR data

Prior preconditioned spectra

100 200 300 400 500 10

−2

10 10

2

10

4

10

6

10

8

number eigenvalue 1000 2000 3000 4000 5000 10

−2

10 10

2

10

4

10

6

10

8

number eigenvalue

Left: prior preconditioned spectrum of H

misfit for Pine Island Glacier

Right: prior preconditioned spectrum of H

misfit for Antarctica

Condition number of HΓ

prior is O(108)! Possible ways forward:

Review trust in Γ

noise and Γ prior

Take into account model error Better preconditioning than just Γ

prior Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 24 / 28

slide-37
SLIDE 37

Summary

Inversion for ice sheet parameters typifies many statistical inverse problems from geoscience applications:

large scale, difficult nonlinear forward problem, complex domain.

Scalable, efficient solvers such as inexact Newton-Krylov apply not just to the forward problem but to related adjoint problems as well. Methods of PDE-constrained optimization, particularly those exploiting adjoint-based gradient and Hessian computations, are the basis of scalable calculation of MAP points and posterior covariances. In applying our techniques to real data, we are testing the limits of the low-rank assumption: the information gain in Bayesian inversion has a profound effect on the solvability of systems involving the Hessian.

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 25 / 28

slide-38
SLIDE 38

Thank you! Questions?

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 26 / 28

slide-39
SLIDE 39

References I

Jed Brown. Efficient nonlinear solvers for nodal high-order finite elements in 3D. Journal of Scientific Computing, 45(1-3):48--63, 2010. doi: 10.1007/s10915-010-9396-8.

  • S. C. Eisenstat and H. F. Walker. Choosing the forcing terms in an inexact Newton
  • method. SIAM Journal on Scientific Computing, 17:16--32, 1996.

Nathan Halko, Per-Gunnar Martinsson, and Joel A. Tropp. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix

  • decompositions. SIAM Review, 53(2):217--288, 2011.

Jason E Hicken and Juan J Alonso. Comparison of reduced-and full-space algorithms for pde-constrained optimization. In 51st AIAA Aerospace Sciences Meeting, no. AIAA--2013--1043, Grapevine, Texas, United States, 2013.

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 27 / 28

slide-40
SLIDE 40

References II

  • G. A. Meehl, T. F. Stocker, W. D. Collins, A. T. Friedlingstein, A. T. Gaye, J. M.

Gregory, A. Kitoh, R. Knutti, J. M. Murphy, A. Noda, S. C. B. Raper, I. G. Watterson, A. J. Weaver, and Z.-C. Zhao. Global climate projections. In Climate Change 2007: The Physical Science Basis. Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change, pages 747--845. Cambridge University Press, 2007. E Rignot, J Mouginot, and B Scheuchl. Ice flow of the antarctic ice sheet. Science, 333(6048):1427--1430, 2011.

Isaac, Petra, Stadler, Ghattas (ICES, UT Austin) Inversion for Ice Sheet Parameters SIAM UQ 2014 28 / 28