Probabilistic UQ using PDEs with Random Data: A Case Study Oliver - - PowerPoint PPT Presentation
Probabilistic UQ using PDEs with Random Data: A Case Study Oliver - - PowerPoint PPT Presentation
Institut fr Numerische Mathematik und Optimierung Probabilistic UQ using PDEs with Random Data: A Case Study Oliver Ernst December 12, 2011 RICAM Special Semester on Multiscale Simulation & Analysis in Energy and the Environment
Collaborators, Acknowledgments
Ingolf Busch, Björn Sprungk TU Freiberg, Institute of Numerical Analysis and Optimization Gerald van den Boogaart TU Freiberg, Institute of Stochastics
- K. Andrew Cliffe
U of Nottingham, School of Mathematical Sciences Elisabeth Ullmann U of Bath, Department of Mathematical Sciences
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 1
Uncertainty Quantification (UQ)
Goal: Account for the fact that numerical simulations in science and engineering are typically based on data which is uncertain. Enabling factors: mature, sophisticated numerical software, highly parallel computing hardware, growing abundance of data. Here: uncertainty modeled as probabilistic. (There are other ways.) Leads to elliptic BVP with random data. Case study: groundwater flow at WIPP Compare 3 different UQ methods for obtaining quantity of interest (contaminant particle travel time). Emphasis on probabilistic model.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 2
Uncertainty Quantification (UQ)
Goal: Account for the fact that numerical simulations in science and engineering are typically based on data which is uncertain. Enabling factors: mature, sophisticated numerical software, highly parallel computing hardware, growing abundance of data. Here: uncertainty modeled as probabilistic. (There are other ways.) Leads to elliptic BVP with random data. Case study: groundwater flow at WIPP Compare 3 different UQ methods for obtaining quantity of interest (contaminant particle travel time). Emphasis on probabilistic model.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 2
Uncertainty Quantification (UQ)
Goal: Account for the fact that numerical simulations in science and engineering are typically based on data which is uncertain. Enabling factors: mature, sophisticated numerical software, highly parallel computing hardware, growing abundance of data. Here: uncertainty modeled as probabilistic. (There are other ways.) Leads to elliptic BVP with random data. Case study: groundwater flow at WIPP Compare 3 different UQ methods for obtaining quantity of interest (contaminant particle travel time). Emphasis on probabilistic model.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 2
Uncertainty Quantification (UQ)
Goal: Account for the fact that numerical simulations in science and engineering are typically based on data which is uncertain. Enabling factors: mature, sophisticated numerical software, highly parallel computing hardware, growing abundance of data. Here: uncertainty modeled as probabilistic. (There are other ways.) Leads to elliptic BVP with random data. Case study: groundwater flow at WIPP Compare 3 different UQ methods for obtaining quantity of interest (contaminant particle travel time). Emphasis on probabilistic model.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 2
Outline
WIPP Groundwater Flow Problem Parametrization of Random Fields Solution methods for PDEs with Random Data Numerical Results Concluding Remarks
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 3
Outline
WIPP Groundwater Flow Problem Parametrization of Random Fields Solution methods for PDEs with Random Data Numerical Results Concluding Remarks
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 4
WIPP - Waste Isolation Pilot Plant
US DOE repository for radioactive waste situated in New Mexico. Fully operational since 1999. Extensive site characterization and performance assessment since 1976; certification/recertification by US EPA (every 5 years). Large amount of publicly available data.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 5
WIPP Groundwater Flow Problem
Repository located at a depth of 655m within bedded evaporites, primarily halite (salt). The most transmissive rock in the region is the Culebra Dolomite. In the event of an accidental breach, Culebra would be the principal pathway for transport of radionuclides away from the repository.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 6
WIPP Scenario
Uncertainty Quantification Problem
Release of radionuclides by means of a borehole drilled into the repository. Transport of radionuclides by groundwater through the Culebra Dolomite. Travel time from release point in the repository to the boundary of the region is an important quantity. Flow is two-dimensional to a good approximation. UQ Problem: permeablilty properties of subsurface available only at few observation points. Approach: Model lack of information stochastically, propagate random input data to travel time.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 7
Groundwater Flow Model
Stationary Darcy flow q = −K∇p q : Darcy flux K : hydraulic conductivity p : hydraulic head mass conservation ∇· u = 0 u : pore velocity q = φu φ : porosity transmissivity T = Kb b : aquifer thickness particle transport ˙ x(t) = −T(x) bφ ∇p(x) x : particle position x(0) = x0 x0 : release location Quantity of interest: particle travel time to reach WIPP boundary (actually, its log).
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 8
PDE with Random Coefficient
Primal form of Darcy equations: ∇·[T(x)∇p(x)] = 0, x ∈ D, p = p0 along ∂D. Model T as a random field (RF) T = T(x, ω), ω ∈ Ω, x ∈ D with respect to underlying probability space (Ω, A, P). Assumptions: T has finite mean and covariance T(x) = E [T(x, ·)] , x ∈ D, CovT (x, y) = E
- T(x, ·) − T(x)
T(y, ·) − T(y)
- ,
x, y ∈ D. T is lognormal, i.e., Z(x, ω) := log T(x, ω) is a Gaussian RF. CovZ is stationary, isotropic, and of Matérn type.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 9
WIPP Data
6.05 6.1 6.15 6.2 x 10
5
3.57 3.575 3.58 3.585 3.59 3.595 x 10
6
UTM Easting UTM Northing above median below median WIPP site boundary
transmissivity measurements at 38 test wells head measurements, used to
- btain boundary data via
statistical interpolation (kriging) constant layer thickness of b = 8m constant porosity of φ = 0.16 SANDIA Nat. Labs reports
[Caufman et al., 1990] [La Venue et al., 1990]
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 10
Outline
WIPP Groundwater Flow Problem Parametrization of Random Fields Solution methods for PDEs with Random Data Numerical Results Concluding Remarks
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 11
Parametrization of Random Fields
Merge transmissivity data with statistical model
Given: Measurements of transmissivity Assumption: Matérn covariance structure (contains parameters) Regression model Z(x) = k
i=1 βifi(x) for mean (incorporate
geological information) Procedure: (1) {fi}k
i=1 yield point estimates of parameters in Matérn covariance
function via restricted maximum likelihood estimation (RML). (2) Condition RF Z = log T on transmissivity measurements. (for Gauss RF coincides with kriging, BLUP, leads to low-rank modification of covariance operator.) (3) Mean of conditioned field is kriging interpolant. (4) Approximate log T by truncated Karhunen-Loève expansion.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 12
Parametrization of Random Fields
Covariance operator
Covariance function of RF Z ∈ L∞(D) ⊗ L2
P(Ω)
c(x, y) = CovZ(x, y) := E
- Z(x, ·) − Z(x)
- Z(y, ·) − Z(y)
- symmetric in x, y ∈ D, positive semidefinite, and continuous on D × D
if continuous along ‘diagonal’ {(x, x) : x ∈ D}. The covariance operator C = CZ : L2(D) → L2(D), (Cu)(x) =
- D
u(y) c(x, y) dy is therefore selfadjoint, compact, nonnegative. Its eigenvalues {λm}m∈N form a nonincreasing sequence accumulating at most at 0.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 13
Parametrization of Random Fields
Karhunen-Loève expansion
Exists sequence of RV {ξm}m∈N ⊂ L2
P(Ω),
E [ξm] = 0, E [ξkξm] = δk,m, such that expansion in eigenfunctions {Zm}m∈N Z(x, ω) = Z(x) +
∞
- m=1
- λm Zm(x) ξm(ω)
converges in L∞(D) ⊗ L2
P(Ω).
[Karhunen, 1947], [Loève, 1948]
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 14
Parametrization of Random Fields
Karhunen-Loève expansion and variance
For normalized eigenfunctions Zm(x), VarZ(x) := c(x, x) =
∞
- m=1
λmZm(x)2, Total variance:
- D
VarZ(x) dx =
∞
- m=1
λm (Zm, Zm)D
- =1
= trace C. For constant variance (e.g., stationary RF), VarZ ≡ σ2 > 0,
- m
λm = |D| σ2. Interpretation: M first covariance eigenmodes form best rank-M approximation to C in sense of retaining maximal amount of variance.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 15
Parametrization of Random Fields
Truncated KL expansion
Truncate KL expansion after M leading terms: Z(M)(x, ω) = Z(x) +
M
- m=1
- λm Zm(x) ξm(ω).
Truncation error E
- Z − Z(M)2
L2(D)
- =
∞
- m=M+1
λm. Choose M to retain sufficient fraction δ ∈ (0, 1) of total variance, i.e., E
- Z − Z(M)2
L2(D)
- E
- Z2
L2(D)
- =
∞
m=M+1 λm
∞
m=1 λm
= 1 −
M
m=1 λm
|D|σ2 < δ.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 16
Matèrn Covariance Functions
c(x, y) = cθ(r) = σ2 2ν−1 Γ(ν)
- 2√ν r
ρ
ν
Kν
- 2√ν r
ρ
- ,
r = x − y2 Kν : modified Bessel function of order ν Parameters θ = (σ2, ρ, ν) σ2 : variance ρ : correlation length ν : smoothness parameter
Special cases: ν = 1
2 :
c(r) = σ2 exp(− √ 2r/ρ) exponential covariance ν = 1 : c(r) = σ2
2r ρ
- K1
- 2r
ρ
- Bessel covariance
ν → ∞ : c(r) = σ2 exp(−r2/ρ2) Gaussian covariance
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 17
Matèrn Covariance Functions
c(x, y) = cθ(r) = σ2 2ν−1 Γ(ν)
- 2√ν r
ρ
ν
Kν
- 2√ν r
ρ
- ,
r = x − y2 Kν : modified Bessel function of order ν Parameters θ = (σ2, ρ, ν) σ2 : variance ρ : correlation length ν : smoothness parameter
Special cases: ν = 1
2 :
c(r) = σ2 exp(− √ 2r/ρ) exponential covariance ν = 1 : c(r) = σ2
2r ρ
- K1
- 2r
ρ
- Bessel covariance
ν → ∞ : c(r) = σ2 exp(−r2/ρ2) Gaussian covariance
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 17
Matérn Covariance Functions
1 2 3 4 5 0.2 0.4 0.6 0.8 1 r c(r) ν=1/2 ν=1 ν=2 ν=3 Gauss
ρ = 1
1 2 3 4 5 0.2 0.4 0.6 0.8 1 r c(r) ν=1/2 ν=1 ν=2 ν=3 Gauss
ρ = 3
Smoothness of realizations: RF Z(·, ω) is s times mean-square differentiable if and only if ν > s.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 18
Matérn Covariance Functions
Eigenvalue asymptotics
Eigenvalue Decay: the smoother the kernel, the faster {λm}m∈N → 0. More precisely: if D ⊂ Rd, then if the kernel function c is piecewise Hs : λm ≤ c1m−s/d piecewise smooth : λm ≤ c2m−s for any s > 0 piecewise analytic : λm ≤ c3 exp(−c4m1/d) for suitable constants c1, c2, c3, c4. Note: piecewise smoothness of kernel also leads to bounds on derivatives of eigenfunctions am in L∞(D). Proven e.g. in [Schwab & Todor, 2006], [Todor, 2006]
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 19
Matérn Covariance Functions
Eigenvalue preasymptotics
Before asymptotic decay (determined by the smoothness ν of the kernel) sets in, there is a preasymptotic plateau determined by the correlation length parameter ρ.
10 10
1
10
2
10
−6
10
−4
10
−2
10
m λm
ν=1/2, ρ=1 ν=1/2, ρ=1/10 ν=1/2, ρ=1/50 ν=1, ρ=1 ν=1, ρ=1/10 ν=1, ρ=1/50
Eigenvalue decay, Matérn covariance kernel, D = [−1, 1].
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 20
The Covariance Eigenvalue Problem
Numerical solution
Piecewise constant Galerkin discretization on triangles/tetrahedra. Results in generalized matrix eigenvalue problem Cx = λMx, where M can be chosen diagonal but C is generally dense. Construction/storage of and multiplication with Galerkin matrix via hierarchical matrix technique. Near-field Galerkin blocks using adapted quadrature. Thick-restart Lanczos method for dominant eigenpairs of given
- rder, extended to block variant for multiple eigenpairs.
Conditioning of covariance on measured data via low-rank correction.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 21
The Covariance Eigenvalue Problem
Performance
Bessel covariance (ν = 1) on D = [−1, 1]2, ρ = 1/4. Compute leading 300 eigenpairs on sequence of uniformly refined triangular grids, resulting in N = 104 to N = 3 014 656 DOF. Used Krylov space of dimension 500.
10
1
10
2
10
3
10
4
10
5
10
6
10
7
10
−2
10 10
2
10
4
10
6
10
8
N time matrix generation [s]
Bessel kernel Gaussian kernel N log N 10
1
10
2
10
3
10
4
10
5
10
6
10
7
10
−2
10 10
2
10
4
10
6
10
8
N time eigenvalue solve [s]
Bessel kernel Gaussian kernel N
hierarchical matrix generation eigenvalue calculation
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 22
WIPP KL Modes
Conditioned on 38 transmissivity observations unconditioned, m = 1, 2, 9, 16 conditioned, m = 1, 2, 9, 16
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 23
WIPP KL Truncation
Portion of captured variance
20 40 60 80 100 120 140 160 180 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
- No. retained KL modes (M)
Portion of total variance ()
unconditioned conditioned
RML: ˆ σ2 = 3.57, ˆ ρ = 9865, ˆ ν = 0.58.
M δ 5 77.16 % 10 81.28 % 15 83.81 % 50 90.14 % 1000 99.11 %
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 24
Restricted Maximum Likelihood Estimation
Goal: estimate covariance parameters θ = (σ2, ρ, ν) from observations ζ = {ζj}n
j=1 of Z = log T without introducing bias from estimation of
regression parameters β in model for mean Z(x) =
k
- i=1
βifi(x) Random vector Z(ω) ∈ Rn at measurement points {xj}n
j=1.
Here Z = Fβ, [F]i,j = fj(xi). Once covariance parameters θ known obtain β by minimizing weighted least squares functional ζ − Fβ2
C −1
θ
:= (ζ − Fβ)⊤C −1
θ (ζ − Fβ)
[Cθ]i,j = cθ(xi, xj), i, j = 1, . . . , n.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 25
Restricted Maximum Likelihood Estimation
Bias from estimation of β avoided by maximum likelihood estimation of transformed random vector Z ′ := PZ, P : Rn → range(F)⊥ orthogonal projection. Then, regardless of β, E
Z ′ = E
- P(Fβ +
Z)
- = 0.
Now maximize likelihood function: joint probability density of Z ′ as function of θ to obtain RML estimate for θ.
[Patterson & Thompson, 1971], [Kitanidis, 1983]
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 26
Restricted Maximum Likelihood Estimation
Regression functions for WIPP
For x = (x1, x2) ∈ D used Z(x) = β1 + β2x1 + β3x2 + β4d(x) + β5i(x) d(x) : thickness of Culebra overburden at x, i(x) : indicator function for geological zone 1
/&
- Distinct geological zones at WIPP (Source: WIPP CRA 2009)
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 27
Restricted Maximum Likelihood Estimation
RML results for different regression functions
constant: σ2 = 3.56, ρ = 9 900, ν = 0.59 linear: σ2 = 1.09, ρ = 2 100, ν = 1.09 linear, overburden: σ2 = 1.15, ρ = 2 200, ν = 1.07 linear, overburden, zone: σ2 = 0.72, ρ = 1 500, ν = 1.22
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 28
Kriging
Best unbiased linear prediction
Georges Matheron, 1970s: “le krigeage”, “statistical interpolation” Given mean zero random field Z = Z(x, ω) with known covariance structure, observed at n spatial locations Z(xj) = ζj, j = 1, . . . , n. Seek “linear predictor” ˆ Z(x, ω) := m(x)⊤Z(ω), m(x) =
m1(x) . . . mn(x)
,
Z(ω) =
Z(x1, ω) . . . Z(xn, ω)
which is unbiased, i.e. E
ˆ
Z(x, ·)
- = E [Z(x, ·)] for all x ∈ D, and
minimizes the kriging error E
Z(x, ·) − m(x)⊤Z(·) 2
for each x ∈ D.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 29
Kriging
Best unbiased linear prediction
This results in ˆ Z(x, ω) = c(x)⊤C −1ζ, where c(x) =
c(x, x1) . . . c(x, xn)
,
[C]i,j = c(xj, xn), ζ =
ζ1 . . . ζn
.
Conditioned covariance function is then the covariance function of the residual field R = Z − ˆ Z: cR(x, y) = c(x)⊤C −1c(y).
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 30
Kriging
Variants: simple kriging: Z ≡ 0.
- rdinary kriging: constant
unknown mean universal kriging: non-constant mean obtained by linear regression (used here).
Right: Kriging contours of WIPP trans- missivity data.
6.05 6.1 6.15 6.2 x 10
5
3.57 3.575 3.58 3.585 3.59 3.595 x 10
6
UTM Easting UTM Northing 6.5 6 5.5 5 4.5 4 3.5 Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 31
Outline
WIPP Groundwater Flow Problem Parametrization of Random Fields Solution methods for PDEs with Random Data Numerical Results Concluding Remarks
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 32
Solution methods for PDEs with Random Data
Deterministic Parametric Representation
Parametrize input RF by vector of independent RV {ξm}M
m=1 =: ξ.
If ξm has density ρm and Ξm := ξm(Ω), then (Doob-Dynkin lemma) L2
P(Ω) ≃ L2 ρ(ΞM),
where ΞM := ×M
m=1Ξm, ρ = M
- m=1
ρm. Z(x, ω) ← Z(x, ξ), p(x, ω) ← p(x, ξ), . . . BVP becomes purely deterministic with (possibly) high-dimensional parameter space: ∇·[T(x, ξ)∇p(x, ξ)] = 0, x ∈ D, ρ - a.e., p(x, ξ) = p0(x), x ∈ ∂D, ρ - a.e., where log T(x, ξ) = Z(x) +
M
- m=1
- λm Zm(x) ξm.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 33
Travel-time computations
Generate sufficiently large ensemble of log-travel times s(ξ) = log t(ξ) Compute empirical CDF to quantify uncertainty in travel time. Three sampling methods: (1) Monte Carlo (MC) sampling of RV ξ → s(ξ). (NMC solutions of PDE) (2) Stochastic collocation (SC) → RF representation of velocity field uNSC(x, ξ), use this to sample s(ξ). (NSC solutions of PDE) (3) Gaussian process emulator: NDP MC samples of s(ξ) used to calibrate surrogate of mapping ξ → s(ξ), use this surrogate to sample s(ξ). (NDP solutions of PDE)
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 34
Stochastic Collocation
For an elliptic BVP with M-dimensional random parameter ξ ∈ ΞM in variational form: find u(ξ) ∈ X such that a(u(ξ), v) = ℓ(v) ∀v ∈ X, (1) stochastic collocation constructs a polynomial approximation uN(ξ) from the solutions u(ξj) at N collocation points {ξj}N
j=1 ⊂ ΞN.
Variants: (1) Full tensor product on ΞM with multivariate Lagrange interpolation (curse of dimensionality). (2) Smolyak sparse grids based on zeros of Hermite polynomials with associated interpolant (used here). (3) Elliptic case: [Tatang & al., 1997], [Mathelin & Husseini, 2003], [Xiu &
Hesthaven, 2005], [Babuška & al., 2007] [Schwab & al., 2009–]
(4) Approximation and convergence result extended to lognormal coefficient: [E. & Sprung, 2011], [Charrier, 2010]
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 35
Stochastic Collocation
Full tensor vs. Smolyak grids
−10 −8 −6 −4 −2 2 4 6 8 10 −10 −8 −6 −4 −2 2 4 6 8 10
#Nodes = 1089
−10 −8 −6 −4 −2 2 4 6 8 10 −10 −8 −6 −4 −2 2 4 6 8 10
#Nodes = 301
Full-Tensor and Smolyak sparse grids of level q = 5 for Gauss-Hermite abscissas (ni = 2i−1 + 1)
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 36
Gaussian Process Emulators
An emulator is a statistical approximation to the output of a computer code y = f(x). Basic idea: (1) Represent the code, f(·) as a Gaussian stochastic process f(x, ω) =
n
- i=1
βihi(x) + Z(x, ω). (2) Run model for sample of design inputs x and observe outputs y. (3) Condition GP on observed outputs y. (4) Emulator provides a distribution function for the output of the computer code. (5) Use emulator as a surrogate for computer model when performing MC analysis.
[Kennedy & O’Hagan, 2001], [Stone, 2011]
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 37
WIPP FE Discretization
mixed FE discretization, lowest order RT elements for q, pcw. constants for p. fixed mesh, 7124 triangles, respects WIPP boundary flow divergence-free ⇒ discrete fluxes pcw. constant, makes particle trajectory calculation trivial.
0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 x 10
4
1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2 x 10
4
x y
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 38
Outline
WIPP Groundwater Flow Problem Parametrization of Random Fields Solution methods for PDEs with Random Data Numerical Results Concluding Remarks
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 39
Numerical Results
Computational Schemes
(1) Random transmissivity field T(x, ω) from data (2) KL expansion (3) Solve PDE with random diffusion coefficient to obtain realizations
- f flux field q = q(x, ω)
(4) MC + empirical CDF for log travel time. Step (3) performed by straight MC stochastic collocation Gaussian process emulator
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 40
Numerical Results
Collocation error
Error measures using MC reference calculation with NMC = 20, 000. expected value of collocation error for head E
- ph − ph,refL2(D)
1/2
This is the norm in L2(D) ⊗ L2
P(Ω).
norm of expected value of collocation error for head E [p − ph,ref] L2(D) error in variance Var(ph) − Var(ph,ref)L2(D). Same for flux field and log travel time.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 41
Stochastic Collocation Errors vs. MC solution
10 10
2
10
4
10
4
10
3
10
2
- rel. error
head 10 10
2
10
4
10
5
10
4
10
3
- rel. error
head mean 10 10
2
10
4
10
2
10
1
10
- rel. error
head variance 10 10
2
10
4
10
0.5
10
0.3
10
0.1
- rel. error
flux 10 10
2
10
4
10
2
10
1
10
- rel. error
flux mean 10 10
2
10
4
10
- rel. error
flux variance 10 10
2
10
4
10
3
10
2
10
1
- rel. error
log tt # nodes 10 10
2
10
4
10
5
10
- rel. error
log tt mean # nodes 10 10
2
10
4
10
3
10
2
10
1
10 #nodes
- rel. error
log tt variance SC SC2 SC SC2 SC SC2 SC SC2 SC SC2 SC SC2 SC SC2 SC SC2 SC SC2
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 42
Travel Time CDFs
M = 5
4 4.5 5 5.5 6 6.5 7 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 log10 t ECDF of log travel time, 5 KL modes, 20 000 samples MC SC2 L=2 SC L=2 GE 100
4.35 4.4 4.45 4.5 4.55 4.6 4.65 0.02 0.04 0.06 0.08 0.1 log10 t zoom on early times Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 43
Travel Time CDFs
M = 10
4 4.5 5 5.5 6 6.5 7 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 log10 t ECDF of log travel time 10 KL modes, 20 000 samples MC SC2 L=2 SC L=2 GE 200
4.35 4.4 4.45 4.5 4.55 4.6 4.65 4.7 0.02 0.04 0.06 0.08 0.1 0.12 log10 t
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 44
Kolmogorov-Smirnov Test
M = 5
Statistical test to determine whether the empirical CDFs were drawn from the same distribution as pure MC with 2K resp. 20K samples. Significance level: α = 0.05.
Surrogate PDE solves KS-test (2K) KS-test (20K) GE 50
- ✗
100
- ✗
SC L=0 1 ✗ ✗ SC L=1 11
- ✗
SC L=2 71
- SC L=3
351
- SC2 L=0
1 ✗ ✗ SC2 L=1 11 ✗ ✗ SC2 L=2 61
- ✗
SC2 L=3 241
- Oliver Ernst (TU Freiberg)
UQ via PDEs with Random Data: A Case Study Linz, December 2011 45
Kolmogorov-Smirnov Test
M = 15 Surrogate PDE solves KS-test (2K) KS-test (20K) GE 150 ✗ ✗ 300 ✗ ✗ SC L=0 1 ✗ ✗ SC L=1 31 ✗ ✗ SC L=2 511 ✗
- SC L=3
5951
- SC2 L=0
1 ✗ ✗ SC2 L=1 31 ✗ ✗ SC2 L=2 481
- ✗
SC2 L=3 5021
- Oliver Ernst (TU Freiberg)
UQ via PDEs with Random Data: A Case Study Linz, December 2011 46
Neglected Variance
4 4.5 5 5.5 6 6.5 7 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 M = 5 M = 10 M = 15 M = 50 M = 1000
Empirical CDF of log t based on KL approximations of different length.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 47
Outline
WIPP Groundwater Flow Problem Parametrization of Random Fields Solution methods for PDEs with Random Data Numerical Results Concluding Remarks
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 48
Concluding Remarks
Summary: 3 methods for PDEs with random data for WIPP UQ scenario. Emphasized geostatistical model from data. Collocation convergence analysis for lognormal case. Collocation and emulators show potential for outperforming MC. Covering most of the variance requires more powerful methods. In progress: Anisotropic and sparse tensor collocation methods MC error quantification (Berry-Esseen bounds) Outlook: Bayesian inversion of transmissivity field using SC approximation as surrogate.
Oliver Ernst (TU Freiberg) UQ via PDEs with Random Data: A Case Study Linz, December 2011 49