Inverse Modeling with the aid of Surrogate Models
Dongxiao Zhang, Qinzhuo Liao, Haibin Chang College of Engineering Peking (Beijing) University dxz@pku.edu.cn The 2017 EnKF Workshop
Inverse Modeling with the aid of Surrogate Models Dongxiao Zhang, - - PowerPoint PPT Presentation
Inverse Modeling with the aid of Surrogate Models Dongxiao Zhang, Qinzhuo Liao, Haibin Chang College of Engineering Peking (Beijing) University dxz@pku.edu.cn The 2017 EnKF Workshop Inverse modeling Estimate parameters from physical
Dongxiao Zhang, Qinzhuo Liao, Haibin Chang College of Engineering Peking (Beijing) University dxz@pku.edu.cn The 2017 EnKF Workshop
Inverse modeling
2
input parameter: θ model: f s = f(θ) + e Input
Output
Model
Stochastic approach
− Use Monte Carlo simulations to construct a Markov chain − Computationally expensive: repeated evaluations of the forward model
− Can generate a large number of samples at low cost − Posterior error depends on forward solution error
3
( ) ( ) ( | ) ( | ) ( ) s f e p p s p s p s
( ): prior density ( | ): posterior density ( | ):likelihood function ( ): normalization factor p p s p s p s
which has a finite (random) dimensionality.
where
1 2
where ( , ,..., )T
N
ξ
ˆ ( ; , ) ( ) ( ) ( , ) ( ) ( )
P P
L u x w p d g x w p d
ˆ( , ) , where trial function space u x V V
( ) , where test (weighting) function space w W W
( ) probability density function of ( ) p ξ
Spanos, 1991]:
Sarma et al., 2005; Li and Zhang, 2007, 2009]:
Xiu and Hesthaven, 2005; Chang and Zhang, 2009]:
1 1
( ) , ( )
M M i i i i
V span W span
1 1
( ) , ( )
M M i i i i
V span W span
1 1
( ) , ( )
M M i i i i
V span L W span
1
where { ( )} lagrange interpolation basis
M i i
L
1
where ( )
M i i
fields
with finite dimensions
Lagrange interpolation basis
Stochastic collocation method (SCM)
7
1
1
1 ( , ) 1 ... is univariate interpolation
N
q i i i q N i q
N q N U U q i U
1 1 1
is a set of nodes in
( ) ( ) ( ) ( ) , ( ) ,1 ,
M N i i M i i i M j i i j ij j i j j i
M N f f L L L i j M
Xiu and Hesthaven (2005) Xiu and Hesthaven (2005) Chang & Zhang (2009) Lin & Tartakovsky (2009)
1 2 3
1 2 3
1 2 3
1 2 3
Each dimension: knots dimension:
N
m N M m For N>1, preserving interpolation property of N=1 with a small number
2nd order PCM: 28 representations, = 4.0, Y
2 = 1.0
x Head, h
MC: 1000 realizations = 4.0, Y
2 = 1.0
x Head, h
PCM/SCM:
(collocation points)
(representations) MCS:
(realizations)
(realizations)
Stochastic collocation method
− Non-physical realizations/Gibbs oscillation − Inaccurate statistical moments and probability density functions
10
Zhang et al. (2010) Lin & Tartakovsky (2009)
Stochastic collocation method
− When: advection dominated (Pe = 100) low regularity − Why: physical space random space
− Unit mass instantaneously released at x = 0, t = 0 − Input parameter: conductivity k = exp(0.3θ), θ ~ N(0,1) − Output response: concentration c at x = 0.3, t = 1
11
Transformed stochastic collocation method
− Approximate s as a function of θ at fixed x and t
− Approximate x as a function of θ for a given s at fixed t − Approximate t as a function of θ for a given s at fixed x
12
( , ; ) s t x
( , ; ) s t x ( , ; ) t s x
Liao & Zhang (WRR, 2016)
1D example
− Input parameter: conductivity k = exp(0.3θ), θ ~ N(0,1), − Output response: concentration c at x = 0.3, t = 1
13
1D example
14
true observation c = 0.842
true parameter θ = 0.2
1D example
15
2
2 1
surrogate model: ( ) ( ) ( ) error: ( ) ( ) ( ) 0, ( ) poster Kullback-Leibler diverg ior: ( ) ( | ) : || ( )log 0, ( ) ence
M M i i M M L i M M M
s f L s s s s p d M p s D d M
Marzouk & Xiu (2009)
2D example
16
2D example
17
2D example
− Dimension-adaptive: automatically select important dimensions − Further reduce the number of collocation points
18
Klimke (2006) Liao et al. (JCP, 2016)
2D example
− MCMC with 105 model runs as a reference − ATSCM with 67 model runs is more accurate than SCM with 6017 model runs
19
2D example
− Black: MCMC, red: SCM, blue: ATSCM
20
Inverse modeling
21
EnKF
the data
effort (restart required)
Iterative ES
all the data
effort (no restart, iteration)
non-linear problems
ES
all the data
effort (no restart)
( ) ( | ) ( | ) ( )
p p p p m d m m d d
1 1
1 1 ( ) ( ( ) ) ( ( ) ) ( ) ( ). 2 2
T
pr T pr D M
J g C g C
m m d m d m m m m
Iterative ensemble smoother
22
1 1 1 1 1 1
(1 ) ( )
T l l l M l D l pr T
M l l D l
C G C G C G C g
m m m m m d
1 1 1 1
1 (1 ) 1 (1 ) ( ) .
l l T T pr M M l l D l M l l M M l l T T
M l l D l M l l
C C G C G C G G C C C G C G C G g d m m m m m
1, , 1 1 , , , , , 1 , , , ,
1 (1 ) 1 (1 ) ( ) , 1,..., ,
l j l j T T pr M M l j l D l j M l j l j M M l j j l T T
M l j l D l j M l j l j j e
C C G C G C G G C C C G C G C G g j N m m m m m d
Size: Nm x Nm Size: Nd x Nd
Can be further approximated
Iterative ensemble smoother
23
1, , 1 1 , 1 ,
1 (1 ) 1 (1 ) ( ) , 1,..., ,
l l l l l l
l j l j T T pr M M l l D l M l l M M l j j l T T
M l l D l M l l j j e
C C G C G C G G C C C G C G C G g j N m m m m m d
1, , 1 1 , 1 ,
1 (1 ) 1 (1 ) ( ) , 1,..., .
l l l l l l l l l l l
l j l j pr M M D l D D D D M M l j j l
M D l D D D l j j e
C C C C C C C C C g j N m m m m m d
Change with iteration Mean sensitivity Can be approximated by Can be approximated by
l l
M D
C
l l
D D
C
Surrogate model based iterative ES
arhunen nen-Loe Loeve ve Expan ansio ion (KLE) :
1 2
: { , ,..., }
N T
m
1
( , ) ( ) ( )
N n n n n
Y f
x x
n n
10 20 30 40 0.00 0.10 0.20
x1 x2
2 4 6 8 10 2 4 6 8 10 1.5 1 0.5(c) n=10 x1 x2
2 4 6 8 10 2 4 6 8 10 1.5 1 0.5(b) n=4 x1 x2
2 4 6 8 10 2 4 6 8 10 1.5 1 0.5(d) n=20 x1 x2
2 4 6 8 10 2 4 6 8 10 1.3 1.2 1.1 1 0.9 0.8 0.7 0.6 0.5 0.4(a) n=1
Surrogate model based iterative ES
25
1 2
: { , ,..., }
N T
m
1 1, , , 1 ,
1 (1 ) 1 (1 ) ( ) , 1,..., .
l l l l l l l l l l l
pr l j l j D l D D D D l j j l
D l D D D l j j e
C C C C C C C C g j N d
1 1, , , 1 ,
1 (1 ) 1 (1 ) ( ) , 1,..., .
l l l l l l l l l l l
surr surr surr pr l j l j D l D D D D l j j l surr surr surr
D l D D D l j j e
C C C C C C C C j N
d d
Can be obtained from surrogate
Case study: single-phase flow
no flow (two lateral boundaries).
injecting well at block (28, 28).
start from day 0.2, at every 0.6 day, up to 5 days
26
( , ) ( ) ( , ) = ( , )
s
h t S K h t q t t x x x x
200 400 600 800 200 400 600 800 Y (m) X (m)
Uncertain random field
27
ln ( ) ln(m / day), K x
1 2 1 2 1 1 2 2
2 2 2 ln , , ln
( , ) exp , / 0.4, / 0.2,
i i j j K i j i j K x y x x y y
x x y y C L L x x
2 ln ( )
0.75,
K
x
150 300 450 600 750 150 300 450 600 750
X Y
0.5 1.0 1.5 2.0
1
, , , 1 1 1
1 ,
d
i sim i surr N N j j surr i sim j i d j
d d e N N d
, , , 1 1
1 ,
e d
i obs i update N N j j d i obs j i e d j
d d e N N d
2 , , 1 1
1 .
e m
N N i ref i update m j j i e m
e m m N N
Surrogate error: data match error: Parameter estimation error:
Results
28 150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.5 1.0 1.5 2.0 150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.5 1.0 1.5 2.0
level 2-Smolyak
Traditional iterative ES
29
150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.5 1.0 1.5 2.0 150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.5 1.0 1.5 2.0 150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.5 1.0 1.5 2.0 150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.5 1.0 1.5 2.0
Ne=20 Ne=60 Ne=100 Ne=1000
Standard deviation comparison
30
150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.15 0.22 0.29 0.36 0.43 0.49 0.56 0.63 0.70 150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.09 0.17 0.26 0.35 0.44 0.53 0.61 0.70 150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.15 0.22 0.29 0.36 0.42 0.49 0.56 0.63 0.70 150 300 450 600 750 150 300 450 600 750
X (m) Y (m)
0.09 0.17 0.26 0.35 0.44 0.53 0.61 0.70
iteration 1 iteration 2 iteration 1 iteration 2
Case study: Multi-phase flow
Water-oil two-phase system:
producers at the corners.
every 30 day, up to 510 days
31
( ) ( ) , ,
ri i i i i i
k k S p g z q i w o t x x
100 200 300 400 100 200 300 400 Y (m) X (m)
Uncertain random field
32
ln ( ) 4 ln(mD), k x
2 ln ( )
0.16,
k
x
1 2 1 2 1 1 2 2
2 ln , , ln
( , ) exp , / 0.4, / 0.4.
i i j j k i j i j k x y x x y y
x x y y C L L x x
100 200 300 400 100 200 300 400
X (m) Y (m)
3.0 3.3 3.5 3.8 4.1 4.4 4.7 4.9 5.2
Results
33
100 200 300 400 100 200 300 400
X (m) Y (m)
3.0 3.3 3.5 3.8 4.1 4.4 4.7 4.9 5.2 100 200 300 400 100 200 300 400
X (m) Y (m)
0.12 0.16 0.19 0.23 0.26 0.30 0.33 0.37 0.40 100 200 300 400 100 200 300 400
X (m) Y (m)
3.0 3.3 3.5 3.8 4.1 4.4 4.7 4.9 5.2 100 200 300 400 100 200 300 400
X (m) Y (m)
3.0 3.3 3.5 3.8 4.1 4.4 4.7 4.9 5.2
Ne=20 ( ) Ne=100 ( ) Mean standard deviation
0.27
m
e 0.445
m
e 0.268
m
e
Data match of water cut from tTPCM
34
2 4 6 8 10 12 14 16 0.0 0.2 0.4 0.6 0.8 1.0 WCT of Producer 1 Observation step 2 4 6 8 10 12 14 16 0.0 0.2 0.4 0.6 0.8 1.0 WCT of Producer 2 Observation step 2 4 6 8 10 12 14 16 0.0 0.2 0.4 0.6 0.8 1.0 WCT of Producer 3 Observation step 2 4 6 8 10 12 14 16 0.0 0.2 0.4 0.6 0.8 1.0 WCT of Producer 4 Observation step 2 4 6 8 10 12 14 16 0.0 0.2 0.4 0.6 0.8 1.0 WCT of Producer 1 Observation step 2 4 6 8 10 12 14 16 0.0 0.2 0.4 0.6 0.8 1.0 WCT of Producer 2 Observation step 2 4 6 8 10 12 14 16 0.0 0.2 0.4 0.6 0.8 1.0 WCT of Producer 3 Observation step 2 4 6 8 10 12 14 16 0.0 0.2 0.4 0.6 0.8 1.0 WCT of Producer 4 Observation step
Initial ensemble updated ensemble
Conclusions
aid of surrogate models − Use surrogate model to approximate the forward solution − Apply transformation to address the low-regularity − Select the important dimensions adaptively to reduce the points
− Fast convergence of the surrogate solution to the exact forward solution − Fast convergence of the surrogate posterior to the true posterior
35
36
References
for flow in spatially correlated random fields. Communications in Computational Physics, 6(3), 509-535.
subsurface flow data assimilation. Advances in Water Resources. Doi:10.1016/j.advwatres.2016.12.001
problems: 3. transform by time. Water Resources Research,52(3), 7911–7928.
method on nested sparse grids for multiphase flow in randomly heterogeneous porous
method on sparse grids for three-dimensional flow and solute transport in randomly heterogeneous porous media, Adv. Water Resour., 32(5), 712–722.
in inverse problems, Commun. Comput. Phys., 6(4), 826–847.
Citeseer.
equations with random inputs, SIAM J. Sci. Comput., 27(3), 1118-1139.
approaches to risk assessment, Stoch. Environ. Res. Risk. Assess. 24, 971–984.
37
38
Backup: transformed EnKF (TEnKF)
Liao, Q., & Zhang, D. (2015). Data assimilation for strongly nonlinear problems by transformed ensemble kalman filter. SPE Journal, 20(1), 202-221.