Using Bayesian Model Probability with ensemble methods to quantify - - PowerPoint PPT Presentation

using bayesian model probability with ensemble methods to
SMART_READER_LITE
LIVE PREVIEW

Using Bayesian Model Probability with ensemble methods to quantify - - PowerPoint PPT Presentation

Using Bayesian Model Probability with ensemble methods to quantify uncertainty in reservoir modelling with multiple prior scenarios Sigurd Ivar Aanonsen, NORCE Energy Svenn Tveit, NORCE Energy Mathias Alerini, Equinor ASA EnKF Workshop, Voss,


slide-1
SLIDE 1

Using Bayesian Model Probability with ensemble methods to quantify uncertainty in reservoir modelling with multiple prior scenarios

Sigurd Ivar Aanonsen, NORCE Energy Svenn Tveit, NORCE Energy Mathias Alerini, Equinor ASA EnKF Workshop, Voss, June 3-5, 2019

slide-2
SLIDE 2

Outline

◮ Motivation ◮ Introduction to Bayesian model averaging and selection

◮ Bayesian Model Average (BMA) ◮ Bayesian Model Probability (BMP) ◮ Model Likelihood/Model Evidence (BME) ◮ Bayes Factor (BF)

◮ Calculating BME/BMP

◮ Challenges

◮ Examples ◮ Summary and conclusions

slide-3
SLIDE 3

Motivation

◮ Uncertainty quantification in history matching is typically based on a

single prior-model scenario. However, often, several alternative models or scenarios are viable a priori.

◮ Geological scenarios, flow scenarios, alternative seismic

interpretations, etc.

slide-4
SLIDE 4

Motivation

◮ Uncertainty quantification in history matching is typically based on a

single prior-model scenario. However, often, several alternative models or scenarios are viable a priori.

◮ Geological scenarios, flow scenarios, alternative seismic

interpretations, etc.

◮ Ensemble-based data assimilation methods, like EnKF or ES, does

not handle alternative prior models or scenarios. Handling complex model uncertanty is challenging.

slide-5
SLIDE 5

Motivation

◮ Uncertainty quantification in history matching is typically based on a

single prior-model scenario. However, often, several alternative models or scenarios are viable a priori.

◮ Geological scenarios, flow scenarios, alternative seismic

interpretations, etc.

◮ Ensemble-based data assimilation methods, like EnKF or ES, does

not handle alternative prior models or scenarios. Handling complex model uncertanty is challenging.

◮ Bayesian theory for models provides a framework for this:

◮ “Total” uncertainty through Bayesian Model Averaging (BMA). ◮ Selecting models or scenarios based on comparing Bayesian Model

Probabilities (BMP’s), i.e., probability for a given model or scenario to be correct given the data.

slide-6
SLIDE 6

Motivation, cont’d

◮ Bayesian model averaging and model probability rely on the calculation of

Bayesian Model Evidence (BME) or Bayes Factors (BF), which have a long history within a number of fields for model comparison and model selection.

slide-7
SLIDE 7

Motivation, cont’d

◮ Bayesian model averaging and model probability rely on the calculation of

Bayesian Model Evidence (BME) or Bayes Factors (BF), which have a long history within a number of fields for model comparison and model selection.

◮ Few applications to petroleum industry/reservoir modelling (Park et al.,

2013, Elsheikh et al., 2014, Hong et al, 2018).

slide-8
SLIDE 8

Motivation, cont’d

◮ Bayesian model averaging and model probability rely on the calculation of

Bayesian Model Evidence (BME) or Bayes Factors (BF), which have a long history within a number of fields for model comparison and model selection.

◮ Few applications to petroleum industry/reservoir modelling (Park et al.,

2013, Elsheikh et al., 2014, Hong et al, 2018).

◮ Recently, it has been shown that, for weakly nonlinear models, Bayesian

model selection can be efficiently coupled with ensemble-based data assimilation methods (Carrassi et al., 2017).

slide-9
SLIDE 9

Motivation, cont’d

◮ Bayesian model averaging and model probability rely on the calculation of

Bayesian Model Evidence (BME) or Bayes Factors (BF), which have a long history within a number of fields for model comparison and model selection.

◮ Few applications to petroleum industry/reservoir modelling (Park et al.,

2013, Elsheikh et al., 2014, Hong et al, 2018).

◮ Recently, it has been shown that, for weakly nonlinear models, Bayesian

model selection can be efficiently coupled with ensemble-based data assimilation methods (Carrassi et al., 2017).

◮ The methodology can be applied as a “simple” post-processing after having

applied standard ensemble-based data assimilation methods to the various scenarios.

slide-10
SLIDE 10

Motivation, cont’d

◮ Bayesian model averaging and model probability rely on the calculation of

Bayesian Model Evidence (BME) or Bayes Factors (BF), which have a long history within a number of fields for model comparison and model selection.

◮ Few applications to petroleum industry/reservoir modelling (Park et al.,

2013, Elsheikh et al., 2014, Hong et al, 2018).

◮ Recently, it has been shown that, for weakly nonlinear models, Bayesian

model selection can be efficiently coupled with ensemble-based data assimilation methods (Carrassi et al., 2017).

◮ The methodology can be applied as a “simple” post-processing after having

applied standard ensemble-based data assimilation methods to the various scenarios.

◮ However, the use of these methods are disputed. The calculations may be

very challenging with respect to e.g. stability, and further investigations of the applicability to reservoir modeling and updating are necessary.

slide-11
SLIDE 11

BMA/BMP/BME/BF

◮ Bayesian Model Average (BMA):

P (∆|D) =

  • k

P(∆|D, Mk)P(Mk|D)

slide-12
SLIDE 12

BMA/BMP/BME/BF

◮ Bayesian Model Average (BMA):

P (∆|D) =

  • k

P(∆|D, Mk)P(Mk|D)

◮ Bayesian (posterior) Model Probability (BMP):

P (Mk|D) = P(D|Mk)Ppri(Mk)

  • j P(D|Mj)Ppri(Mj) =

1

  • j

P(D|Mj) P(D|Mk) Ppri(Mj) Ppri(Mk)

slide-13
SLIDE 13

BMA/BMP/BME/BF

◮ Bayesian Model Average (BMA):

P (∆|D) =

  • k

P(∆|D, Mk)P(Mk|D)

◮ Bayesian (posterior) Model Probability (BMP):

P (Mk|D) = P(D|Mk)Ppri(Mk)

  • j P(D|Mj)Ppri(Mj) =

1

  • j

P(D|Mj) P(D|Mk) Ppri(Mj) Ppri(Mk) ◮ Model Likelihood/Model Evidence (BME):

P (D|Mk) =

  • P(D|θ, Mk)P(θ|Mk) dθ

Denominator in the “normal” Bayes formula for model k.

slide-14
SLIDE 14

BMA/BMP/BME/BF

◮ Bayesian Model Average (BMA):

P (∆|D) =

  • k

P(∆|D, Mk)P(Mk|D)

◮ Bayesian (posterior) Model Probability (BMP):

P (Mk|D) = P(D|Mk)Ppri(Mk)

  • j P(D|Mj)Ppri(Mj) =

1

  • j

P(D|Mj) P(D|Mk) Ppri(Mj) Ppri(Mk) ◮ Model Likelihood/Model Evidence (BME):

P (D|Mk) =

  • P(D|θ, Mk)P(θ|Mk) dθ

Denominator in the “normal” Bayes formula for model k.

◮ Bayes factor (BF):

BFj-k = P(D|Mj) P(D|Mk)

slide-15
SLIDE 15

Some alternatives for calculating BME

  • 1. Gauss-linear approximation
slide-16
SLIDE 16

Some alternatives for calculating BME

  • 1. Gauss-linear approximation

P (D|Mk) = N(Gkθk, Ck) = ((2π)nd det Ck)−1/2 exp

  • −1

2(D − Gkθk)TC −1

k

(D − Gkθk)

  • Ck

= CD + GkCθkG T

k

P (D|Mj) P (D|Mk) = det Ck det Cj 1/2 exp

  • − 1

2

  • (D − Gjθj)TC −1

j

(D − Gjθj) − ( D − Gkθk)TC −1

k

(D − Gkθk)

  • where θk is prior mean and Ck is prior covariance matrix.

Utilizing the ensemble representation of the pdf’s, the calculations may be performed in a space of dimension equal to the ensemble size.

slide-17
SLIDE 17

Some alternatives for calculating BME

  • 1. Gauss-linear approximation
  • 2. “Inverted Bayes”, i.e.,

P (D|Mk) = P(D|θ, Mk)P(θ|Mk) P(θ|D, Mk) using e.g. posterior mean or MAP estimate for θ.

slide-18
SLIDE 18

Some alternatives for calculating BME

  • 1. Gauss-linear approximation
  • 2. “Inverted Bayes”, i.e.,

P (D|Mk) = P(D|θ, Mk)P(θ|Mk) P(θ|D, Mk) using e.g. posterior mean or MAP estimate for θ.

slide-19
SLIDE 19

Alternatives for calculating BME

  • 1. Gauss-linear approximation
  • 2. “Inverted Bayes”, i.e.,

P (D|Mk) = P(D|θ, Mk)P(θ|Mk) P(θ|Mk, D) , (1)

using e.g. posterior mean or MAP estimate for θ.

  • 3. Importance sampling with posterior ensemble as importance sampler,

i.e., averaging Eq. (1) over posterior ensemble.

slide-20
SLIDE 20

Alternatives for calculating BME

  • 1. Gauss-linear approximation
  • 2. “Inverted Bayes”, i.e.,

P (D|Mk) = P(D|θ, Mk)P(θ|Mk) P(θ|Mk, D) , (1)

using e.g. posterior mean or MAP estimate for θ.

  • 3. Importance sampling with posterior ensemble as importance sampler,

i.e., averaging Eq. (1) over posterior ensemble.

  • 4. Harmonic average of likelihoods over posterior ensemble

(approximation to the importance sampling above): P (D|Mk) ≈

  • 1

Ne

Ne

  • i=1

1 P(D|θi, Mk) −1 , (2)

slide-21
SLIDE 21

Alternatives for calculating BME

  • 1. Gauss-linear approximation
  • 2. “Inverted Bayes”, i.e.,

P (D|Mk) = P(D|θ, Mk)P(θ|Mk) P(θ|Mk, D) , (1)

using e.g. posterior mean or MAP estimate for θ.

  • 3. Importance sampling with posterior ensemble as importance sampler,

i.e., averaging Eq. (1) over posterior ensemble.

  • 4. Harmonic average of likelihoods over posterior ensemble

(approximation to the importance sampling above): P (D|Mk) ≈

  • 1

Ne

Ne

  • i=1

1 P(D|θi, Mk) −1 , (2)

  • 5. Eq. 2 with only one realization, i.e., simply the likelihood function

(with posterior mean, e.g.).

slide-22
SLIDE 22

Challenges

◮ Stability issues: BMP very sensitive to “everything” (amount of data,

data mismatch, prior and data covariance matrices, quality of posterior, degree of nonlinearity, ...).

slide-23
SLIDE 23

Challenges

◮ Stability issues: BMP very sensitive to “everything” (amount of data,

data mismatch, prior and data covariance matrices, quality of posterior, degree of nonlinearity, ...).

◮ Gauss-linear: Nonlinear forward model (BMP mainly depends on

prior properties).

slide-24
SLIDE 24

Challenges

◮ Stability issues: BMP very sensitive to “everything” (amount of data,

data mismatch, prior and data covariance matrices, quality of posterior, degree of nonlinearity, ...).

◮ Gauss-linear: Nonlinear forward model (BMP mainly depends on

prior properties).

◮ Inverted bayes (single-value or average): Requires calculating 3

probabilities, which may be small away from the mean.

slide-25
SLIDE 25

Challenges

◮ Stability issues: BMP very sensitive to “everything” (amount of data,

data mismatch, prior and data covariance matrices, quality of posterior, degree of nonlinearity, ...).

◮ Gauss-linear: Nonlinear forward model (BMP mainly depends on

prior properties).

◮ Inverted bayes (single-value or average): Requires calculating 3

probabilities, which may be small away from the mean.

◮ Harmonic average: Requires large ensemble size; very sensitive to

individual ensemble members with small likelihood values (unstable).

slide-26
SLIDE 26

Challenges

◮ Stability issues: BMP very sensitive to “everything” (amount of data,

data mismatch, prior and data covariance matrices, quality of posterior, degree of nonlinearity, ...).

◮ Gauss-linear: Nonlinear forward model (BMP mainly depends on

prior properties).

◮ Inverted bayes (single-value or average): Requires calculating 3

probabilities, which may be small away from the mean.

◮ Harmonic average: Requires large ensemble size; very sensitive to

individual ensemble members with small likelihood values (unstable).

◮ Likelihood function: Does not take into account prior uncertainty.

slide-27
SLIDE 27

Some approaches to handle these challenges

◮ Multidimensional scaling and kernel density estimation of likelihood

(Park et al., 2013).

◮ Localization (Metref et al., 2018). ◮ Multilevel methods (Hoel et al., 2016, Fossum et al., 2019)

slide-28
SLIDE 28

Examples

From S.I. Aanonsen, S. Tveit and M. Alerini: Using Bayesian Model Probability for Ranking Different Prior Scenarios in Reservoir History Matching SPEJ, 2019

slide-29
SLIDE 29

Example 1

Measurements of one, single parameter Bimodal prior

6 4 2 2 4 6 x 0.0 0.2 0.4 0.6 0.8 1.0 Probability density Prior Prior-1 Prior-2 6 4 2 2 4 6 x 0.00 0.05 0.10 0.15 0.20 0.25 0.30 Probability density Data

Prior: Mean = -3.0 / 2.0 Data: Mean = 0.0, Std = 2.0

slide-30
SLIDE 30

Example 1

Bayesian average vs “Total EnKF”

6 4 2 2 4 6 x 0.0 0.2 0.4 0.6 0.8 1.0 Probability density Prior Prior-1 Prior-2 6 4 2 2 4 6 x 0.00 0.05 0.10 0.15 0.20 0.25 0.30 Probability density Data 6 4 2 2 4 6 x 0.0 0.1 0.2 0.3 0.4 0.5 0.6 Probability density Bayesian average Analytical EnKF_Post-1 EnKF_Post-2 6 4 2 2 4 6 x 0.00 0.05 0.10 0.15 0.20 0.25 0.30 Probability density Total EnKF

slide-31
SLIDE 31

Example 1

Bayesian average with repeated measurements

6 4 2 2 4 6 x 0.0 0.2 0.4 0.6 0.8 1.0 Probability Density

Update 1 P2 = 64%

Analytical EnKF_Post-1 EnKF_Post-2 6 4 2 2 4 6 x 0.0 0.2 0.4 0.6 0.8 1.0 Probability Density

Update 2 P2 = 75%

Analytical EnKF_Post-1 EnKF_Post-2 6 4 2 2 4 6 x 0.0 0.2 0.4 0.6 0.8 1.0 Probability Density

Update 3 P2 = 82%

Analytical EnKF_Post-1 EnKF_Post-2 6 4 2 2 4 6 x 0.0 0.2 0.4 0.6 0.8 1.0 Probability Density

Update 4 P2 = 88%

Analytical EnKF_Post-1 EnKF_Post-2 6 4 2 2 4 6 x 0.0 0.2 0.4 0.6 0.8 1.0 Probability Density

Update 5 P2 = 91%

Analytical EnKF_Post-1 EnKF_Post-2 6 4 2 2 4 6 x 0.0 0.2 0.4 0.6 0.8 1.0 Probability Density

Update 6 P2 = 93%

Analytical EnKF_Post-1 EnKF_Post-2

slide-32
SLIDE 32

Example 2

Estimating top reservoir surface from 4D seismic data Synthetic model inspired by a real North Sea oil field

Seismic interpretations

500 1000 1500 2000 x 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 Depth

GI-0 GI-1 OP-0 OP-1 OP-2

Top interpretation 1 Top interpretation 2 Top interpretation 3 True top and base

Prior realizations

5 10 15 20 25 30 35 40 Grid cell 1000 1020 1040 1060 1080 Depth model 1 model 2 model 3

GI-0 GI-1 OP-0 OP-1 OP-2

Gas injection into undersaturated oil reservoir. 2D cross section Data: Gas-cap thickness No of parameters: 40 No of ensemble members: 100

slide-33
SLIDE 33

Example 2

Results Ensemble Smoother. Data at 100 and 200 days

Model 1

5 10 15 20 25 30 35 40 Cell no 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 Depth

Data variance 9m2 Model 2

5 10 15 20 25 30 35 40 Cell no 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 Depth

Data variance 9m2 Model 3

5 10 15 20 25 30 35 40 Cell no 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 Depth

Data variance 9m2 Model 1

5 10 15 20 25 30 35 40 Cell no 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 Depth

Data variance 360m2 Model 2

5 10 15 20 25 30 35 40 Cell no 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 Depth

Data variance 360m2 Model 3

5 10 15 20 25 30 35 40 Cell no 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 Depth

Data variance 360m2 Pink: Prior ensemble. Green: Posterior ensemble. Black: True model.

slide-34
SLIDE 34

Example 2

Predicted GOC at 200 days. Data at 100 and 200 days

Model 1

5 10 15 20 25 30 35 40 grid cell 1000 1010 1020 1030 1040 1050 1060 1070 1080 GOC

Simulated and measured data time 200

Data variance 9m2 Model 2

5 10 15 20 25 30 35 40 grid cell 1000 1010 1020 1030 1040 1050 1060 1070 1080 GOC

Simulated and measured data time 200

Data variance 9m2 Model 3

5 10 15 20 25 30 35 40 grid cell 1000 1010 1020 1030 1040 1050 1060 1070 1080 GOC

Simulated and measured data time 200

Data variance 9m2 Model 1

5 10 15 20 25 30 35 40 grid cell 1000 1010 1020 1030 1040 1050 1060 1070 1080 GOC

Data variance 360m2 Model 2

5 10 15 20 25 30 35 40 grid cell 1000 1010 1020 1030 1040 1050 1060 1070 1080 GOC

Data variance 360m2 Model 3

5 10 15 20 25 30 35 40 grid cell 1000 1010 1020 1030 1040 1050 1060 1070 1080 GOC

Data variance 360m2 Pink: Predicted from prior. Green: Predicted from posterior. Red: True model.

slide-35
SLIDE 35

Example 2

Predicted gas thickness at 200 days. Data at 100 and 200 days

Model 1

5 10 15 20 25 30 35 40 grid cell 10 20 30 40 50 Gas thickness

Simulated and measured data time 200

Data variance 9m2 Model 2

5 10 15 20 25 30 35 40 grid cell 10 20 30 40 50 Gas thickness

Simulated and measured data time 200

Data variance 9m2 Model 3

5 10 15 20 25 30 35 40 grid cell 5 10 15 20 25 30 35 40 45 Gas thickness

Simulated and measured data time 200

Data variance 9m2 Model 1

5 10 15 20 25 30 35 40 grid cell 10 20 30 40 50 Gas thickness

Data variance 360m2 Model 2

5 10 15 20 25 30 35 40 grid cell 10 20 30 40 50 Gas thickness

Data variance 360m2 Model 3

5 10 15 20 25 30 35 40 grid cell 5 10 15 20 25 30 35 40 45 Gas thickness

Data variance 360m2 Pink: Predicted from prior. Green: Predicted from posterior. Black: Data (true model).

slide-36
SLIDE 36

Example 2

Model probability — effect of data variance

Data Model Gauss- Inverted Importance Harmonic variance Linear Bayes∗) sampling average 9m2 1 1.4E-39 1.6E-62 1.4E-22 7.9E-24 2 2.6E-15 1.4E-28 3.0E-12 6.3E-27 3 1.0 1.0 1.0 1.0 360m2 1 0.04 0.03 0.03 0.04 2 0.15 0.11 0.15 0.11 3 0.81 0.86 0.82 0.85

*) With θ = posterior mean All pdf’s are assumed Gaussian. Mean and covariances for prior and posterior from respective ensembles.

slide-37
SLIDE 37

Example 2

Model probability — effect of the amount of data

Data Model Model probability 1 0.25 One seismic survey at 100 days 2 0.33 3 0.42 1 0.03 Two seismic surveys at 100 and 200 days 2 0.15 3 0.82 1 0.004 Ten seismic surveys at 100, 200,..., 1000 days 2 0.01 3 0.98 Data variance: 360m2 Method: Importance sampling

slide-38
SLIDE 38

Example 2

Model probability (%) — 10 independent runs

Mean Std Model 1 3.96 0.54 Model 2 18.3 2.28 Model 3 77.8 2.64 .

10 independent prior ensembles Data: 2 surveys at 100 and 200 days Data variance: 360m2 Method: Importance sampling

slide-39
SLIDE 39

Summary and conclusions

◮ There is a need for methodology to take into account several alternative

prior models in history matching and uncertainty analysis.

slide-40
SLIDE 40

Summary and conclusions

◮ There is a need for methodology to take into account several alternative

prior models in history matching and uncertainty analysis.

◮ Bayesian methods for models provide a consistent way to handle this:

◮ “Total” uncertainy analysis (BMA) ◮ Posterior model probability (BMP) ◮ Model selection (BME/BF)

slide-41
SLIDE 41

Summary and conclusions

◮ There is a need for methodology to take into account several alternative

prior models in history matching and uncertainty analysis.

◮ Bayesian methods for models provide a consistent way to handle this:

◮ “Total” uncertainy analysis (BMA) ◮ Posterior model probability (BMP) ◮ Model selection (BME/BF)

◮ Calculating BME and BMP is challenging:

◮ BME very sensitive to “everything”. ◮ Approximations, including data reduction, may be required, and the results

may be very sensitive to these approximations.

slide-42
SLIDE 42

Summary and conclusions

◮ There is a need for methodology to take into account several alternative

prior models in history matching and uncertainty analysis.

◮ Bayesian methods for models provide a consistent way to handle this:

◮ “Total” uncertainy analysis (BMA) ◮ Posterior model probability (BMP) ◮ Model selection (BME/BF)

◮ Calculating BME and BMP is challenging:

◮ BME very sensitive to “everything”. ◮ Approximations, including data reduction, may be required, and the results

may be very sensitive to these approximations.

◮ More research is needed to understand the methodology and its

applicability to reservoir modelling.

slide-43
SLIDE 43

Acknowledgments

Equinor ASA Petromaks2 project DIGIRES with sponsors