Ensemble transform filters for geophysical data assimilation - PowerPoint PPT Presentation

Ensemble transform filters for geophysical data assimilation Sebastian Reich in collaboration with Georg Gottwald (University of Sydney), Kay Bergemann, Seoleun Shin (Uni Potsdam), Eugenia Kalnay, Kayo Ide, Javier Amezcua (U of Maryland)

1. Data assimilation (DA) Text Nature empirical evidence/ theory & hypotheses/ Text Text scientific modeling experiments mathematical/ statistical models Text Text computational models Bayesian data assimilation Text Text state/parameter estimation, model selection, prediction

2. A simplified mathematical problem statement 1) Mathematical model x = f ( x ) ˙ 2) Random initial conditions x (0), i.e. x (0) ∼ ρ 0 for some given probability density function (PDF) ρ 0 ( x ). 3) Observations/measurements y ∈ R k , H ∈ R k × n , y ( t j ) = Hx ( t j ) + η j , k < n , at discrete times t j > 0, j = 1 , . . . , J , subject to random measurement errors: η j ∼ N( 0 , R ) .

Liouville (forecast): x = ( x 1 , x 2 ) , π post ( x , t j ) → π prior ( x , t j +1 ) Bayes (assimilation): y = x 1 + η, π prior ( x , t j +1 ) → π post ( x , t j +1 ) (figures courtesy Chris Jones)

3. Ensemble propagation and particle filters Particle/Monte-Carlo methods. Given a collection X ( t ) = [ x 1 ( t ) | x 2 ( t ) | · · · | x m ( t )] ∈ R n × m of m independent solutions of x = f ( x ) ˙ with initial conditions x i (0) ∼ ρ 0 ( x ). The empirical measure m � ρ em ( x , t ) = α i δ ( x − x i ( t )) i =1 is an exact (weak) solution to Liouville’s equation ( δ ( · ) Dirac’s delta function, α i > 0 (constant) weights).

Data assimilation for particle approximation at t j . Bayes’ formula: m ρ ( x , t + � j ) = α i δ ( x − x i ( t j )) . i =1 with new weights α i ⇒ π ( y ( t j ) | x i ( t j )) × α i . Problem: Particles are still sampled from the prior and not from the posterior PDF. The mismatch is compensated for by non- uniform weight factors (importance sampling). Near mutual sin- gularity of measures leads to degeneracy of weights.

4. Assimilation as a continuous deformation of probability I Dynamics and assimilation are not easily compatible in a numer- ical sense. We have to shift focus somehow. Here is our basic line of attack: We view the change of measure induced by Bayes’ theorem as an (optimal) transportation problem. See Crisan and Xiong (2010) for a related approach in the con- text of continuous time filtering.

Bayes (assimilation): π prior ( x , t j +1 ) → π post ( x , t j +1 ) Task: Shuffle blue prior into red posterior: x ’ = T( x ) s.t. π post ( T ( x )) | DT ( x ) | = π pr ( x ) .

Negative log-likelihood: L = ( Hx − y ( t j )) T R − 1 ( Hx − y ( t j )) / 2 . Bayes’ theorem: N π post ( x ) = π ( x | y ( t j )) ∝ π pr ( x ) e − L = π pr ( x ) e − L/N � n =1 Limit N → ∞ , Taylor expansion in ∆ s = 1 /N : ∂ρ ∂s = − ρ ( L − E ρ [ L ]) , s ∈ [0 , 1], ρ ( x , 0) = π pr ( x ).

“Kushner-Stratonovich” equation for intermittent data assimilation (Reich, 2011): ∂ρ δ ǫ ( t − t j ) ρ ( L − ¯ � ∂t = −∇ x ( ρf ) − L ) j ≥ 1 with δ ǫ ( · ) is a (compact) mollifier to the Dirac delta function with support [ t j − ǫ, t j + ǫ ] and ¯ L = E ρ [ L ] . If Ψ( x ) is a smooth observable, then d ¯ δ ǫ ( t − t j )(Ψ − ¯ Ψ)( L − ¯ � Ψ = ∇ x Ψ · f − L ) dt j ≥ 1 x ) T . One can in particular set ρ = ρ em , Ψ = x or Ψ = ( x − ¯ x )( x − ¯

4. Assimilation as a continuous deformation of probability II For demonstration purposes, consider a single degree of freedom: We got to transport particles distributed according to the heap on the left (prior) such that they form the heap on the right (posterior). Direct approach: Monge-Ampere, nonlinear elliptic PDF.

Goal: Find vector field g ( x ; ρ ) ∈ L 2 (d ρ, R n ) such that � L − ¯ ∇ x · ( ρg ) = ρ L � . The vector field g is not unique. Find minimizer with respect to the norm (see Otto, 2001, for an application in gradient flow dynamics): 2 ∂ρ d ρ v T M v : ∂ρ � � �� = inf ∂s + ∇ x · ( ρv ) = 0 . � � ∂s v ∈ L 2 (d ρ, R n ) � � ρ Gradient flow: g ( x ; ρ ) = M − 1 ∇ x ψ ( x ; ρ ) .

probability density functions error in posterior probability density functions 0.4 prior EGMF, M=2000 likelihood EnKF, M=2000 0.35 posterior 0.15 RHF, M=2000 0.3 0.1 0.25 0.05 0.2 0 0.15 − 0.05 0.1 − 0.1 0.05 0 − 10 − 5 0 5 10 − 8 − 6 − 4 − 2 0 2 4 6 8 position position particle dynamics under EGMF assimilation step particle dynamics under EnKF assimilation step 8 8 6 6 4 4 particle positions particle positions 2 2 0 0 − 2 − 2 − 4 − 4 − 6 − 6 − 8 − 8 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 ficticious time s ficticious time s

Formulation of a complete dynamics-assimilation step (Reich, 2010): x = f ( x ) + δ ǫ ( t − t j ) M − 1 ∇ x ψ ( x ; ρ ) , ˙ (1) ρ t = −∇ x · ( ρ ˙ x ) (2) closed by the elliptic PDE ∇ x · ( ρ M − 1 ∇ x ψ ) = ρ � L − ¯ � . (3) L for t ∈ [ t j − ǫ, t j + ǫ ]. This system constitutes an example of a Vlasov-McKean system (infinite-dimensional interacting particle system).

Algorithmic summary of data assimilation step: 1) Use the ensemble of solutions x i ( t ), i = 1 , . . . , m , to define a statistical model ˜ ρ ( x ; { x i ( t ) } ). 2) Solve ρ M − 1 ∇ x ψ ) = ˜ � � ∇ x · (˜ ρ L − E ˜ ρ [ L ] (4) numerically or by quadrature for the potential ψ . 3) Propagate particles under vector field g = M − 1 ∇ ψ (gradient flow) .

Choices for ˜ ρ include (a) Gaussian PDF parametrized by the ensemble mean and covariance matrix (ensemble Kalman filter (EnKF)) (b) Gaussian mixture models/ Gaussian kernel density estima- tors. Resulting elliptic PDE can be solved analytically! (c) Empirical PDF and weak form of McKean-Vlasov; e.g. find vector field such that ensemble mean and covariance matrix are consistently propagated (different from EnKF!) The second and third approach give rise to new filters (Reich 2011/2012).

6. Numerics I: Langevin dynamics under bistable potential Second-order Langevin dynamics under a one-dimensional double- well potential V ( q ): √ d v = − V ′ ( q )d t − γv d t + 2 σ d w ( t ) , d q = v d t, w ( t ) denotes standard Brownian motion, potential V ( q ) given by potential energy 5 4 3 2 1 0 − 1 − 10 − 5 0 5 10 q variable

Assimilate velocities v only (!); posterior PDFs in positions q : posterior probability density function posterior probability density function 0.5 0.5 0.45 0.45 0.4 0.4 0.35 0.35 0.3 0.3 0.25 0.25 0.2 0.2 0.15 0.15 0.1 0.1 0.05 0.05 0 0 � 10 � 5 0 5 10 � 10 � 5 0 5 10 position position

Results from EGMF ( M = 50 ensemble members): continuous ensemble Gaussian mixture filter 10 truth 5 position 0 − 5 − 10 0 2000 4000 6000 8000 10000 10 filtered 5 position 0 − 5 − 10 0 2000 4000 6000 8000 10000 time RMSE = 1 . 9148 for EGMF compared to 2.331 for the EnKF. A bi-Gaussian is used in 25% of the assimilation steps.

7. First and second-order filter consistency Bayes’ theorem implies Ψ = E π pr [Ψ e − L ] ¯ E π pr [ e − L ] for the posterior expectation value of an observable Ψ. Set π pr = ρ em (ensemble), Ψ = x (ensemble mean) or Ψ = x ) T (ensemble covariance matrix). ( x − ¯ x )( x − ¯ The ensemble Kalman filter (EnKF) is not consistent with respect to the Bayesian posterior mean and covariance matrix. Example for classic moment closure problem: d Ψ = − (Ψ − ¯ ¯ Ψ)( L − ¯ L ) . ds

Simple example: x ∈ R , obs y = x + η , prior with mean ¯ x = 0 and variance P : Ψ = x 2 → ¯ Ψ = P Then d − 1 2( x 2 − P )( x − y ) 2 = dsP 1 x 4 − P 2 − 2 x 3 y � � = 2 For Gaussians: x 4 = 3 P 2 , x 3 = 0 (EnKF closure).

Appropriate corrections proposed by Lei & Bickel (2011) (NLEAF) and Reich (2011) (MEnKF). Requires computing the square root of covariance matrix. Basic idea: x p x p and covari- i are the proposal ensemble members with mean ¯ P p . These can be obtained from a standard EnKF. ance matrix ¯ The Bayesian posterior mean and covariance are denoted by ¯ x and ¯ P , respectively. The analysed ensemble members are now given by P p ) − 1 / 2 ( x p x a P 1 / 2 ( ¯ x p ) . x + ¯ i = ¯ i − ¯

Lorenz-63 model: Mean RMSE over 2000 assimilation cycles with t j = 0 . 05 j for the Lorenz-63 model using five different filter algorithms and three different ensemble sizes. FD (filter divergence) is being used whenever the mean RMSE exceeds the observation error R = 4. EnKF MEnKF1 MEnKF2 NLEAF1 NLEAF2 M = 10 0.4405 1.2045 FD FD FD M = 40 0.3004 0.3140 0.2510 0.3772 FD M = 400 0.3272 0.3262 0.2375 0.3037 0.2336

Continuous & consistent update: d ds ¯ x = − L ∆ x (ensemble mean) , and ds ∆ x = − 1 L )∆ x − 1 d 2( L − ¯ 2 L ∆ x (ensemble deviations) with ∆ x = x − ¯ x . Ensemble filter equations for assimilation at t j : dtx i = f ( x i ) − 1 d � � ( L i − ¯ 2 δ ǫ ( t − t j ) L )∆ x i + L ∆ x . EnKF closure seems preferable. Work in progress ...

Ensemble transform filters for geophysical data assimilation - PowerPoint PPT Presentation

Ensemble transform filters for geophysical data assimilation Sebastian Reich in collaboration with Georg Gottwald (University of Sydney), Kay Bergemann, Seoleun Shin (Uni Potsdam), Eugenia Kalnay, Kayo Ide, Javier Amezcua (U of Maryland) 1.

Linear ensemble transform filters: A unified perspective on ensemble Kalman and particle filters

Overview of Discrete-Time Filters First-order filters Ideal filters Practical filters

Overview of Discrete-Time Filters Discrete-Time Filters Overview First-order filters N M

Topic 10: The Z Transform o Introduction to Z Transform o Relationship to the Fourier transform o

Fourier Series and Transform Overview Why Fourier transform? Trigonometric functions Who is

AngularJS Unit Testing AngularJS Filters and Services with Karma & Jasmine Filters

Boosting (ensemble) Module 4 - Ensemble classifiers - Objectives module 4: boosting (ensemble

SMART GOVERNMENT INVOICING: INVOICE PROCESSING PLATFORM LEAD. TRANSFORM. DELIVER LEAD. TRANSFORM.

Convergence of ensemble Kalman filters in the large ensemble limit and infinite dimension Jan

Finite Impulse Response (FIR) Digital Filters Digital filters are rapidly replacing classic

Practical Analog Filters Overview Types of practical filters Filter specifications

HLT MET Noise Filters in Run2011B Alex Mott Caltech Review of Noise Filters HBHE noise

Advanced Geophysical Classification Projects September 13, 2016 1 Advanced Geophysical

Topic 4: Continuous-Time Fourier Transform (CTFT) o Introduction to Fourier Transform o Fourier

Geophysical Surveillance Methods of Active Volcanoes Michael Wack 8.6.2004 Michael Wack

Geophysical methods for EGS investigation: Geophysical methods for EGS investigation: an overview

Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at

Stochastic particle methods in Bayesian statistical learning P. Del Moral (INRIA team ALEA) INRIA

Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency-Domain

Ergodicity and Accuracy in Optimal Particle Filters David Kelly and Andrew Stuart Courant

Big Data - Lecture 3 Supervised classification S. Gadat Toulouse, Novembre 2014 S. Gadat Big

Rao-Blackwellized Monte Carlo Data Association for Multiple Target Tracking Simo Srkk,

Plan Dynamic Linear Models Bayesian Analysis of Dynamic Linear Models in R The R package dlm

1 - However, I would like to go beyond the specific problem of direct illumination and really

Ensemble transform filters for geophysical data assimilation - PowerPoint PPT Presentation

Ensemble transform filters for geophysical data assimilation Sebastian Reich in collaboration with Georg Gottwald (University of Sydney), Kay Bergemann, Seoleun Shin (Uni Potsdam), Eugenia Kalnay, Kayo Ide, Javier Amezcua (U of Maryland) 1.

Linear ensemble transform filters: A unified perspective on ensemble Kalman and particle filters

Overview of Discrete-Time Filters First-order filters Ideal filters Practical filters

Overview of Discrete-Time Filters Discrete-Time Filters Overview First-order filters N M

Topic 10: The Z Transform o Introduction to Z Transform o Relationship to the Fourier transform o

Fourier Series and Transform Overview Why Fourier transform? Trigonometric functions Who is

AngularJS Unit Testing AngularJS Filters and Services with Karma &amp; Jasmine Filters

Boosting (ensemble) Module 4 - Ensemble classifiers - Objectives module 4: boosting (ensemble

SMART GOVERNMENT INVOICING: INVOICE PROCESSING PLATFORM LEAD. TRANSFORM. DELIVER LEAD. TRANSFORM.

Convergence of ensemble Kalman filters in the large ensemble limit and infinite dimension Jan

Finite Impulse Response (FIR) Digital Filters Digital filters are rapidly replacing classic

Practical Analog Filters Overview Types of practical filters Filter specifications

HLT MET Noise Filters in Run2011B Alex Mott Caltech Review of Noise Filters HBHE noise

Advanced Geophysical Classification Projects September 13, 2016 1 Advanced Geophysical

Topic 4: Continuous-Time Fourier Transform (CTFT) o Introduction to Fourier Transform o Fourier

Geophysical Surveillance Methods of Active Volcanoes Michael Wack 8.6.2004 Michael Wack

Geophysical methods for EGS investigation: Geophysical methods for EGS investigation: an overview

Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at

Stochastic particle methods in Bayesian statistical learning P. Del Moral (INRIA team ALEA) INRIA

Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency-Domain

Ergodicity and Accuracy in Optimal Particle Filters David Kelly and Andrew Stuart Courant

Big Data - Lecture 3 Supervised classification S. Gadat Toulouse, Novembre 2014 S. Gadat Big

Rao-Blackwellized Monte Carlo Data Association for Multiple Target Tracking Simo Srkk,

Plan Dynamic Linear Models Bayesian Analysis of Dynamic Linear Models in R The R package dlm

1 - However, I would like to go beyond the specific problem of direct illumination and really

AngularJS Unit Testing AngularJS Filters and Services with Karma & Jasmine Filters