Convergence of ensemble Kalman filters in the large ensemble limit - PowerPoint PPT Presentation

Convergence of ensemble Kalman filters in the large ensemble limit and infinite dimension Jan Mandel Department of Mathematical and Statistical Sciences University of Colorado Denver Based on joint work with Loren Cobb, Evan Kwiatkowski, and Kody Law Supported by NSF grant DMS-1216481 Colorado School of Mines, Golden, CO, April 24, 2015

Data assimilation - Analysis cycle • Goal: Inject data into a running model analysis advance analysis step in time step forecast − → analysis − → forecast − → analysis . . . data ր data ր • Kalman filter used already in Apollo 11 navigation, now in GPS, computer vision, weather forecasting, remote sensing,...

From least squares to Bayes theorem Inverse problem Hu ≈ d with background knowledge u ≈ u f Q − 1 + | d − Hu | 2 | u − u f | 2 R − 1 → min u u f =forecast, what we think the state should be, d =data, H =observation operator, Hu =what the data should be given state u . �� u − u f � � 2 � � u − u f � � 2 − 1 Q − 1 + | d − Hu | 2 e − 1 e − 1 2 | d − Hu | 2 R − 1 Q − 1 2 R − 1 2 e = → max u � �� ∝ p a ( u )= p ( u | d ) ∝ p ( u ) ∝ p ( d | u ) data likelihood analysis (posterior) density forecast (prior) density This is Bayes theorem for probability densities. ∝ means proportional. − 1 2 | u − u a | 2 Qa − 1 mean and covariance are The analysis density p a ( u ) ∝ e u a = u f + K ( d − Hu f ) , Q a = ( I − KH ) Q , K = K ( Q ) = QH T ( HQH T + R ) − 1 is the Kalman gain

Kalman filter (KF) • Add advancing in time - model is operator u �→ Mu + b : mean and covariance advanced as Mu and MQM T • Hard to advance the covariance matrix when the model is nonlinear. • Need tangent and adjoint model operators. • Can’t maintain the covariance matrix for large problems, such as from discretizations of PDEs.

Ensemble forecasting to the rescue • Ensemble weather forecast: Independent randomized simulations. If it rains in 30% of them, then say that “the chance of rain is 30%”. • Ensemble Kalman filter (EnKF, Evensen 1994): replace the covariance in KF by the sample covariance of the ensemble, then apply the KF formula for the mean to each ensemble member independently. • But the analysis covariance was wrong – too small even if the covariance were exact. • Fix: randomize also the data by sampling from the data error distribution . (Burgers, Evensen, van Leeuwen, 1998). • Then all is good and we should get the right ensembles... at least in the Gaussian case, and up to the approximation by the sample covariance.

The EnKF with data randomization is statistically exact if the covariance is exact Lemma. Let U f ∼ N ( u f , Q f ) , D = d + E , E ∼ N (0 , R ) and U a =U f +K(D-HU f ), K=Q f H T (HQ f H T +R) − 1 . Then U a ∼ N ( u a , Q a ) , i.e., U has the correct analysis distribution from the Bayes theorem and the Kalman filter. Proof. Computation in Burgers, Evensen, van Leeuwen, 1998. Corollary. If the forecast ensemble [ U f 1 , . . . , U f N ] is a sample from Gaussian forecast distribution, and the exact covariance is used, then the analysis ensemble [ U a 1 , . . . , U a N ] is a sample from the analysis distribution.

EnKF properties • The model is needed only as a black box . • The ensemble members interact through ensemble mean and covariance only • The EnKF was derived for Gaussian distributions but the algorithm does not depend on this. • So it is often used for nonlinear models and non-Gaussian distributions anyway. • Folklore: “high-dimensional problems require large ensembles” a.k.a. “ curse of dimensionality ”. Not necessarily so. • Curse of dimensionality: slow convergence in high dimension. With arbitrary probability distributions, sure. But probability distributions in practice are not arbitrary: the state is a discretization of a random function.The smoother the functions, the faster the EnKF convergence.

Convergence of EnKF in the large ensemble limit • Laws of large numbers to guarantee that the EnKF gives correct results for large ensembles, in the Gaussian case: Le Gland et al. (2011), Mandel et al (2011). • In general, the EnKF converges to a mean-field limit (Le Gland et al. 2011). - mean-field approximation = the effect of all other particles on any one particle is replaced by a single averaged effect - mean field limit = large number of particles, the influence of each becomes negligible. For convergence in high state space dimension, we look at the infinite dimension first - just like in numerical analysis where we study first the PDE and only then its numerical approximation

Tools: Random elements on a separable Hilbert space The mean of a random element X on Hilbert space V : E ( X ) ∈ V, � v, E ( X ) � = E ( � v, X � ) ∀ v ∈ V Similarly for random bounded operator A on V , E ( A ) ∈ V, � v, E ( A ) u � = E ( � v, Au � ) ∀ u, v ∈ V Tensor product of vectors xy T , x , y ∈ V , becomes x ⊗ y ∈ [ V ] , ( x ⊗ y ) v = x � y, v � ∀ v ∈ V Covariance of a random element X becomes C = Cov ( X ) = E (( X − E ( X )) ⊗ ( X − E ( X ))) = E ( X ⊗ X ) − E ( X ) ⊗ E ( X ) Covariance must be a compact operator with finite trace, Tr C = � ∞ n =1 λ n < ∞ ( λ n eigenvalues).

Tools: Karhunen-Lo` eve expansion and random functions � � | X | 2 � � Suppose X ∈ L 2 (Ω , V ) = X : E < ∞ . Then C = Cov ( X )compact and self-adjoint, has complete orthonormal system { e n } of eigenvectors, Ce n = λ n e n , eigenvalues λ n ≥ 0, and n =1 λ 1 / 2 X = E ( X ) + � ∞ ξ n e n , E ( ξ n ) = 0 . n is convergent a.s. in V , and ξ m are uncorrelated random variables � ξ m , ξ n � L 2 (Ω) = E ( ξ m , ξ n ) = δ mn . Examples: � � λ 1 / 2 ξ 1 , λ 1 / 2 in ℓ 2 , X = ξ 2 , . . . 1 2 ∞ � λ 1 / 2 ξ n cos ( nx ) in L 2 (0 , π ) X ( x ) = n n =1 with ξ k ∼ N (0 , 1) independent, λ n = n − α , α > 1 .

Curse of dimensionality? Not for probability measures! EnKF, N = 10 , m = 25 10 2 10 1 filter error 10 0 10 − 1 10 − 2 10 1 10 2 10 3 model size Constant covariance eigenvalues λ n = 1 and the inverse law λ n = 1 /n are not probability measures in the limit because � ∞ n =1 λ n = ∞ . Inverse square law λ n = 1 /n 2 gives a probability measure because � ∞ n =1 λ n < ∞ . m=25 uniformly sampled data points from 1D state, N=10 ensemble members. From the thesis Beezley (2009), Fig. 4.7.

Bayes theorem in infinite dimension • Forecast and analysis are probability distributions, densities p f , p a � • Bayes theorem: p a ( u ) = 1 C p ( d | u ) p f ( u ), C = p ( d | u ) p f du U • Write in terms of measures µ a , µ f with densities p a , p f � � � p a ( u ) du = 1 p ( d | u ) p f ( u ) du , C = p ( d | u ) dµ f ( u ) C A A U � µ a ( A ) = 1 p ( d | u ) dµ f ( u ), ∀ A µ f -measurable C A ⇔ Radon-Nikodym derivative: p ( d | u ) = C dµ a dµ f � p ( d | u ) dµ f ( u ) > 0 ? • But how do we know that C = U

Gaussian case • data likelihood: d = Hu + ε , ε ∼ N (0 , R ), p ( d | u ) = const e − 1 2 | Hu − d | 2 R − 1 • µ f ( u ) is Gaussian measure on U , data d ∈ V • state space U and data space V are separable Hilbert spaces • p ( d |· ) ∈ L 1 � U, µ f � ⇒ µ a is absolutely continuous w.r.t. µ f ajek theorem ⇒ µ a and µ f equivalent • Feldman-H´ • Difficulties when the data are infinitely dimensional...

Infinite-dimensional data, Gaussian measure error bad • Example: µ f = N (0 , Q ), H = I ⇔ all state observed, d = 0, R = Q � � − 1 R − 1 / 2 u,R − 1 / 2 u • p ( d | u ) = const e − 1 2 | u | 2 R − 1 = const e 2 � R − 1 / 2 � • p ( d | u ) > 0 if u ∈ R 1 / 2 ( U ) = dom • p ( d | u ) = e −∞ = 0 if u / ∈ R 1 / 2 ( U ) = Q 1 / 2 ( U ) • Q 1 / 2 ( U ) is the Cameron-Martin space of the measure N (0 , Q ) � � � Q 1 / 2 ( U ) p ( d | u ) dµ f ( u ) = 0 • But µ = N (0 , Q ) ⇒ µ = 0. Thus, U • Also not possible from the Feldman-H´ ajek theorem, µ a and µ f are not equivalent.

Infinite-dimensional data, white noise error good • All is good when data is finite-dimensional and R not singular • More generally, when data covariance R is bounded below: R − 1 < ∞ ∀ u ⇒ e −| Hu − d | 2 � u, Ru � ≥ α | u | 2 ∀ u , α > 0 ⇒ | u | 2 R − 1 > 0 ∀ u � � e −| Hu − d | 2 p ( d | u ) dµ f ( u ) = const R − 1 dµ f ( u ) > 0 ⇒ U U • But if V is not finite dimensional, then such data error distribution N (0 , R ) is not a probability measure on V . Covariance of probability a measure has finite trace � ∞ n =1 λ n < ∞ , but the spectrum σ ( R ) ≥ α > 0. � • Yet Bayes theorem µ a ( A ) = 1 p ( d | u ) dµ f ( u ) is fine. C A

Randomized data ensemble Kalman filter � � Y 0 1 , Y 0 2 , . . . , Y 0 i.i.d., Gaussian bakground Initial ensemble: N � � Y m − 1 Advance time by the model: X m k = F k Analysis step: nonlinear map � � � � X m 1 , X m 2 , . . . , X m Y m 1 , Y m 2 , . . . , Y m �→ N N by solving the inverse problem HY ≈ d m Y m = X m k + K ( Q m N )( D m k − HX m k ) k � � Q m X m 1 , . . . , X m , D m k ∼ N ( d m , R ) N = C N N

Convergence of ensemble Kalman filters in the large ensemble limit - PowerPoint PPT Presentation

Convergence of ensemble Kalman filters in the large ensemble limit and infinite dimension Jan Mandel Department of Mathematical and Statistical Sciences University of Colorado Denver Based on joint work with Loren Cobb, Evan Kwiatkowski, and

Multimodality in the Kalman Filter and Ensemble Kalman Filter Maxime Conjard, Henning Omre

ECE 516: Adaptive Digital Filters Lecture 8 (Kalman Filtering) Mojtaba Soltanalian Kalman

Linear ensemble transform filters: A unified perspective on ensemble Kalman and particle filters

Recursive State Estimation 2 Lecture 8 Recap Today Kalman Filter Extended Kalman Filter

Progress Report of Local Ensemble Kalman Progress Report of Local Ensemble Kalman Filter/fvGCM

Robot Localization Localization Robot and and Kalman Filters Filters Kalman Rudy Negenborn

KALMAN FILTERS STRIKE BACK KALMAN FILTERS STRIKE BACK MATTHIEU BLOCH April 16, 2020 1 / 14

Kalman Filters 1 Kalman filters Modelling systems described by a set of continuous variables,

Overview of Discrete-Time Filters First-order filters Ideal filters Practical filters

Overview of Discrete-Time Filters Discrete-Time Filters Overview First-order filters N M

UNSCENTED KALMAN FILTER UNSCENTED KALMAN FILTER MATTHIEU BLOCH April 21, 2020 1 / 9 RECAP:

Kalman filter Kalman Filter Kalman filter is used to filter true system states from noisy

The Kalman Filter (part 1) Administrative Stuff Rudolf Emil Kalman

Convergence of the ensemble Kalman filter in the large sample limit and in high and infinite

Extended ensemble Kalman filters for high-dimensional hierarchical state-space models Matthias

Spectral and morphing ensemble Kalman filters Jan Mandel, Jonathan D. Beezley, and Loren Cobb

Preliminary Results on n e / n t selection at DUNE FD. CP violation & n t physics perspectives.

s rs rts

Lecture 6: Support Vector Machine (Part 1) Feb 10 2020 Lecturer: Steven Wu Scribe: Steven Wu We

Machine Learning A Geometric Approach Linear Classification: Perceptron Professor Liang Huang

Lecture 3: Crossed Products by Finite Groups; the 1129 July 2016 Rokhlin Property Lecture 1

Probabilistic & Unsupervised Learning Factored Variational Approximations and Variational

Informational and Computational Limits of Clustering and other questions about clustering Nati

Lecture 2.1: Separation of Variables Matthew Macauley Department of Mathematical Sciences

Convergence of ensemble Kalman filters in the large ensemble limit - PowerPoint PPT Presentation

Convergence of ensemble Kalman filters in the large ensemble limit and infinite dimension Jan Mandel Department of Mathematical and Statistical Sciences University of Colorado Denver Based on joint work with Loren Cobb, Evan Kwiatkowski, and

Multimodality in the Kalman Filter and Ensemble Kalman Filter Maxime Conjard, Henning Omre

ECE 516: Adaptive Digital Filters Lecture 8 (Kalman Filtering) Mojtaba Soltanalian Kalman

Linear ensemble transform filters: A unified perspective on ensemble Kalman and particle filters

Recursive State Estimation 2 Lecture 8 Recap Today Kalman Filter Extended Kalman Filter

Progress Report of Local Ensemble Kalman Progress Report of Local Ensemble Kalman Filter/fvGCM

Robot Localization Localization Robot and and Kalman Filters Filters Kalman Rudy Negenborn

KALMAN FILTERS STRIKE BACK KALMAN FILTERS STRIKE BACK MATTHIEU BLOCH April 16, 2020 1 / 14

Kalman Filters 1 Kalman filters Modelling systems described by a set of continuous variables,

Overview of Discrete-Time Filters First-order filters Ideal filters Practical filters

Overview of Discrete-Time Filters Discrete-Time Filters Overview First-order filters N M

UNSCENTED KALMAN FILTER UNSCENTED KALMAN FILTER MATTHIEU BLOCH April 21, 2020 1 / 9 RECAP:

Kalman filter Kalman Filter Kalman filter is used to filter true system states from noisy

The Kalman Filter (part 1) Administrative Stuff Rudolf Emil Kalman

Convergence of the ensemble Kalman filter in the large sample limit and in high and infinite

Extended ensemble Kalman filters for high-dimensional hierarchical state-space models Matthias

Spectral and morphing ensemble Kalman filters Jan Mandel, Jonathan D. Beezley, and Loren Cobb

Preliminary Results on n e / n t selection at DUNE FD. CP violation &amp; n t physics perspectives.

s rs rts

Lecture 6: Support Vector Machine (Part 1) Feb 10 2020 Lecturer: Steven Wu Scribe: Steven Wu We

Machine Learning A Geometric Approach Linear Classification: Perceptron Professor Liang Huang

Lecture 3: Crossed Products by Finite Groups; the 1129 July 2016 Rokhlin Property Lecture 1

Probabilistic &amp; Unsupervised Learning Factored Variational Approximations and Variational

Informational and Computational Limits of Clustering and other questions about clustering Nati

Lecture 2.1: Separation of Variables Matthew Macauley Department of Mathematical Sciences

Preliminary Results on n e / n t selection at DUNE FD. CP violation & n t physics perspectives.

Probabilistic & Unsupervised Learning Factored Variational Approximations and Variational