Robust method for EnKF in the presence of observation - PowerPoint PPT Presentation

Robust method for EnKF in the presence of observation outliers/Multivariate localization methods for EnKF Mikyoung Jun Department of Statistics Texas A&M University May 19, 2015 M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 1 / 32

Setup ◮ y t ∈ R p : observations at time t ◮ x t ∈ R n : unobservable state at time t ◮ Nonlinear system eq. x t = M ( x t − 1 ) + e t ◮ Observation eq. y t = H t x t + ǫ t ◮ System error e t ∼ N n ( 0 , Q t ) ◮ Observation error ǫ t ∼ N p ( 0 , R t ) ◮ System and observation errors are uncorrelated. ◮ M , H t , Q t , R t assumed to be known. M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 2 / 32

Definitions ◮ M -member background ensemble x b = { x b ( k ) : k = 1 , . . . , M } ∈ R n × M x b = 1 � M ◮ background mean ¯ k = 1 x b ( k ) M ◮ ensemble-based estimate of the background error covariance P b = k = 1 X b ( k ) [ X b ( k ) ] T , where X b ( k ) = x b ( k ) − ¯ � M 1 x b M − 1 x a = ¯ x b + K ( y − H ¯ x b ) ◮ analysis mean ¯ ◮ analysis covariance P a = ( I − KH ) P b ◮ Kalman gain K = P b H T ( HP b H + R ) − 1 M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 3 / 32

Part 1: Multivariate Localization for EnKF Outline Part 1: Multivariate Localization for EnKF Part 2: Robust EnKF in the presence of observation outliers M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 4 / 32

Part 1: Multivariate Localization for EnKF Motivation Joint work with Soojin Roh , Istvan Szunyogh , and Marc Genton ◮ Localization: Schur (elementwise) product of P b and a localization matrix from a compactly supported correlation function ρ ( · ) ◮ In statistics, localization function is called “taper” or compactly supported covariance function; needs to be positive definite ◮ For multivariate state variables, current practice is to apply the same localization function to each “block” of P b ◮ Does it not matter? ( K = P b H T ( HP b H + R ) − 1 ) ◮ Kang et al. (2011, JGR) zeroes out covariances between physically unrelated variables (not about positive-definiteness) ◮ M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 5 / 32

Part 1: Multivariate Localization for EnKF Motivation ◮ Problem of rank deficiency: � L � L ◮ Localization matrix L L ◮ Problem is more serious when P b ij ’s are “significantly” non-zero ◮ We need ρ ( · ) = { ρ ij ( · ) } i , j = 1 ,..., N : matrix-valued correlation (positive definite) function, N : number of state variables – multivariate version of Gaspari-Cohn functions? ◮ In statistics literature, not many known such “valid” ρ (parametric) functions are available yet M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 6 / 32

Part 1: Multivariate Localization for EnKF One simple idea ◮ Use ρ ij ( · ) = β ij · ρ ( · ) with | β ij | < 1, | β ji | < 1, and β ii = β jj = 1. � 1 � β is positive-definite and of full rank for any β with | β | < 1. ◮ β 1 ◮ For ρ , use any localization functions in Gaspari and Cohn (1999). ◮ Example: 8 − 1 4 ( | d | / c ) 5 + 1 2 ( d / c ) 4 + 5 8 ( | d | / c ) 3 − 5 3 ( d / c ) 2 + 1 , 0 ≤| d |≤ c ; > < ρ ( d ; c ) = (1) 12 ( | d | / c ) 5 − 1 1 2 ( d / c ) 4 + 5 8 ( | d | / c ) 3 + 5 3 ( d / c ) 2 − 5 ( | d | / c )+ 4 − 2 3 c / | d | , c ≤| d |≤ 2 c ; > 0 , : 2 c ≤| d | and β 11 = β 22 = 1, 0 ≤ β ij ≤ 1. ◮ This multivariate localization function is separable in the sense that � 1 � β multivariate component (in the above example, ) β 1 and localization function (in the above example, ρ ) are factored M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 7 / 32

Part 1: Multivariate Localization for EnKF Another idea ◮ Use one of a few multivariate compactly supported correlation functions available in statistics literature. ◮ e.g. Bivariate Askey function (Porcu et al. 2012) � ν + µ ij � 1 − d ρ ij ( d ; ν, c ) = β ij , c + ◮ c > 0, µ 12 = µ 21 ≥ 1 2 ( µ 11 + µ 22 ) , ν ≥ [ 1 2 s ] + 2, and s is space dimension. � Γ( 1 + µ 12 ) Γ( 1 + ν + µ 11 )Γ( 1 + ν + µ 22 ) ◮ | β ij | ≤ , β ii = β jj = 1 Γ( 1 + ν + µ 12 ) Γ( 1 + µ 11 )Γ( 1 + µ 22 ) ◮ | β ij | ≤ 1 if µ 11 = µ 22 . M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 8 / 32

Part 1: Multivariate Localization for EnKF Gaspari−Cohn ● 0.8 Askey ( ν =1) covariance Askey ( ν =2) Askey ( ν =3) ● 0.4 0.0 0 10 20 30 40 50 distance M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 9 / 32

Part 1: Multivariate Localization for EnKF Experiment with bivariate Lorenz Model ◮ X k and Y j , k are equally spaced on a latitude circle ( j = 1 , . . . , J and k = 1 , . . . , K ). ◮ With boundary conditions X k ± K = X K , Y j , k ± K = Y j , k , Y j − J , k = Y j , k − 1 , and Y j + J , k = Y j , k + 1 , J dX k � = − X k − 1 ( X k − 2 − X k + 1 ) − X k − ( ha / b ) Y j , k + F , dt j = 1 dY j , k = − abY j + 1 , k ( Y j + 2 , k − Y j − 1 , k ) − aY j , k + ( ha / b ) X k dt M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 10 / 32

Part 1: Multivariate Localization for EnKF Experiment details ◮ True model states generated by a long time step integration of the model ◮ Initialize ensembles by adding Gaussian noise to the true state ◮ We discard first 3000 time steps ◮ Simulated observations are generated by adding mean zero noise (variance 0.02 for X and 0.005 for Y ) to the truth ◮ 20 ensemble members are used (we tested 40 ensembles as well) ◮ Covariance inflation of 1.015 ◮ RMSE calculated using last 1000 time steps and we repeat 50 times to produce boxplots M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 11 / 32

Part 1: Multivariate Localization for EnKF Bivariate Lorenz Model ◮ 36 variables of X , 360 variables of Y , a = 10, b = 10, h = 2 6 X Y 4 state 2 X Y 0 0 50 100 150 200 250 300 350 variable locations longitudinal profiles M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 12 / 32

Part 1: Multivariate Localization for EnKF Experiment set up ◮ Two scenarios for observation 1 Observe 20 % of X and 90 % of Y at locations where X is not observed. 2 Fully observe X and Y ◮ Four localization schemes S1 No localization S2 No localization and let P b 12 = P b 21 = 0. S3 localize P b 11 and P b 22 , but let P b 12 = P b 21 = 0. S4 localize P b 11 , P b 22 , P b 12 , P b 21 M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 13 / 32

Part 1: Multivariate Localization for EnKF Localization (S4) 1 Gaspari-Cohn function: ρ ij ( d ; c ) = β ij ρ ( d ; c ) , i , j = 1 , 2, where  − 1 4 ( | d | / c ) 5 + 1 2 ( d / c ) 4 + 5 8 ( | d | / c ) 3 − 5 3 ( d / c ) 2 + 1 , 0 ≤| d |≤ c ;   ρ ( d ; c ) = 3 c / | d | , 12 ( | d | / c ) 5 − 1 1 2 ( d / c ) 4 + 5 8 ( | d | / c ) 3 + 5 3 ( d / c ) 2 − 5 ( | d | / c )+ 4 − 2 c ≤| d |≤ 2 c ;  0 ,  2 c ≤| d | and β 11 = β 22 = 1, 0 ≤ β ij ≤ 1. (support=2 c ) 2 Bivariate Askey function � ν + µ ij � 1 − | d | ρ ij ( d ; c ) = β ij , i , j = 1 , 2 c + with µ 11 = 0, µ 22 = 2, µ ij = 1, ν = 3, and β 11 = β 22 = 1, 0 ≤ β ij ≤ 0 . 7. (support= c ) M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 14 / 32

Part 1: Multivariate Localization for EnKF Results for X in scenario 1 support 50 support 70 4 4 Gaspari−Cohn Askey 3 3 RMSE RMSE ● 2 2 ● ● ● ● ● ● ● 1 1 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● 0 S1 S2 S3 S4 S4 S4 S4 S4 S4 S1 S2 S3 S4 S4 S4 S4 S4 S4 5e −3 1e −2 0.1 5e −3 1e −2 0.1 0.4 0.7 1 0.4 0.7 1 support 100 support 160 4 4 3 3 RMSE RMSE ● 2 ● ● 2 ● ● ● ● ● ● ● ● ● ● ● ● 1 ● 1 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 0 S1 S2 S3 S4 S4 S4 S4 S4 S4 S1 S2 S3 S4 S4 S4 S4 S4 S4 5e −3 1e −2 0.1 5e −3 1e −2 0.1 0.4 0.7 1 0.4 0.7 1 M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 15 / 32

Part 1: Multivariate Localization for EnKF Results for X in scenario 1 ◮ S1 vs S2: ignoring cross-covariance is better than not doing localization ◮ S3 is worse than S1 ◮ S4 with β = 0 . 01 performs the best regardless of localization radius ◮ Askey seems to be better than the Gaspari-Cohn function M. Jun (TAMU) Multivariate Localization/Robust Methods for EnKF May 19 16 / 32

Robust method for EnKF in the presence of observation - PowerPoint PPT Presentation

Robust method for EnKF in the presence of observation outliers/Multivariate localization methods for EnKF Mikyoung Jun Department of Statistics Texas A&M University May 19, 2015 M. Jun (TAMU) Multivariate Localization/Robust Methods for

Real-data Mesoscale Applications of EnKF and Towards coupling EnKF with 4DVAR Fuqing Zhang

Morphing and wavelet EnKF data assimilation Jan Mandel Based on joint work with J. D. Beezley, L.

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Qatar observation stations Qatar observation stations, Instruments and calibrations By By

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

EnKF FAQ (Ensemble Kalman filter Frequently asked questions) pdf x Patrick N. Raanes,

Adaptive covariance inflation in the EnKF by Gaussian scale mixtures pdf x Patrick N. Raanes,

EnKF and filter divergence David Kelly Andrew Stuart Kody Law Courant Institute New York

An adaptive plurigaussian truncation scheme for geological uncertainty quantification using EnKF.

Toward Assimilation of Crowdsourcing Data using the EnKF William Lahoz and Philipp Schneider

EnKF and filter divergence David Kelly Andrew Stuart Kody Law Courant Institute New York

Comparison of 4DVAR and EnKF state estimates and forecasts in the Gulf of Mexico Ganesh

What do we know about EnKF? David Kelly Kody Law Andrew Stuart Andrew Majda Xin Tong Courant

Presence Presence Presence When we wake up in the morning we may automatically leave our

Evaluation of quality circles Documentary method Evaluation, observation, communication

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

EASM 2014 empirical evidence about the awareness and meaning of Olympism. For example,

Effort and achievement of 15-year-olds in PISA 2015 across EU member states Opportunity versus

Bayesian Analysis of Multivariate Normal Models when Dimensions are Absent Robert Zeithammer

V ALUE AT R ISK ( VaR ) Let X be a random variable representing loss, F its distribution function

A Municipal Bond Case Study Material drawn from USAID, Developing Sustainable and Inclusive Urban

MONADNOCK BROADBAND GROUP December 2, 2019 Presented by: Tammy J. St. Gelais, Executive Director

Sustainability Bond for Dutch Municipalities November 2018 Disclaim er THE SLIDES USED IN THIS

VAN | 2014 Municipal Bonds MAYOR DAN SUL L I 2014 MOA BONDS Re q ue st: $32.7 Millio n

Sambuz

Useful Links

Newsletter

Mail Us