Predictor-corrector ensemble filters for high-dimensional nonlinear - PowerPoint PPT Presentation

Predictor-corrector ensemble filters for high-dimensional nonlinear systems and sparse data and regularized ensemble Kalman filter for a wildfire model Jan Mandel University of Colorado at Denver and Health Sciences Center NCAR MMM and IMAGe The DDDAS wildfire team: Lynn Bennethum, Jonathan Beezley, Janice Coen, Craig Douglas, Leo Franca, Minjeong Kim, Bob Kremens, Deng Li, Guan Qin, Tony Vodacek Supported by National Science Foundation Grant CNS-0325314 and NCAR ASP Faculty Fellowship

The challenge: data assimilation into highly nonlinear wildfire models Simulation code is a black box ⇒ ensemble filters • Highly non-gaussian ⇒ predictor-corrector ensemble filters Jan Mandel and Jonathan Beezley, CCM Report 232, 2006 • Need to avoid nonphysical solutions ⇒ regularized EnKF C. J. Johns and J. Mandel, CCM Report 221, 2005; Env. Ecol. Stat., to appear • Efficient parallel EnKF for large data Jan Mandel, CCM Report 231, 2006 • Data assimilation into a one-step reaction-diffusion fire model Jan Mandel, Lynn S. Bennethum, Jonathan Beezley, Janice L. Coen, Craig C. Douglas, Leopoldo P. Franca, Minjeong Kim, and Anthony Vodacek, CCM Report 233, 2006 http://www-math.cudenver.edu/ccm/reports

Data Assimilation: Sequential Statistical Estimation State of the model is probability density p ( u ) of the system state u Analysis cycle: = ⇒ = ⇒ p a,t k − 1 p f,t k p a,t k advance Bayes p f,t k : “forecast” probability density at time t k p a,t k : “analysis” probability density at time t k Model advanced in time by the model At analysis time , new data incorporated by the use of Bayes theorem p a,t k ( u ) ∝ p ( d | u ) p f,t k ( u ) � pdν = 1; ν is some ∝ means proportional; densities are always scaled so that underlying measure (Lebesgue in finite dimension). p ( d | u ) is data likelihood = probability density that the data value is d if the system state is u

Data Likelihood data likelihood = probability density that the data value is d if the system state is u Data likelihood can be obtained from data error probability density p ε (assumed known) and observation function h ( u )=what the data would be if there are no errors and the system state is u : P ( d − h ( u ) ∈ A ) = � A p ε dν ⇒ p ( d | u ) = p ε ( d − h ( u )) This formula tacitly assumes that data error is additive and it does not depend on d and u . All data must be accompanied by an error estimate (=error probability distribution) otherwise it is meaningless.

Ensemble Approach Model state (= probability density of the system state) is represented by a weighted ensemble ( u k , w k ), k = 1 , . . . , N N u k are system state vectors, w k are positive weights, w k = 1. � k =1 Simple case: all weights are same w k = 1 /N , { u k } is a sample from model state p, u k ∼ p (values of independent identically distributed random variables with density p )

Weighted ensemble How to represent a probability density by the weights of an ensemble? Ensemble members are a sample from some probability density, u k ∼ p π . We want the weighted ensemble represent another probability density p . For given function f , consider Monte-Carlo approximation of the integral N Ω f p � � � Ω fpdν = p π dν ≈ w k f ( u k ) p π k =1 p ( u k ) with the weights w k = 1 p π ( u k ) . N So, given a sample { u k } from p π , w k ∝ p ( u k ) p ∼ ( u k , w k ) , p π ( u k ) , u k ∼ p π

Sequential Importance Sampling (SIS) a.k.a. Sequential Monte Carlo (SMC) a.k.a. Particle Filters (PF) Given forecast ensemble p f ∼ ( u f k , w f k ) � � u a Given sample from proposal density ∼ p π k Apply Bayes theorem: p a ( u ) = p ( u | d ) ∝ p ( d | u ) p f ( u ) = ⇒ � � u a k ∝ p a ( u a p f k ) k ( u a k , w a w a k ) ∝ p ( d | u a k ) ∼ p a , k ) p π ( u a � � u a p π k k = u f SIS chooses u a k (does not change the ensemble) and only updates the k ∝ p f ( u k ) u f ∼ p π , w f � � weights = ⇒ already p π ( u k ) = ⇒ k d | u f w f � � w a k ∝ p k k

Sequential Importance Sampling with Resampling (SIR) Trouble with SIS: 1. data likelihoods can be very small, then 2. the analysis ensemble represents the analysis density poorly 3. one or a few of the analysis weights will dominate Standard solution: SIR (Gordon 1993,...) 1. Resample by choosing u a k with probability ∝ w a k , set all weights equal 2. rely on stochastic behavior of the model to recover spread. But there is still trouble: 1. huge ensembles (thousands) are needed because the analysis distribution is effectively approximated only by those ensemble members that have large weights, and a vast majority of weights is infinitesimal 2. if the model is not stochastic, need artificial perturbation to recover ensemble spread Solution: Predictor-corrector filters (new) Place the analysis ensemble so that the weights are all reasonably large .

Predictor-Corrector Filters (new) Given forecast ensemble and a proposal ensemble obtained by a predictor p f ∼ ( u f ℓ , w f p π ∼ ( u a ℓ ) , k ) , Apply Bayes theorem: p a ( u ) = p ( u | d ) ∝ p ( d | u ) p f ( u ) = ⇒ � � u a k ∝ p a ( u a p f k ) k ( u a k , w a w a k ) ∝ p ( d | u a k ) ∼ p a , k ) p π ( u a � � u a p π k Estimate the ratio of densities from ( u f k , w f k ) to get the analysis weights ( corrector ): e k ≈ p f ( u a k ) d | u f � � � � w a ew a d | u a k ∝ p k ∝ p e k , k k p π ( u a k ) Trouble: 1. density estimates in high dimension are intractable 2. need to estimate far away from and outside of the span of the sample Solution: 1. the probability densities are not arbitrary: they come from probability measures on spaces of smooth functions, low effective dimension 2. nonparametric estimation that depends only the concept of distance

Nonparametric Density Estimation Let { u ℓ : ℓ = 1 , . . . , N } ∼ p and B h ( x ) = { y : � y − x � < h } be the ball with radius h and center x . The density estimate p N,h ( y ) = number of u ℓ ∈ B h ( x ) → p ( y ) in probability N measure of B h ( x ) if p is continuous at y and h = h ( N ) is the distance to k ( N )-th nearest u k from y k ( N ) → ∞ , k ( N ) /N → 0 , N → ∞ (Loftsgaarden-Quesenberry (1965) in R n and Lebesgue measure), or h = h ( N ) → 0 , N measure of B h ( x ) → 0 , N → ∞ (Rosenblatt (1956) in R n , Dabo-Niang (2004) in Banach spaces)

Nonparametric Estimation of Importance Weights e k ≈ p f ( u a k ) � � Recall the analysis weights: w a d | u a k ∝ p e k , k p π ( u a k ) Generalize density estimation to weighted ensemble: � w ℓ ℓ : � u ℓ − x � <h ( u k , w k ) ∼ p, measure of B h ( x ) ≈ p ( x ) Recall forecast ensemble ( u f k , w f k ) N k =1 and proposal ensemble ( u a k ) N k =1 Substitute and the (generally unknown) measure of B h ( x ) falls out: � <h w f � � � � � u a � u f ℓ ℓ − u a p f ℓ : � � k k � ≈ 1 � u a � p π � � � u f N k ℓ − u a ℓ : � <h � � k It remains to choose the norm �·� in the estimate. The bandwidth is chosen as on the previous page.

Choosing the norm in density ratio estimate Need performance independent of dimension = ⇒ use infinite dimension, model state is probability distribution on a space of functions The densities p f , p a , p π need to exist w.r.t. some underlying measure ν . The ratio of densities p f /p π does not depend on ν as long as the densities exist and are nonzero but the estimate does . The measure ν is linked to the norm via the measure of the ball, ν ( B h ( x )), need B h ( x ) > 0 and lim h → 0 ν ( B h ( x )) = 0. The density of probability distribution ϕ of system state w.r.t. ν exists ⇐ ⇒ ∀ A : v ( A ) = 0 = ⇒ ϕ ( A ) = 0 (Radon-Nikodym theorem)

Choosing the norm in density ratio estimate (translation) The norm is linked to the underlying measure via the measure of small balls, which need to have reasonable properties. The probability for a function in the state space to fall into a ball should be positive and have limit zero for small balls. The probability for a function in the state space to fall in a set of measure ν zero should be zero. In infinite dimension, this is far from automatic, and restricts the choice of the norm in the density estimate! Lebesgue measure in infinite dimension does not exist = ⇒ use gaussian measure At least the initial ensemble should be from probability distribution that has density w.r.t. the measure ν .

Smooth random fields The initial ensemble is constructed by a perturbation of initial condition by smooth random fields; this gives the underlying measure ν and associated norm Random smooth function (random field) from some space V ∞ � u ( x ) = λ n v n e n ( x ) , ν n ∼ N (0 , 1) n =1 { e n } is Fourier basis of V , 1D example: e n ( x ) = sin ( nx ) λ n > 0, λ n − → 0 determine smoothness: faster decay of λ n = ⇒ smoother u Well known in geosciences as “random fields by FFT” (Evensen 1994,...). λ n and e n are eigenvalues and eigenvectors of the covariance of the random field. Theory of gaussian measures: must assume � ∞ n =1 λ 2 n < ∞ .

Predictor-corrector ensemble filters for high-dimensional nonlinear - PowerPoint PPT Presentation

Predictor-corrector ensemble filters for high-dimensional nonlinear systems and sparse data and regularized ensemble Kalman filter for a wildfire model Jan Mandel University of Colorado at Denver and Health Sciences Center NCAR MMM and IMAGe

Predictor-Corrector and Morphing Ensemble Filters for the Assimilation of Sparse Data into

Linear ensemble transform filters: A unified perspective on ensemble Kalman and particle filters

Overview of Discrete-Time Filters First-order filters Ideal filters Practical filters

Overview of Discrete-Time Filters Discrete-Time Filters Overview First-order filters N M

Predictor-Corrector Policy Optimization (PicCoLO) Ching-An Cheng #$ , Xinyan Yan # , Nathan

Amplitude Detuning from Corrector Misalignments in the LHC and HL-LHC (Part II) Joschua Dilly O

Boosting (ensemble) Module 4 - Ensemble classifiers - Objectives module 4: boosting (ensemble

Convergence of ensemble Kalman filters in the large ensemble limit and infinite dimension Jan

Finite Impulse Response (FIR) Digital Filters Digital filters are rapidly replacing classic

Extended ensemble Kalman filters for high-dimensional hierarchical state-space models Matthias

Practical Analog Filters Overview Types of practical filters Filter specifications

AngularJS Unit Testing AngularJS Filters and Services with Karma & Jasmine Filters

HLT MET Noise Filters in Run2011B Alex Mott Caltech Review of Noise Filters HBHE noise

Is Chocolate a Personality Is Chocolate a Personality Predictor? Predictor? Susan C. Sharpe,

Variant Effect Predictor Demo: The Variant Effect Predictor (VEP)

1 Tournament Branch Predictor Accuracy of Return Address Predictor Used in Alpha 21264: Track

A Formal Security Analysis of the Signal Messaging Protocol Luke Garratt Computer Science

Superintendent Evaluation Raymond G. Gordon Queensbury UFSD Board of Education President Jacinda

Missing Growth from Creative Destruction Philippe Aghion (LSE) Antonin Bergeaud (PSE-BdF) Timo

Week 5.2, Wed, Sept 18 Homework 3 Due: September 23 rd , 2019 @ 11:59PM on Gradescope Midterm 1:

AI Large Practical Alan Smaill School of Informatics Sep 27 2017 Alan Smaill AI Large

On Partial Sums in Cyclic Groups Douglas R. Stinson David R. Cheriton School of Computer Science

Lecture 38 tf/idf and information retrieval Mark Hasegawa-Johnson 5/1/2020 CC-BY 4.0: you

Information Access to Historical Documents from the Early New High German Period Andreas Hauser,

Predictor-corrector ensemble filters for high-dimensional nonlinear - PowerPoint PPT Presentation

Predictor-corrector ensemble filters for high-dimensional nonlinear systems and sparse data and regularized ensemble Kalman filter for a wildfire model Jan Mandel University of Colorado at Denver and Health Sciences Center NCAR MMM and IMAGe

Predictor-Corrector and Morphing Ensemble Filters for the Assimilation of Sparse Data into

Linear ensemble transform filters: A unified perspective on ensemble Kalman and particle filters

Overview of Discrete-Time Filters First-order filters Ideal filters Practical filters

Overview of Discrete-Time Filters Discrete-Time Filters Overview First-order filters N M

Predictor-Corrector Policy Optimization (PicCoLO) Ching-An Cheng #$ , Xinyan Yan # , Nathan

Amplitude Detuning from Corrector Misalignments in the LHC and HL-LHC (Part II) Joschua Dilly O

Boosting (ensemble) Module 4 - Ensemble classifiers - Objectives module 4: boosting (ensemble

Convergence of ensemble Kalman filters in the large ensemble limit and infinite dimension Jan

Finite Impulse Response (FIR) Digital Filters Digital filters are rapidly replacing classic

Extended ensemble Kalman filters for high-dimensional hierarchical state-space models Matthias

Practical Analog Filters Overview Types of practical filters Filter specifications

AngularJS Unit Testing AngularJS Filters and Services with Karma &amp; Jasmine Filters

HLT MET Noise Filters in Run2011B Alex Mott Caltech Review of Noise Filters HBHE noise

Is Chocolate a Personality Is Chocolate a Personality Predictor? Predictor? Susan C. Sharpe,

Variant Effect Predictor Demo: The Variant Effect Predictor (VEP)

1 Tournament Branch Predictor Accuracy of Return Address Predictor Used in Alpha 21264: Track

A Formal Security Analysis of the Signal Messaging Protocol Luke Garratt Computer Science

Superintendent Evaluation Raymond G. Gordon Queensbury UFSD Board of Education President Jacinda

Missing Growth from Creative Destruction Philippe Aghion (LSE) Antonin Bergeaud (PSE-BdF) Timo

Week 5.2, Wed, Sept 18 Homework 3 Due: September 23 rd , 2019 @ 11:59PM on Gradescope Midterm 1:

AI Large Practical Alan Smaill School of Informatics Sep 27 2017 Alan Smaill AI Large

On Partial Sums in Cyclic Groups Douglas R. Stinson David R. Cheriton School of Computer Science

Lecture 38 tf/idf and information retrieval Mark Hasegawa-Johnson 5/1/2020 CC-BY 4.0: you

Information Access to Historical Documents from the Early New High German Period Andreas Hauser,

AngularJS Unit Testing AngularJS Filters and Services with Karma & Jasmine Filters