Sequential Monte Carlo Methods for State and and Parameter - PDF document

Sequential Monte Carlo Methods for State and and Parameter Estimation (with application to ocean biogeochemistry) Michael Dowd Dept of Mathematics and Statistics Dalhousie University Halifax, N.S., Canada BIRS Data Assimilation Workshop, February 3-8, 2008 Outline 1. Motivation: observations and dynamic models for ocean biogeochemistry 2. The state space model for nonlinear and nonGaussian systems: filtering and smoothing 3. Sequential Monte Carlo approaches: resampling/bootstrap and MCMC 4. Static parameter estimation for stochastic dynamics: likelihood and state augmentation

Statistical Estimation for Nonlinear NonGaussian Dynamic Systems Approaches … • Statistical emulators/ computer experiments for studying large scale dynamical model and perhaps DA. • Functional data analysis applied to estimating differential equations • Hierarchical Bayes and Markov Chain Monte Carlo • Sequential Monte Carlo approaches* Data

Ocean Biogeochemical Time Series • Ocean Observatory at Lunenburg, Canada obs � gamma( � t , � ) P t obs � lognormal( � t , f ( � t )) N t Long Term Ocean Time - Space Series Bermuda Atlantic Time Series: • 15 years, monthly cruises • measure depth profiles of biogeochemical variables • use CTD and bottle samples Temperature Chlorophyll

Dynamic Models A Physical - Biogeochemical Model dP { } � P � � P P � g P ; k P { } IZ + n P dt = g N ; k N dZ { } IZ � � Z Z + n Z dt = � g P ; k P dN { } IZ � g N ; k N { } � P + �� Z Z + n N dt = � D + � g P ; k P Physics dD { } IZ + (1 � � ) � Z Z + n D dt = � � D + � P P + (1 � � � � ) g P ; k P � � � X � t � � � X Sample Output � = SMS ( X ) � � K � z � t � z � � S � t � � � � S � � � � � � u u � � � � � = � � = F S K f v F K � � t � u � � � z � t � z � t z z � � � � � v � t � � � v � T � t � � � T � + f u = F � � = F � � � z K t K � � v � t � T � z � z � z Biogeochemistry where turbulence sub-model computes K t and K t ’ from u , v , T and S

Incorporating Stochasticity (stochastic photosynthetic parameter + system noise) O-D ODE based PZND model • Frequent transitioning across bifurcation • aperiodic/episodic • dynamical dependencies maintained • note (high freq) forcing versus (low freq) response Describe ensemble properties as a distribution The estimation problem for system state and parameters …

The State Space Model dynamics equation measurement equation = y h x e ( , ) t t t t x t = f ( x t � 1 , � , n t ) y t = h ( x t , � , � t ) (measurement equation) or or y t : measurements h t : obs operator y t � p ( y t | x t , � ) x t � p ( x t | x t � 1 , � ) e t : measurement error • Given Y � = ( � y 1 ,...., � y � � ) � want to jointly estimate the state x t and static parameters � and � General Case (Nonlinear Stochastic Dynamics): Filtering and Smoothing for State Estimation* Filtering: � p ( x t | Y t , � , � ) � p ( y t | x t , � ) p ( x t | x t � 1 , � ) p ( x t � 1 | Y t � 1 , � , � ) dx t � 1 for t=1, …, T , given p(x 0 ) Smoothing: T T p ( x 1: T | Y T , � , � ) = p ( x 0 ) � p ( x t | x t � 1 , � ) � p ( y t | x t , � ) t = 1 t = 1 � nonlinear, non-Gaussian case can be treated with sampling based solutions (via sequential MC methods) * treat parameter estimation later on …

Sequential Monte Carlo Approaches 1. Stochastic dynamic prediction : numerical integration of stochastic dynamic system (generate forecast ensemble) = f ( x t � 1| t � 1 i = 1,...., n ( i ) ( i ) ( i ) , � ), x t | t � 1 , n t { } � p ( x t | Y t � 1 , � , � ) ( i ) x t | t � 1 � Ensemble must cover the part of state space with non- negligible values of the predictive density 2. Bayesian blending of measurements and numerical � model predictions (e.g. resampling, MCMC). Sequential Bayesian Monte Carlo { } � p ( x t | Y t � 1 , � , � ) { } � p ( x t | Y t , � , � ) ( i ) ( i ) x t | t � 1 x t | t ( i ) = w t � 1 ( i ) p ( y t | x t | t � 1 ( i ) ), i = 1,...., n (a) SIR - compute: w t { } � x t | t { } � p ( x t | Y t ) ( i ) , w t ( i ) ( i ) - weighted resample of x t | t � 1 or (b) Sequential Metropolis Hastings MCMC (c) Resample - Move (SIR/MCMC) (d) Ensemble/unscented Kalman filter (approximate) + …

Sequential Metropolis-Hastings { } � p ( x t � 1 | Y t � 1 , � , � ), i = 1,...., n ( i ) Basic Idea: Given x t � 1| t � 1 1. Generate candidate from predictive density: loop over k � p ( x t | Y t � 1 , � , � ) = f ( x t � 1| t � 1 * * * * , � ) x t | t � 1 via: x t | t � 1 , n t 2. Evaluate acceptance probability � � , � , � ) * � = min 1, p ( y t | x t | t � 1 * ( k ) � , choose x t | t � 1 or x t | t � ( k ) , � , � ) � � p ( y t | x t | t { } � p ( x t | Y t , � , � ) ( i ) x t | t sample from target: • Flexible and configurable, e.g adaptive ensemble • EFFICIENT PROPOSALS ARE KEY, e.g prior, or from EnKF? Filter State Estimates Time series of median and percentiles Distributions

Example SIR Results from Physical Biogeochemical Model Comparison of SMC Methods : Convergence of Distributions p ( x t | y 1: t )log p ( x t | y 1: t ) � <K-L divergence> = p ( x t | y 1: t ) dx t � Figure : Convergence to “exact” solution for different SMC methods

Convergence of Moments (M-H MCMC) Parameter Estimation via Likelihood The likelihood arising from the state space model* is T T � � � L ( � | Y T ) = p ( Y T | � ) = p ( y t | Y t � 1 , � ) = p ( y t | x t , � ) p ( x t | Y t � 1 , � ) dx t t = 1 t = 1 From sequential MC filter we can compute predictive density ( i ) } ~ p ( x t | Y t � 1 , � ) { x t | t � 1 and so compute the likelihood as ( ) T L ( � | Y T ) � 1 � ( i ) , � n � p ( y t | x t | t � 1 i = 1 n t = 1 *assume � is given, and suppress the explicit dependence)

Distributions for Parameters Likelihood Posterior: using prior info Parameter identifiability issues priors ‘focus’ the likelihood (sample based) likelihood surface is ‘rough’ � challenge for optimizers (stochastic gradients) Parameter Estimation via State Augmentation � � x t = x t � � � Idea : Append state to include parameters, � t � � � t = � t � 1 Specify p( � 0 ) and allow parameter to evolve as Choose � T as estimate for parameter For practical implementation with finite sample, we must { � 0 ( i ) } ~ p ( � ) (1) Specify initial ensemble (including dependence structure between the parameters) { � t ( i ) } (2) At each t , introduce smoothed bootstrap of (with dispersion correction) to generate diversity in parameter ensemble, while maintaining distributional properties. Easy and seems to work in practice, little theoretical guidance on convergence. Does not seem to work well with EnKF

State Augmentation Example Parameter values from Trace plot of one realization 2000 realizations State Reconstruction by Smoother • Fixed interval smoother using optimal parameters. • Uses forward M-H MCMC filter • Smoother realizations provided by backwards sweep smoother algorithm of Godsill et al (2004)

Remarks and Outstanding Issues on Fully Bayesian DA via Sequential Monte Carlo 1. Sequential MC approaches allow for state and parameter estimation in nonlinear nonGaussian dynamic systems. Wide variety available (bootstrap or MCMC) and easy to implement, but computationally …. 2. Static parameter estimation in SDEs outstanding statistical issue. Likelihood (via predictive density). State augmentation (via filter density). EM algorithm (via smoother density) 3. Effective stochastic simulation (integration) and specification of model errors a key feature. 4. Adaptation for (large dimension) dynamical systems! � need small ensembles (100-1000) to represent large dimensional state space. Efficient proposal distribution is paramount , e.g use information flow via dynamics. 5. Methods for computationally efficient smoothing also needed. 6. Information based metrics for assessing improvements and comparing approaches

Sequential Monte Carlo Methods for State and and Parameter - PDF document

Sequential Monte Carlo Methods for State and and Parameter Estimation (with application to ocean biogeochemistry) Michael Dowd Dept of Mathematics and Statistics Dalhousie University Halifax, N.S., Canada BIRS Data Assimilation Workshop,

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1.

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include:

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational

Monte Carlo Methods for physically based Volume rendering Monte Carlo Methods for physically based

Monte Carlo methods for volumetric light transport Monte Carlo methods for volumetric light

Monte Carlo Methods An introduction to Monte Carlo (MC) methods How to use MC methods

Sequential Monte Carlo Methods Click to edit Master text styles Click to edit Master text

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Sequential Monte Carlo Dr. Jarad Niemi STAT 615 - Iowa State University October 20, 2017 Jarad

4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo

Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference,

Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1.

Machine learning techniques in predicting uncertainty of environmental models Dimitri Solomatine

Portable Monte Carlo Transport Performance Evaluation in the PATMOS Prototype Tao CHANG 1

An Agent-Based Boom-Bust Business Cycle Model with Search-for-Yield and Heterogeneous

Kevin McLaughlin Outline Advance of Fab technologies and the evolution of raw materials for

Statistical Thermodynamics of Polymers with a Biophysics Emphasis Continued development of

Aim Provide a strategic overview of how simulation can enhance individual training scheduling

What I will Show You Today (in 10 Minutes!) PLS has no advantage at small sample size Not

Adoption of Curricular and Instructional Materials Montgomery County Board of Education January

Sequential Monte Carlo Methods for State and and Parameter - PDF document

Sequential Monte Carlo Methods for State and and Parameter Estimation (with application to ocean biogeochemistry) Michael Dowd Dept of Mathematics and Statistics Dalhousie University Halifax, N.S., Canada BIRS Data Assimilation Workshop,

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1.

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include:

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational

Monte Carlo Methods for physically based Volume rendering Monte Carlo Methods for physically based

Monte Carlo methods for volumetric light transport Monte Carlo methods for volumetric light

Monte Carlo Methods An introduction to Monte Carlo (MC) methods How to use MC methods

Sequential Monte Carlo Methods Click to edit Master text styles Click to edit Master text

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Sequential Monte Carlo Dr. Jarad Niemi STAT 615 - Iowa State University October 20, 2017 Jarad

4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo

Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference,

Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1

Monte Carlo Control CMPUT 366: Intelligent Systems S&amp;B 5.3-5.5, 5.7 Lecture Outline 1.

Machine learning techniques in predicting uncertainty of environmental models Dimitri Solomatine

Portable Monte Carlo Transport Performance Evaluation in the PATMOS Prototype Tao CHANG 1

An Agent-Based Boom-Bust Business Cycle Model with Search-for-Yield and Heterogeneous

Kevin McLaughlin Outline Advance of Fab technologies and the evolution of raw materials for

Statistical Thermodynamics of Polymers with a Biophysics Emphasis Continued development of

Aim Provide a strategic overview of how simulation can enhance individual training scheduling

What I will Show You Today (in 10 Minutes!) PLS has no advantage at small sample size Not

Adoption of Curricular and Instructional Materials Montgomery County Board of Education January

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1.