Sampling low-dimensional Markovian dynamics for learning certified - PowerPoint PPT Presentation

Sampling low-dimensional Markovian dynamics for learning certified reduced models from data Wayne Isaac Tan Uy and Benjamin Peherstorfer Courant Institute of Mathematical Sciences, New York University February 2020

Learning dynamical-system models from data error reduced PDE control model low-dim. data ? model Learn low-dimensional model from data of dynamical system • Interpretable • Fast predictions • System & control theory • Guarantees for finite data 2 / 41

Recovering reduced models from data error reduced PDE control model our approach: pre-asymptotically guaranteed low-dim. data ? model Learn low-dimensional model from data of dynamical system • Interpretable • Fast predictions • System & control theory • Guarantees for finite data Learn reduced model from trajectories of high-dim. system • Recover exactly and pre-asymptotically reduced models from data • Then build on rich theory of model reduction to establish error control 2 / 41

Intro: Polynomial nonlinear terms Models with polynomial nonlinear terms d d t x ( t ; µ ) = f ( x ( t ; µ ) , u ( t ); µ ) ℓ � A i ( µ ) x i ( t ; µ ) + B ( µ ) u ( t ) = i = 1 • Polynomial degree ℓ ∈ N • Kronecker product x i ( t ; µ ) = � i j = 1 x ( t ; µ ) • Operators A i ( µ ) ∈ R N × N i for i = 1 , . . . , ℓ • Input operator B ( µ ) ∈ R N × p Lifting and transformations • Lift general nonlinear systems to quadratic-bilinear ones [Gu, 2011], [Benner, Breiten, 2015], [Benner, Goyal, Gugercin, 2018], [Kramer, Willcox, 2019], [Swischuk, Kramer, Huang, Willcox, 2019], [Qian, Kramer, P., Willcox, 2019] • Koopman lifts nonlinear systems to infinite linear systems [Rowley et al, 2009], [Schmid, 2010] 3 / 41

Intro: Beyond polynomial terms (nonintrusive) 4 / 41

Intro: Parametrized systems Consider time-invariant system with polynomial nonlinear terms d d t x ( t ; µ ) = f ( x ( t ; µ ) , u ( t ); µ ) ℓ � A i ( µ ) x i ( t ; µ ) + B ( µ ) u ( t ) = i = 1 Parameters • Infer models ˆ f ( · , · ; µ 1 ) , . . . , ˆ f ( · , · ; µ M ) at parameters µ 1 , . . . , µ M ∈ D • For new µ ∈ D , interpolate operators of [Amsallem et al., 2008], [Degroote et al., 2010] ˆ f ( µ 1 ) , . . . , ˆ f ( µ M ) Trajectories X = [ x 1 , . . . , x K ] ∈ R N × K U = [ u 1 , . . . , u K ] ∈ R p × K 5 / 41

Intro: Parametrized systems Consider time-invariant system with polynomial nonlinear terms d d t x ( t ) = f ( x ( t ) , u ( t )) ℓ � A i x i ( t ) + Bu ( t ) = i = 1 Parameters • Infer models ˆ f ( · , · ; µ 1 ) , . . . , ˆ f ( · , · ; µ M ) at parameters µ 1 , . . . , µ M ∈ D • For new µ ∈ D , interpolate operators of [Amsallem et al., 2008], [Degroote et al., 2010] ˆ f ( µ 1 ) , . . . , ˆ f ( µ M ) Trajectories X = [ x 1 , . . . , x K ] ∈ R N × K U = [ u 1 , . . . , u K ] ∈ R p × K 5 / 41

Intro: Parametrized systems Consider time-invariant system with polynomial nonlinear terms x k + 1 = f ( x k , u k ) ℓ � A i x i = k + Bu k , k = 0 , . . . , K − 1 i = 1 Parameters • Infer models ˆ f ( · , · ; µ 1 ) , . . . , ˆ f ( · , · ; µ M ) at parameters µ 1 , . . . , µ M ∈ D • For new µ ∈ D , interpolate operators of [Amsallem et al., 2008], [Degroote et al., 2010] ˆ f ( µ 1 ) , . . . , ˆ f ( µ M ) Trajectories X = [ x 1 , . . . , x K ] ∈ R N × K U = [ u 1 , . . . , u K ] ∈ R p × K 5 / 41

Intro: Classical (intrusive) model reduction Given full model f , construct reduced ˜ f via projection x K x 1 1. Construct n -dim. basis V = [ v 1 , . . . , v n ] ∈ R N × n x 2 • Proper orthogonal decomposition (POD) • Interpolatory model reduction • Reduced basis method (RBM), ... R N 2. Project full-model operators A 1 , . . . , A ℓ , B onto reduced space, e.g., N × N i N × p �� A i = V T ˜ B = V T ˜ A i ( V ⊗ · · · ⊗ V ) B , � �� n × p n × n i 3. Construct reduced model ℓ � x k + 1 = ˜ ˜ x i k + ˜ ˜ f (˜ x k , u k ) = A i ˜ Bu k , k = 0 , . . . , K − 1 i = 1 with n ≪ N and � V ˜ x k − x k � small in appropriate norm [Rozza, Huynh, Patera, 2007], [Benner, Gugercin, Willcox, 2015] 6 / 41

Our approach: Learn reduced models from data Sample (gray-box) high-dimensional system with inputs � u 0 � U = · · · u K − 1 initial condition inputs to obtain trajectory   | | | X = x 0 x 1 · · · x K   gray-box Ex k +1 = Ax k + B u k | | | dynamical y k = Cx k system Learn model ˆ f from data U and X x k + 1 =ˆ ˆ f (ˆ x k , u k ) state trajectory ℓ � ˆ k + ˆ A i x i = k = 0 , . . . , K − 1 Bu k , i = 1 7 / 41

Intro: Literature overview System identification [Ljung, 1987], [Viberg, 1995], [Kramer, Gugercin, 2016], ... Learning in frequency domain [Antoulas, Anderson, 1986], [Lefteriu, Antoulas, 2010], [Antoulas, 2016], [Gustavsen, Semlyen, 1999], [Drmac, Gugercin, Beattie, 2015], [Antoulas, Gosea, Ionita, 2016], [Gosea, Antoulas, 2018], [Benner, Goyal, Van Dooren, 2019], ... Learning from time-domain data (output and state trajectories) • Time series analysis (V)AR models, [Box et al., 2015], [Aicher et al., 2018, 2019], ... • Learning models with dynamic mode decomposition [Schmid et al., 2008], [Rowley et al., 2009], [Proctor, Brunton, Kutz, 2016], [Benner, Himpe, Mitchell, 2018], ... • Sparse identification [Brunton, Proctor, Kutz, 2016], [Schaeffer et al, 2017, 2018], ... • Deep networks [Raissi, Perdikaris, Karniadakis, 2017ab], [Qin, Wu, Xiu, 2019], ... • Bounds for LTI systems [Campi et al, 2002], [Vidyasagar et al, 2008], ... Correction and data-driven closure modeling • Closure modeling [Chorin, Stinis, 2006], [Oliver, Moser, 2011], [Parish, Duraisamy, 2015], [Iliescu et al, 2018, 2019], ... • Higher order dynamic mode decomposition [Le Clainche and Vega, 2017], [Champion et al., 2018] 8 / 41

Outline • Introduction and motivation • Operator inference for learning low-dimensional models • Sampling Markovian data for recovering reduced models • Rigorous and pre-asymptotic error estimators • Learning time delays to go beyond Markovian models • Conclusions 9 / 41

OpInf: Fitting low-dim model to trajectories 1. Construct POD (PCA) basis of dimension n ≪ N V = [ v 1 , · · · , v n ] ∈ R N × n 2. Project state trajectory onto the reduced space ˘ X = V T X = [˘ x K ] ∈ R n × K x 1 , · · · , ˘ 3. Find operators ˆ A 1 , . . . , ˆ A ℓ , ˆ B such that ℓ � ˆ x i k + ˆ x k + 1 ≈ ˘ A i ˘ Bu k , k = 0 , · · · , K − 1 i = 1 by minimizing the residual in Euclidean norm 2 � � K − 1 ℓ � � � � ˆ k − ˆ x i � � min � ˘ x k + 1 − A i ˘ Bu k � � A 1 ,..., ˆ ˆ A ℓ , ˆ B � k = 0 i = 1 2 [P., Willcox, Data driven operator inference for nonintrusive projection-based model reduction ; Computer Methods in Applied Mechanics and Engineering, 306:196-215, 2016] 10 / 41

OpInf: Learning from projected trajectory Fitting model to projected states • We fit model to projected trajectory 1 . 6 1 . 4 ˘ X = V T X 1 . 2 2-norm of states projected • Would need ˜ 1 X = [˜ x 1 , . . . , ˜ x K ] because int. model reduction 0 . 8 OpInf (w/out re-proj) � � 2 0 . 6 K − 1 ℓ � � � � ˜ k − ˜ � x i � 0 . 4 � ˜ x k + 1 − A i ˜ Bu k = 0 � � 0 . 2 � k = 0 i = 1 2 0 0 10 20 30 40 50 60 70 80 90 100 • However, trajectory ˜ X unavailable time step k Thus, � ˆ f − ˜ f � small critically depends on � ˘ X − ˜ X � being small • Increase dimension n of reduced space to decrease � ˘ X − ˜ X � ⇒ increases degrees of freedom in OpInf ⇒ ill-conditioned • Decrease dimension n to keep number of degrees of freedom low ⇒ difference � ˘ X − ˜ X � increases 11 / 41

OpInf: Closure of linear system Consider autonomous linear system x 0 ∈ R N , x k + 1 = Ax k , k = 0 , . . . , K − 1 • Split R N into V = span( V ) and V ⊥ = span( V ⊥ ) R N = V ⊕ V ⊥ • Split state x k = V V T x k + V ⊥ V T ⊥ x k � �� x � x ⊥ Represent system as k k x � k + 1 = A 11 x � k + A 12 x ⊥ k k + 1 = A 21 x � x ⊥ k + A 22 x ⊥ k with operators A 11 = V T AV A 12 = V T AV ⊥ , A 21 = V T A 22 = V T , ⊥ AV , ⊥ AV ⊥ � �� =˜ A [Givon, Kupferman, Stuart, 2004], [Chorin, Stinis, 2006] [Parish, Duraisamy, 2017] 12 / 41

Sampling low-dimensional Markovian dynamics for learning certified - PowerPoint PPT Presentation

Sampling low-dimensional Markovian dynamics for learning certified reduced models from data Wayne Isaac Tan Uy and Benjamin Peherstorfer Courant Institute of Mathematical Sciences, New York University February 2020 Learning dynamical-system

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Dependability Evaluation through Markovian model Markovian model The combinatorial methods are

Hedging Default Risks of CDOs in Markovian Hedging Default Risks of CDOs in Markovian Contagion

Successive events and Energy Networks for Markovian deteriorating model Markovian deteriorating

Sampling multimodal densities in high dimensional sampling space Gersende FORT LTCI, CNRS &

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Mathematical Models of Markovian Dephasing Franco Fagnola Politecnico di Milano (joint work with

A Markovian Approach to Darts Model and Simulation Bortolon, F. Castiglione, C. Parolini, L.

NON-MARKOVIAN CONTINUOUS QUANTUM MEASUREMENT OF RETARDED OVSERVABLES Lajos Di osi, Budapest

Markovian-modulated Models and Their Application Potential Magorzata OReilly

Newfound Water Quality Sampling: In Lake Sampling 8 Historic Sampling locations

Optimization for Machine Learning Tom Schaul schaul@cims.nyu.edu Recap: Learning Machines

Bayesian Neural Networks from a Gaussian Process Perspective Andrew Gordon Wilson

How can different campuses inter-operate? From ora et labora to collaborate and federate

Longstanding Discrepancies in Stratospheric Water Vapor Measurements Revisited During the 2011

Examples of online analysis tools for gene expression data Tools integrated in data repositories

Efficiency Announcements Measuring Efficiency Recursive Computation of the Fibonacci Sequence

Partial difference equations over compact Abelian groups Tim Austin Courant Institute, NYU New

Dynamics of Periodically-Kicked Oscillators Lai-Sang Young Courant Institute, NYU

Sampling low-dimensional Markovian dynamics for learning certified - PowerPoint PPT Presentation

Sampling low-dimensional Markovian dynamics for learning certified reduced models from data Wayne Isaac Tan Uy and Benjamin Peherstorfer Courant Institute of Mathematical Sciences, New York University February 2020 Learning dynamical-system

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Dependability Evaluation through Markovian model Markovian model The combinatorial methods are

Hedging Default Risks of CDOs in Markovian Hedging Default Risks of CDOs in Markovian Contagion

Successive events and Energy Networks for Markovian deteriorating model Markovian deteriorating

Sampling multimodal densities in high dimensional sampling space Gersende FORT LTCI, CNRS &amp;

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Mathematical Models of Markovian Dephasing Franco Fagnola Politecnico di Milano (joint work with

A Markovian Approach to Darts Model and Simulation Bortolon, F. Castiglione, C. Parolini, L.

NON-MARKOVIAN CONTINUOUS QUANTUM MEASUREMENT OF RETARDED OVSERVABLES Lajos Di osi, Budapest

Markovian-modulated Models and Their Application Potential Magorzata OReilly

Newfound Water Quality Sampling: In Lake Sampling 8 Historic Sampling locations

Optimization for Machine Learning Tom Schaul schaul@cims.nyu.edu Recap: Learning Machines

Bayesian Neural Networks from a Gaussian Process Perspective Andrew Gordon Wilson

How can different campuses inter-operate? From ora et labora to collaborate and federate

Longstanding Discrepancies in Stratospheric Water Vapor Measurements Revisited During the 2011

Examples of online analysis tools for gene expression data Tools integrated in data repositories

Efficiency Announcements Measuring Efficiency Recursive Computation of the Fibonacci Sequence

Partial difference equations over compact Abelian groups Tim Austin Courant Institute, NYU New

Dynamics of Periodically-Kicked Oscillators Lai-Sang Young Courant Institute, NYU

Sampling multimodal densities in high dimensional sampling space Gersende FORT LTCI, CNRS &