Data-driven reduced model construction with the time-domain Loewner framework and operator inference
Benjamin Peherstorfer Courant Institute of Mathematical Sciences New York University in collaboration with Serkan Gugercin and Karen Willcox
1 / 37
Data-driven reduced model construction with the time-domain Loewner - - PowerPoint PPT Presentation
Data-driven reduced model construction with the time-domain Loewner framework and operator inference Benjamin Peherstorfer Courant Institute of Mathematical Sciences New York University in collaboration with Serkan Gugercin and Karen Willcox
1 / 37
2 / 37
◮ Approximate QoI of high-fidelity model ◮ Often significantly reduced costs
◮ Drop nonlinear terms, average ◮ Early-stopping schemes
◮ Learn input-QoI map induced by hi-fi model ◮ SVMs, Gaussian processes, neural networks
◮ Approximate state xk in low-dim space ◮ Project E, A, B, C onto low-dim space ◮ POD, interpolatory model reduction, RBM RN x1 x2 xK
3 / 37
◮ Approximate QoI of high-fidelity model ◮ Often significantly reduced costs
◮ Drop nonlinear terms, average ◮ Early-stopping schemes
◮ Learn input-QoI map induced by hi-fi model ◮ SVMs, Gaussian processes, neural networks
◮ Approximate state xk in low-dim space ◮ Project E, A, B, C onto low-dim space ◮ POD, interpolatory model reduction, RBM RN x1 x2 xK
3 / 37
◮ Time-discrete single-input-single-output (SISO) LTI system ◮ System matrices E, A ∈ RN×N, B ∈ RN×1, C ∈ R1×N ◮ Input uk and output yk at time step tk, k ∈ N ◮ State xk at time step tk, k ∈ N ◮ Asymptotically stable
◮ High-dimensional state xk makes computing QoI expensive ◮ Repeated QoI computations can become prohibitively expensive
◮ Uncertainty propagation, statistical inference, control, . . . 4 / 37
◮ Proper orthogonal decomposition (POD) ◮ Interpolatory model reduction ◮ Reduced basis method (RBM) ◮ ... RN x1 x2 xK
N×N
N×N
N×p
q×N
q×n
5 / 37
◮ Proper orthogonal decomposition (POD) ◮ Interpolatory model reduction ◮ Reduced basis method (RBM) ◮ ... RN x1 x2 xK
N×N
N×N
N×p
q×N
q×n
5 / 37
◮ Operators E, A, B, C unavailable ◮ Time-step model Σ with inputs to obtain outputs and states
6 / 37
◮ Operators E, A, B, C unavailable ◮ Time-step model Σ with inputs to obtain outputs and states
6 / 37
◮ Operators E, A, B, C unavailable ◮ Time-step model Σ with inputs to obtain outputs and states
6 / 37
◮ System identification [Ljung, 1987], [Viberg, 1995], [Qin, 2006], . . . ◮ Eigensystem realization [Kung, 1978], [Mendel, 1981], [Juang, 1985], [Kramer & Gugercin,
2016], . . .
◮ Finite impulse response estimation [Rabiner et al., 1978], [Mendel, 1991],
[Abed-Meraim, 1997, 2010], . . .
◮ Loewner framework [Antoulas, Anderson, 1986], [Lefteriu, Antoulas, 2010], [Mayo, Antoulas,
2007], [Beattie, Gugercin, 2012], [Ionita, Antoulas, 2012], . . .
◮ Vector fitting [Drmac, Gugercin, Beattie, 2015a], [Drmac, Gugercin, Beattie, 2015b], . . .
◮ Learning models with dynamic mode decomposition [Tu et al., 2013],
[Proctor, Brunton, Kutz, 2016], [Brunton, Brunton, Proctor, Kutz, 2016], . . .
◮ Learning models [Chung, Chung, 2014], [Xie, Mohebujjaman, Rebholz, Iliescu, 2017], . . . ◮ Sparse identification [Brunton, Proctor, Kutz, 2016], . . .
◮ Gaussian process regression, support vector machines, . . .
7 / 37
8 / 37
|z|=1
9 / 37
10 / 37
◮ No access to E, A, B, C required ◮ Requires transfer function values (frequency-response data)
[Antoulas, Anderson, 1986] [Lefteriu, Antoulas, 2010] [Mayo, Antoulas, 2007] 11 / 37
◮ Given inputs u = [u0, u1, . . . , uK−1]T ∈ CK ◮ Compute outputs y = [y0, y1, . . . , yK−1]T ∈ CK via time stepping ◮ Transfer function H unavailable (E, A, B, C, xk unavailable as well)
◮ Given are interpolation points z1, . . . , zm ◮ Perform single time-domain simulation of Σ to steady state ◮ Derive approximate ˆ
◮ Construct ˆ
full model
classical Loewner model
time-domain Loewner model
12 / 37
2πj K i ,
K−1
i ,
r
i ,
k
k
r
i
r
i k
i
13 / 37
r
i k
i
r
i ,
∞
◮ Problem-dependent rate of decay of error |H(z) − Hk(z)| ◮ Slow decay of error if many time steps to reach steady state
14 / 37
r
i ,
ˆ H1,..., ˆ Hr ∈C K−1
r
i
Fourier component
◮ Controls the time step from which on Hk(qi) sufficiently accurate ◮ Asymptotic analysis confirms that kmin is problem-dependent ◮ If kmin ≤ K − r, then system overdetermined
15 / 37
◮ Restricted by non-zero Fourier coefficients of input ◮ Number of time steps K determines frequency range
16 / 37
1e-14 1e-12 1e-10 1e-08 1e-06 1e-04 1e-02 1e+00 0.01 0.1 1 relative error spectral radius ρ freq ω1 = 0.12566 freq ω2 = 0.37699 freq ω3 = 1.00531 freq ω4 = 2.51327 freq ω5 = 6.15752 1e-14 1e-12 1e-10 1e-08 1e-06 1e-04 1e-02 1e+00 0.01 0.1 1 relative error spectral radius ρ kmin = ⌊3/4K⌋ kmin = ⌊1/2K⌋ kmin = ⌊1/4K⌋ (a) dependence on ρ, K = 50 (b) dependence on kmin, K = 50 ◮ Synthetic example where we can control ρ ◮ Relative error of approximate transfer function values
◮ A large spectral radius leads to larger error for fixed K ◮ Large kmin avoids early, inaccurate transfer function approximations ◮ Setting kmin too large, leads to ill-conditioned least-squares problem
17 / 37
1e-14 1e-12 1e-10 1e-08 1e-06 1e-04 1e-02 1e+00 0.01 0.1 1 relative error spectral radius ρ freq ω1 = 0.12566 freq ω2 = 0.37699 freq ω3 = 1.00531 freq ω4 = 2.51327 freq ω5 = 6.15752 1e-14 1e-12 1e-10 1e-08 1e-06 1e-04 1e-02 1e+00 0.01 0.1 1 relative error spectral radius ρ kmin = ⌊3/4K⌋ kmin = ⌊1/2K⌋ kmin = ⌊1/4K⌋ (a) dependence on ρ, K = 100 (b) dependence on kmin, K = 100 ◮ Synthetic example where we can control ρ ◮ Relative error of approximate transfer function values
◮ A large spectral radius leads to larger error for fixed K ◮ Large kmin avoids early, inaccurate transfer function approximations ◮ Setting kmin too large, leads to ill-conditioned least-squares problem
17 / 37
◮ Order of system is N = 598 ◮ Discretize with 4th-order scheme ◮ Time step size δt = 10−1 and K = 103
◮ Dimension of reduced model n = 5 ◮ Set kmin = ⌊1/4K⌋ ◮ Select m logarithmic interpolation pts
◮ Input uk at time tk, k = 0, . . . , K − 1
m
il ◮ Simulate full model Σ once
1e+01 1e+02 1e+03 1e+04 1e-02 1e-01 1e+00 magnitude frequency ω full model
(a) magnitude
0e+00 1e-02 1e-01 1e+00 phase frequency ω full model
(b) phase
http://slicot.org/20-site/126-benchmark-examples-for-model-reduction 18 / 37
1e+01 1e+02 1e+03 1e+04 1e-02 1e-01 1e+00 magnitude frequency ω [rad/s] full model classical Loewner time Loewner −3 −2.5 −2 −1.5 −1 −0.5 1e-02 1e-01 1e+00 phase frequency ω [rad/s] full model classical Loewner time Loewner (a) magnitude (b) phase
H− ˜ HH2 HH2 H− ˆ HH2 HH2 ˜ H− ˆ HH2 HH2 H− ˜ HH∞ HH∞ H− ˆ HH∞ HH∞ ˜ H− ˆ HH∞ HH∞
◮ Construct time-domain Loewner from single trajectory ◮ Magnitude of transfer function matched well; slight difference in
◮ Time-domain (& classical) Loewner model are asymptotically stable
19 / 37
◮ Order of system is N = 1006 ◮ Discretize in time with implicit Euler ◮ Time step size δt = 10−4 ◮ Number of time steps K = 106
◮ Dimension of reduced model n = 10 ◮ Set kmin = ⌊1/4K⌋ ◮ Select m logarithmic interpolation pts ◮ Construct input as in Eady example ◮ Simulate full model Σ once
1e-01 1e+00 1e+01 1e+02 1e-04 1e-03 1e-02 1e-01 1e+00 magnitude frequency ω full model
(a) magnitude
0e+00 1e+00 2e+00 3e+00 1e-04 1e-03 1e-02 1e-01 1e+00 phase frequency ω full model
(b) phase
[Penzl, 2006], [Ionita, 2013] 20 / 37
1e-01 1e+00 1e+01 1e+02 1e-04 1e-03 1e-02 1e-01 1e+00 magnitude frequency ω [rad/s] full model classical Loewner time Loewner −3 −2 −1 1 2 3 1e-04 1e-03 1e-02 1e-01 1e+00 phase frequency ω [rad/s] full model classical Loewner time Loewner (a) magnitude (b) phase ◮ Number of interpolation points m = 64 ◮ Test points logarithmically distributed in range [10−4, 1] ◮ Time-domain Loewner matches classical Loewner model well
21 / 37
−0.04 −0.03 −0.02 −0.01 0.01 0.02 0.03 0.04 0.98 0.99 1 imaginary part real part classical L. time L. 0.940 0.950 0.960 0.970 0.980 0.990 1.000 1.010 1 2 3 4 5 6 7 8 9 10 magnitude of eigenvalue index of eigenvalue classical Loewner time Loewner (a) eigenvalues (b) magnitude of eigenvalues
H− ˜ HH2 HH2 H− ˆ HH2 HH2 ˜ H− ˆ HH2 HH2 H− ˜ HH∞ HH∞ H− ˆ HH∞ HH∞ ˜ H− ˆ HH∞ HH∞
◮ Time-domain Loewner model matches poles of classical Loewner ◮ Time-domain (& classical) Loewner model are asymptotically stable
22 / 37
◮ Full 3D finite-element model of beam ◮ Force applied at tip of beam ◮ Implicit Euler, δt = 10−4, K = 106
◮ Dimension of reduced model n = 8 ◮ Select m = 150 interpolation points ◮ Same kmin and input as in Eady ◮ Simulate full model Σ once force
(a) geometry of beam
[Panzer et al., 2009]
(b) beam at time step 4452 (c) beam at time step 5061
23 / 37
◮ The kmin significantly influences the condition number ◮ Conservative choice seems sufficient in practice
24 / 37
1e-07 1e-06 1e-05 1e-04 1e-03 1e-02 1e-01 1e-04 1e-03 1e-02 1e-01 1e+00 magnitude frequency ω [rad/s] full model classical Loewner time Loewner −3 −2 −1 1 2 3 1e-04 1e-03 1e-02 1e-01 1e+00 phase frequency ω [rad/s] full model classical Loewner time Loewner (a) magnitude (b) phase ◮ Time-domain Loewner model matches transfer function well ◮ Differences can be seen for high frequencies
25 / 37
1e-15 1e-14 1e-13 1e-12 1e-11 1e-10 1e-09 1e-08 1e-07 1e-06 1e-05 1e-04 1e-03 1e-02 1e-01 1e+00 abs error frequency ω [rad/s] classical Loewner time Loewner
0e+00 1e+02 2000 4000 6000 8000 10000
0e+00 1e+02 2000 4000 6000 8000 10000
time step classical Loewner
time step time Loewner (a) absolute error (b) output
H− ˜ HH2 HH2 H− ˆ HH2 HH2 ˜ H− ˆ HH2 HH2 H− ˜ HH∞ HH∞ H− ˆ HH∞ HH∞ ˜ H− ˆ HH∞ HH∞
◮ Absolute error is low for low frequencies ◮ Perform time-domain simulation of reduced model ◮ Output of time-domain Loewner matches output of classical Loewner
26 / 37
−2 −1.5 −1 −0.5 0.5 0e+00 2e+05 4e+05 6e+05 8e+05 1e+06 input time step k 1e-07 1e-06 1e-05 1e-04 1e-03 1e-02 1e-01 1e-04 1e-03 1e-02 1e-01 1e+00 magnitude frequency ω [rad/s] full model classical Loewner time Loewner (a) chirp signal (b) magnitude ◮ Extract input u from “chirp” signal (non-zero Fourier coefficients) ◮ Simulate Σ at u and construct time-domain Loewner model ◮ Time-domain Loewner shows similar behavior as for synthetic input
27 / 37
28 / 37
◮ Time-continuous multiple-input-multiple-output (MIMO) system ◮ System matrices A ∈ RN×N, B ∈ RN×p, C ∈ Rq×N ◮ Input u(t) ∈ Rp and output y(t) ∈ Rq at time t ∈ [0, ∞) ◮ State x(t) ∈ RN at time t ∈ [0, ∞) ◮ Nonlinear term with matrix F ∈ RN×N2
◮ State trajectory
T
◮ Outputs Y =
◮ Inputs U =
29 / 37
full-model trajectories reduced space construct reduced model project full model full-model
assemble full-model trajectories reduced space construct infer
30 / 37
31 / 37
ˆ A,ˆ F, ˆ B K
2 ◮ Transform into n independent least-squares problem ◮ Can be solved efficiently with standard solvers
ˆ C K
2
◮ Need sufficient data ◮ Need that V spans RN for n → N
[Tu et al., 2013], [Chung, Chung, 2014], [Proctor et al., 2016], [Xie, Mohebujjaman, Rebholz, Iliescu, 2017] 32 / 37
◮ Spatial domain ω ∈ [0, 1] and time domain [0, 1] ◮ Parameter µ in parameter domain D = [0.1, 1] ◮ Dirichlet boundary conditions x(0, t; µ) = −x(1, t; µ) = u(t)
33 / 37
1e-06 1e-05 1e-04 1e-03 1e-02 1e-01 1e+00 2 4 6 8 10 12 14 avg error of states dimension n intrusive inference-based 1e-06 1e-05 1e-04 1e-03 1e-02 1e-01 1e+00 2 4 6 8 10 12 14 avg error of states dimension n intrusive inference-based (a) training parameters (b) test parameters ◮ Inferred from µ1, . . . , µ10 and tested on µ11, . . . , µ20 ◮ Error measured with respect to full model ◮ Inferred operators lead to similar error as intrusive operators ◮ For n > 10, inference problem becomes ill-conditioned
34 / 37
◮ Nonlinear convection-diffusion-reaction ◮ Third-order nonlinear term ◮ Damköhler µ ∈ D controls behavior ◮ Implicit Euler, δt = 10−4, K = 5 × 105
◮ Reduced model dimension n = 5 ◮ Inference for parameters µ1, . . . , µ15 ∈ D ◮ Error on parameters µ16, . . . , µ23 ∈ D 1.06 1.08 1.1 1.12 1.14 1.16 1.18 1.2 10 20 30 40 50 temperature time [s] Damköhler µ = 0.163 (a) steady-state 1.06 1.08 1.1 1.12 1.14 1.16 1.18 1.2 10 20 30 40 50 temperature time [s] Damköhler µ = 0.1655 (b) LCO [Heinemann & Poore, 1981], [Zhou, 2012]
35 / 37
0.004 0.008 0.012 0.016 0.02 0.024 0.163 0.164 0.165 LCO amplitude Damköhler number µ intrusive inference-based 0.004 0.008 0.012 0.016 0.02 0.024 0.163 0.164 0.165 LCO amplitude Damköhler number µ intrusive inference-based (a) training parameters (b) test parameters ◮ Inference-based model enters LCO at same µ as intrusive model ◮ Similar behavior of inference-based and intrusive reduced model
36 / 37
1e-07 1e-06 1e-05 1e-04 1e-03 1e-02 1e-01 1e-04 1e-03 1e-02 1e-01 1e+00 magnitude frequency ω [rad/s] full model classical Loewner time Loewner ◮ Nonintrusive construction of reduced model directly from data ◮ Can be seen as data-driven model construction ◮ More details in
37 / 37