krylov subspace methods for perron frobenius operators in
play

Krylov subspace methods for Perron-Frobenius operators in RKHS Yuka - PowerPoint PPT Presentation

Krylov subspace methods for Perron-Frobenius operators in RKHS Yuka Hashimoto Takashi Nodera NTT Network Technology Laboratories / Graduate School of Science and Technology, Keio University Faculty of Science and Technology, Keio


  1. Krylov subspace methods for Perron-Frobenius operators in RKHS Yuka Hashimoto ∗ Takashi Nodera † ∗ NTT Network Technology Laboratories / Graduate School of Science and Technology, Keio University † Faculty of Science and Technology, Keio University 2020/2/11

  2. Abstract • We consider numerical estimations of Perron-Frobenius (P-F) operators in RKHS. • A P-F operator is a linear operator which describes the time evolution of a dynamical system. • Recently, using P-F operators for time-series data analysis have been actively researched. • We investigate theoretical analyses of Krylov subspace methods for estimating P-F operators. Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 2 / 17

  3. Contents 1. Background 2. Existing Krylov subspace methods for Perron-Frobenius operators 3. Difference from classical settings 4. New analyses of the Krylov subspace methods 5. Numerical experiments 6. Conclusion Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 3 / 17

  4. Dynamical systems with random noise ( Ω, F ) : A measurable space, ( X , B ) : A Borel measurable and locally compact Hausdorff vector space, X t , ξ t : random variables from Ω to X , { ξ t } : An i.i.d. stochastic process corresponds to the random noise in X ( ξ t is also independent of X t ), h : X → X (nonlinear in general) Dynamical system with random noise X t +1 = h ( X t ) + ξ t , (1) P : A probability measure on Ω , X t Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 4 / 17

  5. Dynamical systems with random noise ( Ω, F ) : A measurable space, ( X , B ) : A Borel measurable and locally compact Hausdorff vector space, X t , ξ t : random variables from Ω to X , { ξ t } : An i.i.d. stochastic process corresponds to the random noise in X ( ξ t is also independent of X t ), h : X → X (nonlinear in general) Dynamical system with random noise X t +1 = h ( X t ) + ξ t , (1) X t Transform X t ∗ P , P : A probability measure on Ω , where X t ∗ P ( B ) = P ( X t − 1 ( B )) for B ∈ B : The push forward measure Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 4 / 17

  6. Dynamical systems with random noise ( Ω, F ) : A measurable space, ( X , B ) : A Borel measurable and locally compact Hausdorff vector space, X t , ξ t : random variables from Ω to X , { ξ t } : An i.i.d. stochastic process corresponds to the random noise in X ( ξ t is also independent of X t ), h : X → X (nonlinear in general) Dynamical system with random noise X t +1 = h ( X t ) + ξ t , (1) X t Transform X t ∗ P , P : A probability measure on Ω , where X t ∗ P ( B ) = P ( X t − 1 ( B )) for B ∈ B : The push forward measure X t +1 ∗ P = β t ∗ ( X t ∗ P ⊗ P ) , (2) where β t ( x, ω ) = h ( x ) + ξ t ( ω ) linear Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 4 / 17

  7. RKHS and kernel mean embedding To define an inner product between measures, we employ the theory of RKHSs and kernel mean embeddings. k : A positive definite kernel, ϕ ( x ) = k ( x, · ) : The feature map H k = { ∑ m − 1 t =0 c t ϕ ( x t ) | m ∈ N , x t ∈ X , c t ∈ C } : The RKHS M ( X ) : The space of all the finite signed Borel measures on X Φ : M ( X ) → H k µ �→ ∫ x ∈X ϕ ( x ) dµ ( x ) : The kernel mean embedding Feature map ϕ x ϕ ( x ) µ Φ( µ ) Kernel mean embedding Φ X (Usually a finite dimensional sp.) H k (An infinite dimensional Hilbert sp.) ∫ ∫ ⟨ Φ( µ ) , Φ( ν ) ⟩ = x ∈X k ( x, y ) dµ ( x ) dν ( y ) y ∈X : The inner product between Φ( µ ) and Φ( ν ) Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 5 / 17

  8. Perron-Frobenius operators in RKHSs Linear relation between X t ∗ P and X t +1 ∗ P X t +1 ∗ P = β t ∗ ( X t ∗ P ⊗ P ) (2) Definition 1 (Perron Frobenius operator in RKHS) An operator K : Φ( M ( X )) → H k is called a Perron-Frobenius operator in H k if it satisfies K Φ( µ ) := Φ( β t ∗ ( µ ⊗ P )) , (3) for µ ∈ M ( X ) . It can be shown that: • K is well-defined with some mild conditions of k . • K is linear. • K does not depend on time t . Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 6 / 17

  9. Construction of a Krylov subspace of the P-F operator { x 0 , x 1 , . . . , x T } ⊆ X : observed time-series data t,N := 1 /N ∑ N − 1 µ S i =0 δ x t + iS ( t = 0 , . . . , m ) : empirical measures (We will drop superscript S for simplicity) δ x : Dirc measure of x ∈ X Assumptions 1. µ t,N converge to a finite Borel measure µ t weakly as N → ∞ for t = 0 , . . . m ∑ N − 1 2. lim N →∞ 1 ∫ ω ∈ Ω f ( h ( x t + iS ) + ξ t ( ω )) dP ( ω ) (Space average) i =0 N ∑ N − 1 = lim N →∞ 1 i =0 f ( h ( x t + iS ) + ξ t + iS ( η )) (Time average) N a.s. η ∈ Ω If K is bounded, K Φ( µ t ) = Φ( µ t +1 ) ( t = 0 , . . . , m − 1) (4) Span { Φ( µ 0 ) , . . . , Φ( µ m − 1 ) } = Span { Φ( µ 0 ) , K Φ( µ 0 ) , . . . , K m − 1 Φ( µ 0 ) } : Krylov subspace of K and Φ( µ 0 ) , denoted as K m ( K, Φ( µ 0 )) Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 7 / 17

  10. Numerical estimation for the P-F operator 1 K m ( K, Φ( µ 0 )) = Span { Φ( µ 0 ) , . . . , Φ( µ m − 1 ) } · · · constructed only with observed data [Φ( µ 0 ) , . . . , Φ( µ m − 1 )] = Q m R m : QR decomposition Proposition 1 (Numerical estimation of K ) Let K m := Q ∗ m KQ m , the operator projected onto K m ( K, Φ( µ S 0 )) . Then, K m is represented only with observed data as: K m = Q ∗ [Φ( µ 0 ) , . . . , Φ( µ m − 1 )] R − 1 (5) m For application, we want to know the time evolution of the dynamical Observable system at some time t > T → Estimate Kv for v = ϕ ( x t ) at time t ( T : The number of observables in the time-series data) Arnoldi approximation of Kv Kv ≈ Q T K m Q ∗ T v (6) 1 Hashimoto et al., arXiv:1909.03634v3, 2019. Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 8 / 17

  11. Unboundedness of the P-F operator In fact, the P-F operators can be unbounded 2 . In this case, the Arnoldi approximation K m does not converge to K as m → ∞ . K (Unbounded) Transform ( γI − K ) − 1 (Bounded), where γ / ∈ Λ( K ) u t,N = ∑ t ( t ) ( − 1) i γ t − i Φ( µ i,N ) , lim N →∞ u t,N = u t i =0 i ( γI − K ) − 1 u t +1 = u t (7) K m (( γI − K ) − 1 , u S m ) = Span { u 1 , . . . , u m } [ u S 1 , . . . , u S m ] = Q m R m : QR decomposition L m := Q ∗ T ( γI − K ) − 1 Q T = Q ∗ T [ u 1 , . . . , u m ] R − 1 m Shift-invert Arnoldi approximation of Kv K m := γ I − L − 1 Kv ≈ Q T K m Q ∗ T v, (8) m 2 Ikeda, Ishikawa and Sawano, arXiv:1911.11992, 2019. Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 9 / 17

  12. Difference from classical settings Our setting with P-F operators is different from classical ones in numerical linear algebra. Although the analyses for classical settings have been actively investigated, those for P-F operators have not been investigated. The Classical setting with a linear operator A Our setting with a P-F operator K (A typical example of A • Data driven approach : Laplace operator) • K is not given , instead, • Model driven approach observed data are given • Operators are given • Kv for a vector v have to be • Av for a vector v can be estimated by data computed Data Data Model Estimate Model Behavior Extract of the model New analyses for the Krylov subspace methods for P-F operators are needed Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 10 / 17

  13. Residual of a Krylov approximation Krylov approximations for classical settings have a strong connection with their residuals. → We also investigate a connection of the Krylov approximations of P-F operators with their residuals. Kv ≈ u m → ∥ v − K − 1 u m ∥ : residual of u m The steps of our analysis : 1. Find a minimizer of the residual 2. Derive the relation between the residual of the Arnoldi approximation and that of the minimizer Theorem 1 (Minimizer of the Residual) u m := Q m +1 K m +1 Q ∗ m +1 Q m Q ∗ Let ˜ m v . Then, ˜ u m minimizes the residual in K m ( K, Φ( µ 1 )) , that is: ∥ v − K − 1 u ∥ . u m = ˜ arg min (9) u ∈K m ( K, Φ( µ 1 )) Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 11 / 17

  14. Residual of a Krylov approximation Theorem 1 (Minimizer of the Residual) u m := Q m +1 K m +1 Q ∗ m +1 Q m Q ∗ Let ˜ m v . Then, ˜ u m minimizes the residual in K m ( K, Φ( µ 1 )) , that is: ∥ v − K − 1 u ∥ . u m = ˜ arg min (9) u ∈K m ( K, Φ( µ 1 )) Q m Q ∗ Proof. m v =: p m − 1 ( K )Φ( µ 0 ) ∈ K m ( K, Φ( µ 0 )) = Span { Φ( µ 0 ) , K Φ( µ 0 ) , . . . , K m − 1 Φ( µ 0 ) } Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 12 / 17

  15. Residual of a Krylov approximation Theorem 1 (Minimizer of the Residual) u m := Q m +1 K m +1 Q ∗ m +1 Q m Q ∗ Let ˜ m v . Then, ˜ u m minimizes the residual in K m ( K, Φ( µ 1 )) , that is: ∥ v − K − 1 u ∥ . u m = ˜ arg min (9) u ∈K m ( K, Φ( µ 1 )) Q m Q ∗ Proof. m v =: p m − 1 ( K )Φ( µ 0 ) Projection onto K m ( K, Φ( µ 0 )) ∥ v − u ∥ = Q m Q ∗ arg min m v u ∈K m ( K, Φ( µ 0 )) (10) Krylov subspace methods for P-F operators in RKHSs Y. H. and T. N. 12 / 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend