a geometric approach
play

a geometric approach * * ACC09 paper plus other stuff Andrea Censi - PowerPoint PPT Presentation

Kalman filtering with intermittent observations: a geometric approach * * ACC09 paper plus other stuff Andrea Censi Ph.D. student, Control & Dynamical Systems, California Institute of Technology Advisor: Richard Murray Linear/Gaussian


  1. Kalman filtering with intermittent observations: a geometric approach * * ACC’09 paper plus other stuff Andrea Censi Ph.D. student, Control & Dynamical Systems, California Institute of Technology Advisor: Richard Murray

  2. Linear/Gaussian estimation Consider the discrete-time linear dynamical system x ( k + 1 ) = A x ( k ) + B ω ( k ) , y ( k ) = C x ( k ) + v ( k ) , with ω ( k ) and v ( k ) white Gaussian sequences with covariance matrix equal to identity. ■ Let Q � BB ∗ and I � C ∗ C . Then the posterior covariance matrix of the error P ( k ) = cov ( ˆ x ( k ) − x ( k ) | y ( 1 ) , . . . , y ( k )) evolves according to the following deterministic map: � − 1 � ( APA ∗ + Q ) − 1 + I g : P �→ (a Riccati iteration “in compact form” for the lazy researcher) ∞ = lim n → ∞ g n ( P ( 0 )) ■ If ( A , B ) controllable and ( A , C ) detectable, P 2 / 17

  3. ... with intermittent observations ■ [Sinopoli’04] One can model packet drops as follows: y ′ ( k ) = γ ( k ) y ( k ) γ ( k ) ∼ Bernoulli, with probability γ ■ Evolution of P ( k ) as an Iterated Function System [Barnsley] : ◆ execute g if the packet arrives ◆ execute h if the packet does not arrive � � − 1 ( APA ∗ + Q ) − 1 + I �→ p g = γ g : P , APA ∗ + Q , �→ p h = 1 − γ h : P ■ The iteration of P ( k ) is not deterministic: rather than P ∞ , we must speak of the stationary distribution of P . ■ What is the behavior as a function of the arrival probability γ ? 3 / 17

  4. Three contributions 1. Existence of the stationary distribution: ■ Literature: Stationary distribution exists if γ > γ s [Kar’09]. ■ Contribution : If A nonsingular, it always exists. 2. Mean of the stationary distribution: ■ Literature: ◆ E { P } exists if and only if γ > γ c [Sinopoli’04], ◆ γ c not precisely characterized yet. [Mo’08, Plarre’09,...]. ■ Contribution : The intrinsic Riemannian mean always exists. 3. CDF (performance bounds): ■ Literature: Upper and lower bounds on P ( { P ≤ M } ) . [Epstein’05] ■ Contribution: if a certain non-overlapping condition holds, then: ◆ p ( P ) has a fractal support. ◆ P ( { P ≤ M } ) can be found in closed form. 4 / 17

  5. A really useful metric Let P be the set of positive definite matrices of order n . Define: � � 1/2 n log 2 ( λ i ( P − 1 ∑ d ( P 1 , P 2 ) = 1 P 2 )) i = 1 1. ( P , d ) is a complete metric space, with the usual topology. 2. d is invariant to conjugacy. For any invertible matrix A : d ( AP 1 A ∗ , AP 2 A ∗ ) = d ( P 1 , P 2 ) 3. d is invariant to inversion: d ( P − 1 1 , P − 1 2 ) = d ( P 1 , P 2 ) 4. For any two matrices P 1 , P 2 in P , and for any Q ≥ 0, α d ( P 1 + Q , P 2 + Q ) ≤ α + β d ( P 1 , P 2 ) where α = max { λ max ( P 1 ) , λ max ( P 2 ) } and β = λ min ( Q ) . 5 / 17

  6. Contraction properties of g and h ■ The maps g , h are compositions of: conjugations, addition, inversion. � � − 1 ( APA ∗ + Q ) − 1 + I �→ g : P APA ∗ + Q �→ h : P ■ Therefore, they are non-expansive in this metric: (also h !) d ( g ( P 1 ) , g ( P 2 )) ≤ d ( P 1 , P 2 ) d ( h ( P 1 ) , h ( P 2 )) ≤ d ( P 1 , P 2 ) ■ [Bougerol ’93]: If A nonsingular, ( A , C ) observable, ( A , B ) controllable, g n is a strict contraction : d ( g n ( P 1 ) , g n ( P 2 )) ≤ ρ d ( P 1 , P 2 ) , ρ < 1 6 / 17

  7. Existence of stationary distribution ■ Lemma [Barnsley’88]. An iterated function system { f i } where all f i are non-expansive, and at least one is a strict contraction , admits a unique attractive stationary distribution (convergence in distribution). ■ Proposition: A stationary distribution for the covariance always exists if A nonsingular, ( A , C ) observable, ( A , B ) controllable. Proof: After n steps, there are 2 n combinations of g and h . n � �� � hhh · · · h non-expansive ghh · · · h non-expansive . . . hgg · · · g non-expansive ggg · · · g strict contraction Therefore the system satisfies the average-contractivity condition. 7 / 17

  8. Existence of Riemannian mean(s) ■ Does it not bother you that E {·} is not invariant to change of coordinates? √ P } 2 � = E { P − 1 } − 1 E { P } � = E { Qualitative/quantitative results should be invariant of parametrizations. ■ The manifold of positive definite matrices is not flat ; we must be careful. ■ The expectation can be generalized to a Riemannian manifolds M with distance d as follows: � � d 2 ( X , y ) M { X } � arg inf y ∈ M E ■ In our case, different distances give different “critical probabilities”: covariances std devs information Riemannian √ √ � 1 � || P − 1 1 − P − 1 ∑ log 2 ( λ i ( P − 1 2 d = || P 1 − P 2 || F || P 1 − P 2 || F 2 || F 1 P 2 ) γ d c = γ c < γ c none none 8 / 17

  9. Which is more natural? covariances std devs information Riemannian √ √ � 1 � || P − 1 1 − P − 1 ∑ log 2 ( λ i ( P − 1 2 d = || P 1 − P 2 || F || P 1 − P 2 || F 2 || F 1 P 2 ) γ d c = γ c < γ c none none ■ Covariances: Corresponds to the average squared error E {|| e || 2 } . ■ Standard deviations ◆ Corresponds to the average error E {|| e ||} (physical meaning). ■ Information matrices ◆ P − 1 is the natural parametrization for Gaussians. ■ Riemannian distance ◆ Information Geometry interpretation: the natural distance from the Fisher Information Metric on the manifodl of Gaussian distributions. ◆ d ( P 1 , P 2 ) can be linked to the probability of distinguishing the two distributions from the samples [Amari’00]. 9 / 17

  10. Finally the fractals 10 / 17

  11. Cantor set and Cantor function ■ Instructions: take a segment, remove the middle third, repeat. ↑ Cantor Set The CDF is the Cantor Function → ■ The Cantor set is a “fractal”: ◆ it is totally disconnected ◆ it is self-similar ◆ it has non-integer dimension ■ The Cantor function is a singular function (“devil’s staircase”) ◆ continuous ◆ differentiable almost everywhere, with derivative 0 11 / 17

  12. Cantor set ■ Proposition : For some value of the parameters, the distribution of P − 1 is exactly the (scaled/translated) Cantor set. √ ■ Black plot: nominal system A = 1/ 3, Q = 0, I = 1 and packet dropping governed by a Markov chain with transition matrix T = [ α , 1 − α ; 1 − β , β ] , α = β = 0.5. Varying the parameters of the Markov Chain: Varying Q : α = 0.5, β = 0.5; α = 0.3, β = 0.3; α = 0.1, Q = 0; Q = 1; Q = 2; Q = 3. β = 0.1; α = 0.9, β = 0.9. 12 / 17

  13. Fractal properties ■ Proposition: If h and g have disjoint range, then: ◆ the stationary distribution is homeomorphic to the Cantor set, ◆ it is a compact, totally disconnected set ◆ the cumulative distribution function is a singular function ■ If h and g have non-disjoint range, then it might be still fractal. ◆ This is an open problem in number theory. ■ Proposition: If h and g satisfy the stronger condition: h ( P ∞ ) ≥ g ( 0 ) then one can find a closed form solution for P ( { P ≤ M } ) = "ugly" formula involving the "digit representation" of M ◆ This implies that C is invertible (strong condition). ◆ This is also valid for Markov Chain driving the packet drops. 13 / 17

  14. Conclusions Three contributions: ■ The stationary distribution always exists. ■ The intrinsic Riemannian mean always exists. ■ P ( { P ≤ M } ) in closed form with a (strong) non-overlapping condition. Main ideas: ■ Theory of Iterated Function System s — many ready-to-use results in books with very nice fractal illustrations [Barnsley]. ■ A useful Riemannian metric for positive definite matrices — natural Information Geometry metric for manifold of Gaussian distributions ■ Contraction properties of Riccati recursions in this metric. Open problems: ■ Case of a singular A . ■ Computing the Riemannian mean (only proved existence). ■ Computing the CDF P ( { P ≤ M } ) without non-overlapping condition. 14 / 17

  15. Metric spaces properties ■ ( P , d ) is a complete metric space : every Cauchy sequence has a limit in P . ■ Fixed Point Theorem: If f strict contraction mapping: d ( f ( x ) , f ( y )) = q < 1 sup d ( x , y ) x , y and complete metric space, then lim n → ∞ f n ( x ) = x 0 . 15 / 17

  16. Cantor set, more formally ■ Let { 0, 1 } N the Cantor space with the following metric: d ( x , y ) = 2 − k ; k first digit for which x k � = y k Note that with this metric, 0.11111 · · · � = 1.0000 . . . because d ( 0.1, 1.0 ) � = 0. ■ Any set is called “Cantor set” if it is homeomorphic to the Cantor space. 16 / 17

  17. Cantor set, more formally ■ Let { 0, 1 } N represent the arrival sequence. You can write the final covariance as: ϕ : { 0, 1 } N → P �→ arrival sequence final covariance ■ If you can prove that ϕ is invertible, then the range of ϕ is a Cantor set. ■ Non-overlapping condition: If the ranges of h and g are non-overlapping, then ϕ is invertible. ■ Morever: ϕ : { 0, 1 } N P ϕ �→ statistics of packet arrival probability distributions 17 / 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend