Numerical Linear Algebra issues in Singular Spectrum Analysis of - - PowerPoint PPT Presentation

numerical linear algebra issues in singular spectrum
SMART_READER_LITE
LIVE PREVIEW

Numerical Linear Algebra issues in Singular Spectrum Analysis of - - PowerPoint PPT Presentation

Numerical Linear Algebra issues in Singular Spectrum Analysis of time series Dario Fasino with E. Bozzo, R. Carniel University of Udine (Italy) Genova, 2ggALN12 D. Fasino (Udine) NLA issues in SSA 17/02/2012 1 / 18 Singular Spectrum


slide-1
SLIDE 1

Numerical Linear Algebra issues in Singular Spectrum Analysis of time series

Dario Fasino — with E. Bozzo, R. Carniel

University of Udine (Italy)

Genova, 2ggALN’12

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 1 / 18

slide-2
SLIDE 2

Singular Spectrum Analysis (SSA): Introduction

SSA is a quite recent technique for the analysis of experimental time series, based on the SVD of certain Hankel matrices. Let x = (x1, x2, . . . , xℓ)T a finite time series, ℓ = m + n − 1. The m × n Hankel matrix Xm,n =      x1 x2 · · · xn x2 x3 · · · xn+1 . . . . . . . . . . . . xm xm+1 · · · xℓ      is known as trajectory matrix, Xm,n = Tm,n(x).

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 2 / 18

slide-3
SLIDE 3

Singular Spectrum Analysis (SSA): Introduction

SSA is a quite recent technique for the analysis of experimental time series, based on the SVD of certain Hankel matrices. Let z ∈ C. Then rank      1 · · · zn−1 z · · · zn . . . . . . . . . zm−1 · · · zℓ−2      = 1. Let xi = pk(i), a k-degree algebraic poly. Then rank    x1 · · · xn . . . . . . . . . xm · · · xℓ    = k. Time series made up by trigonometric, algebraic, exponential terms have small rank trajectory matrices.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 2 / 18

slide-4
SLIDE 4

Singular Spectrum Analysis (SSA): Introduction

SSA idea

Use SVD of trajectory matrices to decompose time series into constant terms, trends, oscillatory components, noise. . . Given x = (x1, . . . , xℓ) and I1, . . . , Ik a partition of {1, . . . , n} do:

1

Build up trajectory matrix: X = Tm,n(x).

2

Compute SVD X = UΣV T; singular triples: (ui, σi, vi).

3

Group triples: X (k) =

i∈Ik σiuivT i .

Note:

k X (k) = X.

4

Hankelization (diagonal averaging): H(k) = H(X (k)). Note:

k H(k) = X.

5

Extract components: x(k) = T −1

m,n(H(k)).

Note:

k x(k) = x.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 3 / 18

slide-5
SLIDE 5

Example

Figure: SSA of a mixture of time series. ℓ = 200, n = 10, . . . , 30. Above: individual time series and respective SVs. Below: composite time series and respective SVs.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 4 / 18

slide-6
SLIDE 6

Example

Figure: SSA of a mixture of time series. ℓ = 200, n = 30. Left: first original component (blue) and its reconstruction (red). Right: second original component (blue) and its reconstruction (red).

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 5 / 18

slide-7
SLIDE 7

References

  • N. Golyandina, V. Nekrutkin, A. Zhigljavsky.

Analysis of time series structure. SSA and related techniques. Chapman & Hall/CRC, 2001.

  • R. Carniel et al.

On the singular values decoupling in the Singular Spectrum Analysis of volcanic tremor at Stromboli.

  • Nat. Hazards Earth Syst. Sci., 6 (2006), 903–909.
  • E. Bozzo, R. Carniel, D. F

. Relationship between SSA and Fourier analysis: Theory and application to the monitoring of volcanic activity.

  • Comp. Math. Appl. 60 (2010), 812–820.
  • V. Busoni.

Risultati di tipo perturbativo nell’analisi dello spettro singolare di serie temporali. Tesi di Laurea in Matematica, Universit` a di Udine, 2011.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 6 / 18

slide-8
SLIDE 8

A motivating problem (and our answer)

April 5, 2003: a major paroxism destroyed a seismic station

  • n the Stromboli volcano.

SSA analysis of the sismogram suggests the presence of a consistent preparatory phase. What information is conveyed in the SVs of (not so large) trajectory matrices coming from chaotic time series?

Figure: SSA of a volcanic tremor

sismogram. Singular values of 7200 trajectory matrices X3000,10 (smoothed plot; x-axis in hours)

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 7 / 18

slide-9
SLIDE 9

A motivating problem (and our answer)

Our result: The behaviour of SVs mirrors qualitative modifications in the power spectrum of the time series.

Figure: SSA of a volcanic tremor

sismogram. Singular values of 7200 trajectory matrices X3000,10 (smoothed plot; x-axis in hours)

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 7 / 18

slide-10
SLIDE 10

Stationary time series

Definition

The infinite time series x = (x1, x2, . . .) is called stationary if ∀i, j 0 lim

m→∞

1 m

m

  • k=1

xi+kxj+k = R(i − j), where R : Z → R is the covariance function of x. Equivalently, the covariance matrix Tn = lim

m→∞

1 mX T

m,nXm,n

exists for all n (and is a Toeplitz matrix).

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 8 / 18

slide-11
SLIDE 11

Stationary time series

Definition

The infinite time series x = (x1, x2, . . .) is called stationary if ∀i, j 0 lim

m→∞

1 m

m

  • k=1

xi+kxj+k = R(i − j), where R : Z → R is the covariance function of x. By Herglotz theorem, x is a stationary time series iff there exists a (unique, bounded) nondecreasing function µ(t) on I = [0, 2π] such that R(k) = 1 2π

  • I

ei2πkt dµ(t), k ∈ Z.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 8 / 18

slide-12
SLIDE 12

Stationary time series

Definition

The infinite time series x = (x1, x2, . . .) is called stationary if ∀i, j 0 lim

m→∞

1 m

m

  • k=1

xi+kxj+k = R(i − j), where R : Z → R is the covariance function of x. A special case: x is called aperiodic or chaotic whenever R(k) = 1 2π

  • I

ei2πktf(t) dt, k ∈ Z. The function f ∈ L1(I), f 0, is the spectral density of x and the symbol of the Toeplitz matrix sequence {Tn}.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 8 / 18

slide-13
SLIDE 13

Asymptotic distributions

Definition

Two triangular sequences {ξ(n)

i

}i=1...n and {ζ(n)

i

}i=1...n, with n ∈ N, are equally distributed (or asymptotically equidistributed), ξ(n)

i

∼ ζ(n)

i

if, for all continuous functions F having bounded support, lim

n→∞

1 n

n

  • i=1
  • F
  • ξ(n)

i

  • − F
  • ζ(n)

i

  • = 0.

Theorem

Let {Tn} be a sequence of Toeplitz matrices whose symbol f ∈ L1(I) is Riemann-integrable. Then λi(Tn) ∼ f(2πi/n).

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 9 / 18

slide-14
SLIDE 14

Asymptotic distributions

Definition

Two triangular sequences {ξ(n)

i

}i=1...n and {ζ(n)

i

}i=1...n, with n ∈ N, are equally distributed (or asymptotically equidistributed), ξ(n)

i

∼ ζ(n)

i

if, for all continuous functions F having bounded support, lim

n→∞

1 n

n

  • i=1
  • F
  • ξ(n)

i

  • − F
  • ζ(n)

i

  • = 0.

Theorem [Tyrtyshnikov ’96]

Let An, Bn be m(n) × n matrices, with m(n) ≥ n. If lim

n→∞

1 nAn − Bn2

F = 0

= ⇒ σi(An) ∼ σi(Bn).

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 9 / 18

slide-15
SLIDE 15

SSA and Fourier analysis

Let x = (x1, x2 . . .) be a stationary time series,

  • Xm,n =

1 √mXm,n

  • Xm,n =

1 √m          x1 x2 · · · xn x2 . . . . . . . . . . . . . . . . . . x1 . . . xm · · · . . . xm x1 · · · xn−1          .

Lemma

For any integer sequence m(n) such that n/m(n) → 0 as n → ∞, the singular values of Xm(n),n and Xm(n),n are equally distributed.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 10 / 18

slide-16
SLIDE 16

SSA and Fourier analysis

Indeed: 1 n Xm(n),n − Xm(n),n2

F

= 1 m(n)

n

  • k=1

n − k n (xk − xm(n)+k)2 < 2 m(n)

n

  • k=1

x2

k + x2 m(n)+k

= 2n m(n)

→0

1 n

n

  • k=1

x2

k

  • →R(0)

+ 1 n

n

  • k=1

x2

m(n)+k

  • →R(0)
  • → 0.

Note: x stationary ⇒ any trailing subsequence is stationary.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 11 / 18

slide-17
SLIDE 17

SSA and Fourier analysis

For any fixed window length m, let x = (x1, . . . , xm)T, and let ˆ x = (ˆ x0, . . . , ˆ xm−1)T = Fmx be its Fourier transform. Let Λm = Diag(1, ei2π/m, . . . , ei2(m−1)π/m) and Pm = F H

mΛmFm.

The matrix Tm,n = X T

m,n

Xm,n is Toeplitz, with symbol fm,n(z) = 1 m

n−1

  • k=−n+1

xTP−k

m x eikz = 1

m

n−1

  • k=−n+1

ˆ xHΛ−k

m ˆ

x eikz. Due to the stationarity assumption, limm→∞ Tm,n = Tn, hence for “fastly growing” m(n) we have limn→∞ 1

nTm(n),n − Tn2 F = 0.

σi( Xm(n),n)2 ∼ σi( Xm(n),n)2 = λi(Tm(n),n) ∼ λi(Tn) ∼ f(2πi/n).

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 12 / 18

slide-18
SLIDE 18

SSA and Fourier analysis

For any fixed window length m, let x = (x1, . . . , xm)T, and let ˆ x = (ˆ x0, . . . , ˆ xm−1)T = Fmx be its Fourier transform. Let Λm = Diag(1, ei2π/m, . . . , ei2(m−1)π/m) and Pm = F H

mΛmFm.

The matrix Tm,n = X T

m,n

Xm,n is Toeplitz, with symbol fm,n(z) = 1 m

n−1

  • k=−n+1

xTP−k

m x eikz = 1

m

n−1

  • k=−n+1

ˆ xHΛ−k

m ˆ

x eikz. Moreover, for any fixed m, n and for j = 1, . . . , n fm,n 2πj n

  • = 1

m ˆ xH

  • n−1
  • k=−n+1

eik2πj/nΛ−k

m

  • ˆ

x = 1 m

m−1

  • i=0

|ˆ xi|2ηi,j where ηi,j = n−1

k=−n+1 eik2π(j/n−i/m).

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 12 / 18

slide-19
SLIDE 19

SSA and Fourier analysis

Figure: Plots of the coefficients ηi,j = Dn(2π(j/n − i/m)). Here m = 1000.

Dn(θ) =

n−1

  • k=−n+1

eikθ is the nth Dirichlet kernel.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 13 / 18

slide-20
SLIDE 20

SSA and Fourier analysis

Theorem

If x is a stationary time series (+ assumptions on integrability of f and growth of m(n) as n → ∞) then the singular values of Xm(n),n and {f 1/2(2πj/n)}j=1...n are equally distributed. Each SV of Xm,n depends essentially from a portion of the power spectrum |ˆ x0|2, . . . , |ˆ xm−1|2 whose width is ≈ m/n. Actually, we can replace f 1/2(2πj/n) with the power bins ϕ(n)

j

=  1 L

⌊jL⌋−1

  • i=⌊(j−1)L⌋

|ˆ xi|2  

1/2

L = m(n) 2n j = 1, . . . , n.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 14 / 18

slide-21
SLIDE 21

Numerical example: Pseudorandom time series

Figure: SSA of 10 pseudorandom time series with spectral density function f(z) = |1 − 2 cos(z) + 2 cos(2z)|. Left: singular values of trajectory matrices X1000,20 Center: power bins ϕ(20)

1

, . . . , ϕ(20)

20

Right: power bins ϕ(20)

i

in decreasing order.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 15 / 18

slide-22
SLIDE 22

Numerical example: Stromboli tremor analysis

Figure: Analysis of a volcanic tremor signal. Left: singular values of trajectory matrices X3000,10. Right: power bins ϕ(10)

i

in decreasing order.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 16 / 18

slide-23
SLIDE 23

Separability

Figure: SSA of two time series and their sum.

In general, the SVs of Tm,n(x(1)) and Tm,n(x(2)) cannot be recovered from those of Tm,n(x(1) + x(2)) — biorthogonality needed.

Problem

To what extent one can recover individual components without biorthogonality?

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 17 / 18

slide-24
SLIDE 24

Separability

Theorem

Let A, B ∈ Rm×n with SVDs A = UAΣAV T

A and B = UBΣBV T B .

Suppose rank(A|B) = rank(A + B). Moreover, let εL = UT

A UB,

εR = V T

A VB.

Then, 1 η σi(A + B) σi(A|B) η, η =

  • (1 + εL)(1 + εR)

(1 − εL)(1 − εR). Note: εL A+B+ATB and εR A+B+ABT.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 18 / 18

slide-25
SLIDE 25

Separability

Theorem

Let A, B ∈ Rm×n with SVDs A = UAΣAV T

A and B = UBΣBV T B .

Suppose rank(A|B) = rank(A + B). Moreover, let εL = UT

A UB,

εR = V T

A VB.

Then, 1 η σi(A + B) σi(A|B) η, η =

  • (1 + εL)(1 + εR)

(1 − εL)(1 − εR). Thank you.

  • D. Fasino (Udine)

NLA issues in SSA 17/02/2012 18 / 18