Data-driven window width adaption adaption for robust for robust - - PowerPoint PPT Presentation

data driven window width adaption
SMART_READER_LITE
LIVE PREVIEW

Data-driven window width adaption adaption for robust for robust - - PowerPoint PPT Presentation

Data-driven window width Data-driven window width adaption adaption for robust for robust online moving window regression online moving window regression Matthias Matthias Borowski Borowski Fakultt Statistik, TU Dortmund COMPSTAT


slide-1
SLIDE 1

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Data-driven window width adaption for robust online moving window regression

Matthias Borowski

Fakultät Statistik, TU Dortmund

COMPSTAT 2010, Paris

1

slide-2
SLIDE 2

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Situation

Online-monitoring time series: systolic blood pressure

t xt

1 50 100 150 200 250 300 350 400 450 500 80 100 120 140 160 180

General assumption: Xt = µt + εt + ηt data = signal + noise +

  • utliers

2

slide-3
SLIDE 3

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Online filtering by Repeated Median (RM) regression

t

  • 20

40 60 80 100 −2 2 4 6 8 10 12

  • data

local linear regression µ ^t

RM regression (Siegel, 1982):

Slope ˆ βt = med

i ∈{1,...,n}

  • med

j=i, j ∈{1,...,n}

xt−n+i − xt−n+j i − j

  • Level

ˆ µt = med

i ∈{1,...,n}

  • xt−n+i − ˆ

βt · (i − n)

  • 3
slide-4
SLIDE 4

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Online filtering by Repeated Median (RM) regression

t

  • 20

40 60 80 100 −2 2 4 6 8 10 12

  • data

local linear regression µ ^t

RM regression (Siegel, 1982):

Slope ˆ βt = med

i ∈{1,...,n}

  • med

j=i, j ∈{1,...,n}

xt−n+i − xt−n+j i − j

  • Level

ˆ µt = med

i ∈{1,...,n}

  • xt−n+i − ˆ

βt · (i − n)

  • 4
slide-5
SLIDE 5

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Online filtering by Repeated Median (RM) regression

t

  • 20

40 60 80 100 −2 2 4 6 8 10 12

  • data

local linear regression µ ^t

RM regression (Siegel, 1982):

Slope ˆ βt = med

i ∈{1,...,n}

  • med

j=i, j ∈{1,...,n}

xt−n+i − xt−n+j i − j

  • Level

ˆ µt = med

i ∈{1,...,n}

  • xt−n+i − ˆ

βt · (i − n)

  • 5
slide-6
SLIDE 6

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Online filtering by Repeated Median (RM) regression

t

  • 20

40 60 80 100 −2 2 4 6 8 10 12

  • data

local linear regression µ ^t

RM regression (Siegel, 1982):

Slope ˆ βt = med

i ∈{1,...,n}

  • med

j=i, j ∈{1,...,n}

xt−n+i − xt−n+j i − j

  • Level

ˆ µt = med

i ∈{1,...,n}

  • xt−n+i − ˆ

βt · (i − n)

  • 6
slide-7
SLIDE 7

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Online filtering by Repeated Median (RM) regression

t

  • 20

40 60 80 100 −2 2 4 6 8 10 12

  • data

local linear regression µ ^t

RM regression (Siegel, 1982):

Slope ˆ βt = med

i ∈{1,...,n}

  • med

j=i, j ∈{1,...,n}

xt−n+i − xt−n+j i − j

  • Level

ˆ µt = med

i ∈{1,...,n}

  • xt−n+i − ˆ

βt · (i − n)

  • 7
slide-8
SLIDE 8

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Online filtering by Repeated Median (RM) regression

t

  • 20

40 60 80 100 −2 2 4 6 8 10 12

  • data

local linear regression µ ^t

RM regression (Siegel, 1982):

Slope ˆ βt = med

i ∈{1,...,n}

  • med

j=i, j ∈{1,...,n}

xt−n+i − xt−n+j i − j

  • Level

ˆ µt = med

i ∈{1,...,n}

  • xt−n+i − ˆ

βt · (i − n)

  • 8
slide-9
SLIDE 9

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Repeated Median (RM) regression

Trade-off problem for window width (ww): Large ww ⇒ smooth signal extraction Small ww ⇒ exact signal extraction Approach: data-driven ww selection for RM At each time point t: Test: Is window width n adequate for RM fit? Yes: estimate signal No: decrease n

9

slide-10
SLIDE 10

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Methods

ADORE – ADaptive Online REpeated Median

(Schettlinger et al., 2010)

SCARM – Slope Comparing Adaptive Repeated Median

(Borowski, 2010)

10

slide-11
SLIDE 11

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

ADORE test

H0: ww n adequate, if # neg. residuals ≈ # pos. residuals in right sub-window

t xt

1 25 50 75 100 −2 −1 1 2 3 4

  • RM fit

20 pos. residuals 10 neg. residuals

11

slide-12
SLIDE 12

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

SCARM test

H0: ww n adequate, iff

  • ˆ

βleft − ˆ βright

  • Var(ˆ

βleft − ˆ βright) is small

t xt

1 25 50 75 100 −2 −1 1 2 3 4

RM fit left sub−window RM fit right sub−window

12

slide-13
SLIDE 13

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

SCARM test

H0: ww n adequate, iff

  • ˆ

βleft − ˆ βright

  • Var(ˆ

βleft − ˆ βright) is small Sophisticated estimation of Var(ˆ βleft − ˆ βright) to increase power: Use that RM slope is unbiased (symm. noise) and regression equivariant Estimate noise variability by regression-free scale estimator

(Gelper et al., 2009)

13

slide-14
SLIDE 14

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Power comparison ADORE vs. SCARM

Standard normal noise Significance level 0.01 Several sizes of level shifts and trend changes

1 2 3 4 5 6 0.0 0.2 0.4 0.6 0.8 1.0

Level shifts

Shift height Power SCARM ADORE 0.1 0.2 0.3 0.4 0.5 0.0 0.2 0.4 0.6 0.8 1.0

Trend changes

Trend slope Power SCARM ADORE 14

slide-15
SLIDE 15

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

Outlook

ww adaption based on

Shift detection (Fried, Gather, 2007) Trend detection (Fried, Imhoff, 2004)

Further comparisons:

Other types of noise Effect of outliers

SCARM in R package robfilter

(Fried, Schettlinger, Borowski, 2010)

15

slide-16
SLIDE 16

Data-driven window width adaption for robust

  • nline

moving window regression Matthias Borowski

References

Borowski, M. (2010): Window width adaption for robust moving window regression in online-monitoring time series. Discussion Paper, to appear. Fried, R., Gather, U. (2007): On rank tests for shift detection in time series. CSDA 52, 221-233. Fried, R., Imhoff, M. (2004): On the online detection of monotonic trends in time series. Biometrical Journal 46, 90-102. Fried, R., Schettlinger, K., Borowski, M. (2010): robfilter: Robust Time Series Filters. R package version 2.6.1 Siegel, A. F. (1982): Robust regression using repeated medians. Biometrika 69(1), 242-244. Gelper, S., Schettlinger, K., Croux, C., Gather, U. (2009): Robust online scale estimation in time series: a regression-free approach. J. Stat. Plann.

  • Inf. 139, 335-349.

Schettlinger, K., Fried, R., Gather, U. (2010): Real time signal processing by adaptive repeated median filters. Int. J. Adapt. Control Signal Process. 24, 346-362.

16

slide-17
SLIDE 17

Estimation of Var(D)

Var(D) = Var

  • ˆ

βleft − ˆ βright

= Var

  • ˆ

βleft + Var

  • ˆ

βright

*under H0 for i.i.d. symmetric noise with zero median

Var

  • ˆ

β

  • depends on ww n and on noise variance σ2:

Var

  • ˆ

β

  • =: V (n, σ) = V (n, 1) · σ2

approximations ˆ V (n, 1) =: vn for n = 5, . . . , 300 by simulation estimate noise variance σ2 on residuals? ⇒ small power! better: obtain ˆ σ2 by regression-free Q scale estimator

(Gelper et al., 2009)

slide-18
SLIDE 18

Estimation of Var(D) = Var

  • ˆ

βleft + Var

  • ˆ

βright

  • Var(D) = vℓ · Q(xleft

t

)2 + vr · Q(xright

t

)2 → if |d∗| =

  • d
  • vℓ · Q(xleft

t

)2 + vr · Q(xright

t

)2

  • is ’too large’: reject H0

Critical values for D∗? D∗ roughly N(0, 1) distributed for several noise distributions simulation: quantiles of the emp. distribution of d∗ for N(0, 1) noise

slide-19
SLIDE 19

Application: Blocks data

SCARM arguments: r = 20, nmin = 40, nmax = 300, α = 0.001

200 400 600 800 1000 −6 −4 −2 2 4 6 Blocks Data and SCARM signal extraction t

data signal signal extraction

200 400 600 800 1000 50 100 150 200 250 window widths t

slide-20
SLIDE 20

Application: Real data

SCARM arguments: r = 40, nmin = 80, nmax = 300, α = 0.001

100 200 300 400 500 100 120 140 160 Real data and SCARM signal extraction t

data signal extraction

100 200 300 400 500 40 60 80 100 120 140 160 180 window widths t