modified segmentation methods of quasi stationary time
play

Modified segmentation methods of quasi-stationary time series Irina - PowerPoint PPT Presentation

Modified segmentation methods of quasi-stationary time series Irina Roslyakova Department of Scale Bridging Thermodynamic and Kinetic Simulation ICAMS (Interdisciplinary Centre for Advanced Materials Simulation) Ruhr-Universitt Bochum, UHW


  1. Modified segmentation methods of quasi-stationary time series Irina Roslyakova Department of Scale Bridging Thermodynamic and Kinetic Simulation ICAMS (Interdisciplinary Centre for Advanced Materials Simulation) Ruhr-Universität Bochum, UHW 10/1022 Stiepeler Str. 129, 44801 Bochum Tel.: +49 234 32 22449, Mobil: +49 (163) 196 3327 E-mail: irina.roslyakova@rub.de

  2. Contents � Problem importance � Method of Pedro Bernaola-Galván et al. � Modified segmentation algorithm � Comparison: Modified segmentation algorithm vs. R-Package “ strucchange” � Conclusion This work is part of my master thesis performed in BASF SE, Germany 2

  3. Problem importance � Many time series are a sequence of stationary intervals � The statistical analysis of such quasi-stationary processes requires a division of the measurements into different stationary time segments. � The segmentation of quasi-stationary time series is a tedious computational problem � For large samples statistical methods will be too time consuming. Heuristics have to be applied. � Possible application � Medicine � Chemical production processes � Internet traffic fluctuations 3

  4. Method of Pedro Bernaola-Galván et al. Statistic t for comparison mean value of two samples is calculated for every position of sliding point p from start of interval left right − m m = x x t ( p ) , S is the pooled variance i D S D till the end of intervall Point p with maximal difference in mean value is DOI: 10.1103/PhysRevLett.87.168105 checked by using modified T-Test η { } ( ) ( ) ≈ 1- δν δ , P t I max 2 ν ν + [ /( t )] max = − v N 2 I ( a,b ) is the incomplete beta function where is the number of degrees of freedom, x Significance of cutting point p is checked under condition that minimal length of subintervals have to be not less that some used defined value left ≥ l l ( ) ( ) 0 p c = ≥ P = p t P t 0 . 95 we cut the series at point and max 0 max right ≥ l l 0 4

  5. Method of Pedro Bernaola-Galván et al. Analysis of the vapor consumption in a chemical production Observation Vapor consumption window: Method is sensitive but not robust against 30 days significant outliers near boundaries 30 days Vapor consumption 85 days no segmentation outliers 85 days Time 5

  6. Modified segmentation algorithm � Bernaola-Galván’s method is sensitive but not robust against significant outliers near boundaries Vapor consumption Observation window: 85 days Time � Modification to make algorithm robust against outliers near boundaries The length of right subinterval is less than minimal allowed value l 0 Pedro Bernaola-Galván Modified method 6

  7. Modified segmentation algorithm Analysis of the vapor consumption in a chemical production Vapor consumption Observation window: Segmentation result are independent of observation 30 days window and outliers. 30 days Vapor consumption 85 days outliers Time index 85 days Time 7

  8. Segmentation Steps in Comparison Pedro Bernaola-Galván Modified method 1 1 2 2 3 4 3 4 5 5 6 6 8

  9. Modified Segmentation: Steps Vapor consumption Vapor consumption Observation window: t t t 1 2 1 30 days 30 days 30 days Time Time Vapor consumption Vapor consumption t t t t t t t 2 3 1 2 3 1 4 Time Time 9

  10. Modified segmentation vs. " strucchange" R-Package strucchange Vapor consumption, t/h (+) breaks in time series with trends 0.175 can also be recognized (-) slower (49.08 sec) 0.165 0 100 200 300 400 500 600 700 Time index Modified segmentation Time index Vapor concsumption, t/h 0.175 (+) much faster (0.68 sec) (-) only for breaks in stationary data 0.165 0 100 200 300 400 500 600 700 Time index 10

  11. Conclusion � Heuristic method proposed by Pedro Bernaola-Galván et al. was implemented in R and analyzed � Analysis of original segmentation algorithms showed that this method in not robust to outliers near to bounds � Original algorithms was modified and show good resistant to influence of outliers near bounds � Algorithm is limited to stationary segments. � The modified segmentation method was compared with functions breakpoints from R-package strucchange and its computational efficiency was shown 11

  12. Literature � Bernaola-Galván, Pedro; Ivanov, Plamen Ch.; Amaral, Luís A. Nunes; Stanley, H. Eugene: Scale Invariance in the Nonstationarity of Human Heart Rate. In: Physical review letters (2001), Volume 87, number 16 (abgerufen am 13. August 2009). http://polymer.bu.edu/hes/articles/bias01.pdf � Fukuda, Kensuke; Stanley, H. Eugene; Amaral, Luı´s A. Nunes: Heuristic segmentation of a nonstationary time series. In: Physical review letters 69 (2004). � Zeileis, Achim; Leisch, Friedrich; Hansen, Bruce; Hornik, Kurt; Kleiber, Christian: Package „strucchange“, 2009 (Version 1.3-7 ) � Zeileis, Achim; Leisch, Friedrich; Hornik, Kurt; Kleiber: strucchange: An R Package for Testing for Structural Change in Linear Regression Models (Version 1.3-7 ) http://cran.r- project.org/web/packages/strucchange/vignettes/strucchange-intro.pdf 12

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend