PATTERN DISCOVERY IN TIME SERIES A SURVEY
DENIS KHRYASHCHEV, GRADUATE CENTER, CUNY, OCTOBER 2018
A SURVEY DENIS KHRYASHCHEV, GRADUATE CENTER, CUNY, OCTOBER 2018 - - PowerPoint PPT Presentation
PATTERN DISCOVERY IN TIME SERIES A SURVEY DENIS KHRYASHCHEV, GRADUATE CENTER, CUNY, OCTOBER 2018 MOTIVATION Often datasets represent processes that take place over long periods of time. Their outputs are measured at regular time intervals
DENIS KHRYASHCHEV, GRADUATE CENTER, CUNY, OCTOBER 2018
Data sources: 1. https://s3.amazonaws.com/tripdata/index.html,
CitiBike Fisher river
[1] Petrosian, Arthur. "Kolmogorov complexity of finite sequences and recognition of different preictal EEG patterns." Computer-based medical systems, 1995., Proceedings of the Eighth IEEE Symposium on. IEEE, 1995.
interictal 1. voltage, µV preictal 1. voltage, µV seizure 1. voltage, µV
Image source: http://web.mta.info/mta/news/books/docs/Ridership_Trends_FINAL_Jul2018.pdf
[2] Zhao, Kai, et al. "Predicting taxi demand at high spatial resolution: approaching the limit of predictability." Big Data (Big Data), 2016 IEEE International Conference on. IEEE, 2016.
Image source: Denis Khryashchev’s summer internship at Simulmedia (Jun – Aug 2018).
Video source: Denis Khryashchev’s summer internship at Simulmedia (Jun – Aug 2018).
𝑂, 𝑌𝑢 ∈ ℜ
𝑘 = 𝑌𝑢 𝑗 𝑘 = 𝑌𝑗, … , 𝑌 𝑘 , 𝑗 ≤ 𝑘
Computer Science, The University of Auckland, New Zealand, Tech. Rep., 1995.
ℎ1+1, 𝑌ℎ1+1 ℎ2
𝑛
𝐼(𝑌) 𝐼 𝑌
1 1−𝛽 log2 σ𝑙 𝑞𝑙 𝛽
𝜐→∞ lim 𝜗→∞ lim 𝑒→∞
𝑗1,…𝑗𝑒
𝑂
𝑂
𝑘 log2 𝑄 𝑌𝑗 𝑘
𝑘 is the probability of the subsequence 𝑌𝑗 𝑘. 𝐼𝑢 𝑌 is 𝑃 2𝑂 complex.
1 𝑂 σ𝑢 𝑡𝑢 ′ −1
′ is the shortest subsequence starting at period 𝑢 observed for the 1st time.
IEEE Transactions on Information Theory 44.3 (1998): 1319-1327.
# 𝑢|0≤𝑢≤𝑈−𝑜, 𝑢𝑧𝑞𝑓 𝑌𝑢+1,…,𝑌𝑢+𝑜 =𝜌 𝑈−𝑜+1
2 5 log 2 5 − 1 5 log 1 5 ≈ 1.52 .
(2002): 174102.
Electronics, 1949.
𝑌−𝑔 𝑌
𝑈 𝑌−𝑔 𝑌
𝑌𝑡−𝑔 𝑌𝑡
𝑈 𝑌𝑡−𝑔 𝑌𝑡
Electronics, 1949.
𝑊𝑏𝑠 𝛽+𝛾𝑌𝑢 𝑊𝑏𝑠 𝑌𝑢+1
𝑂−1 𝑌𝑢𝑓−𝑗2𝜌𝑙𝑢/𝑂
𝑂−1 𝑌𝑢𝑥(𝑢 − 𝜐)𝑓−𝑗2𝜌𝑙𝑢/𝑂
(1977): 1558-1564.
𝑂−1 𝑠 𝑦𝑦𝑓−𝑗2𝜌𝑙𝑢/𝑂
𝑦𝑦 is its autocorrelation function. In
IEEE International Conference on. Vol. 3. IEEE, 1998.
2 𝑂 σ𝑢=0 𝑂−1 𝑌𝑢 cos 2𝜌𝑔𝑢 + 2 𝑂 σ𝑢=0 𝑂−1 𝑌𝑢 sin 2𝜌𝑔𝑢
𝑔
1 2 ≤ 𝑔 ≤ 1 2 . The test assumes that 𝑌 = ς + 𝑏 with ς being the real
𝑞 to be a set of all 𝑞-periodic sequences.
𝑡 𝑘 = ቊ1 if 𝑘 − 𝑡 mod 𝑞 = 0
𝑞 = σ𝑡=0 𝑞−1 𝛽𝑡𝜀𝑡 𝑞 , 𝛽𝑡 = 1 𝑂 σ𝑢=0 𝑂−1 𝑌𝑡+𝑢𝑞
𝑌−𝔽 𝑌 𝜏𝑌
𝑍−𝔽 𝑍 𝜏𝑍
𝑔, 𝜍 𝑔 𝑌 , 𝑍
Ausgleichsrechnung." ZAMM‐Journal of Applied Mathematics and Mechanics/Zeitschrift für Angewandte Mathematik und Mechanik 21.6 (1941): 364-379.
𝑂−𝜐, 𝑌𝜐 𝑂 > 𝜍 𝑌1 𝑂−𝜐, 𝑌𝜐 𝑂 for every lag 𝜐.
𝑙
𝑙, 𝑍 1, … , 𝑍 𝑂 is a collection of independent identically distributed
𝑗 < ∞.
44.1/2 (1957): 289-292.
𝑞𝑗𝑘 𝑞(𝑗.) 𝑞(.𝑘).
𝑔, ∈ ℱ 𝑔 𝑌 , 𝑍
𝑦𝑦 𝑌1 , 𝑠 𝑦𝑦 𝑌2 , 𝑠 𝑦𝑦(𝑌3)} instead
𝑦𝑦 𝑌 = 𝜍 𝑌, 𝑀𝜐𝑌 |2 ≤ 𝜐 ≤ 𝑂 − 1 , 𝑀𝑙𝑌𝑢 = 𝑌𝑢−𝜐
𝑌𝑗,𝑢
1 𝑌, 𝑍 = 1−𝜍 𝑌,𝑍 1+𝜍 𝑌,𝑍 𝛾
2 𝑌, 𝑍 = 2 1 − 𝜍 𝑌, 𝑍
99-109.
260.
2
𝜀𝑌𝑙 𝜀𝑢𝑙 − 𝜀𝑍𝑙 𝜀𝑢𝑙 2
𝑂 − 𝑌1 𝑂−1
Intelligent Data Analysis. Springer, Berlin, Heidelberg, 2003.
2011 SIAM international conference on data mining. Society for Industrial and Applied Mathematics, 2011.
2 .
Dimensionality Data and Its Applications. 2003.
Temporal Data Mining. 2001.
𝑌, … , 𝜄𝑙 𝑌} and 𝜄𝑍 = {𝜄1 𝑍, … , 𝜄𝑙 𝑍}. Setting the null
𝑢 = 𝐷 𝑇𝑙𝛾|𝑌1
𝑢
𝐷 𝑇𝑙 𝑌1 𝑢
𝑢) stands for the number of times 𝑇𝑙 is followed with 𝛾.
𝑢 = 𝑄 𝑌𝑢+𝑙+1 = 𝛾|𝑌𝑙+1 𝑢+𝑙
𝑢
𝑢+𝑙 = 𝑇1, … , 𝑇𝑙 = 𝑇 we predict
𝑢 = 𝐷(𝑇𝛾|𝑌1
𝑢)
𝐷 𝑇 𝑌1 𝑢
𝑞−1, 𝑌2 𝑞, … , 𝑌𝑂−𝑞+1 𝑂−1
𝑂
1 𝑂−𝑞+1 σ𝑗=1 𝑂−𝑞+1 max 0, 1 − 𝑧𝑗 𝑥𝑈𝑦𝑗 − 𝑐
2
𝑈𝜚 𝑥1 𝑈𝑦𝑗 + 𝑐1 + 𝑐2
(2005): 1160-1187.
𝑞
𝑟
Psychometrika 52.3 (1987): 345-370.
𝑗=1 𝐽
𝑗𝑗 𝑢
MAX(𝑞,𝑟)
𝑟
𝐽
prediction." IEEE transactions on neural networks 5.2 (1994): 240-254.
1 𝑗1 + ℎ𝑢−1𝑋 2 𝑗1 , 𝑗2 = 𝜏 𝑐𝑗2 + 𝑌𝑢𝑋 1 𝑗2 + ℎ𝑢−1𝑋 2 𝑗2
1 𝑔 + ℎ𝑢−1𝑋 2 𝑔
1 𝑝 + ℎ𝑢−1𝑋 2 𝑝
prediction." IEEE transactions on neural networks 5.2 (1994): 240-254.