CS6220: DATA MINING TECHNIQUES
Instructor: Yizhou Sun
yzsun@ccs.neu.edu November 30, 2015
CS6220: DATA MINING TECHNIQUES Mining Time Series Data Instructor: - - PowerPoint PPT Presentation
CS6220: DATA MINING TECHNIQUES Mining Time Series Data Instructor: Yizhou Sun yzsun@ccs.neu.edu November 30, 2015 Announcement No class next week and see you on Dec. 14. The final report and presentation guideline is going to be
yzsun@ccs.neu.edu November 30, 2015
2
3
Matrix Data Text Data Set Data Sequence Data Time Series Graph & Network Images Classification
Decision Tree; Naรฏve Bayes; Logistic Regression SVM; kNN HMM Label Propagation* Neural Network
Clustering
K-means; hierarchical clustering; DBSCAN; Mixture Models; kernel k-means* PLSA SCAN*; Spectral Clustering*
Frequent Pattern Mining
Apriori; FP-growth GSP; PrefixSpan
Prediction
Linear Regression Autoregression
Similarity Search
DTW P-PageRank
Ranking
PageRank
4
5
6
7
8
1, ๐ 2, โฆ , ๐๐
๐ข: ๐ข โ ๐ , ๐ฅโ๐๐ ๐ ๐ ๐๐ก ๐ขโ๐ ๐๐๐๐๐ฆ ๐ก๐๐ข
9
10
direction in which a time series is moving over a long interval
about a trend line or curve
follow during corresponding months of successive years.
11
12
๐ข) = ln(๐ ๐ข) โ ln(๐ ๐ขโ1)
13
14
15
Autocovariance
๐๐๐ค(๐
๐ข,๐๐ขโ๐)
๐ค๐๐ (๐
๐ข)
๐ข, ๐ ๐ขโ๐) is calculated as:
16
๐
๐ข
๐
๐ขโ๐
๐ง๐+1 ๐ง1 ๐ง๐+2 ๐ง2 โฎ โฎ ๐ง๐โ1 ๐ง๐โ๐โ1 ๐ง๐ ๐ง๐โ๐
17
๐๐ = ๐. ๐๐, very high: Last quarterโs inflation rate contains much information about this quarterโs inflation rate
18
19
20
21
23
24
25 VanEck International Fund Fidelity Selective Precious Metal and Mineral Fund
Two similar mutual funds in the different fund group
26
27
โฒ = ๐๐โ๐(๐ท) ๐(๐ท)
28
29
30
31
32
33
34
35
+ ๐(๐ฆ๐, ๐ง๐)
36
Time complexity: O(MN)
37
38
the frequency domain
same as their Euclidean distance in the frequency domain
39
40
41
42
43
44
domain is the same as their distance in the frequency domain
45
๏ญ ๏ฝ ๏ญ ๏ฝ
1 2 1 2
n f f n t t
3 2 2
๏ฝ ๏ฝ
f n t
46
47