SLIDE 38 State-of-the- Art in Data Stream Mining (Part I) Jo˜ ao Gama Outline Motivation Data Streams
Basic Methods
Change Detection
Predictive Learning
Clustering Data Streams Predictive Models from Data Streams
Decision Trees Neural Networks
Exponential Histograms: Example
Time 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Element 1 1 1 1 1 1 1 1 1 1 1 1 Window length=10 Relative Error=0.5 Merge if 3 buckets of the same size: |1/0.5|/2/2 Time Buckets Total Last T1 11 1 1 T2 11, 12 2 1 T3 11, 12, 13 3 1 (merge) 22, 13 3 1 T4 22, 13, 14 3 2 . . . T11 44, 28, 210, 111 9 4 T12 44, 28, 210, 111, 112 10 4 T13 44, 410, 212, 113 11 4 T14 44, 410, 212, 113, 114 12 4 (Removing out-of-date) T15 410, 212, 113, 114 8 4