Scalable Frequent Sequence Mining With Flexible Subsequence Constraints
Alexander Renz Wieland 1 Matthias Bertsch 2 Rainer Gemulla 2
1Technische Universit¨
at Berlin
2Universit¨
Scalable Frequent Sequence Mining With Flexible Subsequence - - PowerPoint PPT Presentation
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints Alexander Renz Wieland 1 Matthias Bertsch 2 Rainer Gemulla 2 1 Technische Universit at Berlin 2 Universit at Mannheim ICDE 2019, Macau, China April 11 th , 2019
1Technische Universit¨
2Universit¨
Cannon5D Nikon5100 DSLR Camera Tripod Photography . . . Example product hierarchy
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 2 / 15
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 3 / 15
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 4 / 15
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 5 / 15
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 6 / 15
1
2
3
4
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 7 / 15
1 3-grams
2 3−, 4-, and 5-grams
3 skip 3-grams with gap 1
4 All subsequences
5 length 3–5 subsequences
6 bounded gap of 0–3
7 serial episodes of length 3, window 5
8 generalized 5-grams
9 subsequences matching regex [a|b] c∗d
10 . . .
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 8 / 15
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 9 / 15
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 10 / 15
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 11 / 15
{a} {c} {b} {a} {c} {c} {b} {a} {c} {d} {b} {a} {c} {d} {c} {b} {a} {d} {c} {b}
{c} {a} {c} {d} {d} {c} {b} {c} {b} {b}
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 12 / 15
N1(10) N2(100) N3(10) N4(1k) N5(1k) Subsequence constraint Total time (in seconds) 1 10 100 1000 Naïve SemiNaïve D−SEQ D−CAND
A1(500) A2(100) A3(100) A4(100) Subsequence constraint Total time (in seconds) 1 10 100 1000
n/a (OOM) n/a (OOM)
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 13 / 15
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 14 / 15
1
2
To appear in Transactions on Database Systems, 2019.
Scalable Frequent Sequence Mining With Flexible Subsequence Constraints 15 / 15