The DESQ Framework for Declarative and Scalable Frequent Sequence Mining
Kaustubh Beedkar1 Rainer Gemulla 2 Alexander Renz-Wieland 1
1Technische Universit¨
at Berlin
2Universit¨
The DESQ Framework for Declarative and Scalable Frequent Sequence - - PowerPoint PPT Presentation
The DESQ Framework for Declarative and Scalable Frequent Sequence Mining Kaustubh Beedkar 1 Rainer Gemulla 2 Alexander Renz-Wieland 1 1 Technische Universit at Berlin 2 Universit at Mannheim INFORMATIK 19, Kassel September 24 th , 2019
1Technische Universit¨
2Universit¨
DESQ: Declarative and Scalable Frequent Sequence Mining 2/23
DESQ: Declarative and Scalable Frequent Sequence Mining 3/23
DESQ: Declarative and Scalable Frequent Sequence Mining 4/23
DESQ: Declarative and Scalable Frequent Sequence Mining 5/23
DESQ: Declarative and Scalable Frequent Sequence Mining 6/23
DESQ: Declarative and Scalable Frequent Sequence Mining 7/23
DESQ: Declarative and Scalable Frequent Sequence Mining 8/23
DESQ: Declarative and Scalable Frequent Sequence Mining 9/23
DESQ: Declarative and Scalable Frequent Sequence Mining 10/23
DESQ: Declarative and Scalable Frequent Sequence Mining 11/23
DESQ: Declarative and Scalable Frequent Sequence Mining 12/23
DESQ: Declarative and Scalable Frequent Sequence Mining 13/23
DESQ: Declarative and Scalable Frequent Sequence Mining 14/23
100,0,3 100,0,5 100,1,5 100,2,5 1K,0,5(+H) Total time [seconds] 10 100 1000 σ, γ, λ
>12Hr >12Hr >12Hr
DESQ: Declarative and Scalable Frequent Sequence Mining 15/23
Total time [seconds] 10 100 1000 10000 Pattern expression (σ) N1(10) N2(100) N3(10) N4(1K) N5(1K) A1(500) A2(100) A3(100) A4(100) Naive+cFST DESQ−COUNT DESQ−DFS
1.03 9.38 2.02 54.55 89.8 4876 445 11892 3894 1.03 7.5 1.84 48.75 75.98 1478 416 5840 909
DESQ: Declarative and Scalable Frequent Sequence Mining 16/23
DESQ: Declarative and Scalable Frequent Sequence Mining 17/23
DESQ: Declarative and Scalable Frequent Sequence Mining 18/23
Item-based partitioning
a b n
FSM FSM FSM
DESQ: Declarative and Scalable Frequent Sequence Mining 19/23
{a} {c} {b} {a} {c} {c} {b} {a} {c} {d} {b} {a} {c} {d} {c} {b} {a} {d} {c} {b}
{c} {a} {c} {d} {d} {c} {b} {c} {b} {b}
DESQ: Declarative and Scalable Frequent Sequence Mining 20/23
2 4 8 Executors Total time (in minutes) 5 10 20
2(25) 4(50) 6(75) 8(100) Number of executors (% of Data) Total time (in minutes) 2 4 6
DESQ: Declarative and Scalable Frequent Sequence Mining 21/23
DESQ: Declarative and Scalable Frequent Sequence Mining 22/23
DESQ: Declarative and Scalable Frequent Sequence Mining 23/23