SLIDE 12
- 1. Memory and DFS improvement. How to Store and
restore databases in the DFSearch ? => reversible vectors making use of trailing techniques.
- 2. Support Count Improvement. How to compute
support efficiently? Visit only the last position of each symbol after start position.
- 3. Sequence visited Improvement. Visit a sequence only
if current start position is less than last position of prefix [weakness 3]
- 4. Pruning Improvement. Remove infrequent item from
- nly Di+1 domains of Pi+1. [weakness 1]
Improvements of Literature (2/4)
A C A B A A A C B B B B
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
startPos
✦ startPos=5
- How to compute items support in SDB?
Often many repeating symbols: cache for each symbol its 'last position' and only iterate over those (O(m) vs O(n))
★ Identify items which exist in each sequence of SDB and
increase items support value.
Aoga et al., An Efficient Algorithm for SPM with CP, ECML PKDD 2016
9
★ Last Position List = [(B,16),(C,12),(A,11),(D,2),(E,0)] A C A B A A A C B B B B
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
startPos
✦ startPos=5
- How to compute items support in SDB?
Often many repeating symbols: cache for each symbol its 'last position' and only iterate over those (O(m) vs O(n))
★ Identify items which exist in each sequence of SDB and
increase items support value.
A C B
Aoga et al., An Efficient Algorithm for SPM with CP, ECML PKDD 2016
9
★ Last Position List = [(B,16),(C,12),(A,11),(D,2),(E,0)] A C A B A A A C B B B B
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
startPos
✦ startPos=5
- How to compute items support in SDB?
Often many repeating symbols: cache for each symbol its 'last position' and only iterate over those (O(m) vs O(n))
★ Identify items which exist in each sequence of SDB and
increase items support value.
A C B
Aoga et al., An Efficient Algorithm for SPM with CP, ECML PKDD 2016
9