SLIDE 5 Monotone Constraints
Example
Assume Itemset ABC should be
generated by from AB and AC but since ACB is not generated whereas
Assume Itemset ABC is correctly
generated by from AB and AC but since ACB is incorrectly pruned whereas
( ) . C S C S ≡ ∈
apriori
generate
( ) , C AB false =
( ) C ABC true =
( ) . C S A S ≡ ∈
apriori
generate
( ) , C AB false = ( ) C ABC true =
The generation step in Apriori must be complete: i.e., it must not miss any itemset satisfying C The pruning step (Phase 1) must be correct, i.e., it must not prune an itemset that verify C The generation step and pruning step need to be modified in order to include monotone constraints
Monotone Constraints
Some definition in modified generation
procedure
Negative border: If denotes an anti-monotone
constraint, is the collection of the minimal itemsets that do not satisfy
- denotes a monotone constraint, it is the
negation of , so equals to
am
C
am
C
Bd
am
C
m
C
am
C
'am C ¬
m
C
Monotone Constraints
Generation procedure
- and B is a 1-itemset
- A,B
- Assume
and
,
If k<ms, If k=ms, If k>ms,
This generation procedure is complete and ensures that
every candidate itemset verifies ( )
1(
) { ,
k
generate L A B where = U
k
A L ∈
2(
) { ,
k
generate L A B where = U
k
L ∈ '
am am
C C C = Λ¬
'
| |
C am
S Bd
ms Max S
∈
=
1 '
( )
am
m C
generate L Bd Items = ∩ 1 k ≥
1 1 '
( ) ( ) ( )
am
m k k k C
generate L generate L Bd Items + = ∪ ∩
1
( ) ( )
m k k
generate L generate L =
2
( ) ( )
m k k
generate L generate L = } }
monotone constraint
monotone constraint
collection of the minimal itemsets that do not satisfy
am
C
am
C
Bd
m
C
'am C ¬
m
generate
'am C ¬ We do not need to verify the monotone constraint after this generation procedure
m
C
Monotone Constraints
Pruning procedure
For all and for all such that |S’|=k
do if and then delete S from
m
prune
1 g k
S C + ∈
' S S ⊂
'
k
S L ∉ ( ')
m
C S true =
1 g k
C +
m
prune
The algorithm is correct because it does not prune any itemset that verify .Its completeness means that if an itemset is not pruned then every proper subset of that itemset verify .
'
am am
C C C = Λ¬
am
C