SLIDE 3 8
The Algorithm (refine model 1)
Cluster N patterns in k clusters User ranks k patterns Refine model Re-rank all N patterns N=aN
*** main contribution of the paper
– How to model the users knowledge? – So far we have only ranked k out of N patterns…
– Difference between observed frequency and expected frequency fo(P) and fe(P) – Observed from input – Expected calculated from the model of the users knowledge fe(P) = M(P,θ) – If fo(P) and fe(P) are different the pattern is interesting
– if the user ranks Pi as more interesting than Pj : R[fo(Pi),fe(Pi)] > R[fo(Pj),fe(Pj)] – Log-linear model R[fo(P),fe(P)] = log fo(P) - log fe(P) – This is a constraint on the model optimization
9
Will have k constraints
The Algorithm (refine model 2)
Cluster N patterns in k clusters User ranks k patterns Refine model Re-rank all N patterns N=aN
– Say we have a pattern (P) in a data set of s items, fe(P) is: – Recall ordering of patterns by user as a constraint: – Define a weight vector and new representation of the constraint above:
1
log ( )
s e j j
f P u u
=
= +∑
1 1 2 2
log ( ) log ( ) log ( ) log ( )
f P f P f P f P − > −
1 2
( ) ( )
T T
w v P w v P >
1
( ) [log ( ), ,..., ]
v P f P x x =
1
[ , , ,..., ]
s
w c u u u = − − −
R[fo(Pi),fe(Pi)] > R[fo(Pj),fe(Pj)]
10
The Algorithm(Re-rank all N patters)
Cluster N patterns in k clusters User ranks k patterns Refine model Re-rank all N patterns N=aN 1 2
( ) ( )
T T
w v P w v P >
1
( ) [log ( ), ,..., ]
v P f P x x =
1
[ , , ,..., ]
s
w c u u u = − − −
Modified from www.nasa.com
SVM Black Box
– Can now rank ALL N patterns with interesting measure: R[fo(Pi),fe(Pi)] > R[fo(Pj),fe(Pj)] R[f (P),f (P)] = K[v(P),w]
11
The Algorithm (Reduce N)
- Reduce number of patterns
– Discard some patterns N=aN – a is specified by the user – Will reduce the number of patterns to present to user at end – Stop when reached the max number of iterations also specified by the user
END OF ALGORITHM ☺
– Not presented – Identical formulation to log-linear but assign a users belief probability to each transaction
Cluster N patterns in k clusters User ranks k patterns Refine model Re-rank all N patterns N=aN 1 2
( ) ( )
T T
w v P w v P >
1
1 ( ) [ ( ),..., ( )] ( )
m
x P x P f P =
1
[ ,..., ]
m
w p p =
m = number of transactions xk(P) = 1 if the transaction k contains P