Reductionist View: A Priori Algorithm and Vector-Space Text Retrieval
Sargur Srihari University at Buffalo The State University of New York
1
Reductionist View: A Priori Algorithm and Vector-Space Text - - PowerPoint PPT Presentation
Reductionist View: A Priori Algorithm and Vector-Space Text Retrieval Sargur Srihari University at Buffalo The State University of New York 1 A Priori Algorithm for Association Rule Learning Association rule is a representation for local
1
2
3
4
5
6
7
8
10
11
Ai1 =1
12
– where θ is an itemset pattern – and φ is an itemset pattern consisting of a single conjunct
– Given an itemset pattern θ – its frequency fr(θ) is the number of cases in the data that satisfy θ
– Conditional probability that φ is true given that θ is true
– Given a frequency threshold s, all itemset patterns that are frequent
c(θ ⇒ ϕ) = fr(θ ∧ϕ) fr(θ)
13 Basket\Item A1 A2 A3 A4 A5 t1 1 t2 1 1 1 1 t3 1 1 1 t4 1 t4 1 1 1 t5 1 1 1 t6 1 1 1 t7 1 1 1 t8 1 1 t9 1 1 t10 1 1 1
14
15
e.g., ps =0.1 want only rules that cover at least 10% of the data
e.g., pa =0.9 want only rules that are 90% accurate
16
18
19
– Algorithm performs another linear scan of the database to determine which of these sets are in fact frequent
– Cardinality of largest frequent set is quite small (relative to n) for large support values
20
21
22
23
24
25
26
27