1 introduction co occurrences frequent item tree
play

1 Introduction Co-Occurrences Frequent Item Tree Association rule - PowerPoint PPT Presentation

1 Introduction Co-Occurrences Frequent Item Tree Association rule mining FP Growth Ying Xu COFI tree mining yx2@cs.ualberta.ca (COFI-tree Mining: A New Approach to Pattern Growth with Reduced Candidacy Generation Hajj,


  1. 1 Introduction Co-Occurrences Frequent Item Tree � Association rule mining � FP Growth Ying Xu 徐莹 � COFI tree mining yx2@cs.ualberta.ca (COFI-tree Mining: A New Approach to Pattern Growth with Reduced Candidacy Generation Hajj, Zaiane) 2 1 2 FP-tree 2 FP-tree � Frequent item header that contains item TID Items TID Items names and pointer to the first node in FP tree. T1 A G D C B T10 C F G R T2 B C H E D T11 A D B H I � Prefix tree T3 B D E A M T12 D E B K L � Each node contains the item name, T4 C E F A N T13 M D C G frequency and pointer to another node of the T5 A B N T14 C F same kind. T6 A C G T15 B D E F I T7 A C H I G T16 J E B A D T8 L E F K B T17 A K E F C 3 4 T9 F M N T18 C D L B A

  2. 2 FP-tree 2 FP-tree Root � Min-support>4 � Mining A : 11 B : 4 C : 3 Header Table C: 4 B: 6 Item Frequenc y head A 11 C: 1 D : 1 C: 2 D : 2 B 10 D: 1 C 10 D: 3 E: 2 D 9 E : 2 D: 2 E: 1 F: 2 E: 1 E 8 F: 1 F: 1 E: 2 F 7 F : 1 F : 1 5 6 2 FP-tree 3. COFI-tree � Drawback: � Prunning � global frequent/local non-frequent property: memory space usage the itemset that is global frequent but not local frequent with respect to the item A of the A-COFI- tree It is an anti-monotone property 7 8

  3. 3. COFI-tree 3. COFI-tree Root � Frequent item header that contains items � FP-tree A : 11 B : 4 C : 3 names which are frequent with respect to the Header Table C: 4 B: 6 specific item ascending ordered by global Item Frequenc y head A 11 C: 1 frequency. D : 1 C: 2 D : 2 B 10 D: 1 � Prefix tree C 10 D: 3 E: 2 D 9 E : 2 D: 2 E: 1 F: 2 E: 1 � Each node contains the item name, E 8 F: 1 F: 2 E: 2 F 7 frequency, participation counter and pointer F : 1 F : 1 to another node of the same kind. 9 10 3. COFI-tree 3. COFI-tree Root F (7 0) A : 11 B : 4 C : 3 E 4 Header Table C: 4 B: 6 D 2 Item Frequenc y head C: 1 A 11 D : 1 C 4 C: 2 D : 2 B 10 D: 1 C 10 D: 3 E: 2 B 2 D 9 E : 2 D: 2 E: 1 F: 2 E: 1 E 8 F: 1 A 3 F: 1 E: 2 F 7 F : 1 F : 1 11 12

  4. 3. COFI-tree 3. COFI-tree � E-COFI-tree � Mining E(8 0) (Support>4) E(8 0) Pattern E D B 5 D 5 D 5 D(5 0) B(1 0) D 5 D(5 0) B(1 0) E D 5 B 6 C 3 E B 5 B 6 E D B 5 B 6 B(5 0) B(5 0) A 4 13 14 3. COFI-tree 3. COFI-tree � Mining � Mining E(8 5 ) Pattern E(8 5) Pattern E D B 5 E B 1 D 5 D(5 5 ) B(1 0) E D 5 D(5 5) B(1 0) D 5 E D 5 E B 5 B 6 E B 6 E D B 5 B 6 E D B 5 B(5 5 ) B(5 5) 15 16

  5. 3. COFI-tree 3. COFI-tree � Mining � Mining E(8 6 ) E(8 6) Pattern Pattern E B 1 E D 0 D 5 D(5 5) B(1 1 ) D 5 D(5 5) B(1 1) E D 5 E B 6 B 6 B 6 E D B 5 B(5 5) B(5 5) 17 18 4 Algorithm 4 Algorithm � Algorithm COFI: � Algorithm COFI: Input: modified FP-Tree, a minimum support threshold 2.3.2 Items on C form a prefix of the (A)-COFI-tree. Output: Full set of frequent patterns 2.3.3 If the prefix is new then Set frequency-count = frequency of (A) Method: node and participationcount = 0 for all nodes in the path 1. A = the least frequent item on the header table of FP-Tree Else 2. While (There are still frequent items) do 2.3.4 Adjust the frequency-count of the already exist part of the path. 2.1 count the frequency of all items that share item (A) a path. 2.3.5 Adjust the pointers of the Header list if needed Frequency of all items that share the same path are the same as of 2.3.6 find the next node for item A in the FP-tree and go to 2.3.1 the frequency of the (A) items 2.4 MineCOFI-tree (A) 2.2 Remove all non-locally frequent items for the frequent list 2.5 Release (A) COFI-tree of item (A) 2.6 A = next frequent item from the header table 2.3 Create a root node for the (A)-COFI-tree with both frequency-count 3. Goto 2 and participation-count = 0 2.3.1 C is the path of locally frequent items in the path of item A to 19 20

  6. 4 Algorithm 4 Algorithm � Function: MineCOFI-tree (A) � Function: MineCOFI-tree (A) 1. nodeA = select next node //Selection of nodes starts with the node of 3. Goto 2 most globally frequent item and following its chain, then the next less 4. Based on support threshold remove non-frequent patterns frequent item with its chain, until we reach the least frequent item in the from A Candidate List. Header list of the (A)-COFI-tree 2. while there are still nodes do 2.1 D = set of nodes from nodeA to the root 2.2 F = nodeA. frequency - nodeA. participationCount 2.3 Generate all Candidate patterns X from items in D. Patterns that do not have A will be discarded. 2.4 Patterns in X that do not exist in the A-Candidate List will be added to it with frequency = F otherwise just increment their frequency with F 2.5 Increment the value of participationCount by F for all items in D 2.6 nodeA = select next node 21 22 5 Experimental Studies Questions? 23 24

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend