Iterative Optimization of Rule Sets Jiawei Du 16. November 2010 - PowerPoint PPT Presentation

Iterative Optimization of Rule Sets Jiawei Du 16. November 2010 Prof. Dr. Johannes Fürnkranz Frederik Janssen

Overview � REP-Based Algorithms � RIPPER � Variants � Evaluation � Summary 2

REP-Based Algorithms REP I-REP / I-REP2 / I-REP* RIPPER RIPPER k Learn a Rule Set Learn a Rule Set Split Training Data (I-REP*) Learn a Rule Set Split Training Data (I-REP*) Optimize the Rule Set Learn a Rule Set Learn a Rule Get a Rule Prune the Rule Prune the Rule Set k times Optimize the Rule Set Generate Variants Check the Rule Choose One Variant * k means the number of optimization iterations Learn Rules (I-REP*) 3

RIPPER Iterative Optimization of Rule Sets RIPPER Candidate Rule Growing Phase Pruning Phase Old Rule Growing a new rule The pruning heuristic is guided to minimize the error of the single from an empty rule Learn a Rule Set rule (I-REP*) Replacement See Old Rule The pruning heuristic is guided to minimize the error of the entire Optimize the Rule Set rule set Revision Further growing See Replacement Get a Rule the given Old Rule n times Generate Variants Selection among the candidate rules based on Choose One Variant Minimum Description Length (MDL) Learn Rules (I-REP*) Old Rule Replacement Selection Criterion Best Rule * n means the number of rules in the rule set Revision 4

1 st Variant New Pruning Method Candidate Rule Abridgment RIPPER Learn a Rule Set Rule: Class = A: C_1, C_2, C_3, C_4 (I-REP*) Original Pruning Method Optimize the Rule Set R_1: Class = A: C_1, C_2, C_3 (after 1. Iteration) Get a Rule R_2: Class = A: C_1, C_2 (after 2. Iteration) Example R_3: Class = A: C_1 (after 3. Iteration) n times Generate Variants New Pruning Method Choose One Variant R_1’: Class = A: C_2, C_3, C_4 R_2’ Class = A: C_1, C_3, C_4 Learn Rules (I-REP*) R_3’: Class = A: C_1, C_2, C_4 R_4’: Class = A: C_1, C_2, C_3 (after 1. Iteration) * n means the number of rules in the rule set 5

1 st Variant Search Space 6

2 nd Variant Simplified Selection Criterion Accuracy instead of MDL RIPPER Learn a Rule Set MDL (RS’) = DL (RS’) – Potentials (RS’) (I-REP*) ∑ Potential R R i ∈ RS Potentials (RS’) = ( ' ) ' { ' } i Optimize the Rule Set Potential R ( ' ) calculates the potential of decreasing the DL of the rule sets i R if the rule is deleted ' Get a Rule i + tp tn n times Generate Variants = Accuracy R ( ) ∈ R { OldRule, Replacemen t, Revision } i + P N i Choose One Variant tp means the number of positive examples covered by the relevant rule tn means the number of negative examples that are not covered by the relevant rule Learn Rules (I-REP*) P and N mean the total number of positive and negative examples in the training set * n means the number of rules in the rule set 7

Evaluation � Data Sets 20 real data sets selected from the UCI repository � 9 data sets (type categorical) � 4 data sets (type numerical) � 7 data sets (type mixed) � Evaluation Method 10-fold stratified cross-validation � run 10 times on each data set � training set 90% � testing set 10% 8

Evaluation RIPPER (SeCoRIP) Algorithm AvgCorr. Profit � The correctness of rule sets is SeCoRIP_0 86.19 - increased (the percentage of the correctly classified examples in SeCoRIP_1 87.56 1.59% the testing set) SeCoRIP_2 87.61 0.06% � The size of rule set is decreased � The number of conditions in each SeCoRIP_3 87.53 -0.08% rule is decreased SeCoRIP_4 87.64 0.12% SeCoRIP_5 87.45 -0.21% − AvgCorr AvgCorr + = (i 1) i i ∈ Profit { 0 , 1 , 2 , 3 , 4 } + (i 1) AvgCorr i 9

Evaluation RIPPER (Convergence of SeCoRIP) Group A Group B The maximal value mainly appears at The maximal value appears at the x-axis � � ∈ the x-axis Optimizations { 1 , 2 } = Optimizations 0 These points converge to a definite point � These points converge to a definite point � The relevant data sets contain more � The relevant data sets contain only � nominal attributes than numeric ones nominal attributes 10

Evaluation RIPPER (Convergence of SeCoRIP) Group C Group D The maximal value mainly appears at � The points of the lines show a upward � ∈ ∈ the x-axis Optimizations trend at the x-axis Optimizations { 5 , 6 , 7 } { 8 , 9 , 10 } These points converge to a definite point � The signal of convergence is not � observable � The relevant data sets contain more numeric attributes than nominal ones 11

Evaluation RIPPER (Convergence of SeCoRIP) � N (nominal attributes) > N (numerical attributes) � the accuracy of the optimized rule sets often converge to a definite value with the increasing of the number of optimization iterations � the definite value here is usually not the maximum or minimum value obtained so far � N (nominal attributes) < N (numerical attributes) � The value of the correctness keeps an upward trend with the increasing of the number of optimization iterations � The signal of convergence cannot be obviously detected 12

Evaluation RIPPER (SeCoRIP) AvgCond. Algorithm AvgRules. in one Rule � The correctness of rule sets is SeCoRIP_0 8.75 1.94 increased � The size of rule set is decreased SeCoRIP_1 7.35 1.65 (the sum of all rules in the SeCoRIP_2 7.25 1.69 constructed rule sets) � The number of conditions in each SeCoRIP_3 7.40 1.73 rule is decreased (the sum of all SeCoRIP_4 7.55 1.73 conditions / the size of rule set) SeCoRIP_5 7.50 1.73 13

Evaluation 1 st Variant (SeCoRIP*) � The new pruning method will have no obvious effect on the rule sets whose rules contain too few conditions � Sometimes the constructed Abridgement is the same as the candidate rule Revision or even the original Old Rule R’: Class = A: C_1 R: Class = A: C_1, C_2 R’: Class = A: C_1, C_2 � The correctness of the rule sets can be well improved when the relevant rules normally contain more than three conditions 14

Evaluation 2 nd Variant (SeCoRIP’) ’ 15

Evaluation 2 nd Variant (SeCoRIP’) Compare to SeCoRIP: The correctness of the constructed AvgCond. � Algorithm AvgRules. in one Rule rule sets are often worse SeCoRIP_0 The difference can be reduced with 8.75 1.94 � ’ the increasing of the number of SeCoRIP_1 7.05 1.70 ’ optimization iterations SeCoRIP_2 7.00 1.72 ’ Several data sets cannot be well � SeCoRIP_3 7.25 1.74 processed ’ SeCoRIP_4 7.05 1.74 The number of rules and conditions � ’ SeCoRIP_5 can also be decreased 7.25 1.77 ’ 16

Summary � RIPPER (postprocessing phase) � The correctness of rule sets is increased � The results often converge to a definite value � Better handling the data sets which contain more numeric attributes � The number of rules and conditions is decreased � 1 st Variant (new pruning method) � Not suitable for the rule sets whose rules contain too few conditions � Taking positive effect on the rule sets whose rules contain sufficient number of conditions � 2 nd Variant (simplified selection criterion) � Remaining the features of the original version � The results are not as good as the original version � The original selection criterion MDL is not easily replaceable 17

Thank you for your attention! 18

Iterative Optimization of Rule Sets Jiawei Du 16. November 2010 - PowerPoint PPT Presentation

Iterative Optimization of Rule Sets Jiawei Du 16. November 2010 Prof. Dr. Johannes Frnkranz Frederik Janssen Overview REP-Based Algorithms RIPPER Variants Evaluation Summary 2 REP-Based Algorithms REP I-REP / I-REP2 /

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of

Chapter 12: Iterative Methods ES 240: Scientific and Engineering Computation. Iterative Methods

Development Figures are from : Agile and Iterative Development: A Manager's Guide, Craig

MATH 105: Finite Mathematics 6-1: Sets Prof. Jonathan Duncan Walla Walla College Winter

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi

Rule Changes - Non rule change year Review of 2017 rule changes - just the easy to forgot

Common Rule Advanced Notice of Proposed Rulemaking (ANPRM) IRB Investigator Advanced Notice

2nd RULE: You MUST TALK about BOOK CLUB. 2nd RULE: You DO NOT talk about 3rd RULE: PERSEVERE -- If

Rule #1: Have a takeaway. Rule #2: Keep It Simple. Rule #3: Repetition is Good. Rule #4: Be

Counting Rules, etc Product Rule Generalized Product Rule Division Rule Bijection

Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based

Exploiting Phase Inter-Dependencies for Faster Iterative Compiler Optimization Phase Order

Surfing: Iterative Optimization Over Incrementally Trained Deep Networks Ganlin Song, Zhou Fan,

Languages and Regular expressions Lecture 2 1 Strings, Sets of Strings, Sets of Sets of

Sets Sets A Set is an abstract data type representing an unordered Sets are unordered and

Mining Airfare Data to Minimize Ticket Purchase Price Oren Etzioni ( UW ) Craig Knoblock

Cyber@UC Meeting 42 CEH Cryptography If Youre New! Join our Slack ucyber.slack.com SIGN

MLIC: A MaxSAT-Based framework for learning interpretable classification rules Dmitry Malioutov 1

Compatibility, coherence and the running intersection property Enrique Miranda Marco Zaffalon

Lecture 3 - Passwords and Authentication CMPSC 443 - Spring 2012 Introduction Computer and

7/13/18 D IV ERSIT Y & IN C LU SIO N Jill Ripper D irector , T alent Acquisition H OW

Reasoning Analytically About Password-Cracking Software Enze Alex Liu, Amanda Nakanishi,

End result: C4.5 (Quinlan) Best-known and (probably) most widely-used learning algorithm

Iterative Optimization of Rule Sets Jiawei Du 16. November 2010 - PowerPoint PPT Presentation

Iterative Optimization of Rule Sets Jiawei Du 16. November 2010 Prof. Dr. Johannes Frnkranz Frederik Janssen Overview REP-Based Algorithms RIPPER Variants Evaluation Summary 2 REP-Based Algorithms REP I-REP / I-REP2 /

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of

Chapter 12: Iterative Methods ES 240: Scientific and Engineering Computation. Iterative Methods

Development Figures are from : Agile and Iterative Development: A Manager's Guide, Craig

MATH 105: Finite Mathematics 6-1: Sets Prof. Jonathan Duncan Walla Walla College Winter

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi

Rule Changes - Non rule change year Review of 2017 rule changes - just the easy to forgot

Common Rule Advanced Notice of Proposed Rulemaking (ANPRM) IRB Investigator Advanced Notice

2nd RULE: You MUST TALK about BOOK CLUB. 2nd RULE: You DO NOT talk about 3rd RULE: PERSEVERE -- If

Rule #1: Have a takeaway. Rule #2: Keep It Simple. Rule #3: Repetition is Good. Rule #4: Be

Counting Rules, etc Product Rule Generalized Product Rule Division Rule Bijection

Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based

Exploiting Phase Inter-Dependencies for Faster Iterative Compiler Optimization Phase Order

Surfing: Iterative Optimization Over Incrementally Trained Deep Networks Ganlin Song, Zhou Fan,

Languages and Regular expressions Lecture 2 1 Strings, Sets of Strings, Sets of Sets of

Sets Sets A Set is an abstract data type representing an unordered Sets are unordered and

Mining Airfare Data to Minimize Ticket Purchase Price Oren Etzioni ( UW ) Craig Knoblock

Cyber@UC Meeting 42 CEH Cryptography If Youre New! Join our Slack ucyber.slack.com SIGN

MLIC: A MaxSAT-Based framework for learning interpretable classification rules Dmitry Malioutov 1

Compatibility, coherence and the running intersection property Enrique Miranda Marco Zaffalon

Lecture 3 - Passwords and Authentication CMPSC 443 - Spring 2012 Introduction Computer and

7/13/18 D IV ERSIT Y &amp; IN C LU SIO N Jill Ripper D irector , T alent Acquisition H OW

Reasoning Analytically About Password-Cracking Software Enze Alex Liu, Amanda Nakanishi,

End result: C4.5 (Quinlan) Best-known and (probably) most widely-used learning algorithm

7/13/18 D IV ERSIT Y & IN C LU SIO N Jill Ripper D irector , T alent Acquisition H OW