A MapReduce-based architecture for rule matching in production - PowerPoint PPT Presentation

A MapReduce-based architecture for rule matching in production system Bin Cao 2010.12.1

Agenda  Introduction  Related Work  Architecture  Definition  Implementation  Experimental evaluation  Conclusion and future work

Agenda  I ntroduction  Related Work  Architecture  Definition  Implementation  Experimental evaluation  Conclusion and future work

Introduction  Business rules can improve people’s business by providing a level of agility and flexibility.  Production system (rule engine) The m echanism of a production system

Introduction  Most of the processing time is consumed by matching  The efficiency will drop with the increase of rules and facts.  Rete algorithm and its improvements  But, the limitation will not disappear because of the bounded capability of one single computer.

Introduction  MapReduce programming model  To perform Rete concurrently in different computer

Agenda  Introduction  Related W ork  Architecture  Definition  Implementation  Experimental evaluation  Conclusion and future work

Related Work  Parallel firing of rules  Toru Ishida. Parallel, Distributed and Multi-Agent Production Systems.  Parallel but not distributed  Anoop Gupta, Charles L.Forgy. Parallel OPS5 on the Encore Multimax  Parallel and distributed but no Rete used  C. Wu, L. Lai, Y. Chang. Parallelizing CLIPS-based Expert Systems by the Permutation Feature of Pattern Matching.

Architecture

Architecture  Build stage  Rules are decomposed into sub-rules  Workers compile the sub-rules into a Rete net  Map stage  Facts are passed to workers on demand  Facts will match with rules.  Reduce stage  Reduce the results generated from map stage

I f …( LHS ) Then Definition …( RHS )  Definition 1 ( Rule)  A rule, denoted R , is a tuple(LHS,RHS), where:  LHS is a finite set of conditions in a rule, called the left hand side.  RHS is a finite set of actions in a rule, called the right hand side.  Definition 2 ( Sub-rule)  Let S ∈ LHS be a sub-rule of rule R ( Definition 1 ) , iff, LHS belongs to R .

Definition  Definition 3 ( Rule base)    S S < > < > 1,1 1, n   =      A matrix could be viewed as a M  ( m n S , , )     S S  < > < > m ,1 m n , rule base, where:  m represents for the number of rules.  n represents for the maximum number of sub-rules contained in each of rules above.  S represents for sub-rule (as defined in Definition 2 ) If we denote: 1 ≤ r ≤ m: for rule ID in rule base M .  1 ≤ s ≤ n: for sub-rule ID in certain rule of rule base M .  Then, S < r, s> represents for sub-rule S identified by s in rule R identified by r in rule base M

Definition  R r = ( S < r, 1 > … S < r, s> … S < r, n> ) shows that rule R, which identified by r in rule base, contains n sub-rules S < r, s> ( 1 ≤ s ≤ n) . If the number of sub-rules in rule R r is smaller than n, we equate the rest elements in R r with null.  Definition 4 ( Firing paradigm )  Two paradigms for firing R r = ( S < r, 1 > … S < r, s> … S < r, n> ) are defined as following:  AND: .rule R r can be fired if all the elements in R r were matched simultaneously.  OR: rule R r can be fired if a group of elements in R r were matched simultaneously.

Agenda  Introduction  Related Work  Architecture  Definition  I m plem entation  Experimental evaluation  Conclusion and future work

Implementation  Build: preparations for rule m atching  Forming a rule base M : decomposing rules into sub- rules  Distributing the sub-rules to different workers  Parsing sub-rules to a Rete-net A Rete netw ork

Implementation  Map: rule m atching Function Map ( Queue facts, List index_list) { / * Filter and m atch facts w ith Rete algorithm . * / m atched_ m ap ( sub-rules_index, correspond_facts) = m atch_ fact_w ith_ Rete ( facts, index_ list) ; / * According to the form er definitions, classify and m erge the m atched_ m ap by index.* / classified_m ap ( r, m ap( s, correspond_facts) ) = classify_w ith_index ( m atched_m ap) ; / * Save the result. * / store ( classified_ m ap) ; }

Implementation  Reduce: responsible for correct transference of rule Function Reduce ( RuleI D r, List m atched_ subrule_ list) { / * Classify and m erge m atched sub-rule list by s according to Definition 3 .* / m erged_ subrule_ m ap ( s, corresponding_ subrule_ list) = m erge_ w ith_ s ( m atched_ subrule_ list) ; / * Get firing paradigm of rule Rr.* / sw itch ( get_ firing_ paradigm ( r) ) / / case AND / * Judge w hether the size of m erged_subrule_m ap and sub-rule list of rule Rr in rule base is sam e. * / if ( equal ( m erged_ subrule_ m ap) ) transfer( r) ; / / transfer the rule Rr to agenda of the m aster case OR / * Judge w hether there exists one or several group of elem ents in Rr w as or w ere m atched * / if ( exist_ group_ m atched ( r) ) transfer ( r) ; / / transfer the rule Rr to agenda of the m aster }

Agenda  Introduction  Related Work  Architecture  Definition  Implementation  Experim ental evaluation  Conclusion and future work

Experimental Evaluation  Goal: to com pare w ith sequence process Master and Reduce W orker CPU Intel Core2 Duo P8400@2.26GHz Mem ory 3.00GB HD SATA 250GB OS Windows 7 Ultimate Server Apache-tomcat-6.0.18 LAN 100Mbps Map W orker CPU Intel Core2 Duo E7400@2.8GHz Mem ory 2.00GB HD SATA 250GB OS Windows 7 Ultimate Server Apache-tomcat-6.0.18 LAN 100Mbps The test environm ent configuration

Experimental Evaluation  At the bottom of the line, the MapReduce approach gains a little bit longer duration.  Maybe the network transmission of matched result cost more time than matching process.

Experimental Evaluation  As the number of rules increased, the gap between two lines is widening. The advantage of MapReduce approach appears more and more apparently.

Experimental Evaluation  W hy the MapReduce approach does not double the perform ance?  The heavy netw ork transm ission.  Different load of each w orker.  Different com plexity of facts and rules.  …

Experimental Evaluation  Nevertheless, the general trend is obvious: MapReduce process gains a less duration than sequential process when given the same number of rules, and with the increasing number of rules the MapReduce approach shows more efficient.

Agenda  Introduction  Related Work  Architecture  Definition  Implementation  Experimental evaluation  Conclusion and future w ork

Conclusion and Future work  Analysis coming from the relevant simulations confirm the efficiency of our architecture.  To achieve better performance:  How to compress the transferring data?  How to rescue from the dead or suspended worker?  How to utilize the parallel rules firing strategies?  …

Thank You~ Bin Cao 2 0 1 0 .1 2 .1

A MapReduce-based architecture for rule matching in production - PowerPoint PPT Presentation

A MapReduce-based architecture for rule matching in production system Bin Cao 2010.12.1 Agenda Introduction Related Work Architecture Definition Implementation Experimental evaluation Conclusion and future work

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

Cutting MapReduce Cost with Spot Market Huan Liu Accenture Technology Labs Why spot market? 2

MapReduce Andrew Crotty Alex Galakatos What is MapReduce? MapReduce is a framework for:

Mrs: MapReduce for Scientific Computing in Python Andrew McNabb, Jeff Lund , and Kevin Seppi

Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Lecture 16: Overview of MapReduce MapReduce is a parallel, distributed programming model and

Hadoop Map Reduce 1 MapReduce 2-in-1 A programming paradigm A query execution engine A kind

MapReduce 320302 Databases & Web Services (P. Baumann) 1 Why MapReduce? Motivation: Large

MapReduce 340151 Big Data & Cloud Services (P. Baumann) 1 Overview MapReduce : the

COMP9313: Big Data Management MapReduce Data Structure in MapReduce Key-value pairs are the

Lecture 36: MapReduce Frameworks [Adapted from slides by John DeNero and MapReduce is a

Laboratory Session: MapReduce Algorithm Design in MapReduce Pietro Michiardi Eurecom Pietro

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

Azure MapReduce Thilina Gunarathne Salsa group, Indiana University Agenda Recap of Azure

Music Informatics Alan Smaill Feb 15 2018 Alan Smaill Music Informatics Feb 15 2018 1/1

Decision Making Marco Chiarandini Department of Mathematics & Computer Science University of

Reasoning Systems Reasoning Systems Chapter 4 Chapter 4 Dr Ahmed Rafea Rafea Dr Ahmed

CHR - a common platform for rule-based approaches Prof. Dr. Thom Fr uhwirth | June 2010 | Uni

A rule triggering system for automatic text-to-Sign translation Michael Filhol Mohamed Nassime

Explanation The system must be able to justify that its answer is correct, particularly when it

The LHCb Experiment Control System Clara Gaspar, May 2016 Control System Scope In charge of

Reasoning over Stream Data Using a Rule-Based Programming Language Joaqun Arias 1 , 2 Manuel

A MapReduce-based architecture for rule matching in production - PowerPoint PPT Presentation

A MapReduce-based architecture for rule matching in production system Bin Cao 2010.12.1 Agenda Introduction Related Work Architecture Definition Implementation Experimental evaluation Conclusion and future work

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

Cutting MapReduce Cost with Spot Market Huan Liu Accenture Technology Labs Why spot market? 2

MapReduce Andrew Crotty Alex Galakatos What is MapReduce? MapReduce is a framework for:

Mrs: MapReduce for Scientific Computing in Python Andrew McNabb, Jeff Lund , and Kevin Seppi

Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Lecture 16: Overview of MapReduce MapReduce is a parallel, distributed programming model and

Hadoop Map Reduce 1 MapReduce 2-in-1 A programming paradigm A query execution engine A kind

MapReduce 320302 Databases &amp; Web Services (P. Baumann) 1 Why MapReduce? Motivation: Large

MapReduce 340151 Big Data &amp; Cloud Services (P. Baumann) 1 Overview MapReduce : the

COMP9313: Big Data Management MapReduce Data Structure in MapReduce Key-value pairs are the

Lecture 36: MapReduce Frameworks [Adapted from slides by John DeNero and MapReduce is a

Laboratory Session: MapReduce Algorithm Design in MapReduce Pietro Michiardi Eurecom Pietro

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

Azure MapReduce Thilina Gunarathne Salsa group, Indiana University Agenda Recap of Azure

Music Informatics Alan Smaill Feb 15 2018 Alan Smaill Music Informatics Feb 15 2018 1/1

Decision Making Marco Chiarandini Department of Mathematics &amp; Computer Science University of

Reasoning Systems Reasoning Systems Chapter 4 Chapter 4 Dr Ahmed Rafea Rafea Dr Ahmed

CHR - a common platform for rule-based approaches Prof. Dr. Thom Fr uhwirth | June 2010 | Uni

A rule triggering system for automatic text-to-Sign translation Michael Filhol Mohamed Nassime

Explanation The system must be able to justify that its answer is correct, particularly when it

The LHCb Experiment Control System Clara Gaspar, May 2016 Control System Scope In charge of

Reasoning over Stream Data Using a Rule-Based Programming Language Joaqun Arias 1 , 2 Manuel

MapReduce 320302 Databases & Web Services (P. Baumann) 1 Why MapReduce? Motivation: Large

MapReduce 340151 Big Data & Cloud Services (P. Baumann) 1 Overview MapReduce : the

Decision Making Marco Chiarandini Department of Mathematics & Computer Science University of