Learning Accurate Cutset Networks by Exploiting Decomposability N. - PowerPoint PPT Presentation

Learning Accurate Cutset Networks by Exploiting Decomposability N. Di Mauro, A. Vergari, and F. Esposito Department of Computer Science, LACAM Laboratory University of Bari “Aldo Moro”, Italy 14th Conference of the Italian Association for Artificial Intelligence

G 1 X 3 Introduction G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 Tractable Probabilistic Graphical Models ◮ Probabilistic Graphical Models ◮ powerful formalism to model rich and structured domains ◮ capture independences among random variables into a graph ◮ computing exact inference in PGMs is a NP-Hard problem ◮ Tractable Probabilistic Graphical Models ◮ provide exact and efficient inference but less expressive ◮ tree-structured models, Bayesian and Markov Networks compiled into Arithmetic Circuits, and Sum-Product Networks ◮ Cutset Networks ◮ weighted probabilistic model trees ◮ OR-trees having tree-structured models as leaves ◮ non-negative weights on inner edges ◮ Inner nodes, i.e., conditioning OR nodes, are associated to random variables and outgoing branches represent conditioning on the values for those variables domains. 2 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

G 1 X 3 Cutset Networks G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 G 1 X 3 0 . 12 0 . 88 G 2 X 5 T 1 X 1 X 5 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 Given X be a set of discrete variables, a CNet is defined as follows: 1. a CLtree, with scope X , is a CNet; 2. given X i ∈ X a variable with | V al ( X i ) | = k , graphically conditioned in an OR node, a weighted disjunction of k CNets G i with same scope X \ i is a CNet, where all weights w i,j , j = 1 , . . . , k , sum up to one, and X \ i denotes the set X minus the variable X i . 3 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

G 1 X 3 Contribution G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 dCSN The dCSN algorithm ◮ avoiding decision tree heuristics ◮ choosing the best variable directly maximizing the log-likelihood ◮ complex structures penalized adopting the BIC score BIC ( �G , γ � ) = log P D ( �G , γ � ) − log M Dim ( G ) 2 ◮ Bagging in order to obtain a mixture of CNets ◮ k bootstrapped samples D i from the dataset D ◮ leading to k CNets G i ◮ resulting bagged CNet G set to a weighted sum of CNets G i k ˆ � P ( ξ : G ) = µ i P ( ξ : G i ) , i =1 where µ i = ℓ D ( �G i , γ i � ) / � k j =1 ℓ D ( �G j , γ j � ) 4 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

G 1 X 3 Cutset Networks G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 Proposition 1 (CNet log-likelihood decomposition) � � ℓ D ( �G , γ � ) = log P ( ξ [ X i ] | ξ [Pa i ]) (1) ξ ∈D i =1 ,...,n � ℓ D ( �G , γ � ) = M j log w i,j + ℓ D j ( �G j , γ G j � ) (2) j =1 ,...,k Proposition 2 (BIC decomposition) ℓ D l ( �G i , γ i � ) − ℓ D l ( �T l , θ l � ) > log M (3) 2 ◮ instead of recomputing the likelihood on the complete dataset D we can evaluate only the local improvement ◮ the decomposition of T l is independent from all other T j , j � = l being their local contributions to the global log-likelihood independent ◮ it is not significant the order we choose to decompose leaf nodes 5 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

G 1 X 3 dCSN example I G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 X 1 X 2 X 3 X 4 X 5 1 � � � 2 � � 3 � � X 1 X 5 4 � � X 3 5 � � � � X 2 X 4 6 � � � � � � 7 � � � 8 ◮ starting with a single CLTree for all variables X 1 , X 2 , X 3 , X 4 , X 5 6 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

G 1 X 3 dCSN example II G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 X 1 X 2 X 3 X 4 X 5 G 1 � � � X 5 2 � � 0 . 5 0 . 5 3 � � 4 � � X 3 X 1 X 2 X 1 5 � � � � 6 � � � X 3 X 4 X 2 X 4 � � � 7 � � � 8 ◮ checking whether there is a decomposition ◮ adding OR node on variable X 5 applied on two CLtrees with higher ll 7 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

G 1 X 3 dCSN example III G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 X 1 X 2 X 3 X 4 X 5 X 5 1 � � � 0 . 5 0 . 5 2 � � 3 � � X 3 X 1 X 3 4 � � 0 . 75 0 . 25 5 � � � � X 2 X 4 6 � � � X 1 X 1 X 2 � � � 7 � � � X 4 X 4 X 2 8 ◮ recursively apply the decomposition process ◮ adding OR node on variable X 3 applied on two CLtrees with higher ll 8 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

G 1 X 3 Experiments G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 Empirical risk for all algorithms CNet CNetP dCSN CNet-B CNetP-B dCSN-B MT MCNet NLTCS -6.11 -6.06 -6.04 -6.09 -6.02 -6.02 -6.01 -6.00 MSNBC -6.06 -6.05 -6.05 -6.06 -6.04 -6.04 -6.08 -6.04 Plants -13.24 -13.25 -13.35 -12.30 -12.38 -12.21 -12.93 -12.78 Audio -44.58 -42.05 -42.06 -42.09 -40.71 -40.17 -40.14 -39.73 Jester -61.71 -55.56 -55.30 -57.76 -53.17 -52.99 -53.06 -52.57 Netflix -65.61 -58.71 -58.57 -63.08 -57.63 -56.63 -56.71 -56.32 Accidents -30.97 -30.69 -30.17 -30.25 -30.28 -28.99 -29.69 -29.96 Retail -11.07 -10.94 -11.00 -10.99 -10.88 -10.87 -10.84 -10.82 Pumsb-star -24.65 -24.42 -23.83 -24.39 -24.19 -23.32 -23.70 -24.18 DNA -87.19 -84.93 -85.57 -90.48 -87.59 -90.66 -86.85 -85.82 Kosarek -11.04 -10.62 -11.19 -11.14 -10.97 -10.85 -10.85 -10.58 MSWeb -9.94 -9.82 -10.07 -10.07 -9.95 -9.91 -9.86 -9.79 Book -37.22 -34.69 -37.62 -37.35 -35.88 -35.62 -35.92 -33.96 EachMovie -59.19 -58.37 -58.47 -53.91 -54.22 -54.02 -54.51 -51.39 WebKB -161.16 -155.20 -162.85 -162.17 -156.79 -156.94 -157.00 -153.22 Reuters-52 -88.72 -88.55 -88.60 -85.69 -86.22 -86.89 -86.53 -86.11 BBC -262.08 -263.08 -262.08 -252.01 -251.14 -257.72 -259.96 -250.58 Ad -16.92 -16.92 -14.81 -15.94 -16.02 -13.73 -16.01 -16.68 9 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

G 1 X 3 Conclusions G 2 0 . 12 0 . 88 X 5 X 1 X 5 T 1 0 . 78 0 . 22 G 3 X 6 X 2 X 4 T 2 X 2 X 2 X 4 0 . 51 0 . 49 X 6 X 1 T 3 T 4 X 1 X 6 X 4 X 4 X 6 X 1 ◮ a new approach to learn the structure of CNets model ◮ exploiting the decomposable score and maximizing the likelihood ◮ formulating a score including the BIC criterion ◮ introducing informative priors on smoothing parameters ◮ mixtures of CNets with bagging as an alternative to EM ◮ evaluation on standard benchmarks proving the validity of our claims Future Work ◮ latent nodes such as in latent tree models ◮ (gradient) boosting Code available at http://www.di.uniba.it/~ndm/dcsn/ 10 - Learning Accurate Cutset Networks by Exploiting Decomposability, N. Di Mauro, A. Vergari, and F. Esposito

Learning Accurate Cutset Networks by Exploiting Decomposability N. - PowerPoint PPT Presentation

Learning Accurate Cutset Networks by Exploiting Decomposability N. Di Mauro, A. Vergari, and F. Esposito Department of Computer Science, LACAM Laboratory University of Bari Aldo Moro, Italy 14th Conference of the Italian Association for

Look Ma, No Latent Variables: Accurate Cutset Networks via Compilation Tahrima Rahman, Shasha

9.4 Local Perception Filters 9.4 Local Perception Filters Exploiting Exploiting Perceptual

TAKING DATA ON FORM TAKING DATA ON FORM- -WOUND WOUND MOTORS MOTORS By : Manuel Manny

Finding Embedded Multi-Commodity Flow Submatrices in MIPs and Separation of Cutset Inequalities

ACCURATE FLOATING-POINT SUMMATION IN CUB URI VERNER Summer intern OUTLINE Who needs accurate

Di ff erentially-Private Batch Query Answering Exploiting the Workload vs. Exploiting the Data

Exploiting Private Local Exploiting Private Local Memories to Reduce the Memories to Reduce the

Exploiting carbon and nitrogen Exploiting carbon and nitrogen compounds for enhanced energy

Exploiting Extreme Processor Counts on the Cray Exploiting Extreme Processor Counts on the Cray

Visualization of Geant4 Data: Exploiting Component Visualization of Geant4 Data: Exploiting

Hacking Browser's DOM Exploiting Ajax and RIA Exploiting Ajax and RIA Shreeraj Shah

Register Packing Register Packing Exploiting Narrow- -Width Operands Width Operands Exploiting

Exploiting Level- Exploiting Level -of of- -Detail Perception Detail Perception Multiple

C3 B: Exploiting the Num erous C3 B: Exploiting the Num erous Possibilities W eb Technology

Exploiting Exploiting Back-End Back-End APIs APIs fo for Feasible easible Ontology-Based

CrowdSearch: Exploiting Crowds for Accurate Real-time Image Search on Mobile Phones Michael

Growing Up Geek Bob Paulin @bobpaulin Sarah Johnson @johnsons531 Tim Steele @whoistimsteele Public

How to Keep Subscribers Engaged with Your Brand via Personalized Content Dian ana P Primeau au

Session Title DIANA PRIMEAU Director of Member Services CNET (CBS Interactive) Diana Primeau

Configuration management Jack Fowler / Steve Kettell LBNC Feb 20, 2018 Charge Point Provide

Do Social Networks Improve e-Commerce? A Study on Social Marketplaces 1 GAYATRI SWAMYNATHAN,

Ubiquitous and Mobile Computing CS 528: Visage: A Face Interpretation Engine for Smartphone

Satellite Edge Vaibhav Bhosale Ketan Bhardwaj Ada Gavrilovska LEO Satellites Image Credits:

Mixed Membership Markov Models for Unsupervised Conversation Modeling MICHAEL J. PAUL JOHNS