Look Ma, No Latent Variables: Accurate Cutset Networks via - - PowerPoint PPT Presentation

look ma no latent variables accurate cutset networks via
SMART_READER_LITE
LIVE PREVIEW

Look Ma, No Latent Variables: Accurate Cutset Networks via - - PowerPoint PPT Presentation

Look Ma, No Latent Variables: Accurate Cutset Networks via Compilation Tahrima Rahman, Shasha Jin, Vibhav Gogate The University of Texas at Dallas ICML 2019 What is the paper about? Probabilistic Graphical Models ( PGMs ) vs Latent Tractable


slide-1
SLIDE 1

Look Ma, No Latent Variables: Accurate Cutset Networks via Compilation

Tahrima Rahman, Shasha Jin, Vibhav Gogate

The University of Texas at Dallas

ICML 2019

slide-2
SLIDE 2

What is the paper about?

Probabilistic Graphical Models (PGMs) vs Latent Tractable Probabilistic Models (LTPMs):

Criteria Who wins? Why? Test set log-likelihood PGMs Expressive model Marginal (MAR) estimates LTPMs Reliable Exact Inference Maximum-a- posteriori (MAP) estimates No Clear Winner Both use unreliable approximate inference approaches

Tractability helps achieve better marginal predictions even though the model fit is inferior. Can we do the same for MAP inference?

slide-3
SLIDE 3

What is the paper about?

Probabilistic Graphical Models (PGMs) vs Latent Tractable Probabilistic Models (LTPMs):

Criteria Who wins? Why? Test set log-likelihood PGMs Expressive model Marginal (MAR) estimates LTPMs Reliable Exact Inference Maximum-a- posteriori (MAP) estimates No Clear Winner Both use unreliable approximate inference approaches

Tractability helps achieve better marginal predictions even though the model fit is inferior. Can we do the same for MAP inference?

◮ YES: If we use MAP-tractable cutset networks and im-

prove their fit.

slide-4
SLIDE 4

Cutset Networks

X1 X2 X3 X4 X5 X6 X4 X5 X6 X3 X4 X5 X6 X4 X2 X3 X5 X6 X5 X2 X3 X6 X6 X2 X3 0.7 0.1 0.9 0.3 0.6 0.4 0.2 0.8 0.25 0.75 T1 T2 T3 T4 T5 T6

◮ OR tree (probabilistic decision trees) with tree Bayesian net-

works at leaves

◮ MAP tractable and Interpretable unlike sum-product networks

slide-5
SLIDE 5

Improving Accuracy of Cutset Networks

Issue: When learned just from data the test set LL score of cutset networks is much smaller than latent variable models Our approach to improve accuracy: Compile cutset networks from a more accurate latent variable model.

  • 1. First learn a latent variable model M that admits tractable

posterior marginal inference

  • 2. Compute the sufficient statistics used by the classic structure

learning algorithm for cutset networks by performing marginal inference over M

slide-6
SLIDE 6

Experiments

Hypothesis: If we improve the fit (test set LL) of cutset networks, because of tractability, they will yield better MAP estimates as compared with latent variable models Experiments verify our hypothesis Dataset Before Compilation After Compilation DNA

  • 78.33
  • 74.25

Movie

  • 41.38
  • 38.14

BBC

  • 160.27
  • 158.26

Table: Conditional Log-Likelihood of the MAP Estimates

More details at the poster...