Loop Series and Bethe Variational Bounds in Attractive Graphical - - PowerPoint PPT Presentation

loop series and bethe variational bounds in attractive
SMART_READER_LITE
LIVE PREVIEW

Loop Series and Bethe Variational Bounds in Attractive Graphical - - PowerPoint PPT Presentation

Loop Series and Bethe Variational Bounds in Attractive Graphical Models Erik Sudderth Electrical Engineering & Computer Science University of California, Berkeley Martin Wainwright Joint work with Alan Willsky Loopy BP and Spatial


slide-1
SLIDE 1

Loop Series and Bethe Variational Bounds in Attractive Graphical Models

Erik Sudderth

Electrical Engineering & Computer Science University of California, Berkeley Martin Wainwright Alan Willsky

Joint work with

slide-2
SLIDE 2

Loopy BP and Spatial Priors

Dense Stereo Reconstruction (Sun et. al. 2003) Image Denoising

(Felzenszwalb & Huttenlocher 2004)

Segmentation & Object Recognition

(Verbeek & Triggs 2007)

slide-3
SLIDE 3

What do these models share?

Dense Stereo fMRI Analysis

pairwise energies are attractive to encourage spatial smoothness

Kim et. al. 2000

slide-4
SLIDE 4

Outline

Graphical Models & Belief Propagation Pairwise Markov random fields Variational methods & loopy BP Binary Markov Random Fields Attractive pairwise interactions Loop series expansion of the partition function Bounds & the Bethe Approximation Conditions under which BP provides bounds Empirical comparison to mean field bounds

slide-5
SLIDE 5

Pairwise Markov Random Fields

set of nodes representing random variables set of edges connecting pairs of nodes, inducing dependence via positive compatibility functions normalization constant or partition function

slide-6
SLIDE 6

Why the Partition Function?

  • Sensitivity of physical systems to external stimuli

Statistical Physics Hierarchical Bayesian Models

  • Marginal likelihood of observed data
  • Fundamental in hypothesis testing & model selection

Cumulant Generating Function

  • For exponential families, derivatives with respect

to parameters provide marginal statistics PROBLEM: Computing Z in general graphs is intractable

slide-7
SLIDE 7

Gibbs Variational Principle

Entropy Average Energy All Joint Distributions Negative Gibbs Free Energy

  • Mean field methods optimize bound over a

restricted family of tractable densities

  • Provide lower bounds on Z
slide-8
SLIDE 8

Belief Propagation in Trees

  • Belief propagation (BP) is a message passing

algorithm that infers this reparameterization

Exact Marginals Tree structure leads to a simplified representation

  • f the exact

variational problem Marginal Entropies Mutual Information subject to

slide-9
SLIDE 9

Bethe Approximations & Loopy BP

  • Fixed points of loopy BP also correspond to

reparameterizations of (Wainwright et. al. 2001)

Pseudo- Marginals Bethe variational approximation parameterized by pseudo- marginals which may be globally inconsistent subject to Yedidia, Freeman, & Weiss 2000

slide-10
SLIDE 10

When is Loopy BP Effective?

  • Turbo codes & low density parity check (LDPC) codes
  • For long block lengths, graph becomes locally tree-like, and

BP accurate with high probability Graphs with Long Cycles

  • Existing theory does not explain empirical effectiveness
  • We will show that the Bethe approximation lower bounds

the true partition function for a family of attractive models Graphs with Attractive Potentials?

(Gallager 1963; Richardson & Urbanke 2001)

  • If potentials are sufficiently weak, BP has a unique fixed point
  • Analyzing compatibility strength in context of graph structure

can sometimes guarantee message passing convergence Graphs with Weak Potentials

(Tatikonda & Jordan 2002; Heskes 2004; Ihler et. al. 2005; Mooij & Kappen 2005)

slide-11
SLIDE 11

Outline

Graphical Models & Belief Propagation Pairwise Markov random fields Variational methods & loopy BP Binary Markov Random Fields Attractive pairwise interactions Loop series expansion of the partition function Bounds & the Bethe Approximation Conditions under which BP provides bounds Empirical comparison to mean field bounds

slide-12
SLIDE 12

Binary Markov Random Fields

  • Nodes associated with binary variables:
  • Parameterize pseudo-marginal distributions via moments:

Boltzmann Machines, Ising Models, …

1 1

slide-13
SLIDE 13

Attractive Binary Models

  • A pairwise MRF has attractive compatibilities if all

edges satisfy the following bound:

  • Equivalent condition on reparameterized pseudo-marginals:
  • In statistical physics, such models are ferromagnetic
  • Extensive literature on correlation inequalities bounding

moments of attractive fields: GHS, FKG, GKS, …

slide-14
SLIDE 14

Bounding Partition Functions

Original MRF Reparam. MRF True Partition Function Bethe Approximation Original MRF Reparam. MRF

  • Compatibilities differ by a positive, constant multiple:
  • Focus analysis on partition function of reparameterized MRF
slide-15
SLIDE 15

Loop Series Expansions

  • True log partition function can be expressed as a series

expansion, whose first term is the Bethe approximation:

nonempty subset of the graph’s edges scalar function of degree of node in subgraph induced by

  • These loop corrections are only non-zero when

defines a generalized loop (Chertkov & Chernyak, 2006)

slide-16
SLIDE 16

Generalized Loops

  • Subgraphs in which all nodes have degree
  • All connected nodes must have degree
slide-17
SLIDE 17

Lots of Generalized Loops

slide-18
SLIDE 18

Deriving the Loop Series

  • Saddle point approximation of BP fixed point based upon

contour integration in a complex auxiliary field

  • Employ Fourier representation of binary functions, and

manipulate terms via hyperbolic trigonometric identities Two Existing Approaches

(Chertkov & Chernyak 2006)

Our Contribution: A Probabilistic Derivation

  • Simple, direct derivation from reparameterization

characterization of loopy BP fixed points

  • Exposes probabilistic interpretations for loop series terms,

and makes connections to other known invariants

slide-19
SLIDE 19

Loop Series: A Key Identity

  • For binary variables, reparameterized pairwise

compatibilities can be expressed as follows:

  • Straightforward (but tedious) to verify for
  • For attractive compatibilities, note that
slide-20
SLIDE 20

Loop Series Derivation

Expand polynomial using linearity of expectations: Expectation over factorized distribution:

slide-21
SLIDE 21

Pairwise Loop Series Expansion

degree of node in subgraph induced by

  • Depends on central pseudo-moments corresponding to

loopy BP fixed point:

  • Only generalized loops are non-zero:
slide-22
SLIDE 22

Bernoulli Central Moments

slide-23
SLIDE 23

Outline

Graphical Models & Belief Propagation Pairwise Markov random fields Variational methods & loopy BP Binary Markov Random Fields Attractive pairwise interactions Loop series expansion of the partition function Bounds & the Bethe Approximation Conditions under which BP provides bounds Empirical comparison to mean field bounds

slide-24
SLIDE 24

Bethe Bounds in Attractive Models

Theorem: For a “large family” of binary MRFs with attractive compatibilities, any BP fixed point provides a lower bound:

True Partition Function Bethe Approximation Original MRF Reparam. MRF Sufficient condition: Show that all terms in the loop series are non-negative

Conjecture: For all binary MRFs with attractive compatibilities, the Bethe approximation always provides a lower bound

slide-25
SLIDE 25

Loop Series in Attractive Models

  • When are binary pseudo-central moments non-negative?
  • Bound holds when

for all nodes for all nodes

OR

slide-26
SLIDE 26

Loop Series in Attractive Models

  • When are binary pseudo-central moments non-negative?
  • Only nodes with degrees

must agree in sign

Bound always holds for graphs with a single cycle

slide-27
SLIDE 27

Weaker Bound Conditions

Original Graph Core Graph Key Nodes Key Nodes

slide-28
SLIDE 28

Empirical Bounds: 30x30 Torus

0.2 0.4 0.6 0.8 1 −70 −60 −50 −40 −30 −20 −10 10 Edge Strength Difference from True Log Partition Belief Propagation Mean Field Exact partition function via eigenvector method of Onsager (1944)

slide-29
SLIDE 29

Empirical Bounds: 10x10 Grid

All marginals have same bias, satisfying conditions of theorem 0.2 0.4 0.6 0.8 1 −3 −2.5 −2 −1.5 −1 −0.5 0.5 Edge Strength Difference from True Log Partition Belief Propagation Mean Field

slide-30
SLIDE 30

Empirical Bounds: 10x10 Grid

Random marginals with mixed biases, so some negative loop corrections 0.2 0.4 0.6 0.8 1 −8 −6 −4 −2 2 Edge Strength Difference from True Log Partition Belief Propagation Mean Field

slide-31
SLIDE 31

Generalization: Factor Graphs

  • Generalized loops: all connected variable nodes and

factor nodes must have degree at least two

  • Probabilistic derivation via reparameterization generalizes
  • Bethe lower bound continues to hold for a higher-order

family of attractive binary compatibilities

slide-32
SLIDE 32

Conclusions

  • Simple, probabilistic derivation of the loop series expansion

associated with fixed points of loopy BP

  • Proof that the Bethe approximation lower bounds the true

partition function in many attractive binary models Belief Propagation & Partition Functions Ongoing Research

  • Generalize expansion & bounds to other model families:

higher-order discrete MRFs, Gaussian MRFs

  • Implications of results for BP dynamics in attractive models,

and stability of learning algorithms based on loopy BP