Revisiting the Limits of MAP Inference by MWSS on Perfect Graphs - PowerPoint PPT Presentation

Revisiting the Limits of MAP Inference by MWSS on Perfect Graphs Adrian Weller University of Cambridge CP 2015 Cork, Ireland Slides and full paper at http://mlg.eng.cam.ac.uk/adrian/ 1 / 21

Motivation: undirected graphical models (MRFs) • Powerful way to represent relationships across variables • Many applications including: computer vision, social network analysis, deep belief networks, protein folding... • In this talk, mostly focus on binary pairwise (Boolean binary or Ising ) models Example: Grid for computer vision (attractive) 2 / 21

Motivation: undirected graphical models Example: epinions social network (attractive and repulsive edges) Figure courtesy of N. Ruozzi 3 / 21

Motivation: undirected graphical models A fundamental problem is maximum a posteriori (MAP) inference • Find a global mode configuration with highest probability x ∗ ∈ arg max p ( x 1 , x 2 , . . . , x n ) x =( x 1 ,..., x n ) • In a graphical model, �� p ( x 1 , x 2 , . . . , x n ) ∝ exp ψ c ( x c ) c ∈ C where each c is a subset of variables, x c is a configuration of those variables, and ψ c ( x c ) ∈ Q is a potential function . • Each potential function assigns a score to each configuration of variables in its scope, higher score for higher compatibility. May be considered a ‘negative cost’ function. 4 / 21

Motivation: undirected graphical models A fundamental problem is maximum a posteriori (MAP) inference • Find a global mode configuration with highest probability x ∗ ∈ arg max � ψ c ( x c ) , all ψ c ( x c ) ∈ Q x =( x 1 ,..., x n ) c ∈ C • Equivalent to finding a minimum solution of a valued constraint satisfaction problem (VCSP) without hard constraints: x ∗ ∈ arg min x =( x 1 ,..., x n ) � c ∈ C − ψ c ( x c ) • We are interested in when is this efficient? i.e. solvable in time polynomial in the number of variables 5 / 21

Overview of the method (for models of any arity) We explore the limits of an exciting recent method (Jebara, 2009): • Reduce the problem to finding a maximum weight stable set (MWSS) in a derived weighted graph called a nand Markov random field (NMRF) • Examine how to prune the NMRF (removes nodes, simplifies the problem) • Different reparameterizations lead to pruning different nodes • This allows us to solve the original MAP inference problem efficiently if some pruned NMRF is a perfect graph 6 / 21

Background: NMRFs and reparameterizations • In the constraint community, an NMRF is equivalent to the complement of the microstructure of the dual representation egou, 1993; Larrosa and Dechter, 2000; Cooper and ˇ (J´ Zivn´ y, 2011; El Mouelhi et al., 2013) • Reparameterizations here are equivalent to considering soft arc consistency A reparameterization is a transformation of potential functions (shifts score between potentials) � � { ψ c } → { ψ ′ c } s.t. ∀ x , ψ c ( x c ) = ψ ′ c ( x c ) c ∈ C c ∈ C This clearly does not modify our MAP problem x ∗ ∈ arg max � � ψ c ( x c ) = arg max ψ ′ c ( x c ) x =( x 1 ,..., x n ) x =( x 1 ,..., x n ) c ∈ C c ∈ C but can be helpful to simplify the problem after pruning . 7 / 21

Summary of results Only a few cases were known always to admit efficient MAP inference, including: • Acyclic models (via dynamic programming) STRUCTURE • Attractive models, i.e. all edges attractive/submodular (via graph cuts or LP relaxation) LANGUAGE { ψ c } - generalizes to balanced models (no frustrated cycles ) These were previously shown to be solvable via a perfect pruned NMRF. Here we establish the following limits, which characterize precisely the power of the approach using a hybrid condition: Theorem (main result) A binary pairwise model maps efficiently to a perfect pruned NMRF for any valid potentials iff each block of the model is balanced or almost balanced . 8 / 21

Frustrated, balanced, almost balanced Each edge of a binary pairwise model may be characterized as: - attractive (pulls variables toward the same value, equivalent to ψ ij being supermodular or the cost function being submodular); or - repulsive (pushes variables apart to different values). • A frustrated cycle contains an odd number of repulsive edges. These are challenging for many methods of inference. • A balanced model contains no frustrated cycle ⇔ its variables form two partitions with all intra-edges attractive and all inter-edges repulsive. • An almost balanced model contains a variable s.t. if it is removed, the remaining model is balanced. Note all balanced models (with ≥ 1 variable) are almost balanced. 9 / 21

Examples: frustrated cycle, balanced, almost balanced Signed graph topologies of binary pairwise models, solid blue edges are attractive, dashed red edges are repulsive: x 7 x 1 x 4 x 1 x 4 x 1 x 4 x 2 x 5 x 2 x 5 x 2 x 5 x 3 x 6 x 3 x 6 x 3 x 6 frustrated cycle balanced almost balanced (odd # repulsive edges) (no frustrated cycle (added x 7 ) so forms two partitions) a balanced model may be rendered attractive by ‘flipping’ all variables in one or other partition 10 / 21

Block decomposition Figure from Wikipedia Each color indicates a different block. A graph may be repeatedly broken apart at cut vertices until what remains are the blocks (maximal 2-connected subgraphs). 11 / 21

Recap of result Theorem (main result) A binary pairwise model maps efficiently to a perfect pruned NMRF for any valid potentials iff each block of the model is almost balanced . Note a model may have Ω( n ) many blocks. Next we discuss how to construct an NMRF and why the reduction works. • We need some concepts from graph theory: ⊲ Stable sets, max weight stable sets (MWSS) ⊲ Perfect graphs 12 / 21

Stable sets, MWSS in weighted graphs A set of (weighted) nodes is stable if there are no edges between any of them 8 8 8 2 3 2 3 2 3 4 0 4 0 4 0 Stable set Max Weight Stable Set Maximal MWSS (MWSS) (MMWSS) • Finding a MWSS is NP-hard in general, but is known to be efficient for perfect graphs. 13 / 21

Perfect graphs Perfect graphs were defined in 1960 by Claude Berge • G is perfect iff χ ( H ) = ω ( H ) ∀ induced subgraphs H ≤ G • Includes many important families of graphs such as bipartite and chordal graphs • Several problems that are NP-hard in general, are solvable in polynomial time for perfect graphs: MWSS, graph coloring... • We can use many known results, including: ⊲ Strong Perfect Graph Theorem (Chudnovsky et al., 2006): G is perfect iff it contains no odd hole or antihole ⊲ Pasting any two perfect graphs on a common clique yields another perfect graph 14 / 21

Reduction to MWSS on an NMRF Recall our theme: Given a model, we construct a weighted graph NMRF. Claim: If we can solve MWSS on the NMRF, we recover a MAP solution to the original model. If the NMRF is perfect, MWSS runs in polynomial time. Idea: A MAP configuration has max x � c ψ c ( x c ) = � c max x c ψ c ( x c ) s.t. all the x c are consistent, consistency will be enforced by requiring a stable set. We construct a nand Markov random field (NMRF, Jebara, 2009; equivalent to the complement of the microstructure of the dual) N : • For each potential ψ c , instantiate a node in N for every possible configuration x c of the variables in its scope c • Give each node a weight ψ c ( x c ) then adjust • Add edges between any nodes which have inconsistent settings 15 / 21

Example: constructing an NMRF � c ψ c ( x c ) = � Idea: A MAP configuration has max x c max x c ψ c ( x c ) s.t. all x c are consistent, consistency will be enforced by requiring a stable set. v 00 v 01 12 23 v 01 v 00 12 23 v 10 v 11 12 23 v 11 v 10 ψ 12 ψ 23 12 23 x 1 x 2 x 3 v 00 v 01 v 01 24 ψ 24 ( x 2 =0 , x 4 =1 ) 24 24 ψ 24 v 10 v 11 x 4 24 24 Original model (factor graph) Derived NMRF superscripts denote configuration x c subscripts denote variable set c

Example: constructing an NMRF � c ψ c ( x c ) = � Idea: A MAP configuration has max x c max x c ψ c ( x c ) s.t. all x c are consistent, consistency will be enforced by requiring a stable set. v 00 v 01 12 23 v 01 v 00 12 23 v 10 v 11 12 23 v 11 v 10 ψ 12 ψ 23 12 23 x 1 x 2 x 3 v 00 v 01 v 01 24 ψ 24 ( x 2 =0 , x 4 =1 ) 24 24 ψ 24 − min ψ 24 ( x 2 , x 4 ) v 10 v 11 x 4 24 24 Original model (factor graph) Derived NMRF superscripts denote configuration x c subscripts denote variable set c 16 / 21

Revisiting the Limits of MAP Inference by MWSS on Perfect Graphs - PowerPoint PPT Presentation

Revisiting the Limits of MAP Inference by MWSS on Perfect Graphs Adrian Weller University of Cambridge CP 2015 Cork, Ireland Slides and full paper at http://mlg.eng.cam.ac.uk/adrian/ 1 / 21 Motivation: undirected graphical models (MRFs)

City Limits Lions Clubs City Limits Lions Clubs City Limits Lions Clubs City Limits Lions

Different Types of Limits Besides ordinary, two-sided limits, there are one-sided limits (left-

Revisiting Magnetic Field Limits in Revisiting Magnetic Field Limits in Quadrupoles Arising From

MAP Estimation with Perfect Graphs Tony Jebara July 21, 2009 Background Matchings Perfect

MAT 166 Calculus for Bus/Soc Chapter 3 Notes Limits The Deriviative David J. Gisch Limits

MAP Estimation, Message Passing and Perfect Graphs Tony Jebara November 25, 2009 Background

On minimum weight clique cover problem of claw-free perfect graphs Flavia Bonomo 1 Gianpaolo

New Water Sources Projects New Water Sources Projects of the of the Metropolitan Waterworks and

map-D map-D data refined map-D data refined map-D A GPU Database for Real-Time Big Data

Limits (the size of the pie) allocation limits minimum reliability flow of supply Limits

Medical Programs Overview Table 1. Caption Medical SNAP TANF Programs Income Limits Income

Scope & Limits of Scope & Limits of Scope & Limits of Legal Authority Legal

Abstract Data Type Map Map ADT Another fundamental abstract data type is the map (also The most

On Computational and Probabilistic Inference Rajat Mani Thomas Objectives: Revisiting Bayesian

The Perfect Sales Presentation Shook Robert L The Perfect Sales Presentation Shook Robert L The

Perfect for a New England Vacation! Perfect for a New England Vacation! Sarah Santiago

Gaze Embeddings for Zero-Shot Image Classification Nour Karessli Zeynep Akata Bernt

CSE5390 & 7390 Special Topics in Ubiquitous and Cognitive Computing Tangible User Interfaces

Medical Image Segmentation via Unsupervised Convolutional Neural Network Junyu Chen, Eric Frey

What now for housing? Unravelling the new Government Policies Thursday 15 October 2015 Agenda

Deformable Organ Contour Transfer with Deep Inverse Shape Encoding (DISE) Networks for

The consequences of sync_binlog != 1 by Jean-Franois Gagn Senior Infrastructure Engineer /

Cloud Control Design Review Team Andrew Thompson Anna Lee Project Lead, Audio Streaming

Memor ory D y Defen enses es The Elevation from Obscurity to Headlines Rajeev Balasubramonian

Revisiting the Limits of MAP Inference by MWSS on Perfect Graphs - PowerPoint PPT Presentation

Revisiting the Limits of MAP Inference by MWSS on Perfect Graphs Adrian Weller University of Cambridge CP 2015 Cork, Ireland Slides and full paper at http://mlg.eng.cam.ac.uk/adrian/ 1 / 21 Motivation: undirected graphical models (MRFs)

City Limits Lions Clubs City Limits Lions Clubs City Limits Lions Clubs City Limits Lions

Different Types of Limits Besides ordinary, two-sided limits, there are one-sided limits (left-

Revisiting Magnetic Field Limits in Revisiting Magnetic Field Limits in Quadrupoles Arising From

MAP Estimation with Perfect Graphs Tony Jebara July 21, 2009 Background Matchings Perfect

MAT 166 Calculus for Bus/Soc Chapter 3 Notes Limits The Deriviative David J. Gisch Limits

MAP Estimation, Message Passing and Perfect Graphs Tony Jebara November 25, 2009 Background

On minimum weight clique cover problem of claw-free perfect graphs Flavia Bonomo 1 Gianpaolo

New Water Sources Projects New Water Sources Projects of the of the Metropolitan Waterworks and

map-D map-D data refined map-D data refined map-D A GPU Database for Real-Time Big Data

Limits (the size of the pie) allocation limits minimum reliability flow of supply Limits

Medical Programs Overview Table 1. Caption Medical SNAP TANF Programs Income Limits Income

Scope &amp; Limits of Scope &amp; Limits of Scope &amp; Limits of Legal Authority Legal

Abstract Data Type Map Map ADT Another fundamental abstract data type is the map (also The most

On Computational and Probabilistic Inference Rajat Mani Thomas Objectives: Revisiting Bayesian

The Perfect Sales Presentation Shook Robert L The Perfect Sales Presentation Shook Robert L The

Perfect for a New England Vacation! Perfect for a New England Vacation! Sarah Santiago

Gaze Embeddings for Zero-Shot Image Classification Nour Karessli Zeynep Akata Bernt

CSE5390 &amp; 7390 Special Topics in Ubiquitous and Cognitive Computing Tangible User Interfaces

Medical Image Segmentation via Unsupervised Convolutional Neural Network Junyu Chen, Eric Frey

What now for housing? Unravelling the new Government Policies Thursday 15 October 2015 Agenda

Deformable Organ Contour Transfer with Deep Inverse Shape Encoding (DISE) Networks for

The consequences of sync_binlog != 1 by Jean-Franois Gagn Senior Infrastructure Engineer /

Cloud Control Design Review Team Andrew Thompson Anna Lee Project Lead, Audio Streaming

Memor ory D y Defen enses es The Elevation from Obscurity to Headlines Rajeev Balasubramonian

Scope & Limits of Scope & Limits of Scope & Limits of Legal Authority Legal

CSE5390 & 7390 Special Topics in Ubiquitous and Cognitive Computing Tangible User Interfaces