finding k best map solutions using lp relaxations
play

Finding k-best MAP Solutions Using LP Relaxations Amir Globerson - PowerPoint PPT Presentation

Finding k-best MAP Solutions Using LP Relaxations Amir Globerson School of Computer Science and Engineering The Hebrew University Joint Work with: Menachem Fromer (Hebrew Univ.) Prediction Problems Consider the following problem: Observe


  1. The Marginal Polytope M ( G ) � � max µ ij ( x i , x j ) θ ij ( x i , x j ) Marginal µ µ ∈ M ( G ) x i ,x j ij ∈ E Polytope

  2. The Marginal Polytope M ( G ) � � max µ ij ( x i , x j ) θ ij ( x i , x j ) Marginal µ µ ∈ M ( G ) x i ,x j ij ∈ E Polytope There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j )

  3. The Marginal Polytope M ( G ) � � max µ ij ( x i , x j ) θ ij ( x i , x j ) Marginal µ µ ∈ M ( G ) x i ,x j ij ∈ E Polytope There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j ) Difficult set to characterize. Easy to outer bound

  4. The Marginal Polytope M ( G ) � � max µ ij ( x i , x j ) θ ij ( x i , x j ) Marginal µ µ ∈ M ( G ) x i ,x j ij ∈ E Polytope There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j ) Difficult set to characterize. Easy to outer bound The vertices have integral values and correspond to assignments on x

  5. Relaxing the MAP LP � � � max θ ij ( x i , x j ) = max µ ij ( x i , x j ) θ ij ( x i , x j ) µ ∈ M ( G ) x x i ,x j ij ij ∈ E M ( G )

  6. Relaxing the MAP LP � � � max θ ij ( x i , x j ) = max µ ij ( x i , x j ) θ ij ( x i , x j ) µ ∈ M ( G ) x x i ,x j ij ij ∈ E Exact but Hard! M ( G )

  7. Relaxing the MAP LP � � � max θ ij ( x i , x j ) ≤ max µ ij ( x i , x j ) θ ij ( x i , x j ) µ ∈ S x ij x i ,x j ij ∈ E M ( G ) S

  8. Relaxing the MAP LP � � � max θ ij ( x i , x j ) ≤ max µ ij ( x i , x j ) θ ij ( x i , x j ) µ ∈ S x ij x i ,x j ij ∈ E If optimum is an integral vertex, MAP is solved M ( G ) S

  9. Relaxing the MAP LP � � � max θ ij ( x i , x j ) ≤ max µ ij ( x i , x j ) θ ij ( x i , x j ) µ ∈ S x ij x i ,x j ij ∈ E If optimum is an integral vertex, MAP is solved M ( G ) Possible outer bound: Pairwise consistency S

  10. Relaxing the MAP LP � � � max θ ij ( x i , x j ) ≤ max µ ij ( x i , x j ) θ ij ( x i , x j ) µ ∈ S x ij x i ,x j ij ∈ E If optimum is an integral vertex, MAP is solved M ( G ) Possible outer bound: Pairwise consistency S k � j � � � µ ij ( x i , x j ) = µ jk ( x j , x k ) i � x i x k

  11. Relaxing the MAP LP � � � max θ ij ( x i , x j ) ≤ max µ ij ( x i , x j ) θ ij ( x i , x j ) µ ∈ S x ij x i ,x j ij ∈ E If optimum is an integral vertex, MAP is solved M ( G ) Possible outer bound: Pairwise consistency S k � j � � � µ ij ( x i , x j ) = µ jk ( x j , x k ) Exact for trees i � x i x k

  12. Relaxing the MAP LP � � � max θ ij ( x i , x j ) ≤ max µ ij ( x i , x j ) θ ij ( x i , x j ) µ ∈ S x ij x i ,x j ij ∈ E If optimum is an integral vertex, MAP is solved M ( G ) Possible outer bound: Pairwise consistency S k � j � � � µ ij ( x i , x j ) = µ jk ( x j , x k ) Exact for trees i � x i x k Efficient message passing schemes for solving the resulting (dual) LP

  13. Outline LP formulation of the MAP problem LP for 2 nd best General (intractable) exact formulation Tractable formulation for tree graphs Approximations for non-tree graphs Experiments

  14. The 2 nd best problem and LP 2 nd best MAP

  15. The 2 nd best problem and LP 2 nd best MAP max f ( x ) x

  16. The 2 nd best problem and LP 2 nd best MAP x � = x (1) f ( x ) max max f ( x ) x

  17. The 2 nd best problem and LP 2 nd best MAP x � = x (1) f ( x ) max max f ( x ) x µ ∈ M ( G ) µ · θ max

  18. The 2 nd best problem and LP 2 nd best MAP x � = x (1) f ( x ) max max f ( x ) x (1) x µ ∈ M ( G, x (1) ) µ · θ µ ∈ M ( G ) µ · θ max max

  19. The 2 nd best problem and LP 2 nd best MAP x � = x (1) f ( x ) max max f ( x ) x (1) x µ ∈ M ( G, x (1) ) µ · θ µ ∈ M ( G ) µ · θ max max Approximations:

  20. The 2 nd best problem and LP 2 nd best MAP x � = x (1) f ( x ) max max f ( x ) x (1) x µ ∈ M ( G, x (1) ) µ · θ µ ∈ M ( G ) µ · θ max max Approximations:

  21. The 2 nd best problem and LP 2 nd best MAP x � = x (1) f ( x ) max max f ( x ) x (1) x µ ∈ M ( G, x (1) ) µ · θ µ ∈ M ( G ) µ · θ max max Approximations:

  22. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z )

  23. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z ) M ( G, z )

  24. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z ) M ( G, z ) µ

  25. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z ) M ( G, z ) µ There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j )

  26. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z ) M ( G, z ) µ There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j ) and:

  27. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z ) M ( G, z ) µ There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j ) and: p ( z ) = 0

  28. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z ) M ( G, z ) µ There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j ) and: p ( z ) = 0

  29. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z ) M ( G, z ) µ M ( G ) There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j ) and: p ( z ) = 0

  30. A new marginal polytope Given an assignment z , define the Assignment Excluding Marginal Polytope: M ( G, z ) M ( G, z ) z µ M ( G ) There exists a p(x) s.t. p ( x i , x j ) = µ ij ( x i , x j ) and: p ( z ) = 0

  31. LP for the 2 nd best problem The 2 nd best problem corresponds to x (1) the following LP: x � = x (1) f ( x ; θ ) = max max µ ∈ M ( G, x (1) ) µ · θ

  32. LP for the 2 nd best problem The 2 nd best problem corresponds to x (1) the following LP: x � = x (1) f ( x ; θ ) = max max µ ∈ M ( G, x (1) ) µ · θ M ( G, x (1) ) Is there a simple characterization of ?

  33. LP for the 2 nd best problem The 2 nd best problem corresponds to x (1) the following LP: x � = x (1) f ( x ; θ ) = max max µ ∈ M ( G, x (1) ) µ · θ M ( G, x (1) ) Is there a simple characterization of ? Is it plus one inequality? M ( G )

  34. LP for the 2 nd best problem The 2 nd best problem corresponds to x (1) the following LP: x � = x (1) f ( x ; θ ) = max max µ ∈ M ( G, x (1) ) µ · θ M ( G, x (1) ) Is there a simple characterization of ? Is it plus one inequality? M ( G ) If so, what inequality?

  35. Outline LP formulation of the MAP problem LP for 2 nd best General (intractable) exact formulation Tractable formulation for tree graphs Approximations for non-tree graphs Experiments

  36. Adding inequalities to M ( G ) z z

  37. Adding inequalities to M ( G ) Any valid inequality must separate z z from the other vertices

  38. Adding inequalities to M ( G ) Any valid inequality must separate z z from the other vertices How about: (Santos 91) � µ i ( z i ) ≤ n − 1 i

  39. Adding inequalities to M ( G ) Any valid inequality must separate z z from the other vertices How about: (Santos 91) � µ i ( z i ) ≤ n − 1 i RHS is n for z and or less for n − 1 other vertices

  40. Adding inequalities to M ( G ) Any valid inequality must separate z z from the other vertices How about: (Santos 91) � µ i ( z i ) ≤ n − 1 i RHS is n for z and or less for n − 1 other vertices But: Results in fractional vertices, even for trees

  41. Adding inequalities to M ( G ) Any valid inequality must separate z z from the other vertices How about: (Santos 91) � µ i ( z i ) ≤ n − 1 i RHS is n for z and or less for n − 1 other vertices But: Results in fractional vertices, even for trees

  42. Adding inequalities to M ( G ) Any valid inequality must separate z z from the other vertices How about: (Santos 91) � µ i ( z i ) ≤ n − 1 i RHS is n for z and or less for n − 1 other vertices But: Results in fractional vertices, even for trees Only an outer bound on M ( G, z )

  43. The tree case

  44. The tree case Focus on the case where G is a tree

  45. The tree case Focus on the case where G is a tree is given by pairwise consistency M ( G )

  46. The tree case Focus on the case where G is a tree is given by pairwise consistency M ( G ) Define: � � I ( µ , z ) = (1 − d i ) µ i ( z i ) + µ ij ( z i , z j ) i ij ∈ G

  47. The tree case Focus on the case where G is a tree is given by pairwise consistency M ( G ) Define: � � I ( µ , z ) = (1 − d i ) µ i ( z i ) + µ ij ( z i , z j ) i ij ∈ G Bethe: � � H ( µ ) = (1 − d i ) H i ( X i ) + H ( X i , X j ) i ij ∈ G

  48. The tree case Focus on the case where G is a tree is given by pairwise consistency M ( G ) Define: � � I ( µ , z ) = (1 − d i ) µ i ( z i ) + µ ij ( z i , z j ) i ij ∈ G

  49. The tree case Focus on the case where G is a tree is given by pairwise consistency M ( G ) Define: � � I ( µ , z ) = (1 − d i ) µ i ( z i ) + µ ij ( z i , z j ) i ij ∈ G Theorem: � I ( µ , z ) ≤ 0 � M ( G, z ) = µ | µ ∈ M ( G ) ,

  50. The tree case Focus on the case where G is a tree is given by pairwise consistency M ( G ) Define: � � I ( µ , z ) = (1 − d i ) µ i ( z i ) + µ ij ( z i , z j ) i ij ∈ G Theorem: M ( G ) z � I ( µ , z ) ≤ 0 � M ( G, z ) = µ | µ ∈ M ( G ) ,

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend