CS786 Lecture 12: May 12, 2012 Inference as Optimization (continued) - PDF document

26/06/2012 CS786 Lecture 12: May 12, 2012 Inference as Optimization (continued) [KF Chapter 11] CS786 P. Poupart 2012 1 Cluster Tree Recap • Variable elimination: – Induces a cluster tree – Inference: message propagation on cluster tree • Cluster tree: – Graph is a tree (i.e., no loops) – Node: cluster of variables – Edge: subset of variables (a.k.a. sepset) that are common to nodes it connects – Satisfies running intersection property CS786 P. Poupart 2012 2 1

26/06/2012 Cluster Tree Calibration • � � : variables in the cluster at node � � � � � : factor at node � • � �� : variables in sepset at edge � � � � �� : factor at edge � � � • Calibrated cluster tree For all edges � � � : sepset factor is marginal of cluster factors � �� ∈� � \� �� ∈� � \� �� CS786 P. Poupart 2012 3 Calibration by Message Passing • Initialization: – Messages: � �→� ← 1 and � �→� ← 1 ∀�� – Potentials: � � ← ∏ potentials associated with � � • Update messages until calibration � �→� �� ← � � � � � � � �→� � �� ∈� � \� �� ∈��\�� • Return � � � � ← � � � � ∏ � �→� �� ∈�� ← � �→� � �� →� �� CS786 P. Poupart 2012 4 2

26/06/2012 Properties of Calibrated Trees • Normalized � � : marginal of � � – i.e., Pr � � � � � �� • Normalized � �� : marginal of � �� – i.e., Pr � �� • The � � ’s and � �� ’s can be used to simultaneously answer many marginal queries CS786 P. Poupart 2012 5 Loopy Belief Propagation • Approximate inference • Consider cluster graph (with loops) instead of a cluster tree – Scalability: clusters can be much smaller – Approximation: calibrated cluster graph does not necessarily yield correct marginals CS786 P. Poupart 2012 6 3

26/06/2012 Cluster Graph • Same as cluster tree but loops are allowed: – Any graph structure is allowed – Node: cluster of variables – Edge: subset of variables (a.k.a. sepset) that are common to nodes it connects • Generalized running intersection property – Whenever variable � is in clusters � � and � � then there is exactly one path between � � and � � such that � ∈ � � for all edges � in that path CS786 P. Poupart 2012 7 Cluster Graph Calibration • Same algorithm as for cluster tree calibration • Disadvantages: – Convergence is not guaranteed • Damping techniques may be used to ensure convergence – When convergence is achieved: • � � ’s and � �� ’s are not necessarily the correct marginals for � � , � �� • Advantages: – Approximation is often good in practice and inference scales linearly with the size of the graph CS786 P. Poupart 2012 8 4

26/06/2012 Expectation Propagation • Alternative approximation for inference • Idea: stick with cluster tree, but approximate the messages • Consequence: propagate expectations of some statistics instead of full marginals in each sepset CS786 P. Poupart 2012 9 Example CS786 P. Poupart 2012 10 5

26/06/2012 Cluster Tree with Factored Potentials • � � : variables in the cluster at node � � � � � : product of factors at node � • � �� : variables in sepset at edge � � � � �� : product of factors at edge � � � • Calibrated cluster tree (same as before) For all edges � � � : � �� ∈� � \� �� ∈� � \� �� CS786 P. Poupart 2012 11 Calibration with Factored Messages • Initialization: – Messages: � �→� ← 1 and � �→� ← 1 ∀�� – Potentials: � � ← Set of potentials associated with � � • Update messages until calibration � �→� �� ← �� →� � �� ∈� � \� �� ∈��\�� • Return � � � � ← � � � � ∏ � �→� �� ∈�� ← � �→� � �� →� �� CS786 P. Poupart 2012 12 6

26/06/2012 Projection • Approximate distribution � by the “closest” distribution � from some class of distributions. • Examples: – Factorization: joint distribution  product of marginals �� ∏ �� – Mixture of Gaussians  single Gaussian ∑ � � � � ��|� � , � � � � ��|�, �� – Mixture of Dirichlets  single Dirichlet: ∑ � � � � ��|� � � � ��|�� CS786 P. Poupart 2012 13 KL ‐ Divergence • Common distance measure for projections • KL ‐ divergence (a.k.a relative entropy) definition ��| � � � � � log � � � � � • Since ��| � � ��||�� , we can also use ��| � � � � � log � � � � � CS786 P. Poupart 2012 14 7

26/06/2012 Exponential Family • Projection by KL ‐ divergence corresponds to matching expectation of some statistics • Exponential Family � : vector of parameters defining � �: vector of statistics � � � ∝ exp �� , � � �� CS786 P. Poupart 2012 15 Examples • Bernoulli: Pr � � � � � � � � � � � � � � � , � � � � � � � � � � ln � , ln�1 � �� Pr � � � � � exp � � � , � � � � � �⋅�� ⋅�� exp � �� • Gaussian: Pr � � �� , � � � � �, � � � � � � � � , � �� CS786 P. Poupart 2012 16 8

CS786 Lecture 12: May 12, 2012 Inference as Optimization (continued) - PDF document

26/06/2012 CS786 Lecture 12: May 12, 2012 Inference as Optimization (continued) [KF Chapter 11] CS786 P. Poupart 2012 1 Cluster Tree Recap Variable elimination: Induces a cluster tree Inference: message propagation on cluster tree

CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1

CS786 Lecture 15: May 21, 2012 MAP inference [KF Chapter 13] CS786 P. Poupart 2012 1 MAP Queries

CS786: Lecture 1 May 1st Basics: review of probability theory 1 CS 786 Lecture Slides (c)

Markov Networks [KF] Chapter 4 CS 786 University of Waterloo Lecture 7: May 24, 2012 Outline

Learning and Inference in Markov Logic Networks CS 786 University of Waterloo Lecture 24: July

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Date: July 23, 2012 Arrowhead Elementary 06/2012 Arrowhead Elementary 07/2012 Leisure Park

Markov Logic Networks Matt Richardson and Pedro Domingos (2006), Markov Logic Networks, Machine

Be More strategy 2012-2015 March 2012 Be More strategy 2012-2015 BE MORE: THE NEW STRATEGY

Bank of Georgia Q2 2012 and 1H 2012 Results Presentation June 2012 October 2012 Contents Bank of

CEE 680 Lecture #2 1/22/2020 1 CEE 680 Lecture #2 1/22/2020 2 CEE 680 Lecture #2

May May 36% 20% 25% 19% May May May May 36% 20% 25% 19%

2012 NOMCOM FINAL REPORT TRANSPARENCY & ACCOUNTABILITY Vanda Scartezini 2012 Chair 2012

2012 LEVY HEARING December 18, 2012 Meeting of the Board 2012 Levy Calendar Discussion

Q2 2012 RESULTS FOR THE PERIOD ENDED 30 JUNE 2012 www.goldfields.co.za Interim Results Period

H1 2012 Results Main results Key figures H1 2012 H1 2011 Q2 2012 Q1 2012 Q2 2011 Q1 2011

CS 378 Computer Vision Oct 22, 2009 Outline: Stereopsis and calibration I. Computing

Temperature Gradient System (TemGraS) Jelena Maricic, YujingSun, Radovan Milincic University of

BUILDING A PRACTICAL INTERNAL RED TEAM ABHIJITH ABHIJITH B R B R [Abx Abx] DEFCON 28 SAFE

I A-64, the Trillian project, and CERNs involvement CERN Computing Seminar 1 17 May 2000

Clique trees 2 Graphical Models 10708 Carlos Guestrin Carnegie Mellon University October 3

Mo r e f u n w i t h j e t s Mo r e f u n w i t h j e t s E v a B r

Performance analysis on Xeon CERN openlab II quarterly review 20 September 2006 Ryszard Jurga

Callisto-RTS: Fine-Grain Parallel Loops Tim Harris (Oracle Labs), Stefan Kaestle (ETH Zurich) ATC

CS786 Lecture 12: May 12, 2012 Inference as Optimization (continued) - PDF document

26/06/2012 CS786 Lecture 12: May 12, 2012 Inference as Optimization (continued) [KF Chapter 11] CS786 P. Poupart 2012 1 Cluster Tree Recap Variable elimination: Induces a cluster tree Inference: message propagation on cluster tree

CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1

CS786 Lecture 15: May 21, 2012 MAP inference [KF Chapter 13] CS786 P. Poupart 2012 1 MAP Queries

CS786: Lecture 1 May 1st Basics: review of probability theory 1 CS 786 Lecture Slides (c)

Markov Networks [KF] Chapter 4 CS 786 University of Waterloo Lecture 7: May 24, 2012 Outline

Learning and Inference in Markov Logic Networks CS 786 University of Waterloo Lecture 24: July

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Date: July 23, 2012 Arrowhead Elementary 06/2012 Arrowhead Elementary 07/2012 Leisure Park

Markov Logic Networks Matt Richardson and Pedro Domingos (2006), Markov Logic Networks, Machine

Be More strategy 2012-2015 March 2012 Be More strategy 2012-2015 BE MORE: THE NEW STRATEGY

Bank of Georgia Q2 2012 and 1H 2012 Results Presentation June 2012 October 2012 Contents Bank of

CEE 680 Lecture #2 1/22/2020 1 CEE 680 Lecture #2 1/22/2020 2 CEE 680 Lecture #2

May May 36% 20% 25% 19% May May May May 36% 20% 25% 19%

2012 NOMCOM FINAL REPORT TRANSPARENCY &amp; ACCOUNTABILITY Vanda Scartezini 2012 Chair 2012

2012 LEVY HEARING December 18, 2012 Meeting of the Board 2012 Levy Calendar Discussion

Q2 2012 RESULTS FOR THE PERIOD ENDED 30 JUNE 2012 www.goldfields.co.za Interim Results Period

H1 2012 Results Main results Key figures H1 2012 H1 2011 Q2 2012 Q1 2012 Q2 2011 Q1 2011

CS 378 Computer Vision Oct 22, 2009 Outline: Stereopsis and calibration I. Computing

Temperature Gradient System (TemGraS) Jelena Maricic, YujingSun, Radovan Milincic University of

BUILDING A PRACTICAL INTERNAL RED TEAM ABHIJITH ABHIJITH B R B R [Abx Abx] DEFCON 28 SAFE

I A-64, the Trillian project, and CERNs involvement CERN Computing Seminar 1 17 May 2000

Clique trees 2 Graphical Models 10708 Carlos Guestrin Carnegie Mellon University October 3

Mo r e f u n w i t h j e t s Mo r e f u n w i t h j e t s E v a B r

Performance analysis on Xeon CERN openlab II quarterly review 20 September 2006 Ryszard Jurga

Callisto-RTS: Fine-Grain Parallel Loops Tim Harris (Oracle Labs), Stefan Kaestle (ETH Zurich) ATC

2012 NOMCOM FINAL REPORT TRANSPARENCY & ACCOUNTABILITY Vanda Scartezini 2012 Chair 2012