Graphical models Review P [( x y z ) ( y u ) ( z w ) ( z - PowerPoint PPT Presentation

Graphical models

Review P [( x ∨ y ∨ ¯ z ) ∧ (¯ y ∨ ¯ u ) ∧ ( z ∨ w ) ∧ ( z ∨ u ∨ v )] Dynamic programming on graphs ! ‣ variable elimination example ! Graphical model = graph + model ! ‣ e.g., Bayes net: DAG + CPTs ! ‣ e.g., rusty robot ! Benefits: ! ‣ fewer parameters, faster inference ! ‣ some properties (e.g., some conditional independences) depend only on graph Geoff Gordon—Machine Learning—Fall 2013 " 2

Review Blocking ! � � Explaining away Geoff Gordon—Machine Learning—Fall 2013 " 3

d-separation General graphical test: “d-separation” ! ‣ d = dependence ! X ⊥ Y | Z when there are no active paths between X and Y given Z ! ‣ activity of path depends on conditioning variable/set Z ! Active paths of length 3 (W ∉ conditioning set): Geoff Gordon—Machine Learning—Fall 2013 " 4

Longer paths Node X is active (wrt path P) if: ! � � and inactive o/w ! (Undirected) path is active if all intermediate nodes are active Geoff Gordon—Machine Learning—Fall 2013 " 5

Algorithm: X ⊥ Y | {Z 1 , Z 2 , …}? For each Z i : ! ‣ mark self and ancestors by traversing parent links ! Breadth-first search starting from X ! ‣ traverse edges only if they can be part of an active path ! ‣ use “ancestor of shaded” marks to test activity ! ‣ prune when we visit a node for the second time from the same direction (from children or from parents) ! If we reach Y, then X and Y are dependent given {Z 1 , Z 2 , …} — else, conditionally independent Geoff Gordon—Machine Learning—Fall 2013 " 6

Markov blanket Markov blanket of C = minimal set of obs’ns to make C independent of rest of graph Geoff Gordon—Machine Learning—Fall 2013 " 7

Learning fully-observed Bayes nets P(M) = P(Ra) = P(O) = M Ra O W Ru P(W | Ra, O) = T F T T F T T T T T P(Ru | M, W) = F T T F F T F F F T F F T F T Geoff Gordon—Machine Learning—Fall 2013 " 8

Limitations of counting Works only when all variables are observed in all examples ! If there are hidden or latent variables, more complicated algorithm (expectation-maximization or spectral) ! ‣ or use a toolbox! Geoff Gordon—Machine Learning—Fall 2013 " 9

Factor graphs Another common type of graphical model ! Undirected, bipartite graph instead of DAG ! Like Bayes net: ! ‣ can represent any distribution ! ‣ can infer conditional independences from graph structure ! ‣ but some distributions have more faithful representations in one formalism or the other Geoff Gordon—Machine Learning—Fall 2013 " 10

Rusty robot: factor graph P(M) P(Ra) P(O) P(W|Ra,O) P(Ru|M,W) Geoff Gordon—Machine Learning—Fall 2013 " 11

Conventions Markov random field Don’t need to show unary factors—why? ! ‣ can usually be collapsed into other factors ! ‣ don’t affect structure of dynamic programming ! Show factors as cliques Geoff Gordon—Machine Learning—Fall 2013 " 12

Non-CPT factors Just saw: easy to convert Bayes net → factor graph ! In general, factors need not be CPTs: any nonnegative #s allowed ! ‣ higher # → this combination more likely ! In general, P(A, B, …) = ! � Z = Geoff Gordon—Machine Learning—Fall 2013 " 13

Independence Just like Bayes nets, there are graphical tests for independence and conditional independence ! Simpler, though: ! ‣ Cover up all observed nodes ! ‣ Look for a path Geoff Gordon—Machine Learning—Fall 2013 " 14

Independence example Geoff Gordon—Machine Learning—Fall 2013 " 15

What gives? Take a Bayes net, list (conditional) independences ! Convert to a factor graph, list (conditional) independences ! Are they the same list? ! What happened? Geoff Gordon—Machine Learning—Fall 2013 " 16

Inference: same kind of DP as before Typical Q: given Ra=F, Ru=T, what is P(W)? Geoff Gordon—Machine Learning—Fall 2013 " 17

Incorporate evidence Condition on Ra=F, Ru=T Geoff Gordon—Machine Learning—Fall 2013 " 18

Eliminate nuisance nodes Remaining nodes: M, O, W ! Query: P(W) ! So, O&M are nuisance—marginalize away ! Marginal = Geoff Gordon—Machine Learning—Fall 2013 " 19

Elimination order Sum out nuisance variables in turn ! Can do it in any order, but some orders may be easier than others—do O then M Geoff Gordon—Machine Learning—Fall 2013 " 20

Discussion Directed v. undirected: advantages to both ! Normalization ! Each elimination introduces a new table (all current neighbors of eliminated variable), makes some old tables irrelevant ! Each elim. order introduces different tables ! Some tables bigger than others ! ‣ FLOP count; treewidth Geoff Gordon—Machine Learning—Fall 2013 " 21

Treewidth examples Chain ! � � Tree Geoff Gordon—Machine Learning—Fall 2013 " 22

Treewidth examples Parallel chains ! � � � Cycle Geoff Gordon—Machine Learning—Fall 2013 " 23

Inference in general models Prior + evidence → (marginals of) posterior ! ‣ several examples so far, but no general algorithm ! General algorithm: message passing ! ‣ aka belief propagation ! ‣ build a junction tree , instantiate evidence, pass messages ( calibrate ), read off answer, eliminate nuisance variables ! Share work of building JT among multiple queries ! ‣ there are many possible JTs; different ones are better for different queries, so might want to build several Geoff Gordon—Machine Learning—Fall 2013 " 24

Better than variable elimination Suppose we want all 1-variable marginals ! ‣ Could do N runs of variable elimination ! ‣ Or: BP simulates N runs for the price of 2 ! Further reading: Kschischang et al., “Factor Graphs and the Sum-Product Algorithm” ! www.comm.utoronto.ca/frank/papers/KFL01.pdf � Or, Daphne Koller’s book Geoff Gordon—Machine Learning—Fall 2013 " 25

What you need to understand How expensive will inference be? ! ‣ what tables will be built and how big are they? ! What does a message represent and why? Geoff Gordon—Machine Learning—Fall 2013 " 26

Junction tree (aka clique tree, aka join tree) Represents the tables that we build during elimination ! ‣ many JTs for each graphical model ! ‣ many-to-many correspondence w/ elimination orders ! A junction tree for a model is: ! ‣ a tree ! ‣ whose nodes are sets of variables (“cliques”) ! ‣ that contains a node for each of our factors ! ‣ that satisfies running intersection property (below) Geoff Gordon—Machine Learning—Fall 2013 " 27

Example network Elimination order: CEABDF ! Factors: ABC, ABE, ABD, BDF Geoff Gordon—Machine Learning—Fall 2013 " 28

Building a junction tree (given an elimination order) S 0 ← ∅ , V ← ∅ [S = table args; V = visited] ! For i = 1…n: [elimination order] ! ‣ T i ← S i–1 ∪ (nbr(X i )\ V) [extend table to unvisited nbrs] ! ‣ S i ← T i \ {X i } [marginalize out X i ] ! ‣ V ← V ∪ {X i } [mark X i visited] ! Build a junction tree from values S i , T i : ! ‣ nodes: local maxima of T i (T i ⊈ T j for j ≠ i) ! ‣ edges: local minima of S i (after a run of marginalizations without adding new nodes) Geoff Gordon—Machine Learning—Fall 2013 " 29

Example CEABDF Geoff Gordon—Machine Learning—Fall 2013 " 30

Edges, cont’d Pattern: T i … S j–1 T j … S k–1 T k … ! � Pair each T with its following S (e.g., T i w/ S j–1 ) ! Can connect T i to T k iff k>i and S j–1 ⊆ T k ! Subject to this constraint, free to choose edges ! ‣ always OK to connect in a line, but may be able to skip Geoff Gordon—Machine Learning—Fall 2013 " 31

Running intersection property Once a node X is added to T, it stays in T until eliminated, then never appears again ! In JT, this means all sets containing X form a connected region of tree ! ‣ true for all X = running intersection property Geoff Gordon—Machine Learning—Fall 2013 " 32

Moralize & triangulate Geoff Gordon—Machine Learning—Fall 2013 " 33

Graphical models Review P [( x y z ) ( y u ) ( z w ) ( z - PowerPoint PPT Presentation

Graphical models Review P [( x y z ) ( y u ) ( z w ) ( z u v )] Dynamic programming on graphs ! variable elimination example ! Graphical model = graph + model ! e.g., Bayes net: DAG + CPTs !

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Graphical models Review Graphical models (Bayes nets, Markov random fields, factor graphs) !

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Graphical Models Graphical Models Review of probability theory Review of probability theory

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Probabilistic Graphical Models CMSC 691 UMBC Two Problems for Graphical Models 1 ,

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Probabilistic Graphical Models Probabilistic Graphical Models Review of probability theory

Graphical Models Graphical Models Relationship between the directed & undirected models

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical

Review of draft-ietf-sidr-arch-01.txt Steve Kent BBN Technologies Document Outline PKI

Around the Plancherel measure on integer partitions (an introduction to Schur processes without

Matching of of the the Hagedorn Hagedorn Temperature Temperature Matching in AdS AdS/ CF /

Statistical Analysis of Computer Program Text Charles Sutton University of Edinburgh Source

Keeping Them Honest: Using the Power of OPRA to Hold Public Officials Accountable Gavin Rozzi,

3/11/2016 CTSA Common Metric (CM) Operational Guidelines Pilot Funding Publications and

Users as Oracles: Semi-automatically Corroborating User Feedback Andy Podgurski (with Vinay

Using Boolean Networks for Consensus in Multi-Robot Environmental Monitoring Tasks Hanzhong

Graphical models Review P [( x y z ) ( y u ) ( z w ) ( z - PowerPoint PPT Presentation

Graphical models Review P [( x y z ) ( y u ) ( z w ) ( z u v )] Dynamic programming on graphs ! variable elimination example ! Graphical model = graph + model ! e.g., Bayes net: DAG + CPTs !

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Graphical models Review Graphical models (Bayes nets, Markov random fields, factor graphs) !

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Graphical Models Graphical Models Review of probability theory Review of probability theory

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Probabilistic Graphical Models CMSC 691 UMBC Two Problems for Graphical Models 1 ,

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Probabilistic Graphical Models Probabilistic Graphical Models Review of probability theory

Graphical Models Graphical Models Relationship between the directed &amp; undirected models

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical &gt; Tangible? What are their limitations? 93 94 Graphical &gt; Tangible? Graphical

Review of draft-ietf-sidr-arch-01.txt Steve Kent BBN Technologies Document Outline PKI

Around the Plancherel measure on integer partitions (an introduction to Schur processes without

Matching of of the the Hagedorn Hagedorn Temperature Temperature Matching in AdS AdS/ CF /

Statistical Analysis of Computer Program Text Charles Sutton University of Edinburgh Source

Keeping Them Honest: Using the Power of OPRA to Hold Public Officials Accountable Gavin Rozzi,

3/11/2016 CTSA Common Metric (CM) Operational Guidelines Pilot Funding Publications and

Users as Oracles: Semi-automatically Corroborating User Feedback Andy Podgurski (with Vinay

Using Boolean Networks for Consensus in Multi-Robot Environmental Monitoring Tasks Hanzhong

Graphical Models Graphical Models Relationship between the directed & undirected models

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical