Bayesian Networks: Capturing Independence by Directed Graphs - - PowerPoint PPT Presentation

bayesian networks capturing independence by directed
SMART_READER_LITE
LIVE PREVIEW

Bayesian Networks: Capturing Independence by Directed Graphs - - PowerPoint PPT Presentation

Bayesian Networks: Capturing Independence by Directed Graphs COMPSCI 276, Spring 2017 Set 3: Rina Dechter (Reading: Pearl chapter 3, Darwiche chapter 4) 1 d-separation To test whether X and Y are d-separated by Z in dag G, we need to


slide-1
SLIDE 1

1

Bayesian Networks: Capturing Independence by Directed Graphs

COMPSCI 276, Spring 2017 Set 3: Rina Dechter

(Reading: Pearl chapter 3, Darwiche chapter 4)

slide-2
SLIDE 2
slide-3
SLIDE 3

d-separation

To test whether X and Y are d-separated by Z in dag G, we need to consider every path between a node in X and a node in Y, and then ensure that the path is blocked by Z.

A path is blocked by Z if at least one valve (node) on the path is ‘closed’ given Z.

A divergent valve or a sequential valve is closed if it is in Z

A convergent valve is closed if it is not on Z nor any of its descendants are in Z.

3

slide-4
SLIDE 4

No path Is active = Every path is blocked

slide-5
SLIDE 5
slide-6
SLIDE 6
slide-7
SLIDE 7
slide-8
SLIDE 8
slide-9
SLIDE 9
slide-10
SLIDE 10

Constructing a Bayesian Network for any Distribution P

slide-11
SLIDE 11
slide-12
SLIDE 12
slide-13
SLIDE 13

Bayesian Networks as i-maps

 E: Employment  V: Investment  H: Health  W: Wealth  C: Charitable

contributions

 P: Happiness 13

E E E C E V W C P H Are C and V d-separated give E and P? Are C and H d-separated?

slide-14
SLIDE 14

Idsep(R,EC,B)?

slide-15
SLIDE 15

Idsep(R,EC,B)?

slide-16
SLIDE 16

Idsep(R,∅,C)?

slide-17
SLIDE 17

¬Idsep(R,∅,C)?

slide-18
SLIDE 18

d-separation using ancestral graph

 X is d-separated from Y given Z (<X,Z,Y>d) iff:

Take the ancestral graph that contains X,Y,Z and their ancestral subsets.

Moralized the obtained subgraph

Apply regular undirected graph separation

Check: (E,{},V),(E,P,H),(C,EW,P),(C,E,HP)?

18

E E E C E V W C P H

slide-19
SLIDE 19

Idsep(C,S,B)=?

slide-20
SLIDE 20

Idsep(C,S,B)

slide-21
SLIDE 21
slide-22
SLIDE 22
slide-23
SLIDE 23

We already showed that:

slide-24
SLIDE 24

It is not a d-map

slide-25
SLIDE 25

Perfect Maps for Dags

 Theorem 10 [Geiger and Pearl 1988]: For any dag D

there exists a P such that D is a perfect map of P relative to d-separation.

 Corollary 7: d-separation identifies any implied

independency that follows logically from the set of independencies characterized by its dag.

25

slide-26
SLIDE 26

The ancestral undirected graph G of a directed graph D is An i-map of D. Is it a Markov network of D?

slide-27
SLIDE 27

Blanket Examples

slide-28
SLIDE 28

Blanket Examples

slide-29
SLIDE 29

Markov assumptions in G:

slide-30
SLIDE 30
slide-31
SLIDE 31

Bayesian Networks as Knowledge-Bases

 Given any distribution, P, and an ordering we can

construct a minimal i-map.

 The conditional probabilities of x given its parents is

all we need.

 In practice we go in the opposite direction: the

parents must be identified by human expert… they can be viewed as direct causes, or direct influences.

31

slide-32
SLIDE 32
slide-33
SLIDE 33
slide-34
SLIDE 34
slide-35
SLIDE 35
slide-36
SLIDE 36
slide-37
SLIDE 37

37

The role of causality

slide-38
SLIDE 38
slide-39
SLIDE 39

39

Back to Markov random fields (if time)

slide-40
SLIDE 40

How can we construct a probability Distribution that will have all these independencies?

slide-41
SLIDE 41

So, How do we learn Markov networks From data?

Markov Random Field (MRF)

slide-42
SLIDE 42

G is locally markov If neighbors make every Variable independent From the rest.

Markov Random Field (MRF)

slide-43
SLIDE 43

Pearl says: this information must come from measurements or from experts. But what about learning?

slide-44
SLIDE 44
slide-45
SLIDE 45
slide-46
SLIDE 46

Product form over Markov trees

slide-47
SLIDE 47
slide-48
SLIDE 48

Trees are not the only distributions that have product meaningful forms. They can generalize to join-trees

slide-49
SLIDE 49

49

slide-50
SLIDE 50

The running intersection property

slide-51
SLIDE 51