Probabilistic Graphical Models
David Sontag
New York University
Lecture 4, February 16, 2012
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 1 / 27
Probabilistic Graphical Models David Sontag New York University - - PowerPoint PPT Presentation
Probabilistic Graphical Models David Sontag New York University Lecture 4, February 16, 2012 David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 1 / 27 Undirected graphical models Reminder of lecture 2 An alternative
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 1 / 27
x1,...,ˆ xn
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 2 / 27
x1,...,ˆ xn
B A C 10 1 1 10 A B 1 1
φA,B(a, b) =
10 1 1 10 B C 1 1
φB,C(b, c) = φA,C(a, c) =
10 1 1 10 A C 1 1
a,ˆ b,ˆ c∈{0,1}3
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 3 / 27
= +1 = -1
i<j
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 4 / 27
1
2
3
4
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 5 / 27
X1 X2 X3 X4 X5 X6 Y1 Y2 Y3 Y4 Y5 Y6
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 6 / 27
1
2
A C B D A C B D
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 7 / 27
1
A C B D A C B D
2
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 8 / 27
XA XB XC
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 9 / 27
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 10 / 27
1
2
3
4
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 11 / 27
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 12 / 27
Y X1 X2 X3 Xn
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 13 / 27
Y X
Y X
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 14 / 27
Generative Discriminative
Y X1 X2 X3 Xn
Y X1 X2 X3 Xn
1
2
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 15 / 27
1 In the generative model, how do we parameterize p(Xi | Xpa(i), Y )? 2 In the discriminative model, how do we parameterize p(Y | X)?
Generative Discriminative
Y X1 X2 X3 Xn
Y X1 X2 X3 Xn
1 For the generative model, assume that Xi ⊥ X−i | Y (naive Bayes) 2 For the discriminative model, assume that
i=1 αixi
i=1 αixi =
i=1 αixi
(To simplify the story, we assume Xi ∈ {0, 1}) David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 16 / 27
1 For the generative model, assume that Xi ⊥ X−i | Y (naive Bayes)
Y X1 X2 X3 Xn
Y X1 X2 X3 Xn
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 17 / 27
2 For the discriminative model, assume that
i=1 αixi
i=1 αixi =
i=1 αixi
Y X1 X2 X3 Xn
z
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 18 / 27
1 For the generative model, assume that Xi ⊥ X−i | Y (naive Bayes) 2 For the discriminative model, assume that
i=1 αixi
i=1 αixi =
i=1 αixi
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 19 / 27
Generative (naive Bayes) Discriminative (logistic regression)
Y X1 X2 X3 Xn
Y X1 X2 X3 Xn
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 20 / 27
1 Using a conditional model is only possible when X is always observed
2 Estimating the generative model using maximum likelihood is more
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 21 / 27
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 22 / 27
y
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 23 / 27
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 24 / 27
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 25 / 27
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 26 / 27
t (Yt, Yt+1) represents dependencies between neighboring target variables
t (Yt, X1, · · · , XT) represents dependencies between a target and its
David Sontag (NYU) Graphical Models Lecture 4, February 16, 2012 27 / 27