Probabilistic Graphical Models
David Sontag
New York University
Lecture 12, April 23, 2013
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 1 / 24
Probabilistic Graphical Models David Sontag New York University - - PowerPoint PPT Presentation
Probabilistic Graphical Models David Sontag New York University Lecture 12, April 23, 2013 David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 1 / 24 What notion of best should learning be optimizing? This depends on what
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 1 / 24
1
2
3
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 2 / 24
input: two images!
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 3 / 24
y
M ED
w
w
c
w
(x,y)∈D
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 4 / 24
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 5 / 24
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 6 / 24
t′,t
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 7 / 24
y
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 8 / 24
c) − log Z(x; w) ≥ w ·
c) −
c)
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 9 / 24
c)
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 10 / 24
c fc(x, yc). Then, the constraints that we want to satisfy are
y∈Y w · f(xm, y)
1
2
3
4
5
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 11 / 24
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 12 / 24
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 13 / 24
w,ξ
m = max
y∈Y 1 − w ·
w
y∈Y 1 − w ·
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 14 / 24
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 15 / 24
i∈V 1
i ]
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 16 / 24
w
y∈Y
1
2
3
4
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 17 / 24
w
y∈Y
y∈Y ∆(y, ym) + w(t) · f(xm, y)
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 18 / 24
y∈Y ∆(y, ym) + w(t) · f(xm, y)
i∈V 1
i ], this corresponds to adding additional
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 19 / 24
w,ξ
y∈Y ∆(y, ym) + w · f(xm, y)
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 20 / 24
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 21 / 24
Optimization algorithm Online Primal/Dual Type of guarantee Oracle type # Oracle calls dual extragradient (Taskar et al., 2006) no primal-‘dual’ saddle point gap Bregman projection O
λ"
(Collins et al., 2008) yes dual expected dual error expectation O
λ"
gap reduction (Zhang et al., 2011) no primal-dual duality gap expectation O
q
log |Y| λ"
no primal ≥primal error maximization O
λ"
et al., 2009) no primal-dual duality gap maximization O
λ"
subgradient (Shalev-Shwartz et al., 2010a) yes primal primal error w.h.p. maximization ˜ O
λ"
block-coordinate Frank-Wolfe yes primal-dual expected duality gap maximization O
λ"
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 22 / 24
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 23 / 24
the major
these consultations is to make sure that the recovery benefits all . le un de les grands
de les consultations est de faire en sorte que la relance profite ´ egalement ` a tous .
David Sontag (NYU) Graphical Models Lecture 12, April 23, 2013 24 / 24