1/24
CS7015 (Deep Learning) : Lecture 22
Autoregressive Models (NADE, MADE) Mitesh M. Khapra
Department of Computer Science and Engineering Indian Institute of Technology Madras
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
CS7015 (Deep Learning) : Lecture 22 Autoregressive Models (NADE, - - PowerPoint PPT Presentation
CS7015 (Deep Learning) : Lecture 22 Autoregressive Models (NADE, MADE) Mitesh M. Khapra Department of Computer Science and Engineering Indian Institute of Technology Madras 1/24 Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22 Module
1/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
2/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
3/24 v1 v2 · · · vm V ∈ {0, 1}m b1 b2 bm h1 h2 · · · hn H ∈ {0, 1}n c1 c2 cn
W ∈ Rm×n
w1,1 wm,n
+ ǫ
z x Qθ(z|x) Σ µ Pφ(x|z) ˆ x
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
4/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
5/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
6/24
1
2
1
3
1
2
4
1
2
3
1
2
1
3
1
2
4
1
2
3
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
7/24
1
2
1
3
1
2
4
1
2
3
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
8/24
1
2
1
3
1
2
4
1
2
3
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
9/24
1
2
1
3
1
2
4
1
2
3
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
10/24
1
2
1
3
1
2
4
1
2
3
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
11/24
1
2
1
3
1
2
4
1
2
3
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
12/24
1
2
1
3
1
2
4
1
2
3
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
13/24
1
2
1
3
1
2
4
1
2
3
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
14/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
15/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
16/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
17/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
18/24
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
19/24 x1 x2 x3 x4 ˆ x1 ˆ x2 ˆ x3 ˆ x4
W1 W2 V Masks
= MV = MW2 = MW1
x1 x2 x3 x4
1 2 3 4 1 2 1 2 3 1 1 2 1 3 1 2 3 4
p ( x
1
) p ( x
2
| x
1
) p ( x
3
| x
1
, x
2
) p ( x
4
| x
1
, x
2
, x
3
)
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
20/24 x1 x2 x3 x4 ˆ x1 ˆ x2 ˆ x3 ˆ x4
W1 W2 V Masks
= MV = MW2 = MW1
x1 x2 x3 x4
1 2 3 4 1 2 3 4 1 2 3 4 1 2 1 2 3 1 2 1 2 1 1 2 1 3 2 1 2 3 4
p ( x
1
) p ( x
2
| x
1
) p ( x
3
| x
1
, x
2
) p ( x
4
| x
1
, x
2
, x
3
)
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
21/24 x1 x2 x3 x4 ˆ x1 ˆ x2 ˆ x3 ˆ x4
W1 W2 V Masks
= MV = MW2 = MW1
x1 x2 x3 x4
1 2 3 4 1 2 1 2 3 1 1 2 1 1 1 2 1 3 1 2 3 4
p ( x
1
) p ( x
2
| x
1
) p ( x
3
| x
1
, x
2
) p ( x
4
| x
1
, x
2
, x
3
)
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
22/24 x1 x2 x3 x4 ˆ x1 ˆ x2 ˆ x3 ˆ x4
W1 W2 V Masks
= MV = MW2 = MW1
x1 x2 x3 x4
1 2 3 4 1 2 1 2 3 1 1 2 1 3 1 2 3 4
p(x1) p(x2|x1) p(x3|x1, x2) p(x4|x1, x2, x3)
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
23/24 x1 x2 x3 x4 ˆ x1 ˆ x2 ˆ x3 ˆ x4
W1 W2 V Masks
= MV = MW2 = MW1
x1 x2 x3 x4
1 2 3 4 1 2 1 2 3 1 1 2 1 3 1 2 3 4
p(x1) p(x2|x1) p(x3|x1, x2) p(x4|x1, x2, x3)
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22
24/24
Masks
= MV = MW2 = MW1
x1 x2 x3 x4
1 2 3 4 1 2 3 1 2 1 2 3 1 1 2 1 3 1 2 3 4 1 2 3 4 1 1 1 1 1 1 2 1 2 1 1 2 1 1 2 1 2 3 1 1 2 1 3
p(x1) p(x2|x1) p(x3|x1, x2) p(x4|x1, x2, x3) p(x1) p(x2|x1) p(x3|x1, x2) p(x4|x1, x2, x3)
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 22