Lecture 13
Deep Belief Networks Michael Picheny, Bhuvana Ramabhadran, Stanley F . Chen, Markus Nussbaum-Thom
Watson Group IBM T.J. Watson Research Center Yorktown Heights, New York, USA {picheny,bhuvana,stanchen,nussbaum}@us.ibm.com
Lecture 13 Deep Belief Networks Michael Picheny, Bhuvana - - PowerPoint PPT Presentation
Lecture 13 Deep Belief Networks Michael Picheny, Bhuvana Ramabhadran, Stanley F . Chen, Markus Nussbaum-Thom Watson Group IBM T.J. Watson Research Center Yorktown Heights, New York, USA {picheny,bhuvana,stanchen,nussbaum}@us.ibm.com 20
Watson Group IBM T.J. Watson Research Center Yorktown Heights, New York, USA {picheny,bhuvana,stanchen,nussbaum}@us.ibm.com
2 / 84
3 / 84
4 / 84
5 / 84
6 / 84
7 / 84
8 / 84
1 1+exp(−z)
ez+e−z
9 / 84
10 / 84
11 / 84
ij
i
i
12 / 84
1
11 x1 + W (1) 12 x2 + W (1) 13 x3 + b(1) 1 )
2
21 x1 + W (1) 22 x2 + W (1) 23 x3 + b(1) 2 )
3
31 x1 + W (1) 32 x2 + W (1) 33 x3 + b(1) 3 )
1
11 a(2) 1
12 a(2) 2
13 a(2) 3
1 )
13 / 84
14 / 84
15 / 84
16 / 84
m
nl−1
sl
sl+1
ji
m
nl−1
sl
sl+1
ji
17 / 84
θ
18 / 84
19 / 84
20 / 84
ij
i
21 / 84
ij
ij
ij
i
i
i
22 / 84
∂ ∂W (l)
ij J(W, b; x, y) and
∂ ∂b(l)
i J(W, b; x, y), the
i . This
23 / 84
i
i
24 / 84
i
i
j=1 W (1) ij
i
i
i
i
i
25 / 84
i
sl+1
ji δ(l+1) j
i )
ij
j δ(l+1) i
i
i
26 / 84
ij
m
ij
ij
i
m
i
27 / 84
28 / 84
29 / 84
30 / 84
31 / 84
32 / 84
33 / 84
34 / 84
35 / 84
36 / 84
37 / 84
38 / 84
39 / 84
40 / 84
41 / 84
42 / 84
43 / 84
44 / 84
45 / 84
46 / 84
47 / 84
48 / 84
49 / 84
50 / 84
51 / 84
52 / 84
53 / 84
54 / 84
55 / 84
56 / 84
R
Tr
I
57 / 84
58 / 84
59 / 84
60 / 84
R
61 / 84
62 / 84
63 / 84
64 / 84
65 / 84
66 / 84
67 / 84
68 / 84
69 / 84
70 / 84
71 / 84
72 / 84
73 / 84
74 / 84
75 / 84
76 / 84
77 / 84
78 / 84
79 / 84
80 / 84
81 / 84
82 / 84
83 / 84
84 / 84