CS885 Reinforcement Learning Lecture 4a: May 11, 2018
Deep Neural Networks [GBC] Chap. 6, 7, 8
CS885 Spring 2018 Pascal Poupart 1 University of Waterloo
CS885 Reinforcement Learning Lecture 4a: May 11, 2018 Deep Neural - - PowerPoint PPT Presentation
CS885 Reinforcement Learning Lecture 4a: May 11, 2018 Deep Neural Networks [GBC] Chap. 6, 7, 8 University of Waterloo CS885 Spring 2018 Pascal Poupart 1 Quick recap Markov Decision Processes: value iteration ( " + * ,- Pr "
CS885 Spring 2018 Pascal Poupart 1 University of Waterloo
CS885 Spring 2018 Pascal Poupart 2
'
'8 4 "-, 1- − 4(", 1)]
University of Waterloo
CS885 Spring 2018 Pascal Poupart 3
University of Waterloo
CS885 Spring 2018 Pascal Poupart 4
University of Waterloo
CS885 Spring 2018 Pascal Poupart 5
University of Waterloo
– Inputs: ' – Output: + – Weights (parameters): % – Bias: ) – Activation function (usually non-linear): ℎ
CS885 Spring 2018 Pascal Poupart 6 University of Waterloo
" = ℎ%('" % ( + * " (%))
. / + *- (.))
. ℎ% ∑2 1 "2 % 32 + * " (%) + *- (.)
CS885 Spring 2018 Pascal Poupart 7 University of Waterloo
3% 3. !% !. ,% 1%%
(%)
1%.
(%)
1.%
(%)
1..
(%)
1%%
(.)
1%.
(.)
input hidden
1 1 *%
(%)
*.
(%)
*%
(.)
+ +,-./
3 /.4 5 3
CS885 Spring 2018 Pascal Poupart 8 University of Waterloo
CS885 Spring 2018 Pascal Poupart 9 University of Waterloo
'
'
( (
– For each example (*', .'), adjust the weights as follows:
23 ← 1 23 − 5 6!'
23
CS885 Spring 2018 Pascal Poupart 10 University of Waterloo
CS885 Spring 2018 Pascal Poupart 11 University of Waterloo
University of Waterloo CS885 Spring 2018 Pascal Poupart 12
13
28.2 25.8 16.4 11.7 7.3 6.7 3.57 3.07 5.1 5 10 15 20 25 30 N E C ( 2 1 ) X R C E ( 2 1 1 ) A l e x N e t ( 2 1 2 ) Z F ( 2 1 3 ) V G G ( 2 1 4 ) G
l e L e N e t ( 2 1 4 ) R e s N e t ( 2 1 5 ) G
l e L e N e t
4 ( 2 1 6 ) H u m a n Classification error (%)
Features + SVMs Deep Convolutional Neural Nets 5 8 19 22 152
depth
CS885 Spring 2018 Pascal Poupart University of Waterloo
CS885 Spring 2018 Pascal Poupart 14
large gradient medium gradient small gradient
University of Waterloo
CS885 Spring 2018 Pascal Poupart 15
University of Waterloo
CS885 Spring 2018 Pascal Poupart 16
*+ *,- = #′(0%)# 0& *+ *,2 = #3 0% $%#′(0&)# 0' ≤ *+ *,- *+ *,5 = #3 0% $%#′(0&)$&#′(0')#(0() ≤ *+ *,2 *+ *,6 = #3 0% $%#3 0& $ 0' $'#′ 0( ) ≤ *+ *,5
) ℎ( ℎ' ℎ& ! $( $' $& $%
University of Waterloo
CS885 Spring 2018 Pascal Poupart 17
University of Waterloo
CS885 Spring 2018 Pascal Poupart 18
Rectified Linear Softplus
University of Waterloo