Applications
Lecture slides for Chapter 12 of Deep Learning www.deeplearningbook.org Ian Goodfellow 2018-10-25
Applications Lecture slides for Chapter 12 of Deep Learning - - PowerPoint PPT Presentation
Applications Lecture slides for Chapter 12 of Deep Learning www.deeplearningbook.org Ian Goodfellow 2018-10-25 Disclaimer Details of applications change much faster than the underlying conceptual ideas A printed book is updated on the
Lecture slides for Chapter 12 of Deep Learning www.deeplearningbook.org Ian Goodfellow 2018-10-25
(Goodfellow 2018)
underlying conceptual ideas
limitations of my own knowledge will be much more apparent in these slides than others
(Goodfellow 2018)
1950 1985 2000 2015 2056 10−2 10−1 100 101 102 103 104 105 106 107 108 109 1010 1011 Number of neurons (logarithmic scale) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Sponge Roundworm Leech Ant Bee Frog Octopus Human
Figure 1.11
(Goodfellow 2018)
(Goodfellow 2018)
(Goodfellow 2018)
TensorFlow tutorial
(Goodfellow 2018)
Blog post
(Goodfellow 2018)
model
(Goodfellow 2018)
(TensorFlow Lite) Important for mobile deployment
(Goodfellow 2018)
(Viola and Jones, 2001)
(Goodfellow 2018)
Outrageously Large Neural Networks
(Goodfellow 2018)
Affine Distortion Noise Elastic Deformation Horizontal flip Random Translation Hue Shift
(Goodfellow 2018)
Training Data Sample Generator (CelebA) (Karras et al, 2017)
Covered in Part III Progressed rapidly after the book was written Underlies many graphics and speech applications
(Goodfellow 2018)
(Table by Augustus Odena)
(Goodfellow 2018)
(Wang et al, 2018)
(Goodfellow 2018)
(Chan et al 2018)
(Goodfellow 2018)
(Goodfellow 2018)
(Hwang et al 2018)
(Goodfellow 2018)
α(t−1) α(t−1) α(t) α(t) α(t+1) α(t+1) h(t−1) h(t−1) h(t) h(t) h(t+1) h(t+1) c × × × +
Figure 12.6 Important in many vision, speech, and NLP applications Improved rapidly after the book was written
(Goodfellow 2018)
Attention mechanism from Wang et al 2018 Image model from Zhang et al 2018
(Goodfellow 2018)
(Goodfellow 2018)
(Bousmalis et al, 2017)
(Goodfellow 2018)
P(x1, . . . , xτ) = P(x1, . . . , xn−1)
τ
Y
t=n
P(xt | xt−n+1, . . . , xt−1). (12.5)
P(THE DOG RAN AWAY) = P3(THE DOG RAN)P3(DOG RAN AWAY)/P2(DOG RAN). (12.7)
Improve with:
(Goodfellow 2018)
−34 −32 −30 −28 −26 −14 −13 −12 −11 −10 −9 −8 −7 −6 Canada Europe Ontario North English Canadian Union African Africa British France Russian China Germany French Assembly EU Japan Iraq South European 35.0 35.5 36.0 36.5 37.0 37.5 38.0 17 18 19 20 21 22 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009
Figure 12.3
(Goodfellow 2018)
(Goodfellow 2018)
(1) (0) (0,0,0) (0,0,1) (0,1,0) (0,1,1) (1,0,0) (1,0,1) (1,1,0) (1,1,1) (1,1) (1,0) (0,1) (0,0) w0 w0 w1 w1 w2 w2 w3 w3 w4 w4 w5 w5 w6 w6 w7 w7
Figure 12.4
(Goodfellow 2018)
Decoder Output object (English sentence) Intermediate, semantic representation Source object (French sentence or image) Encoder
Figure 12.5
(Goodfellow 2018)
Wu et al 2016
(Goodfellow 2018)
Chan et al 2015 “Listen, Attend, and Spell” Graphic from Current speech recognition is based on seq2seq with attention
(Goodfellow 2018)
WaveNet (van den Oord et al, 2016)
(Goodfellow 2018)
(Mnih et al 2013) Convolutional network estimates the value function (future rewards) used to guide the game-playing agent.
(Note: deep RL didn’t really exist when we started the book, became a success while we were writing it, extremely hot topic by the time the book was printed)
(Goodfellow 2018)
(Silver et al, 2016) Monte Carlo tree search, with convolutional networks for value function and policy
(Goodfellow 2018)
(Google Brain)
(Goodfellow 2018)
(Google Brain)
(Goodfellow 2018)
(WayMo)
(Goodfellow 2018)