a shallow survey of deep learning
Applications, Models, Algorithms and Theory (?) Chiyuan Zhang April 22, 2015
CSAIL, Poggio Lab
a shallow survey of deep learning Applications, Models, Algorithms - - PowerPoint PPT Presentation
a shallow survey of deep learning Applications, Models, Algorithms and Theory (?) Chiyuan Zhang April 22, 2015 CSAIL, Poggio Lab the state of the art the state of the art Deep learning is achieving the state of the art performance in many
CSAIL, Poggio Lab
2
1http://image-net.org/challenges/LSVRC/
3
7.5 15 22.5 30 Top-5 error 2010 2011 2012 2013 2014 Human ArXiv 2015
2Olga Russakovsky, . . ., Andrej Karpathy and Li Fei-Fei et.al. ImageNet Large Scale
4
3http://karpathy.github.io/2014/09/02/
what-i-learned-from-competing-against-a-convnet-on-imagenet/ 5
4Christian Szegedy, et. al. Going Deeper with Convolutions. arXiv:1409.4842 [cs.CV].
6
5Rob Fergus et. al. Intriguing properties of neural networks. arXiv:1312.6199 [cs.CV] 6Ian J. Goodfellow et. al. Explaining and Harnessing Adversarial Examples.
7
image source: Li Deng and Dong Yu. Deep Learning – Methods and Applications.
e.g. Baidu Deep Speech7, RNN trained on 100,000 hours of data (100, 000/365/5 ≈ 55).
7Andrew Ng et. al. Deep Speech: Scaling up end-to-end speech recognition. arXiv:1412.5567 [cs.CL]
8
9
10
8Google Deep Mind. Human-level control through deep reinforcement learning.
11
Montezuma's Revenge Private Eye Gravitar Frostbite Asteroids
Bowling Double Dunk Seaquest Venture Alien Amidar River Raid Bank Heist Zaxxon Centipede Chopper Command Wizard of Wor Battle Zone Asterix H.E.R.O. Q*bert Ice Hockey Up and Down Fishing Derby Enduro Time Pilot Freeway Kung-Fu Master Tutankham Beam Rider Space Invaders Pong James Bond Tennis Kangaroo Road Runner Assault Krull Name This Game Demon Attack Gopher Crazy Climber Atlantis Robotank Star Gunner Breakout Boxing Video Pinball At human-level or above Below human-level 100 200 300 400 4,500% 500 1,000 600 Best linear learner DQN
12
14
x1 x2 x3 x4 x5 x6 Input layer h1 h2 h3 Hidden layer ˜ x1 ˜ x2 ˜ x3 ˜ x4 ˜ x5 ˜ x6 Reconstruction layer
15
W
16
Ristricted Boltzmann Machine
x h1 h2 h3
Deep Belief Network
m
i=1
n
j=1
17
18
19
20
image source: http://arxiv.org/abs/1412.5567
21
22
image source: http://deeplearning.net/tutorial/lenet.html
23
24
26
image source: Another 53 objects database, http://www.vision.ee.ethz.ch/datasets/index.en.html.
27
28
1
1
29
1
1
29
29
30
30
30
31
32
m(x) = Eg
33
34
35
35
37
j
j
j∈N(i) h[j]
38
39
T,θ
N
i=1
40
convolution
pooling
classifier 41
42