CS109B Data Science 2
Pavlos Protopapas and Mark Glickman
Lecture 8: Convolutional Neural Networks 1
1
Lecture 8: Convolutional Neural Networks 1 CS109B Data Science 2 - - PowerPoint PPT Presentation
Lecture 8: Convolutional Neural Networks 1 CS109B Data Science 2 Pavlos Protopapas and Mark Glickman 1 Outline CS109B, P ROTOPAPAS , G LICKMAN 2 Main drawbacks of MLPs MLPs use one perceptron for each input (e.g. pixel in an image,
1
CS109B, PROTOPAPAS, GLICKMAN
2
CS109B, PROTOPAPAS, GLICKMAN
3
CS109B, PROTOPAPAS, GLICKMAN
4
CS109B, PROTOPAPAS, GLICKMAN
5
CS109B, PROTOPAPAS, GLICKMAN
6
CS109B, PROTOPAPAS, GLICKMAN
7
Round, elongated
protuberance Long white rectangular shape (neck) Oval-shaped white blob (body)
CS109B, PROTOPAPAS, GLICKMAN
8
Round, elongated head with orange
Long white neck, square shape Oval-shaped white body with or without large white symmetric blobs (wings)
CS109B, PROTOPAPAS, GLICKMAN
9
Round, elongated head with
be turned backwards Long white neck, can bend around, not necessarily straight White tail, generally far from the head, looks feathery White, oval shaped body, with or without wings visible Black feet, under body, can have different shapes Small black circles, can be facing the camera, sometimes can see both Black triangular shaped form, on the head, can have different sizes White elongated piece, can be squared or more triangular, can be obstructed sometimes
CS109B, PROTOPAPAS, GLICKMAN
10
CS109B, PROTOPAPAS, GLICKMAN
11
CS109B, PROTOPAPAS, GLICKMAN
12
FAST corner detection algorithm SIFT feature descriptor
CS109B, PROTOPAPAS, GLICKMAN
13
CS109B, PROTOPAPAS, GLICKMAN
14
CS109B, PROTOPAPAS, GLICKMAN
15
CS109B, PROTOPAPAS, GLICKMAN
16
CS109B, PROTOPAPAS, GLICKMAN
CS109B, PROTOPAPAS, GLICKMAN
CS109B, PROTOPAPAS, GLICKMAN
CS109B, PROTOPAPAS, GLICKMAN
wikipedia.org
CS109B, PROTOPAPAS, GLICKMAN
+ ReLU + ReLU
CS109B, PROTOPAPAS, GLICKMAN
22
CS109B, PROTOPAPAS, GLICKMAN
23
MLP CNN
CS109B, PROTOPAPAS, GLICKMAN
/
/
24
Function is inverted and shifted left by t
CS109B, PROTOPAPAS, GLICKMAN
25
CS109B, PROTOPAPAS, GLICKMAN
26
CS109B, PROTOPAPAS, GLICKMAN
27
CS109B, PROTOPAPAS, GLICKMAN
28
CS109B, PROTOPAPAS, GLICKMAN
29
Full padding. Introduces zeros such that all pixels are visited the same amount of times by the filter. Increases size of output. Same padding. Ensures that the
input.
CS109B, PROTOPAPAS, GLICKMAN
30
Convolutional layer with four 3x3 filters on a black and white image (just one channel) Convolutional layer with four 3x3 filters
filters are now cubes, and they are applied on the full depth of the image..
CS109B, PROTOPAPAS, GLICKMAN
31
CS109B, PROTOPAPAS, GLICKMAN
32
CS109B, PROTOPAPAS, GLICKMAN
33
CS109B, PROTOPAPAS, GLICKMAN
34
CS109B, PROTOPAPAS, GLICKMAN
35
CS109B, PROTOPAPAS, GLICKMAN
36
CS109B, PROTOPAPAS, GLICKMAN
37
CS109B, PROTOPAPAS, GLICKMAN
38
I/O
previous set of feature maps
2D map per filter Action
extract features
learned.
function on every value of feature map Parameters
and H only, D is defined by input cube)
and value
CS109B, PROTOPAPAS, GLICKMAN
39
CS109B, PROTOPAPAS, GLICKMAN
40
I/O
previous set of feature maps
2D map per filte, reduced spatial dimensions Action
dimensionality
average of a region
approach Parameters
CS109B, PROTOPAPAS, GLICKMAN
41
CS109B, PROTOPAPAS, GLICKMAN
42
I/O
cube, previous set of feature maps
2D map per filter Action
information from final feature maps
classification Parameters
usually changes depending on role of
info, use ReLU. If producing final classification, use Softmax.
CS109B, PROTOPAPAS, GLICKMAN
43
CS109B, PROTOPAPAS, GLICKMAN
44
CS109B, PROTOPAPAS, GLICKMAN
45
CS109B, PROTOPAPAS, GLICKMAN
46
Number of filters Size of Filters Number of channels of prev layer Biases (one per filter)
CS109B, PROTOPAPAS, GLICKMAN
47
Conv1 Conv2 Dense1 Dense2
CS109B, PROTOPAPAS, GLICKMAN
48
CS109B, PROTOPAPAS, GLICKMAN
49
CS109B, PROTOPAPAS, GLICKMAN
discovered by other researchers as well)
50
1 K. Fukushima. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position.
Biological Cybernetics, 36(4): 93-202, 1980.
CS109B, PROTOPAPAS, GLICKMAN
51
1 LeCun, Yann, et al. "Gradient-based learning applied to document recognition." Proceedings of the IEEE 86.11 (1998): 2278-2324.
CS109B, PROTOPAPAS, GLICKMAN
52
AlexNet
CS109B, PROTOPAPAS, GLICKMAN
53
CS109B, PROTOPAPAS, GLICKMAN
54
CS109B, PROTOPAPAS, GLICKMAN
55
1x1 convs to Reduce number
Inception module Proto Inception module
CS109B, PROTOPAPAS, GLICKMAN
56
Residual Block
CS109B, PROTOPAPAS, GLICKMAN
57
Residual Block
CS109B, PROTOPAPAS, GLICKMAN
58
CS109B, PROTOPAPAS, GLICKMAN
59
CS109B, PROTOPAPAS, GLICKMAN
60
CS109B, PROTOPAPAS, GLICKMAN
61