CS109A Introduction to Data Science
Pavlos Protopapas and Kevin Rader
Advanced Section #8: Neural Networks for Image Analysis
1
Advanced Section #8: Neural Networks for Image Analysis Camilo - - PowerPoint PPT Presentation
Advanced Section #8: Neural Networks for Image Analysis Camilo Fosco CS109A Introduction to Data Science Pavlos Protopapas and Kevin Rader 1 Outline Image analysis: why neural networks? Multi Layer Perceptron refresher
1
CS109A, PROTOPAPAS, RADER
2
CS109A, PROTOPAPAS, RADER
3
Round, elongated
protuberance Long white rectangular shape (neck) Oval-shaped white blob (body)
CS109A, PROTOPAPAS, RADER
4
Round, elongated head with orange
Long white neck, square shape Oval-shaped white body with or without large white symmetric blobs (wings)
CS109A, PROTOPAPAS, RADER
5
Round, elongated head with
be turned backwards Long white neck, can bend around, not necessarily straight White tail, generally far from the head, looks feathery White, oval shaped body, with or without wings visible Black feet, under body, can have different shapes Small black circles, can be facing the camera, sometimes can see both Black triangular shaped form, on the head, can have different sizes White elongated piece, can be squared or more triangular, can be obstructed sometimes
CS109A, PROTOPAPAS, RADER
6
CS109A, PROTOPAPAS, RADER
7
CS109A, PROTOPAPAS, RADER
8
FAST corner detection algorithm SIFT feature descriptor
CS109A, PROTOPAPAS, RADER
9
10
CS109A, PROTOPAPAS, RADER
11
𝑦" 𝑦# 𝑦$ 𝑦%
𝑍 = 𝑔(𝛾+ + 𝛾"𝑦" + 𝛾#𝑦# + 𝛾$𝑦$ + 𝛾%𝑦%) Input layer Hidden Layer Output Layer They can be more complex…
CS109A, PROTOPAPAS, RADER
12
CS109A, PROTOPAPAS, RADER
13
CS109A, PROTOPAPAS, RADER
14
CS109A, PROTOPAPAS, RADER
15
CS109A, PROTOPAPAS, RADER
16
CS109A, PROTOPAPAS, RADER
17
CS109A, PROTOPAPAS, RADER
18
19
CS109A, PROTOPAPAS, RADER
20
CS109A, PROTOPAPAS, RADER
21
MLP CNN
CS109A, PROTOPAPAS, RADER
22
Function is inverted and shifted left by t
CS109A, PROTOPAPAS, RADER
23
CS109A, PROTOPAPAS, RADER
24
CS109A, PROTOPAPAS, RADER
25
CS109A, PROTOPAPAS, RADER
26
CS109A, PROTOPAPAS, RADER
27
Full padding. Introduces zeros such that all pixels are visited the same amount of times by the filter. Increases size of output. Same padding. Ensures that the
input.
CS109A, PROTOPAPAS, RADER
28
Convolutional layer with four 3x3 filters on a black and white image (just one channel) Convolutional layer with four 3x3 filters
filters are now cubes, and they are applied on the full depth of the image..
CS109A, PROTOPAPAS, RADER
29
CS109A, PROTOPAPAS, RADER
30
CS109A, PROTOPAPAS, RADER
31
CS109A, PROTOPAPAS, RADER
32
CS109A, PROTOPAPAS, RADER
33
CS109A, PROTOPAPAS, RADER
34
CS109A, PROTOPAPAS, RADER
35
CS109A, PROTOPAPAS, RADER
36
I/O
previous set of feature maps
2D map per filter Action
extract features
learned.
function on every value of feature map Parameters
and H only, D is defined by input cube)
and value
CS109A, PROTOPAPAS, RADER
37
CS109A, PROTOPAPAS, RADER
38
I/O
previous set of feature maps
2D map per filte, reduced spatial dimensions Action
dimensionality
average of a region
approach Parameters
CS109A, PROTOPAPAS, RADER
39
CS109A, PROTOPAPAS, RADER
40
I/O
cube, previous set of feature maps
2D map per filter Action
information from final feature maps
classification Parameters
usually changes depending on role of
info, use ReLU. If producing final classification, use Softmax.
CS109A, PROTOPAPAS, RADER
41
CS109A, PROTOPAPAS, RADER
42
CS109A, PROTOPAPAS, RADER
43
CS109A, PROTOPAPAS, RADER
44
Number of filters Size of Filters Number of channels of prev layer Biases (one per filter)
CS109A, PROTOPAPAS, RADER
45
Conv1 Conv2 Dense1 Dense2
CS109A, PROTOPAPAS, RADER
46
47
CS109A, PROTOPAPAS, RADER
48
CS109A, PROTOPAPAS, RADER
49
CS109A, PROTOPAPAS, RADER
50
CS109A, PROTOPAPAS, RADER
51
CS109A, PROTOPAPAS, RADER
52
53
CS109A, PROTOPAPAS, RADER
54
1 K. Fukushima. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position.
Biological Cybernetics, 36(4): 93-202, 1980.
CS109A, PROTOPAPAS, RADER
55
1 LeCun, Yann, et al. "Gradient-based learning applied to document recognition." Proceedings of the IEEE 86.11 (1998): 2278-2324.
CS109A, PROTOPAPAS, RADER
56
AlexNet
CS109A, PROTOPAPAS, RADER
57
CS109A, PROTOPAPAS, RADER
58
CS109A, PROTOPAPAS, RADER
59
1x1 convs to Reduce number
Inception module Proto Inception module
CS109A, PROTOPAPAS, RADER
60
Residual Block
CS109A, PROTOPAPAS, RADER
61
Residual Block
CS109A, PROTOPAPAS, RADER
62
CS109A, PROTOPAPAS, RADER
63
CS109A, PROTOPAPAS, RADER
64
CS109A, PROTOPAPAS, RADER
65
CS109A, PROTOPAPAS, RADER
66
67
CS109A, PROTOPAPAS, RADER
68