1 Image Classification
BVM 2018 Tutorial: Advanced Deep Learning Methods Jakob Wasserthal, Division of Medical Image Computing
1 Image Classification BVM 2018 Tutorial: Advanced Deep Learning - - PowerPoint PPT Presentation
1 Image Classification BVM 2018 Tutorial: Advanced Deep Learning Methods Jakob Wasserthal, Division of Medical Image Computing Author Division Classification of skin cancer 02.11.16 | vs Esteva et al., Dermatologist-level classification of
BVM 2018 Tutorial: Advanced Deep Learning Methods Jakob Wasserthal, Division of Medical Image Computing
02.11.16 | Author Division | Jakob Wasserthal 2
Esteva et al., Dermatologist-level classification of skin cancer with deep neural networks, Nature, 2017
vs
02.11.16 | Author Division | Jakob Wasserthal 3
malignant benign
Esteva et al., Dermatologist-level classification of skin cancer with deep neural networks, Nature, 2017
vs
02.11.16 | Author Division | Jakob Wasserthal 4
p(malignant) p(benign)
0.98 0.02
02.11.16 | Author Division | Jakob Wasserthal
5
2012 2013 2014 2015 top-5 error
AlexNet 15.3% ZFNet 11.2% GoogLeNet 6.67% ResNet 3.57% Human 5.1% second best 26.2% Inception v3 3.5%
2017
DenseNet ~3.5% VGG 7.3%
02.11.16 | Author Division | Jakob Wasserthal
6
2012 2013 2014 2015 top-5 error
AlexNet 15.3% ZFNet 11.2% GoogLeNet 6.67% ResNet 3.57% Human 5.1% second best 26.2% Inception v3 3.5%
2017
DenseNet ~3.5% VGG 7.3%
02.11.16 | Author Division | Jakob Wasserthal
7
Simonyan et al.,Very deep convolutional networks for large-scale image recognition, arXiv, 2014 He et al., Deep Residual Learning for Image Recognition, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
8
2012 2013 2014 2015 top-5 error
AlexNet 15.3% ZFNet 11.2% ResNet 3.57% Human 5.1% second best 26.2% Inception v3 3.5%
2017
DenseNet ~3.5% GoogLeNet 6.67% VGG 7.3%
02.11.16 | Author Division | Jakob Wasserthal
9
Inception module
Szegedy et al., Going Deeper with Convolutions, arXiv, 2014
02.11.16 | Author Division | Jakob Wasserthal
10 stride=1
Szegedy et al., Going Deeper with Convolutions, arXiv, 2014
02.11.16 | Author Division | Jakob Wasserthal
11 stride=1
WxHx256 WxHx256 WxHx256 WxHx256 WxHx256 WxHx(256+256+256+256) = WxHx1024
[Width x Height x Nr of Filters]
Szegedy et al., Going Deeper with Convolutions, arXiv, 2014
02.11.16 | Author Division | Jakob Wasserthal
12
WxHx256 WxHx128 WxHx128 WxHx192 WxHx32 WxHx96 WxHx64 WxHx256 WxHx(128+192+96+64) = WxHx480
stride=1 stride=1
[Width x Height x Nr of Filters]
WxHx256 WxHx256 WxHx256 WxHx256 WxHx256 WxHx(256+256+256+256) = WxHx1024
Szegedy et al., 2014
02.11.16 | Author Division | Jakob Wasserthal
13
Szegedy et al., Going Deeper with Convolutions, arXiv, 2014
VGG GoogLeNet
14x14x512 1x1x1024 1000 1000*1024=1M 7x 7x512=25.088 4094 25088*4094=102M Data dimensions #Parameters 7x7x1024 Data dimensions #Parameters
02.11.16 | Author Division | Jakob Wasserthal
14
Inception module
Szegedy et al., Going Deeper with Convolutions, arXiv, 2014
02.11.16 | Author Division | Jakob Wasserthal
15
Szegedy et al., Rethinking the Inception Architecture for Computer Vision, arXiv, 2015
Parameters: 5x5-convolution: 5*5=25 2* 3x3-convolution: 2* (3*3)=18 => ~30% less parameters and computations
02.11.16 | Author Division | Jakob Wasserthal
16
Szegedy et al., Rethinking the Inception Architecture for Computer Vision, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
17
Szegedy et al., Rethinking the Inception Architecture for Computer Vision, arXiv, 2015
Parameters: 3x3-convolution: 3*3=9 2* 1x3-convolution: 2* (1*3)=6 => ~33% less parameters and computations
02.11.16 | Author Division | Jakob Wasserthal
18
Szegedy et al., Rethinking the Inception Architecture for Computer Vision, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
19
Representational bottleneck 3x more computations
Szegedy et al., Rethinking the Inception Architecture for Computer Vision, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
20
Representational bottleneck 3x more computations
Szegedy et al., Rethinking the Inception Architecture for Computer Vision, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
21
Optimised Inception module
Szegedy et al., Rethinking the Inception Architecture for Computer Vision, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
22
Szegedy et al., Rethinking the Inception Architecture for Computer Vision, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
23
2012 2013 2014 2015 top-5 error
AlexNet 15.3% ZFNet 11.2% ResNet 3.57% Human 5.1% second best 26.2% Inception v3 3.5%
2017
DenseNet ~3.5% GoogLeNet 6.67% VGG 7.3%
02.11.16 | Author Division | Jakob Wasserthal
24
Esteva et al., Dermatologist-level classification of skin cancer with deep neural networks, Nature, 2017
02.11.16 | Author Division | Jakob Wasserthal
25
Gulshan et al., Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs, JAMA, 2016
02.11.16 | Author Division | Jakob Wasserthal
26
2012 2013 2014 2015 top-5 error
AlexNet 15.3% ZFNet 11.2% ResNet 3.57% Human 5.1% second best 26.2% Inception v3 3.5%
2017
DenseNet ~3.5% GoogLeNet 6.67% VGG 7.3%
02.11.16 | Author Division | Jakob Wasserthal
27
He et al., Deep Residual Learning for Image Recognition, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
28
He et al., Deep Residual Learning for Image Recognition, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
29
02.11.16 | Author Division | Jakob Wasserthal
30
He et al., Deep Residual Learning for Image Recognition, arXiv, 2015
02.11.16 | Author Division | Jakob Wasserthal
31
2012 2013 2014 2015 top-5 error
AlexNet 15.3% ZFNet 11.2% ResNet 3.57% Human 5.1% second best 26.2% Inception v3 3.5%
2017
DenseNet ~3.5% GoogLeNet 6.67% VGG 7.3%
02.11.16 | Author Division | Jakob Wasserthal
32
Huang et al., Densely Connected Convolutional Networks, CVPR, 2017
02.11.16 | Author Division | Jakob Wasserthal
33
Huang et al., Densely Connected Convolutional Networks, CVPR, 2017
02.11.16 | Author Division | Jakob Wasserthal
34
Source: The Radiology Assistant : Bi-RADS for Mammography and Ultrasound 2013
02.11.16 | Author Division | Jakob Wasserthal
35
p(normal) p(diabetic)
0.98 0.02
A deep neural network is often considered as a “black box”.
02.11.16 | Author Division | Jakob Wasserthal
36
“What parts of the input image affect the decision?”
Gulshan et al., Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs, 2016
02.11.16 | Author Division | Jakob Wasserthal
37
0.98 0.02
pdog(x) pcat(x)
Slides by courtesy of Paul Jäger
02.11.16 | Author Division | Jakob Wasserthal
38
pdog(x)
0.98 0.02
“What parts of the input image affect the decision?” “backprop into image”:
pcat(x)
Slides by courtesy of Paul Jäger
02.11.16 | Author Division | Jakob Wasserthal
39
Slides by courtesy of Paul Jäger
02.11.16 | Author Division | Jakob Wasserthal
40
Jamaludin et al., SpineNet: Automated classification and evidence visualization in spinal MRIs, Medical image analysis, 2017
02.11.16 | Author Division | Jakob Wasserthal
41
02.11.16 | Author Division | Jakob Wasserthal
42
02.11.16 | Author Division | Jakob Wasserthal
43
Trick: Backprop into a mask m multiplied with the image to be the “minimal destroying region”. “Interpretable Explanations of Black Boxes by Meaningful Perturbation” Ruth et al., arXiv, 2018
02.11.16 | Author Division | Jakob Wasserthal
44
network training: image perturbation: vs
ij = wij − αdcdog(p(x))
ij = mij − αdc∗(pdog(x), m)
02.11.16 | Author Division | Jakob Wasserthal
45
Avoid high frequency artefacts by enforcing a smooth structure:
02.11.16 | Author Division | Jakob Wasserthal
46
result: ability to verify the underlying functionality
02.11.16 | Author Division | Jakob Wasserthal
47
(source: Fei-Fei Li & Justin Johnson & Serena Young, cs231n 2017, Lecture 12)
02.11.16 | Author Division | Jakob Wasserthal
48
“Primary cause of NN’s vulnerability to adversarial perturbations is their [piecewise] linear nature”
(Explaining and Harnessing Adversarial Examples, Goodfellow et al., 2015) (source: Ian Goodfellow, cs231n 2017, Lecture 16)
ReLU Sigmoid