AMMI – Introduction to Deep Learning 7.3. Networks for object detection
Fran¸ cois Fleuret https://fleuret.org/ammi-2018/ Wed Aug 29 16:58:03 CAT 2018
ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE
AMMI Introduction to Deep Learning 7.3. Networks for object - - PowerPoint PPT Presentation
AMMI Introduction to Deep Learning 7.3. Networks for object detection Fran cois Fleuret https://fleuret.org/ammi-2018/ Wed Aug 29 16:58:03 CAT 2018 COLE POLYTECHNIQUE FDRALE DE LAUSANNE The simplest strategy to move from image
ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 1 / 15
Input image Conv layers Max-pooling 1000d FC layers classication Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 2 / 15
Input image Conv layers Max-pooling 1000d FC layers classication 4d FC layers Localization Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 2 / 15
Figure 7: Examples of bounding boxes produced by the regression network, before being com- bined into final predictions. The examples shown here are at a single scale. Predictions may be more optimal at other scales depending on the objects. Here, most of the bounding boxes which are initially organized as a grid, converge to a single location and scale. This indicates that the network is very confident in the location of the object, as opposed to being spread out randomly. The top left image shows that it can also correctly identify multiple location if several objects are present. The various aspect ratios of the predicted bounding boxes shows that the network is able to cope with various object poses.
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 3 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 4 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 4 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 5 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 5 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 6 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 6 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 6 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 7 / 15
448 448 3 7 7
7x7x64-s-2 Maxpool Layer 2x2-s-2
3 3 112 112 192 3 3 56 56 256
4096
3x3x192 Maxpool Layer 2x2-s-2
1x1x128 3x3x256 1x1x256 3x3x512 Maxpool Layer 2x2-s-2
3 3 28 28 512
1x1x256 3x3x512 1x1x512 3x3x1024 Maxpool Layer 2x2-s-2
3 3 14 14 1024
1x1x512 3x3x1024 3x3x1024 3x3x1024-s-2
3 3 7 7 1024 7 7 1024 7 7 30
3x3x1024 3x3x1024
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 8 / 15
448 448 3 7 7
7x7x64-s-2 Maxpool Layer 2x2-s-2
3 3 112 112 192 3 3 56 56 256
4096
3x3x192 Maxpool Layer 2x2-s-2
1x1x128 3x3x256 1x1x256 3x3x512 Maxpool Layer 2x2-s-2
3 3 28 28 512
1x1x256 3x3x512 1x1x512 3x3x1024 Maxpool Layer 2x2-s-2
3 3 14 14 1024
1x1x512 3x3x1024 3x3x1024 3x3x1024-s-2
3 3 7 7 1024 7 7 1024 7 7 30
3x3x1024 3x3x1024
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 8 / 15
448 448 3 7 7
7x7x64-s-2 Maxpool Layer 2x2-s-2
3 3 112 112 192 3 3 56 56 256
4096
3x3x192 Maxpool Layer 2x2-s-2
1x1x128 3x3x256 1x1x256 3x3x512 Maxpool Layer 2x2-s-2
3 3 28 28 512
1x1x256 3x3x512 1x1x512 3x3x1024 Maxpool Layer 2x2-s-2
3 3 14 14 1024
1x1x512 3x3x1024 3x3x1024 3x3x1024-s-2
3 3 7 7 1024 7 7 1024 7 7 30
3x3x1024 3x3x1024
ˆ
xi,1
ˆ
yi,1
ˆ
wi,1
ˆ
hi,1
ˆ
ci,1 . . .
ˆ
xi,B
ˆ
yi,B
ˆ
wi,B
ˆ
hi,B
ˆ
ci,B
ˆ
pi,1 . . .
ˆ
pi,C
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 8 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 9 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 9 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 10 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 11 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 12 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 12 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 12 / 15
300 300 3
VGG-16 through Conv5_3 layer
19 19 Conv7 (FC7) 1024 10 10 Conv8_2 512 5 5 Conv9_2 256 3 Conv10_2 256 256 38 38 Conv4_3 3 1 Image
Conv: 1x1x1024 Conv: 1x1x256 Conv: 3x3x512-s2 Conv: 1x1x128 Conv: 3x3x256-s2 Conv: 1x1x128 Conv: 3x3x256-s1
Detections:8732 per Class
Classifier : Conv: 3x3x(4x(Classes+4))
512 448 448 3 Image 7 7 1024 7 7 30
Fully Connected
YOLO Customized Architecture Non-Maximum Suppression
Fully Connected
Non-Maximum Suppression Detections: 98 per class
Conv11_2
74.3mAP 59FPS 63.4mAP 45FPS
Classifier : Conv: 3x3x(6x(Classes+4))
19 19 Conv6 (FC6) 1024
Conv: 3x3x1024
SSD YOLO Extra Feature Layers
Conv: 1x1x128 Conv: 3x3x256-s1 Conv: 3x3x(4x(Classes+4))
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 13 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 14 / 15
AlexNet (Krizhevsky et al., 2012) Overfeat (Sermanet et al., 2013) Box regression R-CNN (Girshick et al., 2013) Region proposal + crop in image Fast R-CNN (Girshick, 2015) Crop in feature maps Faster R-CNN (Ren et al., 2015) Convolutional region proposal YOLO (Redmon et al., 2015) No crop SSD (Liu et al., 2015) Fully convolutional + multi-scale maps Multi-scale convolutions + multi boxes Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 7.3. Networks for object detection 15 / 15