Object Detection
- Prof. Kuan-Ting Lai
2020/5/5
Object Detection Prof. Kuan-Ting Lai 2020/5/5 2 YOLO v2 - - PowerPoint PPT Presentation
Object Detection Prof. Kuan-Ting Lai 2020/5/5 2 YOLO v2 https://www.youtube.com/watch?v=VOC3huqHrss&t=40s 3 Detection vs Classification Classification Ex: ImageNet Large-scale Visual Recognition Challenge (Classify 1000
2020/5/5
2
3
−Ex: ImageNet Large-scale Visual Recognition Challenge (Classify 1000 categories)
4
5
6
7
9
10
shallow architecture
11
image and get all the regions of interest (RoI)
12
with neural networks
13
14
15
16
Algorithm Features Prediction time Limitations RCNN
generate regions.
from each image. 40-50 secs High computation time as each region is passed to the CNN separately Fast RCNN
to the CNN and feature maps are extracted.
maps to generate predictions. 2 secs Selective search is slow and hence computation time is still high. Faster RCNN
method with region proposal network. 0.2 secs Object proposal takes time
17
18
19
20
https://heartbeat.fritz.ai/gentle-guide-on-how-yolo-object-localization-works-with-keras-part-2-65fe59ac12d
21
https://www.coursera.org/lecture/convolutional-neural-networks/anchor-boxes-yNwO0
22
− VGG (30.69 billion FLOPS) − GoogLeNet (8.52 billion FLOPS) − DarkNet (5.58 billion FLOPS)
to extract features and 1 × 1 filters to reduce output channels
23
24
25
26
27
− WRC, CSP, CmBN, SAT, Mish activation, Mosaic data augmentation, CmBN, DropBlock regularization, and CIoU loss
28
29
30
31
32
33
34
35
36
37
− Mingxing Tan Ruoming Pang Quoc V. Le, ‘‘EfficientDet: Scalable and Efficient Object Detection”, Google Research, Brain Team
38
39
40
41
42
https://www.analyticsvidhya.com/blog/2019/07/computer-vision-implementing-mask-r-cnn-image-segmentation/
https://github.com/matterport/ Mask_RCNN.git
4 3
*Create a virtual environment with TensorFlow=1.3 and Keras=2.1
44
k_rcnn_coco.h5
45
tensorflow-using-google-colab-7cbc484f83d7
46
detection-algorithms-36d53571365e
introduction-to-the-basic-object-detection-algorithms-part-1/
works-with-keras-part-2-65fe59ac12d
shot-detection-cb320e3bb0de
47