Mincheul Kang
You Only Look Once: Unified, Real-Time Object Detection
1
Redmon et al., CVPR 2016
You Only Look Once: Unified, Real-Time Object Detection Redmon et - - PowerPoint PPT Presentation
You Only Look Once: Unified, Real-Time Object Detection Redmon et al., CVPR 2016 Mincheul Kang 1 Image Retrieval using Scene Graphs Develop novel framework for semantic image retrieval based on the notion of a scene graph Use scene
Mincheul Kang
You Only Look Once: Unified, Real-Time Object Detection
1
Redmon et al., CVPR 2016
Image Retrieval using Scene Graphs
2
retrieval based on the notion of a scene graph
scene graphs grounded to images
Object & Attribute Relationship
Query
Output
Measure Score
Contents
3
Background
4
Localization Recognition What? Where?
Fast R-CNN slides : Ross Girshick
Background
5
Need a fast and accurate algorithms
http://www.nvidia.com/object/drive-px.html http://kitschthingoftheday.blogspot.com/2011/06/breakfast-making-robots-at-tum.html
Background
6 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 80% 70% 60% 50% 40% 30% 20% 10% 0%
year mean Average Precision (mAP)
PASCAL VOC R-CNN DPM Fast R-CNN Faster R-CNN
After CNN Machine learning + Computer vision
Related work
7
each region
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, Ross Girshick et al., CVPR 2014
Related work
8
Related work
9
Fast R-CNN, Ross Girshick et al., ICCV 2015
Related work
10
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, Shaoqing Ren et al., NIPS 2015 and Slides
Related work
11
precision
Overview
12
1) Resizes the input images to 448 X 448 2) Runs a single convolutional networks on the image 3) Thresholds the resulting detections by the model’s confidence
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Approach
13
Input image
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Approach
14
confidence scores for those boxes.
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Approach
15
probabilities
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Approach
16
model’s confidence
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Approach
17
image simultaneously
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Approach
18
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Result
19
from internet
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Result
20
average precision
69.0
You only look once: Unified, real-time object detection, J Redmon et al., CVPR 2016
Conclusion
21
directly on detection
simultaneously
as flocks of birds
22