Object Detection
Ujjwal Post-Doc, STARS Team INRIA Sophia Antipolis
Object Detection Ujjwal Post-Doc, STARS Team INRIA Sophia - - PowerPoint PPT Presentation
Object Detection Ujjwal Post-Doc, STARS Team INRIA Sophia Antipolis Outline What is Object Detection ? Qualitative Definition. Machine Learning Definition. Ingredients of Object Detection. Components of a typical deep
Ujjwal Post-Doc, STARS Team INRIA Sophia Antipolis
Classification There is a dog. Detection There is a dog with a bounding box around it.
Classification There is a dog. Detection There is a dog with a bounding box around it.
Object Detection = Classification + Localization
Classification ( N classes)
Detection ( N classes)
bounding box.
Base Network Detection Specific Post-Processing Pre-Processing Loss Functions Image
connected layers.
framework.β
Base Network Pre-Processing RPN RCNN Post-Processing RCNN Loss RPN Loss Detection Specific Components
intractable.
which need be processed.
RPN Output: Proposals Original Image
Base Network Extra Convolutional Layers (Optional and must be decided by experimentation) Feature Map
Feature Map with object bounding box ππ β Pool of predefined anchors
ππ
bounding boxes (called anchors) with different scales/aspect-ratios.
and measure the intersection-over- union with every GT box.
anchor i.e donβt do any computations.
and measure the intersection-over- union with every GT box.
anchor i.e donβt do any computations.
and measure the intersection-over- union with every GT box.
anchor i.e donβt do any computations.
and measure the intersection-over- union with every GT box.
anchor i.e donβt do any computations.
Feature Probing It is just a convolution of a feature map with a kernel. 2 X #anchors per location 4 X #anchors per location
An Anchor A Convolutional Kernel
An Anchor A Convolutional Kernel
completely inside an anchor.
convolution is relatively incomplete.
representative of all the confocal anchors.
all.
better fit it to the bounding box of the object in the training set.
RPN Output: Proposals Original Image
Base Network Pre-Processing RPN RCNN Post-Processing RCNN Loss RPN Loss Detection Specific Components Covered
RCNN during test time.
function.
maintained during RPN training.
refers to negative examples.
deep learning system.
Feature Pooling A RPN Proposal Number of classes + 1 4 + 1 because of background
to extract features inside a subregion of an image or feature map.
Features inside the shaded area are extracted.
feature vector from a region.
speed.
interpolation.
RCNN.
for bounding box regression ?
bounding box coordinates are ignored.
bounding box coordinates are not ignored.
(60%).
intuition.