YOLO: You Only Look Once Unified Real-Time Object Detection Joseph - - PowerPoint PPT Presentation

β–Ά
yolo you only look once
SMART_READER_LITE
LIVE PREVIEW

YOLO: You Only Look Once Unified Real-Time Object Detection Joseph - - PowerPoint PPT Presentation

YOLO: You Only Look Once Unified Real-Time Object Detection Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi [Website] [Paper] [arXiv] [Reviews] Slides by: Andrea Ferri For: Computer Vision Reading Group (08/03/16) INTRODUCTION


slide-1
SLIDE 1

YOLO: You Only Look Once

Unified Real-Time Object Detection

Slides by: Andrea Ferri For: Computer Vision Reading Group (08/03/16)

Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi [Website] [Paper] [arXiv] [Reviews]

slide-2
SLIDE 2

INTRODUCTION

slide-3
SLIDE 3

Nowadays State of the Art approach, are so architected:

Conv Layer 5 Conv layers RPN RPN Proposals RPN Proposals Class probabilities RoI pooling layer FC layers Class scores

slide-4
SLIDE 4

This complex pipeline means that:

Slow Pipeline Single Pipelines Hard to Optimize Need Parallel Training for Components

slide-5
SLIDE 5

WHAT’S NEW?

(In the architecture approach.)

slide-6
SLIDE 6

Developed as Single Convolutional Network Reason Globally on the Entire Image Learns Generalizable Representations

Easy & Fast

Detection as Single Regression Problem

Concepts

slide-7
SLIDE 7

Unified Detection

slide-8
SLIDE 8

Divide the image into a SxS grid.

If the center of an object fall into a grid cell, it will be the responsible for the object.

Each grid cell predict:

B bounding boxes; B confidence scores as C=Pr(Obj)*IOU;

Confidence Prediction is obtained as IOU of predicted box and any ground truth box.

C cond. Class prob. as P=Pr(π‘«π’Žπ’ƒπ’•π’•π’‹|Object);

slide-9
SLIDE 9

We obtain the class-specific confidence score as:

Pr(π‘«π’Žπ’ƒπ’•π’•π’‹|Object)*Pr(Object)*IOU = Pr(π‘«π’Žπ’ƒπ’•π’•π’‹)*IOU

slide-10
SLIDE 10

Design

slide-11
SLIDE 11

Loss-Function

slide-12
SLIDE 12

Limitations

Struggle with Small Object. Loss function threats errors in different boxes ratio at the same. Struggle with Different aspects and ratios

  • f objects.

Loss function is an approximation.

slide-13
SLIDE 13

EXPERIMENTS

(How performs?.)

slide-14
SLIDE 14

General Comparison

slide-15
SLIDE 15

Fast R-CNN & YOLO

slide-16
SLIDE 16

Fast R-CNN & YOLO

Using YOLO accuracy for Big object to avoid detection mistakes into Fast R-CNN:

slide-17
SLIDE 17

Fast R-CNN & YOLO

slide-18
SLIDE 18

SUMMARY

(Why is an interesting approach.)

slide-19
SLIDE 19

The fastest general-purpose object detector in the literature. Trained on a loss function that directly corresponds to detection performance. The entire model is trained jointly. At least detection at 45fps.

Pros

slide-20
SLIDE 20
  • You Only Look Once: Unified, Real-Time Object Detection,

Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi.

References

slide-21
SLIDE 21

QUESTIONS?

THANKS !!!