YOLO: You Only Look Once Unified Real-Time Object Detection Joseph - - PowerPoint PPT Presentation

▶

Jul 24, 2023 393 likes •621 views

YOLO: You Only Look Once Unified Real-Time Object Detection Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi [Website] [Paper] [arXiv] [Reviews] Slides by: Andrea Ferri For: Computer Vision Reading Group (08/03/16) INTRODUCTION

SLIDE 1

YOLO: You Only Look Once

Unified Real-Time Object Detection

Slides by: Andrea Ferri For: Computer Vision Reading Group (08/03/16)

Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi [Website] [Paper] [arXiv] [Reviews]

SLIDE 2

INTRODUCTION

SLIDE 3

Nowadays State of the Art approach, are so architected:

Conv Layer 5 Conv layers RPN RPN Proposals RPN Proposals Class probabilities RoI pooling layer FC layers Class scores

SLIDE 4

This complex pipeline means that:

Slow Pipeline Single Pipelines Hard to Optimize Need Parallel Training for Components

SLIDE 5

WHAT’S NEW?

(In the architecture approach.)

SLIDE 6

Developed as Single Convolutional Network Reason Globally on the Entire Image Learns Generalizable Representations

Easy & Fast

Detection as Single Regression Problem

Concepts

SLIDE 7

Unified Detection

SLIDE 8

Divide the image into a SxS grid.

If the center of an object fall into a grid cell, it will be the responsible for the object.

Each grid cell predict:

B bounding boxes; B confidence scores as C=Pr(Obj)*IOU;

Confidence Prediction is obtained as IOU of predicted box and any ground truth box.

C cond. Class prob. as P=Pr(𝑫𝒎𝒃𝒕𝒕𝒋|Object);

SLIDE 9

We obtain the class-specific confidence score as:

Pr(𝑫𝒎𝒃𝒕𝒕𝒋|Object)*Pr(Object)*IOU = Pr(𝑫𝒎𝒃𝒕𝒕𝒋)*IOU

SLIDE 10

Design

SLIDE 11

Loss-Function

SLIDE 12

Limitations

Struggle with Small Object. Loss function threats errors in different boxes ratio at the same. Struggle with Different aspects and ratios

f objects.

Loss function is an approximation.

SLIDE 13

EXPERIMENTS

(How performs?.)

SLIDE 14

General Comparison

SLIDE 15

Fast R-CNN & YOLO

SLIDE 16

Fast R-CNN & YOLO

Using YOLO accuracy for Big object to avoid detection mistakes into Fast R-CNN:

SLIDE 17

Fast R-CNN & YOLO

SLIDE 18

SUMMARY

(Why is an interesting approach.)

SLIDE 19

The fastest general-purpose object detector in the literature. Trained on a loss function that directly corresponds to detection performance. The entire model is trained jointly. At least detection at 45fps.

Pros

SLIDE 20

You Only Look Once: Unified, Real-Time Object Detection,

Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi.

References

SLIDE 21

QUESTIONS?

Easy & Fast

Concepts

Unified Detection

Design

Loss-Function

Limitations

General Comparison

Fast R-CNN & YOLO

Fast R-CNN & YOLO

Fast R-CNN & YOLO

Pros

References

THANKS !!!