Category-level localization g y Cordelia Schmid Cordelia Schmid - - PowerPoint PPT Presentation

category level localization g y
SMART_READER_LITE
LIVE PREVIEW

Category-level localization g y Cordelia Schmid Cordelia Schmid - - PowerPoint PPT Presentation

Category-level localization g y Cordelia Schmid Cordelia Schmid Recognition Recognition Classification Classification Object present/absent in an image Often presence of a significant amount of background clutter


slide-1
SLIDE 1

Category-level localization g y

Cordelia Schmid Cordelia Schmid

slide-2
SLIDE 2

Recognition Recognition

  • Classification
  • Classification

– Object present/absent in an image – Often presence of a significant amount of background clutter

  • Localization / Detection

– Localize object within the frame – Bounding box or pixel- level segmentation

slide-3
SLIDE 3

Pixel-level object classification Pixel level object classification

slide-4
SLIDE 4

Difficulties Difficulties

Intra class variations

  • Intra-class variations
  • Scale and viewpoint change
  • Multiple aspects of categories
slide-5
SLIDE 5

Approaches Approaches

Intra class variation

  • Intra-class variation

=> Modeling of the variations, mainly by learning from a large dataset for example by SVMs large dataset, for example by SVMs

  • Scale + limited viewpoints changes
  • Scale + limited viewpoints changes

=> multi-scale approach or invariant local features

  • Multiple aspects of categories

> separate detectors for each aspect front/profile face => separate detectors for each aspect, front/profile face, build an approximate 3D “category” model

slide-6
SLIDE 6

Approaches Approaches

Localization (bounding box)

  • Localization (bounding box)

– Hough transform – Sliding window approach Sliding window approach

  • Localization (segmentation)

( g )

– Shape based – Pixel-based +MRF – Segmented regions + classification

slide-7
SLIDE 7

Hough voting

  • Use Hough space voting to find objects of a class

y y

Learning

g p g j

  • Implicit shape model [Leibe and Schiele ’03,’05]

y x s x s y

Learning

  • Learn appearance codebook

– Cluster over interest points on training images

y y

  • Learn spatial distributions

– Match codebook to training images – Record matching positions on object – Centroid + scale is given

Spatial occurrence distributions

x s x s Probabilistic Interest Points Matched Codebook

Recognition

g

Probabilistic Voting Interest Points Entries

Recognition

slide-8
SLIDE 8

Hough voting Hough voting

[O lt Pi Zi ECCV 2006] [Opelt, Pinz,Zisserman, ECCV 2006]

slide-9
SLIDE 9

Localization with sliding window Localization with sliding window

Training

Positive examples Negative examples

Description + Learn a classifier

slide-10
SLIDE 10

Localization with sliding window Localization with sliding window

T ti t lti l l ti d l Testing at multiple locations and scales Find local maxima non-maxima suppression Find local maxima, non-maxima suppression

slide-11
SLIDE 11

Sliding Window Detectors

Detection Phase

Scan image(s) at all scales and locations Scale-space pyramid Extract features over windows

`

Run window classifier at all locations Fuse multiple Detection window detections in 3-D position & scale space

11

Object detections with bounding boxes

slide-12
SLIDE 12

Haar Wavelet / SVM Human Detector

Haar wavelet descriptors

Haar Wavelet / SVM Human Detector

Training set (2k positive / 10k negative) S t training

1326-D descriptor

Support vector machine test d i results test descriptors Multi-scale search

12

Test image

[Papageorgiou & Poggio, 1998]

slide-13
SLIDE 13

Which Descriptors are Important? c esc p o s a e po a

32x32 descriptors 16x16 descriptors

Mean response difference between positive & Mean response difference between positive & negative training examples Essentially just a coarse-scale human silhouette template!

slide-14
SLIDE 14

Some Detection Results

slide-15
SLIDE 15

PASCAL VOC dataset - localization

  • 20 object classes (aeroplane, bicycle, bird, etc.)
  • Bounding box annotations for training and evaluation
  • Viewpoint information : front, rear, left, right, unspecified
  • Other information : truncated, occluded, difficult
slide-16
SLIDE 16

PASCAL dataset PASCAL dataset

slide-17
SLIDE 17

PASCAL dataset PASCAL dataset

slide-18
SLIDE 18

Evaluating localization with bounding boxes Evaluating localization with bounding boxes

slide-19
SLIDE 19

Evaluation

Evaluating localization with bounding boxes

Evaluation

Evaluating localization with bounding boxes