Particular Object Retrieval with Integral Max-pooling of CNN - - PowerPoint PPT Presentation

particular object retrieval
SMART_READER_LITE
LIVE PREVIEW

Particular Object Retrieval with Integral Max-pooling of CNN - - PowerPoint PPT Presentation

Particular Object Retrieval with Integral Max-pooling of CNN Activations Tolias et al. ICLR 2016 Presented by Jaehyeong Cho Contents Introduction Related works Main approaches Results Conclusion Introduction How to find


slide-1
SLIDE 1

Particular Object Retrieval

with

Integral Max-pooling of CNN Activations

Tolias et al. ICLR 2016 Presented by Jaehyeong Cho

slide-2
SLIDE 2

Contents

  • Introduction
  • Related works
  • Main approaches
  • Results
  • Conclusion
slide-3
SLIDE 3

Introduction

  • How to find similar images?
  • Convert an image into a single feature (e.g. BoW, VLAD, CNN)
  • Measure the similarity between features

=> Quality of features highly affects the retrieval results

  • Are all parts of an image equally representative?
  • No, it is better to focus on important regions only
  • Main contribution
  • Encodes several image regions into single compact feature
  • Localizes matching objects
slide-4
SLIDE 4

Related works

  • Retrieval methods considering spatial information
  • Babenko and Lempitsky, Aggregating Deep Convolutional Features

for Image Retrieval, ICCV 2015

  • Aggregates multiple convolutional features from various position in

an image

  • Gives higher weights for the features near the center
slide-5
SLIDE 5

Related works

  • Retrieval methods considering spatial information
  • Kalantidis et al., Cross-dimensional Weighting for Aggregated Deep

Convolutional Features, ECCV workshop 2016

  • Gives different weights according to the channel and location
slide-6
SLIDE 6

Related works

  • Retrieval methods considering spatial information
  • Xie et al., Image Classification and Retrieval are ONE, ICMR 2015
  • Extract CNN features from object regions
  • Represent an image with multiple features
slide-7
SLIDE 7

Main approaches

  • Maximum activations of convolutions (MAC)
  • Proposed by Azizpour et al., 2014
  • CNN activations for an image I
  • W × H × K
  • Utilizes only maximum activations from each channel
  • Enables to capture representative regions
  • But lacks location information
slide-8
SLIDE 8

Main approaches

  • Regional maximum activations of convolutions (R-MAC)
  • Extract MAC from multiple regions

=> Encodes the location information

  • Makes a single feature by summation
slide-9
SLIDE 9

Main approaches

  • Object localization
  • q : MAC feature from the query object (blue)
  • Find the region that maximize the similarity
  • Fast computation of

fR

T is required

slide-10
SLIDE 10

Main approaches

  • Object localization
  • Approximation of

fR

T

  • Localization result helps re-ranking
slide-11
SLIDE 11

Results

  • Comparison of retrieval accuracy
  • without post-processing
slide-12
SLIDE 12

Results

  • Comparison of retrieval accuracy
  • with post-processing
slide-13
SLIDE 13

Results

  • Re-ranking with object localization
slide-14
SLIDE 14

Conclusion

  • Generated improved feature vector by encoding location

information into the feature

  • Approximated max-pooling process for fast computation
  • Localized the target object and effectively used it for re-ranking
slide-15
SLIDE 15

References

  • Babenko, Artem, and Victor Lempitsky. "Aggregating local deep features for image

retrieval." Proceedings of the IEEE International Conference on Computer Vision. 2015.

  • Kalantidis, Yannis, Clayton Mellina, and Simon Osindero. "Cross-dimensional weighting

for aggregated deep convolutional features." arXiv preprint arXiv:1512.04065 (2015).

  • Xie, Lingxi, et al. "Image classification and retrieval are one." Proceedings of the 5th

ACM on International Conference on Multimedia Retrieval. ACM, 2015.

  • Azizpour, Hossein, et al. "From generic to specific deep representations for visual

recognition." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2015.