sift
play

SIFT 16-385 Computer Vision (Kris Kitani) Carnegie Mellon - PowerPoint PPT Presentation

SIFT 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University SIFT (Scale Invariant Feature Transform) SIFT describes both a detector and descriptor 1. Multi-scale extrema detection 2. Keypoint localization 3. Orientation assignment


  1. SIFT 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University

  2. SIFT (Scale Invariant Feature Transform) SIFT describes both a detector and descriptor 1. Multi-scale extrema detection 2. Keypoint localization 3. Orientation assignment 4. Keypoint descriptor

  3. 1. Multi-scale extrema detection Second octave First octave Gaussian Difference of Gaussian (DoG)

  4. Gaussian Laplacian

  5. Scale-space extrema Scale of Gaussian variance Selected if larger than all 26 neighbors Difference of Gaussian (DoG)

  6. 2. Keypoint localization 2nd order Taylor series approximation of DoG scale-space x = { x, y, σ } Take the derivative and solve for extrema Additional tests to retain only strong features

  7. 3. Orientation assignment For a keypoint, L is the Gaussian-smoothed image with the closest scale, x-derivative y-derivative Detection process returns { x, y, σ , θ } location scale orientation

  8. 4. Keypoint descriptor Image Gradients SIFT descriptor (4 x 4 pixel per cell, 4 x 4 cells) (16 cells x 8 directions = 128 dims) Gaussian weighting (sigma = half width)

  9. �������������������� �������������������� Raw pixels Sampled Locally orderless Global histogram

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend