Image Pyramids
COMPSCI 527 — Computer Vision
COMPSCI 527 — Computer Vision Image Pyramids 1 / 12
Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer - - PowerPoint PPT Presentation
Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Pyramids 1 / 12 Outline 1 Pyramids and Scale 2 (Spatial Frequency) Aliasing 3 Downsampling and Upsampling 4 Bilinear Interpolation 5 Gaussian (and Laplacian)
Image Pyramids
COMPSCI 527 — Computer Vision
COMPSCI 527 — Computer Vision Image Pyramids 1 / 12Outline
1 Pyramids and Scale 2 (Spatial Frequency) Aliasing 3 Downsampling and Upsampling 4 Bilinear Interpolation 5 Gaussian (and Laplacian) Pyramid
COMPSCI 527 — Computer Vision Image Pyramids 2 / 12Pyramids and Scale
↑ smallest denticle
we look for
Scale Budgets
templates: n2(k2 + k 2s2 + k2s4 + . . .) = n2k2(1 + s2 + s4 + . . .)
template: k2(n2 + n2/s2 + n2/s4 + . . .) = k2n2(1 + 1/s2 + 1/s4 + . . .)
image alone
COMPSCI 527 — Computer Vision Image Pyramids 4 / 12g
µ
k's
w
pi
Finer Scales
larger
O
Aliasing
(Spatial frequency) aliasing
O
O
Downsampling = Smoothing + Sampling
average of pixel values in the original image
COMPSCI 527 — Computer Vision Image Pyramids 7 / 12JG.ch 2jGCi.jlIfr i
cs
lq
3oJf3oq3ob
z zaciij I 303 4
M
Key Questions
and noise)
an integer?
COMPSCI 527 — Computer Vision Image Pyramids 8 / 12Bilinear Interpolation
s = 1/phi is not an integer?
ξ = bxc , η = byc ∆x = x ξ , ∆y = y η I(x) = I(ξ, η) (1 ∆x) (1 ∆y) + I(ξ + 1, η) ∆x (1 ∆y) + I(ξ, η + 1) (1 ∆x) ∆y + I(ξ + 1, η + 1) ∆x ∆y
COMPSCI 527 — Computer Vision Image Pyramids 9 / 12a
i
Abstracting Pyramid Operations
J = resize(I, φ):
Filter with σ = 1.6/φ, then sample every s = 1/φ > 1 pixels
No filter. Just sample every s = 1/φ 1 pixels
then define
down(X) = resize(X, φ) up(X) = resize(X, 1/φ)
Cannot restore lost information
COMPSCI 527 — Computer Vision Image Pyramids 10 / 12O
ee
A Gaussian Pyramid (φ = 1/2)
lower spatial frequencies that are in the next-higher resolution level (blurring attenuates high frequencies)
COMPSCI 527 — Computer Vision Image Pyramids 11 / 12A Laplacian Pyramid (φ = 1/2)