In the name of Allah the compassionate, the merciful Digital Video - - PowerPoint PPT Presentation

in the name of allah
SMART_READER_LITE
LIVE PREVIEW

In the name of Allah the compassionate, the merciful Digital Video - - PowerPoint PPT Presentation

In the name of Allah the compassionate, the merciful Digital Video Systems S. Kasaei S. Kasaei Room: CE 307 Department of Computer Engineering Sharif University of Technology E-Mail: skasaei@sharif.edu Webpage: http://sharif.edu/~skasaei


slide-1
SLIDE 1
slide-2
SLIDE 2

In the name of Allah

the compassionate, the merciful

slide-3
SLIDE 3

Digital Video Systems

  • S. Kasaei
  • S. Kasaei

Room: CE 307 Department of Computer Engineering Sharif University of Technology E-Mail: skasaei@sharif.edu Webpage: http://sharif.edu/~skasaei

  • Lab. Website: http://ipl.ce.sharif.edu
slide-4
SLIDE 4

Acknowledgment

Most of the slides used in this course have been provided by: Prof. Yao Wang (Polytechnic University, Brooklyn) based on the book: Video Processing & Communications written by: Yao Wang, Jom Ostermann, & Ya-Oin Zhang Prentice Hall, 1st edition, 2001, ISBN: 0130175471. [SUT Code: TK 5105 .2 .W36 2001].

slide-5
SLIDE 5

Chapter 2

Fourier Analysis of Video Signals & Frequency Response of the HVS

slide-6
SLIDE 6

Kasaei 6

Outline

Fourier transform over multidimensional

space:

Continuous-space FT (CSFT) Discrete-space FT (DSFT)

Frequency domain characterization of video

signals:

Spatial frequency Temporal frequency Temporal frequency caused by motion

slide-7
SLIDE 7

Kasaei 7

Outline

Frequency response of the HVS:

Spatial frequency response Temporal frequency response & flicker Spatio-temporal response Smooth pursuit eye movement

Video sampling (a brief discussion)

slide-8
SLIDE 8

Kasaei 8

Continuous-Space Signals

K-D space continuous signals: Convolution: Example function:

Dirac delta function:

k K

R x x x ∈ = ] ,..., , [ ), (

2 1

x x ψ y y y x x x d h h

k

R

− = ) ( ) ( ) ( * ) ( ψ ψ

real or complex

R: set of real numbers X: K-D continuous variable

slide-9
SLIDE 9

Kasaei 9

Continuous-Space Fourier Transform (CSFT)

Forward transform: Inverse transform: Convolution theorem:

x x f x f d j

k

R T c

− = Ψ ) 2 exp( ) ( ) ( π ψ

f x f f x d j

k

R T c

∫Ψ

= ) 2 exp( ) ( ) ( π ψ ) ( * ) ( ) ( ) ( ) ( ) ( ) ( * ) ( f f x x f f x x

c c c c

H h H h Ψ ⇔ Ψ ⇔ ψ ψ

slide-10
SLIDE 10

Kasaei 10

Continuous-Space Fourier Transform (CSFT)

More on inverse transform:

The inverse CSFT shows that any signal can be

expressed as a linear combination of complex exponential function with different frequencies.

The CSFT at a particular frequency represents the

contribution of the corresponding complex exponential basis function.

The transform determines the correlation between

the input signal & its projection on some defined basis function.

Orthogonal basis functions preserve the signal

energy in the transform domain.

f x f f x d j

k

R T c

∫Ψ

= ) 2 exp( ) ( ) ( π ψ

slide-11
SLIDE 11

Kasaei 11

Continuous-Space Systems

General system over K-D continuous space: Linear & (Space) Shift-Invariant (LSI)

System:

LSI system can be completely described by

its impulse response:

k

R T ∈ = x x x )), ( ( ) ( ψ φ

( )

) ( ) ( ) ( ) ( * ) ( ) ( ) ( ) ( f f f x x x x x

c c c

H h T h Ψ = Φ ⇔ = = ψ φ δ ) ( )) ( ( )) ( ) ( ( ) ( ) (

2 2 1 1 2 2 1 1

x x x x x x x x + = + + = + φ ψ ψ α ψ α φ α φ α T T

slide-12
SLIDE 12

Kasaei 12

Discrete-Space Signals

K-D space discrete signals: Convolution: Example function:

Kronecker delta function:

K K

Z n n n ∈ = ] ,..., , [ ), (

2 1

n n ψ

− =

K

Z

h h

m

m m n n n ) ( ) ( ) ( * ) ( ψ ψ

slide-13
SLIDE 13

Kasaei 13

Discrete-Space Fourier Transform (DSFT)

Forward transform: Inverse transform: Convolution theorem:

{ }

) 2 / 1 , 2 / 1 ( , : period l Fundamenta 1

  • f

period with dimension each in periodic is ) ( ) 2 exp( ) ( ) ( − ∈ = Ψ − = Ψ

∈ k K d R T d

f I j

K

f f n f n f

n

π ψ

Ψ =

K

I T d

d j

f

f n f f n ) 2 exp( ) ( ) ( π ψ

) ( * ) ( ) ( ) ( ) ( ) ( ) ( * ) ( f f n n f f n n

d d d d

H h H h Ψ ⇔ Ψ ⇔ ψ ψ

freq:

unit hypercube repeats @ integer points

slide-14
SLIDE 14

Kasaei 14

Frequency Domain Characterization

  • f Video Signals

Spatial frequency Temporal frequency Temporal frequency caused by motion

slide-15
SLIDE 15

Kasaei 15

Spatial Frequency

Spatial frequency measures how fast the

image intensity changes in the image plane.

Spatial frequency can be completely

characterized by the variation frequencies in two orthogonal directions (e.g., horizontal & vertical):

fx: cycles/horizontal unit distance. fy : cycles/vertical unit distance.

It can also be specified by magnitude & angle

  • f change:

) / arctan( ,

2 2 x y y x m

f f f f f = + = θ

slide-16
SLIDE 16

Kasaei 16

Illustration of Spatial Frequency

slide-17
SLIDE 17

Kasaei 17

Angular Frequency

ee) cycle/degr ( f 180 f f (degree) 180 n) h/2d(radia 2 (radian) ) 2 / arctan( 2

s s

h d d h d h π θ π θ

θ

= = = ≈ =

Problem with previous defined spatial frequency:

Perceived speed of change depends on the

viewing distance.

slide-18
SLIDE 18

Kasaei 18

Angular Frequency

For the same picture, the angular frequency

increases as the viewing distance increases.

For a fixed viewing distance, a larger screen

size leads to lower angular frequency.

The same picture appears to change more

rapidly when viewed farther away, & it changes more slowly if viewed from larger screen.

It depends on both the spatial frequency in

the signal & the viewing conditions.

slide-19
SLIDE 19

Kasaei 19

Temporal Frequency

Temporal frequency measures temporal

variation (cycles/s).

In a video, the temporal frequency is spatial

position dependent, as every point may change differently.

Temporal frequency is caused by camera or

  • bject motion.

It depends not only on the motion, but also

  • n the spatial frequency of the object.
slide-20
SLIDE 20

Kasaei 20

Temporal Frequency caused by Linear Motion

slide-21
SLIDE 21

Kasaei 21

) ( : frequency temporal and spatial, motion, between Relation ) ( ) , ( ) , , ( ) , ( ) , , (

y y x x t y y x x t y x t y x y x

f v f v f f v f v f f f f f f t v y t v x t y x + − = + + Ψ = Ψ ⇔ − − = δ ψ ψ

The temporal frequency of the image of a moving object depends on motion as well as the spatial frequency of the object (projection of v on f). Example: A plane with vertical bar pattern, moving vertically, causes no temporal change; but moving horizontally, it causes fastest temporal change.

Relation between Motion, Spatial, & Temporal Frequency

is at time pattern image the ), , ( is at pattern image the Assume ). , ( speed with moving

  • bject

an Consider t y x t v v

y x

ψ =

under CIA

X’ y’ 2-D CSFT nonzero

  • n plane

convolution

slide-22
SLIDE 22

Kasaei 22

Illustration of the Relation

= ⇒ = =

t y x

f f f

slide-23
SLIDE 23

Kasaei 23

Frequency Response of HVS

Temporal frequency response & flicker Spatial frequency response Spatio-temporal response Smooth pursuit eye movement

slide-24
SLIDE 24

Kasaei 24

Frequency Responses of HVS

Most of the video systems are ultimately

targeted for human viewers.

It is important to understand how the

human perceives a video signal.

Sensitivity of the HVS to a visual pattern

depends on the spatial & temporal frequency content of the pattern.

slide-25
SLIDE 25

Kasaei 25

Frequency Responses of HVS

The visual sensitivity is highest at some

intermediate spatial & temporal frequencies.

It then falls off & diminishes at some cut-off

frequencies.

Spatial or temporal changes above these

frequencies are invisible to the human eye.

They form the basis for determining the

frame & line rates in video capture & display systems.

slide-26
SLIDE 26

Kasaei 26

Temporal Frequency Responses

The temporal frequency response of the

HVS refers to the visual sensitivity to a temporally varying pattern at different frequencies.

The temporal response of an observer

depends on viewing distance, display brightness, ambient lightning, & …

The temporal response of the HVS is similar

to a BPF.

slide-27
SLIDE 27

Kasaei 27

Temporal Frequency Responses

The peak increases with the mean

brightness of the image.

One reason that the eye reduces sensitivity

at higher temporal frequencies is because the eye can retain the sensitivity of an image for a short time interval (even when the actual image has been removed).

This phenomenon is known as the

persistence of vision.

slide-28
SLIDE 28

Kasaei 28

Flicker Perception

It causes temporal blurring, if a pattern

changes in a rate faster than the refresh rate

  • f the HVS.

It allows the display of a video signal as a

consecutive sequence of frames.

As long as the frame interval is shorter than

the visual persistence period, the eye perceives a continuously varying image.

Otherwise, the eye will observe frame

flicker.

slide-29
SLIDE 29

Kasaei 29

Flicker Perception

The lowest frame rate at which the eye does

not perceive flicker is known as the critical flicker frequency.

The brighter the display, the higher the

critical flicker frequency.

The motion picture industry uses 24

frames/sec.

slide-30
SLIDE 30

Kasaei 30

Flicker Perception

The TV industry uses 50/60 fields/sec. The computer display uses 72 frames/sec. A troland is the unit used to describe the

intensity of light entered the retina.

slide-31
SLIDE 31

Kasaei 31

Temporal Response

200 100 50 20 10 5 2 1 2 5 10 20 50 Frequency (Hz) Contrast sensitivity

0.06 trolands 850 trolands 9300 trolands 7.1 trolands 77 trolands 0.65 trolands

Figure 2.5 The temporal frequency response of the HVS obtained by a visual

  • experiment. Different curves represent the responses obtained with different mean

brightness levels, B, measured in trolands. The horizontal axis represents the flicker frequency f , measured in Hz. Reprinted from D. H. Kelly, Visual responses to time-dependent stimuli. I. Amplitude sensitivity measurements, J. Opt. Soc. Am. (1961) 51:422–29, by permission of the Optical Society of America.

Critical flicker frequency: The lowest frame rate at which the eye does not perceive flicker. It provides a guideline for determining the frame rate, when designing a video system. Critical flicker frequency depends on the mean brightness of the display:

60 Hz is typically sufficient

for watching TV.

Watching a movie needs

lower frame rate than TV.

slide-32
SLIDE 32

Kasaei 32

MTF of the Visual System

Modulation transfer function (MTF) of human visual system (HVS). (a) Contrast versus spatial frequency sinusoidal grating. (b) A typical MTF plot. cpd

slide-33
SLIDE 33

Kasaei 33

Spatial Frequency Response

The spatial frequency response of the HVS

refers to the visual sensitivity to a stationary spatial pattern with different spatial frequencies.

The visual sensitivity is isotropic with

respect to the direction of spatial variation.

The spatial frequency response of the HVS is

also similar to a BPF, with a peak response at about 2-5 cpd & diminishing at about 30 cpd.

slide-34
SLIDE 34

Kasaei 34

Spatial Frequency Response

The eye often jumps from one fixation position

to another very rapidly. This is known as saccadic eye movement.

It enhances the contrast sensitivity, but reduces

the peak of the frequency response.

The viewer’s sensitivity is about 10 fold higher

with normal eye movement than without.

The peak response occurs at about 2 cpd with

normal eye movement, but is shifted to about 4 cpd with complete removal of eye movement.

slide-35
SLIDE 35

Kasaei 35

Spatial Frequency Response

The spatial frequency

response of the HVS,

  • btained by a visual

experiment.

  • Filled circles: under

normal, unstabilized conditions.

  • Open squares: with
  • ptical gain setting for

stabilization.

  • Open circles: with
  • ptical gain changed

about 5%.

slide-36
SLIDE 36

Kasaei 36

Spatiotemporal Frequency Response

At higher temporal frequencies, both the

peak & cut-off frequencies in the spatial frequency response shift downwards.

A similar trend happens with the temporal

frequency response.

When an image patter removes very fast, the

eye will not be able to differentiate the very high spatial frequencies.

The eye is more sensitive to temporal

variations caused be motion than by flickering.

slide-37
SLIDE 37

Kasaei 37

Spatiotemporal Response

The reciprocal relation between spatial & temporal sensitivity was used in TV system design: Interlaced scan provides a tradeoff between spatial & temporal resolution. (a) SF:

  • Open circles: 1 Hz.
  • Filled circles: 6 Hz.
  • Open triangles: 16 Hz.
  • Filled triangles: 22 Hz.

(b) TF:

  • Open circles: 0.5 cpd.
  • Filled circles: 4 cpd.
  • Open triangles: 16 cpd.
  • Filled triangles: 22 cpd.
slide-38
SLIDE 38

Kasaei 38

Smooth Pursuit Eye Movement (SPEM)

Smooth pursuit: the eye tracks moving objects. Net effect: reduces the velocity of moving objects on the

retinal plane, so that the eye can perceive much higher raw temporal frequencies than indicated by the temporal frequency response (80 Hz).

y y x x t y y x x t t y x y y x x t y x

v v v v f f v f v f f v v f v f v f v v = = = + + = + − = ~ , ~ if ~ ) ~ ~ ( ~ : ) ~ , ~ ( at moving is eye when the retina at the frequency temporal Observed ) ( : ) , ( at moving is

  • bject

n the motion whe

  • bject

by caused frequency Temporal Tracking the object motion, reduces the observed temporal frequency at the retina.

slide-39
SLIDE 39

Kasaei 39

Smooth Pursuit Eye Movement (SPEM)

Spatiotemporal response of HVS under SPEM: (a) Without SPEM. (b) With eye velocity of 2 deg/s. (c) With eye velocity of 10 deg/s. SPEM extends the nonzero region of the visual response to a large TF range (1000 Hz).

slide-40
SLIDE 40

Kasaei 40

Homework 2

Reading assignment:

Chapter 2

Written assignment:

  • Prob. 2.1,2.2,2.5,2.6, & 2.7
slide-41
SLIDE 41

The End