Visual Perception
Rich Clarke
Visual Perception Rich Clarke Q: Why should we care about humans in - - PowerPoint PPT Presentation
Visual Perception Rich Clarke Q: Why should we care about humans in the loop? A: Imagine building VR system... Asked to specify the ultimate, fully immersive visual display device. Want the Virtual World to be utterly indistinguishable
Rich Clarke
activity?
We’re going to keep in mind: The So What? Implications for Computer Graphics?
POSTERIOR: ~mesh of photoreceptors… ANTERIOR: Modelled as camera optics…
Basic visual hardware: Visual anatomy & physiology
Pigmented layer
Basic visual hardware: Visual anatomy & physiology
Start of the ‘brain’… To the optic nerve
illumination
at ~500nm
illumination
‘B’ peak at 437nm, ‘G’ peak at 533nm, ‘R’ peak: 564nm Much smaller receptive fields
Basic visual hardware: Visual anatomy & physiology
can be coded
Basic visual hardware: Visual anatomy & physiology
SO WHAT?
The fundamental unit of visual coding: Receptive fields
Eye
‘On centre’ cells ‘Off centre’ cells Higher up in the brain (in V1): integrate simple cells: complex cells
photoreceptors serving some ganglion cell
the ganglion cell is responsive
On centre and off centre Spectral as well as spatial: Red/Green Yellow Blue Absolute physical values lost:
Recall last slide...
contrast & context sensitivity only (c.f. illusions)
behaviourally relevant in ontogenetic and phylogenetic history: fast ‘hardware’ recognition
The fundamental unit of visual coding: Receptive Fields
SO WHAT?
Visual pathways: The split of information streams
LGN: 6 layers: Magnocellular layers – primary input from peripheral retina – non spectrally opponent ganglions, large receptive fields Parvocellular layers – primary input from the foveal region – spectrally opponent cells, small receptive fields
Area V1 (Visual Cortex): more complex cells (I.e. with more complex receptive fields) Deals with What? And Where? …Separately?
(Magnocellular layers)
(Parvocellular layers)
Visual pathways: The split of information streams
SO WHAT?
Eyes may be serving 2 relatively separate visual systems:
Rods & Cones: Outersegment: billions of light sensitive pigment molecules Molecules embedded in disks, stacked like pancakes Rods: Pigment is Rhodopsin γ
How well can you really see? Resolution and acuity
entry aperture Resolution limited to 30arcsec
image representation. Spacing well matched to optics (Sampling theory)
Three things determine resolution: 1. Optical filtering, 2. receptor sampling, (and 3. receptive field organisation) Visual acuity is a function of contrast sensitivity ~30sec Vernier (hyper)acuity: Ability to localise position of
Can detect misalignments of ~5sec Unknown exactly how it’s done
How well can you really see? Resolution and acuity
SO WHAT?
display system
artefacts in digital images – simple analysis of the visual system would predict that some artefacts should not be seen (below the limit of supposed visual acuity)
Seeing the sun and the stars: Brightness adaption
How many orders of magnitude difference between the dimmest and the brightest things we can see?
Three mechanisms Mechanical
Photochemical
Neural
Changes in: Contrast Sensitivity: Pattern Acuity: Colour Perception
Seeing the sun and the stars: Brightness adaption
Seeing the sun and the stars: Brightness adaption
Purkeinje break
Seeing the sun and the stars: Brightness adaption
SO WHAT?
image is in brightness contrast
range of ~10 log units
perception changes markedly
allocation etc.)
Colour perception
Colour perception
Colour perception
Relative stimulation of each cone type in your retina (RGB) in the context of some visual field Different spectral distributions of light should be able to stimulate the photoreceptors identically: Receptor sensitivity SD b SD a Spectral distributions Distinct distributions that are perceived identically w.r.t some visual system - METAMERS
Colour Perception
SO WHAT?
re-create any visible colour simply (without recreating the whole C(λ) distribution
colour gamut that covers most visible colours
Ref 1: [Ferwerder] Ferwerda, J. A. (2001) Elements of Early Vision for Computer Graphics, IEEE Computer Graphics and Applications, 21(5), pp. 22-33. Ref 2: [Atkinson] R.C. Atkinson, ed., Steven’s Handbook of Experimental Psychology, 2nd ed., John Wiley & Sons, New York, 1988. Ref 3: [Purves & Lotto] www.lottolab.org, also D. Purves & R. Beau Lotto, Why we see what we do: An Empirical Theory of Vision, Sinauer Associates, 2003 Ref 7: [Sekuler & Blake] 7. R. Sekuler and R. Blake, Perception, McGraw-Hill, New York, 1994. Ref 15: [Spillman & Werner] L. Spillman and J.S. Werner, eds., Visual Perception: The Neurophysiological Foundations, Academic Press, San Diego, 1990. Ref 28: [Bollin & Mayer] M.R. Bolin and G.M. Meyer, “A Frequency Based Ray Tracer,”