SLIDE 1 NYU/CNS
Center for Neural Science
The elements of early vision
- r, what vision (and this course) is all about
SLIDE 2
The Electric Monk was a labor saving device, like a dishwasher or a video recorder. Dishwashers washed tedious dishes for you, thus saving you the bother of washing them yourself, video recorders watched tedious television for you, thus saving you the bother of looking at it yourself; Electric Monks believed things for you, thus saving you what was becoming an increasingly onerous task, that of believing all the things the world expected you to believe.
SLIDE 3 What’s in the image? The task of early vision How do neurons encode visual information? How are neuronal representations decoded? P s y c h
h y s i c s N e u r
h y s i
y Theory
SLIDE 4 after Churchland and Sejnowski, 1988
The challenge of scale
SLIDE 5 0.0001 0.001 0.01 0.1 1 10 100 1000 10000 100000 1000000 Time (s) 0.0001 0.001 0.01 0.1 1 10 100 1000 Size (mm) 0.0001 0.001 0.01 0.1 1 10 100 1000 10000 100000 1000000 0.0001 0.001 0.01 0.1 1 10 100 1000 Millisecond Second Minute Hour Day Month Synapse Dendrite Neuron Layer Nucleus Map Lobe Brain Single units Patch clamp Light microscopy Electron microscopy VSD imaging EEG and MEG fMRI imaging PET imaging Brain lesions 2-DG imaging Calcium imaging Optogenetics Microstimulation Field potentials TMS
Sejnowski, Churchland & Movshon (2014) after Churchland & Sejnowski (1988)
The challenge of scale
SLIDE 6 Marr, 1982
David Courtnay Marr (1946-1980)
The challenge of level
SLIDE 7 What’s in the image? The task of early vision How do neurons encode visual information? How are neuronal representations decoded? P s y c h
h y s i c s N e u r
h y s i
y Theory
SLIDE 8 Every body in light and shade fills the surrounding air with infinite images of itself; and these, by infinite pyramids diffused in the air, represent this body throughout space and
- n every side. Each pyramid that is composed of a long
assemblage of rays includes within itself an infinite number of pyramids and each has the same power as all, and all as each. – The Notebooks of Leonardo da Vinci
In M. Landy and J. A. Movshon (eds), Computational Models of Visual Processing (pp. 3-20). Cambridge, MA: MIT Press (1991).
The Plenoptic Function and the Elements of Early Vision
Edward H. Adelson and James R. Bergen
SLIDE 9
What is there to see? Black and white photo: x, y Black and white movie: x, y, t Color movie: x, y, t, λ Holographic movie: x, y, t, λ, Vx, Vy, Vz The plenoptic function describes all the information available to an observer anywhere in space and time
SLIDE 10 Practical applications
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 14, NO. 2, FEBRUARY 1992 99
Single Lens Stereo with a Plenoptic Camera
Edward H. Adelson and John Y.A. Wang
(a) (b) (c) (d)
- Fig. 2. (a) Pinhole camera forms an image from a single viewpoint; (b) in a
stereo system, two images are formed from different viewpoints; (c) in a motion parallax system, a sequence of images are captured from many adjacent viewpoints; (d) a lens gathers light from a continuum of viewpoints; in an ordinary camera these images are averaged at the sensor plane.
SLIDE 11
Practical applications
SLIDE 12
Proposition 1. The task of early vision is to deliver a small set of useful measurements about each observable location in the plenoptic function. Proposition 2. The elemental operations of early vision measure local change along various directions within the plenoptic function. What is there to see? Black and white photo: x, y Black and white movie: x, y, t Color movie: x, y, t, λ Holographic movie: x, y, t, λ, Vx, Vy, Vz Hence, the plenoptic function: P(x, y, t, λ, Vx, Vy, Vz)
SLIDE 13 picture plane moving red bar grey background
A hypothetical scene that produces a variety of simple plenoptic structures. The plenoptic structures found along various planes. Each panel represents a slice through the plenoptic function.
x
1 x
1 x
1 x
1 x
1 x
1 y
1
1 t
1 Vz x V y V
1
1
700
SLIDE 14 Some edge-like structures found in particular planes within the plenoptic function
x y x y x t x t x y x t x
Vx
Edge with horizontal binocular parallax Vertical edge Horizontal edge Static edge Brightness step Tilted edge Moving edge Color sweep
SLIDE 15
Local derivatives will turn out to be handy
SLIDE 16 Local derivatives will turn out to be handy
*g(s) f(s) f(s)*g(s) derivative
derivative
filter *g'(s) local average *g(s) local average f'(s) f(s)*g'(s)
SLIDE 17 x y t
x vertical "edge" horizontal "edge" blue-yel.
binocular anticorrel. ("luster") dimension
response
x y t
x vertical "bar" horizontal "bar" green-red
dimension
response flicker (brightening) flicker (pulse)
Derivatives along single dimensions yield some basic visual measurements Low-order derivatives lead to a few two-dimensional operators (receptive fields)
SLIDE 18 x y t x y x y x x x t t
Horizontal edgelike structure Vertical edgelike structure Diagonal edgelike structure Uniform brightening Static edgelike structure Moving edgelike structure
x x x x x
Vx V
x
Uniform bluishness Achromatic edgelike structure Spatiochromatic structure Uniform intensity change with eye position No horizontal parallax Horizontal parallax
The same receptive field structures produce different measurements when placed along different planes in plenoptic space
SLIDE 19 x y t
x y t
static achromatic no dispar
leftward achromatic no dispar vertical static hue-sweep no dispar horiz. static no dispar
downward achromatic no dispar
static achromatic
static achromatic
full-field sequential no dispar full-field sequential achromatic eye-order full-field static hue-shift luster hue-sweep hue-sweep
What can you measure with a tilted second derivative?
SLIDE 20 Hubel & Wiesel (1962, 1968)
Orientation selectivity in V1
SLIDE 21 Hawken & Parker, 1991
Orientation in space can be detected and measured by oriented filters
SLIDE 22 5 10 15 20 5 10 15 20 c/deg +
- +
- DeValois, Albrecht and Thorell, 1982
Orientation in space can be detected and measured by oriented filters
SLIDE 23
Motion is orientation in space-time
SLIDE 24 Motion is orientation in space-time and spatiotemporally oriented filters can be used to detect and measure it
Adelson & Bergen (1985)
SLIDE 26
-
-
-
- DeAngelis, Ohzawa
& Freeman, 1995
SLIDE 27
DIsparity is orientation in space-eye position
SLIDE 28 DIsparity is orientation in space-eye position
L.E. R.E. L.E. R.E.
Vx x
L.E. R.E. L.E. R.E.
Vx x
L.E. R.E.
Vx x
L.E. R.E. L.E. R.E.
V
x
x
L.E. R.E.
Four examples of binocular receptive fields. Humans only take two samples from the Vx axis, as shown by the two lines labeled R.E. and L.E. for right eye and left eye. The curves beneath each receptive field indicate the individual weighting functions for each eye alone. Binocular correlation: “tuned excitatory” Binocular anticorrelation: “tuned inhibitory” Uncrossed disparity: “far” Crossed disparity: “near”
SLIDE 29 DIsparity is orientation in space-eye position
Poggio (1981)
SLIDE 30 x y x t
y t x t V
x R.E. L.E.
V
x R.E. L.E.
x y t
x
x y t
x
static achromatic no dispar.
jump left achromatic no dispar. 2 vert. lines static change
static achromatic h disp.
jump down achromatic no dispar. 2 hor. lines static no dispar.
static achromatic v disp. full-field pulse order blue-yel. no dispar. full-field pulse order achromatic anticorr. full-field static anticorr. change
static achromatic no dispar.
pulse order achromatic no dispar.
pulse order achromatic no dispar. no dispar.
no dispar.
diff.
change
sequential static static diff.
- diff.
- Many psychophysical tasks look like Vernier acuity in different planes
SLIDE 31
Proposition 1. The task of early vision is to deliver a small set of useful measurements about each observable location in the plenoptic function. Proposition 2. The elemental operations of early vision measure local change along various directions within the plenoptic function. What is there to see? Black and white photo: x, y Black and white movie: x, y, t Color movie: x, y, t, λ Holographic movie: x, y, t, λ, Vx, Vy, Vz Hence, the plenoptic function: P(x, y, t, λ, Vx, Vy, Vz)
SLIDE 32
What is there to see? Black and white photo: x, y Black and white movie: x, y, t Color movie: x, y, t, λ Holographic movie: x, y, t, λ, Vx, Vy, Vz Hence, the plenoptic function: P(x, y, t, λ, Vx, Vy, Vz) The analysis of the plenoptic function tells us what elementary measurements can be computed, but not which ones are computed or which ones should be computed. What parts of the possible information are present in the world? What parts of the information present in the world are important to the organism?
SLIDE 33 What’s in the image? The task of early vision How do neurons encode visual information? How are neuronal representations decoded? P s y c h
h y s i c s N e u r
h y s i
y Theory
SLIDE 34 Hubel, 1988
Neural circuits perform computations
SLIDE 35 cornea light lens iris
ganglion cells interneurons photoreceptors retina
Vertebrate retina
SLIDE 36 Kandel, Schwartz & Jessell, 2001
Vertical and horizontal pathways for information flow in the retina
SLIDE 37 Retinal neuronal diversity and circuit specificity
Masland, 2001
SLIDE 38
SLIDE 39
Spatial structure is first measured by neurons with center-surround receptive fields
SLIDE 40 Ganglion cell receptive field modeled as difference of Gaussians
Rodieck, 1965; Enroth-Cugell & Robson, 1966
SLIDE 41
Spatial contrast sensitivity
SLIDE 42 Ganglion cell receptive field modeled as difference of Gaussians
Enroth-Cugell & Robson, 1966
SLIDE 43 Frequency-domain representation of the difference of Gaussians
Enroth-Cugell & Robson, 1984
SLIDE 44 Temporal linearity in X cells
Enroth-Cugell, Robson, Schweitzer-Tong & W atson, 1983
SLIDE 45 Temporal linearity in X cells
Enroth-Cugell, Robson, Schweitzer-Tong & W atson, 1983
SLIDE 46
Diversity of retinal ganglion cell types
SLIDE 47 Hubel, 1988
Neural circuits perform computations