The Center for Brains, Minds and Machines
tomaso poggio CBMM McGovern Institute, BCS, LCSL, CSAIL MIT
I-tutorial
Learning of Invariant Representations in Sensory Cortex
I-tutorial Learning of Invariant Representations in Sensory Cortex - - PowerPoint PPT Presentation
The Center for Brains, Minds and Machines I-tutorial Learning of Invariant Representations in Sensory Cortex tomaso poggio CBMM McGovern Institute, BCS, LCSL, CSAIL MIT I-theory Learning of Invariant Representations in Sensory Cortex
tomaso poggio CBMM McGovern Institute, BCS, LCSL, CSAIL MIT
Learning of Invariant Representations in Sensory Cortex
2
1.Intro and background 2.Mathematics of invariance 3.Biophysical mechanisms for tuning and pooling 4.Retina and V1: eccentricity dependent RFs; V2 and V4: pooling, crowding and clutter 5.IT: Class-specific approximate invariance and remarks
Learning of Invariant Representations in Sensory Cortex
3
Class 24 Mon Dec 1 Learning Invariant Representations: Retina and V1: eccentricity dependent RFs; V2 and V4: pooling, crowding and clutter
4
Summary of previous class
The ventral stream hierarchy: V1, V2, V4, IT A gradual increase in the receptive field size, in the complexity of the preferred stimulus, in tolerance to position and scale changes
Kobatake & Tanaka, 1994
8
End Summary
Note: ¡we ¡focus ¡on ¡the ¡ sampling ¡layout ¡of ¡the ¡ retinal ¡ganglion ¡cells ¡ (RGCs) ¡-‑ ¡the ¡outputs ¡of ¡ the ¡retina.
(Also: ¡focusing ¡on ¡the ¡Parvo ¡ pathway, ¡ignoring ¡Magno.)
Hubel and Wiesel, 1971
Schiller, P ., Finlay, B., Volman S. Quantitative Studies of Single Cells Properties in monkey striate cortex, 1976
to have invariant representation
Usual recipe:
each template memorize observed transformations
13
15
16
17
18
19
25’ !!! total 40x40 units 5 degree! total 40x40 units
Anstis, 1974
22
Empfeh len 11
l=4 l=3 l=2 l=1
HW module
24
different size at different locations
∑ = signature⋅vector ⋅
Associative memory
28
(from ¡Freeman ¡& ¡Simoncelli ¡2011)
30
l=4 l=3 l=2 l=1
HW module
The predictions are:
the fovea, activating the smallest simple cells at the bottom of the inverted
a complex cell at the smallest scale, that is d=1’ 20” in V1 and d=2’40” in V2. If the letter is made larger, then the activation of the simple cells shifts to a larger scale and thus does the critical spacing which is proportional to the size of the
Levi and Carney, 2011.
(positive say). The critical separation for avoiding crowding outside the foveola is 12) d ~ b x since the RF size of the complex cells increases linearly with eccentricity, with depending on the cortical area responsible for the recognition signal. Thus the theory ``predicts'' Bouma's law , (Bouma, 1970) of crowding!
35
36
37
38
+ + Evangelopoulos, Zhang, Voinea Also: ¡ ¡L. ¡Isik, ¡S. ¡Ullman, ¡S. ¡Smale, ¡ ¡C. ¡Tan, ¡M. ¡Riesenhuber, ¡T. ¡Serre, ¡G. ¡Kreiman, ¡S. ¡Chikkerur, ¡
40