Where-What Network 3 (WWN-3): Developmental Top-Down Attention for Multiple Foregrounds and Complex Backgrounds
Matthew Luciw www.cse.msu.edu/~luciwmat Juyang Weng www.cse.msu.edu/~weng Embodied Intelligence Lab www.cse.msu.edu/ei
Where-What Network 3 (WWN-3): Developmental Top-Down Attention for - - PowerPoint PPT Presentation
Where-What Network 3 (WWN-3): Developmental Top-Down Attention for Multiple Foregrounds and Complex Backgrounds Matthew Luciw www.cse.msu.edu/~luciwmat Juyang Weng www.cse.msu.edu/~weng Embodied Intelligence Lab www.cse.msu.edu/ei
Matthew Luciw www.cse.msu.edu/~luciwmat Juyang Weng www.cse.msu.edu/~weng Embodied Intelligence Lab www.cse.msu.edu/ei
Michigan State University 2
Michigan State University 3
Michigan State University 5
1993): how were they developed?
down in training
``attentional shroud’’ --- potential difficulty with complex backgrounds
Essen’s study (1993)…
V (bottom-up weight matrix) M (top-down weight matrix) For a single neuron:
training and testing
parameterizations allow different attention modes (i.e., what- imposed, where-imposed)
points or features: each area learns through Lobe Component Analysis
LCA incrementally approximates joint distribution of bottom-up + top-down, in a dually optimal way LCA used for learning bottom-up and top-down weights in each area
(Above): Example training images, from 5 classes with 3 rotation variations in depth Location and Type are imposed at the motors Right: response-weighted input of a slice of V4: shows bottom-up sensitivities Current object representation pathway is limited
IT spatial representation
(a): IT learned type-specific (here: duck) but allows location variations: we show response- weighted input of 4 single neurons here (b): PP learned location-specifc but allows type variation These effects are enabled by top- down connections in training
From IT (PP has a low weight in search tasks) To IT and PP (right): bottom-up response (below): top-k (40) Integration of bottom-up and top-down Type: cat imposed at motor Top-k (4):
From IT To IT and PP (right): bottom-up response (below): top-k (40) Integration of bottom-up and top-down Type: pig imposed at motor Top-k (4):
Disjoint views used in testing
describes a highly recurrent architecture of a multi-sensor, multi-effector brain. Multi-sensory and multi-effector integration are achieved through developmental learning.
internal ``canonical views’’ (combination neurons)
external percept
Michigan State University 23
Michigan State University 24