Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant - PowerPoint PPT Presentation

Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant and Sparse Image Representations Bin Yu Departments of Statistics and EECS University of California at Berkeley Rutgers University, May 2, 2014 Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 1/51

Co-authors Yu Group: Julien Mairal and Yuval Benjemani (leads) Gallant Lab at UC Berkeley: Ben Willmore, Michael Oliver, Jack Gallant Supported by NSF STC, Center for Science of Information (CSoI) Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 2/51

Brain Science 2013 – new ”genomics” Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 3/51

!"#"$%&'(&)*+",'-*+&".'(/-%",'"+&"- ,'%)!* !& !% !"#$%&'%(%')#%!* !" $'()*#$ #$ +$"#%!* ! ! Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 4/51

!"#"$%&'"()&"*+(#,-$*".&%/(0#1,22(01"02 ! ! !"#$%$&'()(*$+$&$,(-../ Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 5/51

!"#$%&$'("#'()*&'(+"&'%+,"-."%+'($% ! ! /*0'1*&23"4"5$%%$(6"7888 Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 6/51

!"#$%&'()*"'+$,-.).$'-"./-,("01"-,/+'- ! ! 2)3$(4"5)6(,-"7"8)**)-.4"9::; Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 7/51

Sources of Knowledge About the Brain Ways of Understanding the Visual Cortex study of lesions and associated impairments; electrodes (single or arrays); imaging studies (fMRI, ...); image below from Hansen et al. [2007] Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 8/51

Classical Models for V1 “Simple” Cells Image from Olshausen and Field [2005]: V1 Models based on Gabor filters achieve impressive prediction performance with experimental data(signle neuron and fMRI); V1 receptive fields are relatively small and well localized. V1 is the most well understood area, but not all is ... It serves as a performance benchmark for other areas. Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 9/51

V4: an intermediate area on the ”what” pathway [see Roe et al., 2012] What we know about V4 affected by attention; diverse selectivity; larger receptive fields and is more invariant than V1/V2; no good predictive model with natural image inputs. Question about V4 what are the roles of V4? Roe et al advocated a background-foreground thesis, among other things. Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 10/51

Experimental set-up in the Gallant Lab for single neuron data collection Image Subject Recording Response Sequence Signal Filtering Spike Sorting Firing Rate Time Binning Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 11/51

Objectives and data Aim: a statistical/computational model with good prediction performance on natural scenes (validation data); elucidation of properties of a population of V4 neurons; biologically interpretability. Data consists of 4000 − 12000 grayscale images (with no motion or color content) and average firing rates for 71 neurons; the image sequence is shown at 30 Hz; the stimuli is centered on an estimated receptive field (RF) while the subject performs a fixation task; the stimuli size is 2 − 4 times larger than the estimated RF. Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 12/51

Outline of Today’s Talk Multi-layer invariant feature extraction Prediction model via low-rank regularization Model interpretation Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 13/51

Part I: Invariant Image Representation: Feature Extraction Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 14/51

Methodology Classical computer vision image representation for scene analysis 1 dense low-level feature extraction (local histograms of gradient orientations) [Lowe, 2004]; 2 feature encoding into visual words using vector quantization or sparse coding [Olshausen and Field, 1997]; 3 feature pooling. A state-of-the-art pipeline for scene and object recognition [Lazebnik et al., 2006, Yang et al., 2009, Boureau et al., 2010]; Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 15/51

Methodology Classical computer vision image representation for scene analysis 1 dense low-level feature extraction (local histograms of gradient orientations) [Lowe, 2004]; 2 feature encoding into visual words using vector quantization or sparse coding [Olshausen and Field, 1997]; 3 feature pooling. A state-of-the-art pipeline for scene and object recognition [Lazebnik et al., 2006, Yang et al., 2009, Boureau et al., 2010]; Can we exploit some ideas from this line of thinking to mimic invariance properties of V4 neurons? obtain a model tuned to natural images? be as biologically compatible as possible? Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 15/51

Our pipeline Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 16/51

First Layer Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 17/51

Patch Extraction Across Orientation Maps The 3D-patches are of size 4 × 4 × 8 = 128 (8 orientations); correspond in the original image domain to 32 × 32 patches; are invariant to local deformations in the original image domain; Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 18/51

Patch Extraction Across Orientation Maps What is the effect of the first processing layer? Comparing patches on orientation maps and in the image domain leads to a different similarity measure. 1.0 0.92 0.92 0.91 0.86 0.76 1.0 0.65 0.76 0.97 0.95 0.93 In blue, correlation in the original image domain. In red, correlation in the new domain. Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 19/51

Constrast Normalization Given a 3D-patch x in R 128 + , we apply x ← x / max( � x � 2 , c ) . In short, our 3D-patches have similar properties as dense SIFT descriptors [Lowe, 2004] but are based on simple filtering/subsampling and normalization step. Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 20/51

Second Layer Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 21/51

Sparse Coding 1 learn a dictionary on a database of 3D-patches once in for all; 2 encode all 3D-patches of an image to obtain feature maps . Dictionary Learning Formulation n 1 � 2 � x i − D α i � 2 + λ � α i � 1 min , 2 A ∈ R p × n , D ∈D � �� i =1 � �� sparsity data fitting We also force the codes α and the dictionary D to be non-negative . The dictionary is fairly large ( p = 2 048). The original formulation of Olshausen and Field [1997] was in the image domain. It was successfully used on image descriptors in computer vision [Yang et al., 2009, Boureau et al., 2010]. Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 22/51

Third Layer Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 23/51

Feature Pooling   α 1   β 1 α 2 .    .  A = = ⇒   .   ·   β p α 29 ∗ 29 � �� single vector sparse codes of an image �� 29 ∗ 29 △ i =1 ( α k i ) 2 The pooling operation is the ℓ 2 -norm of features β k = for one pooling region and k = 1 , ..., 2048. Another alternative is the max-pooling operation [Riesenhuber et al., 1999, Cadieu et al., 2007], often used in computer vision [Lazebnik et al., 2006, Yang et al., 2009, Boureau et al., 2010]. Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 24/51

Summary Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 25/51

Our Feature Extraction Process in a Nutshell It consists of local simple operations and uses the sparse coding principle. In particular, it has some invariance to small image deformation (first layer); has some selectivity to features learned from natural image statistics (second layer); is shift invariant within the receptive field (third layer); Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 26/51

Part II: Prediction Model based on Extracted Features Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 27/51

Prediction Pipeline Neuron Responses Linear Model Input Image Sequence Nonlinear Encoding Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 28/51

Temporal Aspect of Data Typical Time Response to an Excitatory Stimulus Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant 29/51

Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant - PowerPoint PPT Presentation

Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant and Sparse Image Representations Bin Yu Departments of Statistics and EECS University of California at Berkeley Rutgers University, May 2, 2014 Modeling Visual Cortex V4 in

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Chapter 6 Vision Exam 1 Anatomy of vision Primary visual cortex (striate cortex, V1)

Neurobiology HMS 130/230 Harvard/GSAS 78454 Visual Object Recognition Primary Visual Cortex

Geometric visual hallucinations and the architecture of the visual cortex Jack Cowan Department

Neurobiology HMS 230 Harvard/GSAS 78454 Visual Object Recognition Primary Visual Cortex Camille

Agenda Technical updates Project Cortex availability Project Cortex partners

Cortex-A15 Processor ARMs next generation mobile applications processor Travis Lanier Senior

Cortex: Prometheus as a Service, One Year On Tom Wilkie, PromCon 2017 tom.wilkie@gmail.com

Telencephalon/Cerebral Cortex - Anatomy Cerebral Cortex Box 26D Brain Size and Intelligence

Schematic view of the MNS1 model Posterior cortex Frontal cortex Role of PFC in Neural network

From heuristic to optimal models in naturalistic visual search Angela Radulescu 1,2 *, Bas van

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

10 mm Cytoarchitecture and function layer 4: input layer 5: output Motor cortex: expanded layer

Trim Conditions Trim Conditions Trim Conditions Trim Conditions 1 1 VPC Trim Screen VPC Trim

uXOM: Efficient eXecute-Only Memory on Cortex-M Donghyun Kwon 1,4 , Jangseop Shin 1 , Giyeol Kim 1

Integer GEMM (under)performance Marat Dukhan Software Engineer on Ca ff e 2 GEMM in Neural

Instructions to reviewers Reference lines Atlanto-planar line (APL) Posterior axial

Standard Lattice-Based Key Encapsulation on Embedded Devices James Howe , Tobias Oder ,

Multitasking on Cortex-M(0) class MCU A deepdive into the Chromium-EC scheduler $whoami

Modeling and interpretation of extracellular potentials Gaute T . Einevoll 1 , Szymon ski 2

Cortical responses evoked by continuous wrist manipulation Alfred C. Schouten , Martijn P. Vlaar,

27.10.2016 The Team Dipl.-Ing. Tobias Rohde Dipl.-Ing. Tobias Kaupat