fic graphs
play

FIC graphs Goal: estimate cerebral pathways between brain regions - PowerPoint PPT Presentation

FIC graphs Goal: estimate cerebral pathways between brain regions (ROIs) Graph estimation and selection strategies using the focused information criterion Motivation: ROIs collaborate/connect to other ROIs: functional connectivity


  1. FIC graphs Goal: estimate cerebral pathways between brain regions (ROIs) Graph estimation and selection strategies using the focused information criterion Motivation: ROIs ‘collaborate’/connect to other ROIs: functional connectivity estimate and summarize the structure of the brain graphs Eugen Pircalabelu targeted FIC approach to focus on parts of the brain Joint work with Gerda Claeskens , L. Waldorp and S. Jahfari How: Framework with both directed & undirected edges ORSTAT and Leuven Statistic Research Center Lagged & contemporaneous ROI effects Penalized nodewise models with graph ‘composition’ rules Main contribution: FICology Workshop, Oslo, May 9-11th, 2016 select targeted (via the focus) graphs that account for ROI effects general framework where nodes can be other entities, not just brain ROIs 1 / 24 2 / 24 Motivation fMRI data focuses 75 fMRI data: 35 SIGNAL 8 subjects in a resting state study i.e. subjects are not performing any task; 0 Prefrontal cortex parcellation of the brain was obtained containing 68 (further split to 448) −35 regions of interest (ROIs); −75 for each region, 240 volumes of the BOLD signal were obtained; Regions of Interest for PFC 90 45 SIGNAL Default mode network 0 −25 −50 Regions of Interest for DMN 3 / 24 4 / 24

  2. Prefrontal cortex focused graph Graph theoretical framework Define the graph G = ( E , V ) based on edges E nodes V r.v X i : i ∈ V . To each X 1 . . . X p a node in V = { 1 . . . p } is associated. E a subset of V × V set of pairs of distinct nodes. ( i , j ) ∈ E i → j i parent of j ( j , i ) ∈ E j → i i child of j ( j , i ) ∪ ( i , j ) ∈ E i − j i neighbor of j One or more connections between i and j (directed/undirected); Self-loop edges are allowed (such as i → i ) and denote a lagged effect. 5 / 24 6 / 24 Graph selection procedures Proposed method Graphical Lasso: undirected edges 1 the ‘best graph’ should depend on the focus Σ − 1 ≻ 0 (log det Σ − 1 − tr( W Σ − 1 ) − λ || Σ − 1 || 1 ) max for different focuses, possibly different selected models the focus guides the graph selection aspect ∈ E ≡ Σ − 1 X i ⊥ X j | X { 1 ,..., p }\{ i , j } iff ( i , j ) ∪ ( j , i ) / = 0 ij Gaussian AR model; Neighborhood selection: undirected edges 2 penalized nodewise neighborhood selection; X i ⊥ X k | X ne i ∀ i ∈ V & ∀ k ∈ V \ { ne i ∪ i } mean squared error (MSE) to decide on the selection; X i ‘response node’, other X j ( j � = i ) covariates in a linear regression. Lasso reg. to determine the neighborhood of node i Let { j ∈ V : ˆ Y ≡ X i : i ∈ V the node of interest. ne λ θ i ˆ = j � = 0 } i E λ, AND ˆ ne λ ne λ W ≡ set of protected nodes, always included as neighbors of Y = { ( i , j ) : i ∈ ˆ j AND j ∈ ˆ i } E λ, OR ˆ ne λ ne λ = { ( i , j ) : i ∈ ˆ j OR j ∈ ˆ i } Z ≡ set of unprotected nodes, potential neighbors Time Series Chain Graphical Models (TSCGM): undirected+directed edges 3 Subset S ⊆ V \ i of possible neighbors is to be selected. X t | X t − 1 ∼ N (Γ X t − 1 , Σ) . 7 / 24 8 / 24

  3. Local misspecification Penalty function Penalty needed for parameter estimation in p > n settings. FIC takes care of selection so sparse penalties are not necessary. Y k has density f ( y k | w k , z k , θ 0 , γ 0 + δ/ √ n ) for k = 1 , . . . , n Local quadratic approximation to ψ when not differentiable at zero. lasso : ψ l ( | γ j − γ j 0 | ) = | γ j − γ j 0 | f is 2 times diff. in a ‘proximity’ of ( θ 0 , γ 0 ) adaptive lasso : ψ al ( | γ j − γ j 0 | ) = w j | γ j − γ j 0 | , for a weight w j bridge : ψ b ( | γ j − γ j 0 | ) = | γ j − γ j 0 | α α > 0 ( θ 0 , γ 0 + δ/ √ n ) hard thresholding : ψ h ( | γ j − γ j 0 | ) = λ 2 − ( | γ j − γ j 0 | − λ ) 2 I ( | γ j − γ j 0 | < λ ) δ controls the proximity around narrow model ′ SCAD : ψ s ( | γ j − γ j 0 | ) = I ( | γ j − γ j 0 | ≤ λ )+ w k protected neighbors ⇒ θ is present in all models (never subject to model ( a λ − | γ j − γ j 0 | ) + I ( | γ j − γ j 0 | > λ ); a > 2 . selection or exclusion) ( a − 1) λ ′ ( | γ japx | ) ψ ( | γ j − γ j 0 | ) ≈ ψ ( γ japx ) + 1 ψ ( γ j − γ j 0 ) 2 − γ 2 � � d γ japx � n � 2 | γ japx | 1 log f ( y k | w k , z k , θ, γ ) − λ (ˆ � � θ, ˆ γ ) = arg max ψ ( | γ j − γ j 0 | ) , ′ ( | γ japx | ) n n ′ ≈ ψ θ,γ k =1 j =1 ψ ( | γ j − γ j 0 | ) ( γ j − γ j 0 ) | γ japx | for a given penalty function ψ (2 times diff. in 0) and an external value λ . ′ ( | γ japx | ) ′′ ≈ ψ ψ ( | γ j − γ j 0 | ) | γ japx | 9 / 24 10 / 24 Selecting λ Focus parameters Define a focus parameter µ = µ ( θ, γ, w , z ) a function of θ and γ . � � ω T { J 11 , S , 0 cc T ( J 11 , S , 0 ) t − 2( I − G S ) δ c T ( J 11 , S , 0 ) T } ω min c µ ( σ, β, w , z ) = [ w , z ] t β ; mean of a ROI for fixed vals of other ROIs Eg: For ROI 1 k we define the focus, as the conditional mean on covariates √ n z = ( ROI 2 k , ROI 3 k , ROI 4 k , ROI 1 k − 1 , ROI 2 k − 1 , ROI 3 k − 1 , ROI 4 k − 1 ) as λ S = ω T ( I − G S ) δ 1 q T ( J 11 , S , 0 ) T ω ψ ′′ (0) . ω T J 11 , S , 0 1 q 1 q T ( J 11 , S , 0 ) T ω µ = E [ ROI 1 k | z = ( − 20 , 100 , 25 , 45 , 80 , 10 , 50)] µ ( σ, β, w , z ) = β l ; reg.coef of a certain ROI l Endogeneity problem: MSE ≻ c ≻ λ ≻ ( θ, γ ) ≻ λ Two-step solution: estim. ( θ, γ ) on a λ grid, retain (ˆ θ, ˆ γ ) with best GCV µ ( σ, β, w , z ) = G − 1 (0 . 9 | w , z ); 0.90-quantile of the response distribution at performance, estimate c and based on c estimate MSE fixed vals for other ROIs. Goal: estimate µ as precisely as possible (in the MSE sense). 11 / 24 12 / 24

  4. Limiting distribution Graph composition rules For any model S , we compute Once all nodewise models are selected (some might include only contemporaneous √ n (ˆ d or only dynamic effects, while others might contain both) we apply the ‘OR’ rule µ S − µ true ) → Λ S ∼ N (Bias( µ, S , δ, c ) , Var( µ, S )) , µ S ) = Bias( µ, S , δ, c ) 2 + Var( µ, S ) . MSE(ˆ � � E λ, or ˆ ne λ ne λ = ( i , j ) ∪ ( j , i ) : i k ∈ ˆ j k or j k ∈ ˆ instantaneous undirected i − j i k p � � � E λ, or ˆ � ne λ FIC( G ( E S , V )) = MSE(ˆ µ l ; S l ) , = ( i , j ) : i k − 1 ∈ ˆ lag 1 directed i → j j k l =1 � � E λ, or ˆ ne λ = ( j , i ) : j k − 1 ∈ ˆ lag 1 directed i k i ← j where S = { S 1 , . . . , S p | S 1 ⊆ { V \ 1 } ; . . . ; S p ⊆ { V \ p }} . � � ˆ E λ, or ˆ ∪ ˆ E λ, or i → j ∪ ˆ E λ, or E λ, or = directed and undirected i − j i ← j ne λ denotes the neighborhood of the considered node for a certain value of where ˆ The mean depends on the chosen focus µ , the submodel S , the value of δ the penalty. indicating the distance between the parameters from the simplest model and the ′′ (0)1 q / √ n . true model, and on the chosen penalization via the value c = λψ 13 / 24 14 / 24 Prefrontal cortex and Default mode network as focuses Focus FIC LQA ℓ 1 FIC ℓ 2 Ratio vs FIC GL-CV(Lik) GL-StARS MSPE (PFC focus) 1.52 1.11 MSPE (DMN focus) 1.44 1.06 No. of edges (PFC focus) 4.67 10.3 No. of edges (DMN focus) 4.40 9.70 Table: Ratios of the empirical MSPE and number of edges in the estimated graphs for GL-CV(Lik) and GL-StARS relative to the graphs estimated using FIC - LQA ℓ 1 for the PFC and DMN focuses. 15 / 24 16 / 24

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend