directed network topology inference via graph filter
play

Directed Network Topology Inference via Graph Filter Identification - PowerPoint PPT Presentation

Directed Network Topology Inference via Graph Filter Identification Rasoul Shafipour, Santiago Segarra , Antonio G. Marques and Gonzalo Mateos Institute for Data, Systems, and Society Massachusetts Institute of Technology segarra@mit.edu


  1. Directed Network Topology Inference via Graph Filter Identification Rasoul Shafipour, Santiago Segarra , Antonio G. Marques and Gonzalo Mateos Institute for Data, Systems, and Society Massachusetts Institute of Technology segarra@mit.edu http://www.mit.edu/~segarra/ Data Science Workshop, June 5, 2018 Santiago Segarra Inference of Directed Nets via Graph Filter Id 1 / 16

  2. Network Science analytics Online social media Internet Clean energy and grid analy,cs ◮ Desiderata: Process, analyze and learn from network data [Kolaczyk09] Santiago Segarra Inference of Directed Nets via Graph Filter Id 2 / 16

  3. Network Science analytics Online social media Internet Clean energy and grid analy,cs ◮ Desiderata: Process, analyze and learn from network data [Kolaczyk09] ◮ Network as graph G : encode pairwise relationships ◮ Sometimes both G and data at the nodes are available ⇒ Leverage G to process network data ⇒ Graph Signal Processing Santiago Segarra Inference of Directed Nets via Graph Filter Id 2 / 16

  4. Network Science analytics Online social media Internet Clean energy and grid analy,cs ◮ Desiderata: Process, analyze and learn from network data [Kolaczyk09] ◮ Network as graph G : encode pairwise relationships ◮ Sometimes both G and data at the nodes are available ⇒ Leverage G to process network data ⇒ Graph Signal Processing ◮ Sometimes we have access to network data but not to G itself ⇒ Leverage the relation between them to infer G from the data Santiago Segarra Inference of Directed Nets via Graph Filter Id 2 / 16

  5. Graph signal processing: Notation ◮ Graph G with N nodes and adjacency A ⇒ A ij = Proximity between i and j ◮ Define a signal x ∈ R N on top of the graph ⇒ x i = Signal value at node i Santiago Segarra Inference of Directed Nets via Graph Filter Id 3 / 16

  6. Graph signal processing: Notation ◮ Graph G with N nodes and adjacency A ⇒ A ij = Proximity between i and j ◮ Define a signal x ∈ R N on top of the graph ⇒ x i = Signal value at node i ◮ Associated with G is the graph-shift operator S = VΛV − 1 ∈ R N × N ⇒ S ij = 0 for i � = j and ( i , j ) �∈ E (local structure in G ) ⇒ Ex: A and Laplacian L = D − A matrices Santiago Segarra Inference of Directed Nets via Graph Filter Id 3 / 16

  7. Graph signal processing: Notation ◮ Graph G with N nodes and adjacency A ⇒ A ij = Proximity between i and j ◮ Define a signal x ∈ R N on top of the graph ⇒ x i = Signal value at node i ◮ Associated with G is the graph-shift operator S = VΛV − 1 ∈ R N × N ⇒ S ij = 0 for i � = j and ( i , j ) �∈ E (local structure in G ) ⇒ Ex: A and Laplacian L = D − A matrices ◮ Graph filters → Matrix polynomials: H = � N − 1 l =0 h l S l = V diag(˜ h ) V − 1 ◮ Graph Signal Processing → Exploit structure encoded in S to process x ◮ Take the reverse path. How to use GSP to infer the graph topology? Santiago Segarra Inference of Directed Nets via Graph Filter Id 3 / 16

  8. Topology inference: Motivation and context ◮ Network topology inference from nodal observations [Kolaczyk09] ◮ Partial correlations and conditional dependence [Dempster74] ◮ Sparsity [Friedman07] and consistency [Meinshausen06] ◮ [Banerjee08], [Lake10], [Slawski15], [Karanikolas16] ◮ Key in neuroscience [Sporns10] ⇒ Functional net inferred from activity Santiago Segarra Inference of Directed Nets via Graph Filter Id 4 / 16

  9. Topology inference: Motivation and context ◮ Network topology inference from nodal observations [Kolaczyk09] ◮ Partial correlations and conditional dependence [Dempster74] ◮ Sparsity [Friedman07] and consistency [Meinshausen06] ◮ [Banerjee08], [Lake10], [Slawski15], [Karanikolas16] ◮ Key in neuroscience [Sporns10] ⇒ Functional net inferred from activity ◮ Noteworthy GSP-based approaches ◮ Gaussian graphical models [Egilmez16] ◮ Smooth signals [Dong15], [Kalofolias16] ◮ Stationary signals [Pasdeloup15], [Segarra16] ◮ Directed graphs [Mei15], [Shen16] ◮ Low-rank excitation [Wai18] ◮ Contribution: Inference for directed networks from diffused signals Santiago Segarra Inference of Directed Nets via Graph Filter Id 4 / 16

  10. Generating structure of a diffusion process ◮ Signal y is the response of a linear network diffusion process to an input x ∞ ∞ � � β l S l x y = α 0 ( I − α l S ) x = l =1 l =0 ⇒ Structure of y depends on structure of x and S Santiago Segarra Inference of Directed Nets via Graph Filter Id 5 / 16

  11. Generating structure of a diffusion process ◮ Signal y is the response of a linear network diffusion process to an input x ∞ ∞ � � β l S l x y = α 0 ( I − α l S ) x = l =1 l =0 ⇒ Structure of y depends on structure of x and S ◮ Cayley-Hamilton asserts we can write diffusion as � N − 1 � � h l S l y = x := Hx l =0 ⇒ y is the output of a GF H ⇒ Use this and info on ( y , x ) to find S ⇒ Key property: H is diagonalized by the eigenvectors of S ◮ GF ≡ linear maps which are analytic functions of the sparse matrix S Ex.: S , S − 1 , ( I − S ) − 1 , ( I − α S ) − 2 , ( I − S − S 2 ) − 1 , ( I − β S )( I − α S ) − 1 Santiago Segarra Inference of Directed Nets via Graph Filter Id 5 / 16

  12. Problem formulation ◮ We have access to M diffusion processes � L − 1 � � h l S l y m = x m := Hx m l =0 ◮ For each process, we gather the realizations Y m := { y ( p ) m } P m p =1 ⇒ Every realization corresponds to an independent input x ( p ) m ◮ We do not have access to L , h l , or the inputs ⇒ We do know that inputs are zero mean with covariance C x , m Santiago Segarra Inference of Directed Nets via Graph Filter Id 6 / 16

  13. Problem formulation ◮ We have access to M diffusion processes � L − 1 � � h l S l y m = x m := Hx m l =0 ◮ For each process, we gather the realizations Y m := { y ( p ) m } P m p =1 ⇒ Every realization corresponds to an independent input x ( p ) m ◮ We do not have access to L , h l , or the inputs ⇒ We do know that inputs are zero mean with covariance C x , m Problem : Given observations Y = � M m =1 Y m and the input covari- ances C x , m , find sparsest (asymmetric) S that is consistent with the observations Santiago Segarra Inference of Directed Nets via Graph Filter Id 6 / 16

  14. Blueprint of our approach ˆ Y H System' GSO' ˆ S Iden+fica+on' Inference' { C x,m } Sparsity'and' GSO'feasibility' Santiago Segarra Inference of Directed Nets via Graph Filter Id 7 / 16

  15. A first pass at filter ID ◮ The covariance matrix of the output process y m is � � T � H T = HC x , m H T x m x T � � � C y , m = E Hx m Hx m = H E m ◮ Each obs. pair C y , m = HC x , m H T gives rise to a set of potential solutions ⇒ Intersection smaller (unique) as M ↑ , try to solve Santiago Segarra Inference of Directed Nets via Graph Filter Id 8 / 16

  16. A first pass at filter ID ◮ The covariance matrix of the output process y m is � � T � H T = HC x , m H T x m x T � � � C y , m = E Hx m Hx m = H E m ◮ Each obs. pair C y , m = HC x , m H T gives rise to a set of potential solutions ⇒ Intersection smaller (unique) as M ↑ , try to solve M � T || 2 argmin || C y , m − H L C x , m H R s. to H L = H R F H L , H R ∈M N m =1 ⇒ Variables of size N 2 , smarter way to formulate the recovery? ⇒ Parametrize the set of feasible solutions Santiago Segarra Inference of Directed Nets via Graph Filter Id 8 / 16

  17. Filter ID for directed networks ◮ For each m , we have a matrix equation of the form C y , m = HC x , m H T (1) Santiago Segarra Inference of Directed Nets via Graph Filter Id 9 / 16

  18. Filter ID for directed networks ◮ For each m , we have a matrix equation of the form C y , m = HC x , m H T (1) If C x , m and C y , m are full rank, the set H m containing all the (possibly asymmetric) matrices H that solve (1) for a particular m is given by and UU T = I } . H m = { H | H = C 1 / 2 y , m UC − 1 / 2 x , m Santiago Segarra Inference of Directed Nets via Graph Filter Id 9 / 16

  19. Filter ID for directed networks ◮ For each m , we have a matrix equation of the form C y , m = HC x , m H T (1) If C x , m and C y , m are full rank, the set H m containing all the (possibly asymmetric) matrices H that solve (1) for a particular m is given by and UU T = I } . H m = { H | H = C 1 / 2 y , m UC − 1 / 2 x , m ◮ Optimization over unitary matrices ⇒ N ( N − 1) / 2 degrees of freedom in lieu of N 2 ⇒ Each m kills N ( N + 1) / 2 degrees of freedom ⇒ M = 2 may suffice ⇒ Non convex, but tailored algorithms are available ◮ Solving system id (non-symm. square roots) by leveraging structure Santiago Segarra Inference of Directed Nets via Graph Filter Id 9 / 16

  20. Manopt for non-symmetric graph-filter id ◮ Original formulation: M || ˆ � T || 2 argmin C y , m − H L C x , m H R s. to H L = H R ( P 1) F H L , H R m =1 ◮ Leveraging structure: optimize over U m ∈ U N M � � H − ˆ C y , m U m C − 1 / 2 x , m � 2 argmin ( P 2) F H , { U m } M m =1 m =1 ⇒ Approach: projected gradient descent (manopt) ⇒ Sensitive to initialization Santiago Segarra Inference of Directed Nets via Graph Filter Id 10 / 16

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend