Graph Convolutional Networks (GCNs) Dimitris Papatheodorou Aalto - PowerPoint PPT Presentation

Graph Convolutional Networks (GCNs) Dimitris Papatheodorou Aalto University dimitrispapatheodorou95@gmail.com May 21, 2019 Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 1 / 35

Overview Introduction 1 Problem Setting Graph Laplacian Graph Convolutional Networks 2 The ideas behind the problem GCN idea and convolutions on graphs Spectral Graph Convolutions (SGC) Implementation and results Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 2 / 35

Problem Setting Graphs are structured representations of data, such as citation networks, social networks, the World-Wide-Web, protein-interaction networks, and others. Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 3 / 35

Problem Setting Graphs are structured representations of data, such as citation networks, social networks, the World-Wide-Web, protein-interaction networks, and others. Recent work on generalizing neural networks to graphs in various ways for different tasks (graph classification, nodes classification, clustering, link prediction, node embeddings, and more). Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 3 / 35

Problem Setting Graphs are structured representations of data, such as citation networks, social networks, the World-Wide-Web, protein-interaction networks, and others. Recent work on generalizing neural networks to graphs in various ways for different tasks (graph classification, nodes classification, clustering, link prediction, node embeddings, and more). We will denote a undirected graph as G = ( V , E ), with: nodes υ i ∈ V ( N nodes) edges ǫ ij = ( υ i , υ j ) ∈ E ( M edges) adjacency matrix A ∈ R N × N (binary or weighted) degree matrix D ii = � j A ij unnormalized graph Laplacian ∆ = D − A normalized graph Laplacian L = I N − D − 1 2 AD − 1 2 Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 3 / 35

Intuition of graph Laplacian Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 4 / 35

Intuition of graph Laplacian The graph Laplacian can be considered as the discrete analogue (applied on graphs) of the Laplacian operator ∇ 2 on graphs, which is differential operator given by the divergence of the gradient of a function f on Euclidean space. → ∆ f = ∇ 2 f = div(grad( f )) Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 4 / 35

Intuition of graph Laplacian The graph Laplacian can be considered as the discrete analogue (applied on graphs) of the Laplacian operator ∇ 2 on graphs, which is differential operator given by the divergence of the gradient of a function f on Euclidean space. → ∆ f = ∇ 2 f = div(grad( f )) The Gradient Operator For a function on the Euclidean space, the gradient operator gives the derivative of the function along each direction at every point. For a function on a discrete ”graph space”, the graph gradient operator gives the difference of the function along each edge at every vertex: → For edge ǫ = ( u , v ) : grad ( f ) | ǫ = f ( u ) − f ( v ). ⇒ grad( f ) = K ⊤ f , where K is the incidence matrix of size M × N . (by assigning an arbitrary orientation on the edges) Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 4 / 35

Intuition of graph Laplacian The Divergence Operator In the Euclidean space, divergence at a point gives the net outward flux of a vector field. For graphs, the vector field is just the gradient of a graph function. In the discrete ”graph space”, we define the graph divergence of a function g over the edges of a graph (eg the graph gradient) as a mapping from g to Kg . → ∇ f = div(grad( f )) = KK ⊤ f , where KK ⊤ is the Laplacian. Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 6 / 35

Intuition of graph Laplacian The Divergence Operator In the Euclidean space, divergence at a point gives the net outward flux of a vector field. For graphs, the vector field is just the gradient of a graph function. In the discrete ”graph space”, we define the graph divergence of a function g over the edges of a graph (eg the graph gradient) as a mapping from g to Kg . → ∇ f = div(grad( f )) = KK ⊤ f , where KK ⊤ is the Laplacian. Notice that the laplacian ∆ = KK ⊤ here is Cholesky decomposed, thus it’s positive semi-definite . Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 6 / 35

Intuition of graph Laplacian Circled items: degrees of the vertices! Now the definition is more clear: ∆ = D − A Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 8 / 35

Intuition of graph Laplacian Another example Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 9 / 35

Intuition of graph Laplacian More intuition For continuous spaces, the Laplacian is the secord derivative, so it measures how smooth is a function over its domain. It’s the same for graph laplacians: the function values don’t change by much from one node to an adjacent one. Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 10 / 35

Intuition of graph Laplacian More intuition For continuous spaces, the Laplacian is the secord derivative, so it measures how smooth is a function over its domain. It’s the same for graph laplacians: the function values don’t change by much from one node to an adjacent one. Formally (general case of weighted graphs): E ( f ) = 1 2 � � w uv ( f ( u ) − f ( v )) 2 = � � K ⊤ f = f ⊤ ∆ f � � 2 � u ∼ v Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 10 / 35

Intuition of graph Laplacian More intuition For continuous spaces, the Laplacian is the secord derivative, so it measures how smooth is a function over its domain. It’s the same for graph laplacians: the function values don’t change by much from one node to an adjacent one. Formally (general case of weighted graphs): E ( f ) = 1 2 � � w uv ( f ( u ) − f ( v )) 2 = � � K ⊤ f = f ⊤ ∆ f � � 2 � u ∼ v Equivalent to Dirichlet energy, for open set Ω ⊆ R n and function f : Ω → R : E ( f ) = 1 � �∇ f ( x ) � 2 dx 2 Ω a measure of how variable a function is. Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 10 / 35

Intuition of graph Laplacian So, minimizing the variation of a graph function leads us to the Laplacian. The functions that minimize f ⊤ ∆ f are the eigenvectors of ∆. This can be shown either directly, or via the Courant-Fischer-Weyl min-max principle / variational theorem on the Rayleigh quotient of the laplacian for unit norm functions. (See more in Algorithmic Methods of Data Mining course slides) Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 11 / 35

Intuition of graph Laplacian Interesting Properties ∆ = KK ⊤ , thus the Laplacian is a Gram Matrix. Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 12 / 35

Intuition of graph Laplacian Interesting Properties ∆ = KK ⊤ , thus the Laplacian is a Gram Matrix. The multiplicity of its zero eigenvalue λ 0 is equal to the number of components of the graph. (multiplicity: remember the characteristic polynomial det ( A − λ I ) = 0). Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 12 / 35

Intuition of graph Laplacian Interesting Properties ∆ = KK ⊤ , thus the Laplacian is a Gram Matrix. The multiplicity of its zero eigenvalue λ 0 is equal to the number of components of the graph. (multiplicity: remember the characteristic polynomial det ( A − λ I ) = 0). The second smallest eigenvalue (aka Fiedler value) of the Laplacian matrix will be zero if and only if the graph is disconnected . The smaller the second smallest eigenvalue, the less ’connected’ the graph. Interlacing property: For a graph with Laplacian ∆ and eigenvalues of ∆: λ 1 ≥ λ 2 ≥ · · · ≥ λ n , if we delete an edge , the new eigenvalues are: µ 1 ≥ µ 2 ≥ · · · ≥ µ n − 1 . It holds that: 2 ≥ λ 1 ≥ µ 1 ≥ λ 2 ≥ µ 2 ≥ · · · ≥ µ n − 1 ≥ λ n ≥ 0 This is the same for the adjacency matrix and nodes! Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 12 / 35

The ideas behind the problem Kipf’s and Welling’s paper [Kipf and Welling(2016)] focuses on nodes classification, where node labels are available for a small number of nodes. Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 13 / 35

The ideas behind the problem Kipf’s and Welling’s paper [Kipf and Welling(2016)] focuses on nodes classification, where node labels are available for a small number of nodes. That’s a graph-based semi-supervised learning problem. Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 13 / 35

Graph Convolutional Networks (GCNs) Dimitris Papatheodorou Aalto - PowerPoint PPT Presentation

Graph Convolutional Networks (GCNs) Dimitris Papatheodorou Aalto University dimitrispapatheodorou95@gmail.com May 21, 2019 Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 1 / 35 Overview Introduction 1 Problem Setting Graph

Disentangled Graph Convolutional Networks Jianxin Ma, Peng Cui, Kun Kuang, Xin Wang, Wenwu Zhu

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Convolutional Kernel Networks for Graph-Structured Data Dexiong Chen 1 Laurent Jacob 2 Julien

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

On the Experimental transferability of Spectral Graph Convolutional Networks Masters project

Simplifying Graph Convolutional Networks Amauri Holanda Felix Wu* Tianyi Zhang* Christopher

Graph Representation Learning with Graph Convolutional Networks Jure Leskovec Networks: Common

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

EMNLP 2017 Copenhagen Contributions } Syntactic Graph Convolutional Networks }

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Semantic Segmentation of the sekleton in bone scintigraphy images with convolutional neural

15-780 Graduate Artificial Intelligence: Convolutional and recurrent networks J. Zico Kolter

and Inference for Convolutional Neural Networks 1 2 FFT IFFT 3 4 Mathieu et al.: Fast

Feature Significance in Wide Neural Networks Janusz A. Starzyk Rafa Niemiec Adrian Horzyk 1

Q2 2019 RESULTS Amsterdam, 1 August 2019 Agenda Nik Kershaw - Head of IR OPENING Ursula Burns

Assistance Network September 2014 Assistance Network Purpose The Assistance Network provides

Discussion on full network model expansion performance Abhishek Hundiwale Senior Market Design

Quality of Service Forecasting with LSTM Neural Network International Symposium on Integrated

BRAIN-INSPIRED COMPUTING FOR ADVANCED IMAGE AND PATTERN RECOGNITION Leti Devices Workshop | Marc

INFORMATION RETRIEVAL USING NEURAL NETWORKS VINEETH REDDY ANUGU CMSC 676 INFORMATION RETRIEVAL

Fourth-Quarter 2019 Results January 29, 2020 Safe Harbor This presentation includes forward -

Graph Convolutional Networks (GCNs) Dimitris Papatheodorou Aalto - PowerPoint PPT Presentation

Graph Convolutional Networks (GCNs) Dimitris Papatheodorou Aalto University dimitrispapatheodorou95@gmail.com May 21, 2019 Dimitris Papatheodorou (Aalto University) GCNs May 21, 2019 1 / 35 Overview Introduction 1 Problem Setting Graph

Disentangled Graph Convolutional Networks Jianxin Ma, Peng Cui, Kun Kuang, Xin Wang, Wenwu Zhu

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Convolutional Kernel Networks for Graph-Structured Data Dexiong Chen 1 Laurent Jacob 2 Julien

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

On the Experimental transferability of Spectral Graph Convolutional Networks Masters project

Simplifying Graph Convolutional Networks Amauri Holanda Felix Wu* Tianyi Zhang* Christopher

Graph Representation Learning with Graph Convolutional Networks Jure Leskovec Networks: Common

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks 08, 10 &amp; 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

EMNLP 2017 Copenhagen Contributions } Syntactic Graph Convolutional Networks }

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Semantic Segmentation of the sekleton in bone scintigraphy images with convolutional neural

15-780 Graduate Artificial Intelligence: Convolutional and recurrent networks J. Zico Kolter

and Inference for Convolutional Neural Networks 1 2 FFT IFFT 3 4 Mathieu et al.: Fast

Feature Significance in Wide Neural Networks Janusz A. Starzyk Rafa Niemiec Adrian Horzyk 1

Q2 2019 RESULTS Amsterdam, 1 August 2019 Agenda Nik Kershaw - Head of IR OPENING Ursula Burns

Assistance Network September 2014 Assistance Network Purpose The Assistance Network provides

Discussion on full network model expansion performance Abhishek Hundiwale Senior Market Design

Quality of Service Forecasting with LSTM Neural Network International Symposium on Integrated

BRAIN-INSPIRED COMPUTING FOR ADVANCED IMAGE AND PATTERN RECOGNITION Leti Devices Workshop | Marc

INFORMATION RETRIEVAL USING NEURAL NETWORKS VINEETH REDDY ANUGU CMSC 676 INFORMATION RETRIEVAL

Fourth-Quarter 2019 Results January 29, 2020 Safe Harbor This presentation includes forward -

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing