Distributed, Egocentric Representations of Graphs for Detecting - - PowerPoint PPT Presentation

distributed egocentric representations of graphs for
SMART_READER_LITE
LIVE PREVIEW

Distributed, Egocentric Representations of Graphs for Detecting - - PowerPoint PPT Presentation

Distributed, Egocentric Representations of Graphs for Detecting Critical Structures Ruo-Chun Tzeng 1 Shan-Hung Wu 2 1 Microsoft Inc, Taiwan 2 National Tsing Hua University, Taiwan 36th International Conference on Machine Learning Ruo-Chun Tzeng,


slide-1
SLIDE 1

Distributed, Egocentric Representations of Graphs for Detecting Critical Structures

Ruo-Chun Tzeng1 Shan-Hung Wu2

1Microsoft Inc, Taiwan 2National Tsing Hua University, Taiwan

36th International Conference on Machine Learning

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 1 / 9

slide-2
SLIDE 2

Goal

To learn representations of graphs by generalizing convolutions while keeping all the nice properties of CNNs being able to

detect shift-invariant graph patterns by the filters enlarge the receptive fields by multi-layer architecture identify the critical parts (critical structures) most important to the jointly learned task

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 1 / 9

slide-3
SLIDE 3

What are the critical structures?

Local-Scale Critical Structures: Alkane vs Alcohol Global-Scale Critical Structures: Symmetric vs Asymmetric

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 2 / 9

slide-4
SLIDE 4

STOA: Graph Attention Networks (Bengio et al. ICLR’18)

The 1-head self-attentional network (1-head GAT) is the state-of-the-art solution to the problem 1-head GAT learns the attention score αij for each edge (i, j) − → h′

i = σ

 

j∈Ni

αijW − → hj

 

In supervised tasks, α are the critical structures ∵ αij represents the contribution of edge (i, j) to the model prediction

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 3 / 9

slide-5
SLIDE 5

Drawback: limited learning ability

However, learning α sacrifices the learning ability − → h′

i = σ

 

j∈Ni

αijW − → hj

 

It is not obvious in node classification, but severely affects the performance in graph classification

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 4 / 9

slide-6
SLIDE 6

A fix: Ego-CNN

Our idea: learning critical structures by the filters just like CNNs

Traditional Convolution H(l)

ij,d = σ

  • E (l)

ij

⊛ W (l,d) + b(l)

d

  • Ego-Convolution (ours)

H(l)

i,d = σ

  • E (l)

i

⊛ W (l,d) + b(l)

d

  • 1-head GAT (ICLR’18)

H(l)

i,d = σ

  • E (l)

i

  • αi,: ⊗ w(l,d)

+ b(l)

d

  • The filter of Ego-CNN captures the interaction of nodes in Ni

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 5 / 9

slide-7
SLIDE 7

Challenge: variable-sized Ni makes W ill-defined

H(l)

i,d = σ

  • E (l)

i

⊛ W (l,d) + b(l)

d

  • , where E (l)

i

= j∈N iH(l−1)

j

How to define Ni? In Ego-CNN, we define Ni as the top K nodes in the L-hop ego-networks at the L-th layer

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 6 / 9

slide-8
SLIDE 8

Improved learning ability on graph classification

Graph classification benchmark datasets With K = 16, Ego-CNN is comparable to the state-of-the-arts

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 7 / 9

slide-9
SLIDE 9

Ego-CNN can learng critical structure WITHOUT α

Backtracking W with CNN visualization techniques shows the identified critical structures

Local-Scale: Alkane vs Alcohol Global-Scale: Symmetric vs Asymmetric (a) C14H29OH (c) Symmetric Isomer (b) C82H165OH (d) Asymmetric Isomer

Table: Visualization of the critical structures detected by Ego-CNN

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 8 / 9

slide-10
SLIDE 10

More benefits... and let’s chat at Pacific Ballroom#22

Ego-CNN can detect self-similar patterns i.e., same ptterns that exist at different zoom levels commonly exist in social networks

How?

By simply tying the weights (W ’s) across different layers

For more details, let’s chat at Pacific Ballroom#22 6:30-9:00 PM

Ruo-Chun Tzeng, Shan-Hung Wu (Microsoft Inc, Taiwan, National Tsing Hua University, Taiwan) Ego-CNN ICML 2019 9 / 9