Decoupled smoothing on graphs Alex Chin, Yatong Chen , Kristen M. - - PowerPoint PPT Presentation

decoupled smoothing on graphs
SMART_READER_LITE
LIVE PREVIEW

Decoupled smoothing on graphs Alex Chin, Yatong Chen , Kristen M. - - PowerPoint PPT Presentation

Decoupled smoothing on graphs Alex Chin, Yatong Chen , Kristen M. Altenburger, Johan Ugander Stanford University The Web Conference, San Francisco May 17, 2019 1 Attribute prediction on graphs 2 Attribute prediction on graphs Given: Social


slide-1
SLIDE 1

Decoupled smoothing on graphs

Alex Chin, Yatong Chen, Kristen M. Altenburger, Johan Ugander Stanford University

1

The Web Conference, San Francisco May 17, 2019

slide-2
SLIDE 2

Attribute prediction on graphs

2

slide-3
SLIDE 3

Attribute prediction on graphs

Given:

  • Social Network

3

Yatong Chen, Stanford University, WWW 2019

slide-4
SLIDE 4

Attribute prediction on graphs

Given:

  • Social Network
  • Labels for some subset nodes

Male Female

4

Yatong Chen, Stanford University, WWW 2019

slide-5
SLIDE 5

Given:

  • Social Network
  • Labels for some subset nodes

Goal:

  • Infer labels for unlabeled nodes

Attribute prediction on graphs

Male Female ? ? ?

5

Yatong Chen, Stanford University, WWW 2019

slide-6
SLIDE 6

Semi-supervised learning Problem

Given:

  • Social Network
  • Labels for some subset nodes

Goal:

  • Infer labels for unlabeled nodes

Male Female ? ? ?

6

Yatong Chen, Stanford University, WWW 2019

slide-7
SLIDE 7

Approaches for attribute prediction

  • Approach 1: Graph Smoothing based on Gaussian

Random Field [Zhu, Ghahramani, Lafferty 2003]

○ Assumption: Gaussian Markov Random Field Prior on true label of all the nodes ○ Get the Bayes estimator of on unlabeled nodes under the GMRF prior ○ Be referred as ZGL later

slide-8
SLIDE 8

Approaches for attribute prediction

  • Approach 2: LINK classification [Lu-Getoor 2003]

○ Learn a function F: F(row i in adjacency matrix) = i’s label ○ Example F: regularized logistic regression

8

? [1,0,...0,1] [0,1, ...0,1] ? [1,0,...0,1] [0,1, ...0,1] [0,0,..,0,1] [0,0,..,0,1] Labeled Labeled regularized logistic regression

slide-9
SLIDE 9

ZGL’s assumption: Homophily

  • Homophily [one-hop similarity]:

○ Individuals are similar to their friends

9

Yatong Chen, Stanford University, WWW 2019

slide-10
SLIDE 10

ZGL’s assumption: Homophily

  • Homophily [one-hop similarity]:

○ ZGL assumes information of a given node decays smoothly across the topology of the graph (by imposing the GMRF prior)

10 10

? ? ? ? ? ? ?

Labeled Labeled smooth!

slide-11
SLIDE 11

Homophily Assumption: NOT always necessary!

  • [Altenburger-Ugander 2018]:

○ LINK does well even without assuming homophily ○ Homophily assumption is not necessary for inference to succeed

  • ...but ZGL and a lot of other graph smoothing methods

all assumes homophily. Can we do graph smoothing without it? ○ Yes (this talk)

11

Yatong Chen, Stanford University, WWW 2019

slide-12
SLIDE 12
  • Gender example:

12

Male Female

Yatong Chen, Stanford University, WWW 2019

A situation where Homophily assumption fails

slide-13
SLIDE 13

A situation where Homophily assumption fails

  • Gender example:

○ Want to predict the gender of the center node

13

Male Female

Yatong Chen, Stanford University, WWW 2019 ?

slide-14
SLIDE 14

A situation where Homophily assumption fails

  • Gender example:

○ Want to predict the gender of the center node: ■ Assume Homophily (1-hop majority vote): false

14

Male Female

Yatong Chen, Stanford University, WWW 2019 ? ?

slide-15
SLIDE 15

A situation where Homophily assumption fails

  • Observation: there are difference between one’s

identity and preference

15

Identity: male Preference: female

Male Female

Yatong Chen, Stanford University, WWW 2019

slide-16
SLIDE 16

Decoupled smoothing Method

16

slide-17
SLIDE 17

Decoupled smoothing: idea

  • Idea: decoupling one’s “identity” and “preference”
  • Use separate parameters to model them accordingly!

17

Identity: male Preference: female

Male Female

Without homophily (or heterophily) assumption

Yatong Chen, Stanford University, WWW 2019

slide-18
SLIDE 18

Decoupled smoothing: idea

  • Idea: decoupling one’s “identity” and “preference”

18

Yatong Chen, Stanford University, WWW 2019

slide-19
SLIDE 19

Decoupled smoothing: idea

  • Idea: decoupling one’s “identity” and “preference”
  • Intuition: a person’s identity will reveal information

about their friend’s preference, and vice versa

identity preference

19

slide-20
SLIDE 20

Decoupled smoothing: idea

  • Idea: decoupling one’s “identity” and “preference”
  • Intuition: a person’s identity will reveal information

about their friend’s preference, and vice versa

identity preference: can’t observe, how to reveal?

20

?

slide-21
SLIDE 21

Decoupled smoothing: model

  • Intuition: a person’s identity will reveal information

about their friend’s preference, and vice versa

  • Assumption:

21

Yatong Chen, Stanford University, WWW 2019

slide-22
SLIDE 22

Decoupled smoothing: model

  • Intuition: a person’s identity will reveal information

about their friend’s preference, and vice versa

  • Assumption:
  • Goal: to obtain predictions for the identity

the preference is nuisance!

  • Get the marginal prior for :

22

Yatong Chen, Stanford University, WWW 2019

slide-23
SLIDE 23

How to estimate W?

23

Yatong Chen, Stanford University, WWW 2019

Decoupled smoothing: impose a prior

slide-24
SLIDE 24

How to estimate W?

  • Intuition:

○ Node j’th preference will imply a 2-hop similarity between node i and node k’s identities

24

Node i Node j Node k

?

slide-25
SLIDE 25

How to estimate W?

  • Intuition:

○ Node j’th preference will imply a 2-hop similarity between node i and node k’s identities ○ Make use of the information of k when predicting i

25

Node i Node j Node k

?

slide-26
SLIDE 26

How to estimate W?

  • Intuition:

○ Node j’th preference will imply a 2-hop similarity between node i and node k’s identities ○ Make use of the information of k when predicting i

  • Assumption:

○ i’s 2-hop friend k has the distribution:

26

Node i Node j Node k

?

slide-27
SLIDE 27

How to estimate W?

  • Why as mean:

○ Similarity among i and k

  • Why as variance?

○ The more friends j have→ the better its preference being revealed → the less uncertainty about the similarity between i and k

27

Node i Node j Node k

Trust More!

slide-28
SLIDE 28

How to estimate W?

  • Assumption:

○ 2-hop friend k has the distribution ○ Homogeneous standard error

  • Then W can be reduced to

28

We get W!

slide-29
SLIDE 29

Decoupled smoothing: model

  • Now we know everything about the marginal prior for :

  • Next step:

○ Compute the Bayes estimator of for unlabeled node and then make the prediction (recall ZGL)

  • Done!

29

Yatong Chen, Stanford University, WWW 2019

slide-30
SLIDE 30

Relationship between Decoupled smoothing and some phenomenon/concept/method that are related to it

30

slide-31
SLIDE 31

Decoupled smoothing and Monophily

  • The phenomenon of Monophily [Altenburger-Ugander 2018]

○ Two-hop similarity: individuals are similar to their friends’ friends ○ Innovative concept compared to Homophily

31

Identity: male Preference: female

Male Female

slide-32
SLIDE 32

Decoupled smoothing implies Monophily

  • The phenomenon of Monophily [Altenburger-Ugander 2018]

○ Two-hop similarity: ■ similarity among the friends of a person is the result of personal preference ○ The 2 hop similarity phenomenon is implied by our decoupling smoothing idea!

32

Identity: male Preference: female

Male Female

slide-33
SLIDE 33

Decoupled smoothing and 2-hop MV

  • Decoupled smoothing reduces to iterative 2-hop

majority vote (under homogeneous standard error):

33

2 hop Majority Vote (MV): Average over the labeled nodes in 2-hop friend sets

slide-34
SLIDE 34

Decoupled smoothing and ZGL

  • ZGL:
  • Assume Homophily
  • Prior:
  • Matrix A:

adjacency matrix

  • Reduce to iteratively

1-hop majority vote update method!

34

  • Decoupled smoothing:

○ Don’t assume Homophily ○ Prior: ○ Auxiliary matrix: ○ Reduce to iteratively 2-hop majority vote update method!

Decoupled smoothing and ZGL

slide-35
SLIDE 35

35

Empirical Results

Yatong Chen, Stanford University, WWW 2019

slide-36
SLIDE 36

Dataset

  • Facebook 100: Single-day snapshots of Facebook in

September 2005.

  • Goal: gender prediction

36

School Name Number of Nodes Number of Edges Amherst 2032 78733 Reed 962 18812 Haverford 1350 53904 Swarthmore 1517 53725

slide-37
SLIDE 37

Decoupled smoothing: empirical result

37

Reed Swarthmore

Yatong Chen, Stanford University, WWW 2019

slide-38
SLIDE 38

Decoupled smoothing: empirical result

38

Reed

Yatong Chen, Stanford University, WWW 2019

Swarthmore

ZGL ZGL

slide-39
SLIDE 39

Decoupled smoothing: empirical result

39

Reed

Yatong Chen, Stanford University, WWW 2019

Swarthmore

ZGL ZGL

slide-40
SLIDE 40

Decoupled smoothing: empirical result

40

Haverford Amherst

Yatong Chen, Stanford University, WWW 2019

ZGL ZGL

slide-41
SLIDE 41

Why 2-hop Majority Vote beats Decoupled Smoothing?

41

Amherst

Yatong Chen, Stanford University, WWW 2019

ZGL

slide-42
SLIDE 42

Summary

  • Introduce the idea of decoupling one’s “identity”

and “preference”

  • Justify/explain the phenomenon of 2-hop similarity

without assuming homophily

  • Open questions:

○ How to choose the weighted matrix W? ○ Why 2-hop Majority Vote outperforms decoupled smoothing: can you do better?

42

Yatong Chen, Stanford University, WWW 2019

slide-43
SLIDE 43

Questions?

43

slide-44
SLIDE 44

Thank you for your attention!

44