http://cs224w.stanford.edu Three topics for today: 1. GNN - PowerPoint PPT Presentation

CS224W: Analysis of Networks Jure Leskovec, R. Ying and J. You, Stanford University http://cs224w.stanford.edu

Three topics for today: 1. GNN recommendation (PinSage) 2. Heterogeneous GNN (Decagon) 3. Goal-directed generation (GCPN) 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 2

¡ Users interacts with items Watch movies, buy merchandise, listen to music § ¡ Goal: Recommend items users might like Customer X buys Metallica and Megadeth CDs § Customer Y buys Megadeth, the recommender system § suggests Metallica as well Users Interactions “You might also like” Items 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 4

Goal: Learn what items are related ¡ For a given query item(s) Q, return a set of similar items that we recommend to the user Idea: ¡ User interacts with a set of items Query Recommendations ¡ Formulate a query Q ¡ Search the items and Products, web return recommendations sites, movies, Items posts, ads, … 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 5

Query: 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 6

Query: Recommendations: 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 7

Query: 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 8

Query: Recommendations: 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 9

Having a universal similarity function allows for many applications: Related pins Homefeed Ads and shopping (find most (use organic for the (endless feed of similar/related pins) query and search the recommendations) ads database) 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 10

Question: How do we define similarity? ¡ 1) Content-based: User and item features, in the form of images, text, categories, etc. ¡ 2) Graph-based: User-item interactions, in the form of graph/network structure § This is called collaborative filtering: § For a given user X, find others who liked similar items § Estimate what X will like based on what similar others like 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 11

How do we define similarity: ¡ (1) Gathering “known” similarities § How to collect the data about what users like ¡ (2) Extrapolating unknown similarities from the known ones § Mainly interested in high unknown similarities § We are not interested in knowing what you don’t like but what you like ¡ (3) Evaluating methods § How to measure success/performance of recommendation methods 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 12

¡ 300M users ¡ 4+B pins, 2+B boards 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 13

Pinterest: Human curated collection of pins Pin: A visual bookmark someone has saved from the internet to a board they’ve created. Pin: Image, text, link Board: A collection of ideas (pins having something in common) 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 14

Q Two sources of signal: Features: ¡ Image and text of each pin Graph: ¡ Graph is dynamic: Need to apply to new nodes without model retraining 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 15

Goal: Learn embeddings for items ¡ Related Pins Query : Which pin to recommend when a user interacts with a pin 𝑤 " ? ¡ Answer : Find the closest embedding ( 𝑤 # ) to 𝑤 " by nearest neighbor. Recommend it. 𝑤 $ Previously pinned 𝑤 % Item embeddings Query pin 𝑤 " 𝑤 # Related pin recommendation 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 16

¡ Goal 1 : Efficiently learn embeddings for billions of pins (items, nodes) using neural networks ¡ Goal 2 : Perform nearest neighbor query to recommend items in real-time Embedding space Embed Query pin The closer the embeddings are, the more similar the pins are “Predicted” related pin 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 17

Task: Recommend related pins to users Task: Learn node embeddings 𝑨 ' such that 𝑒 𝑨 )*+,$ , 𝑨 )*+,% < 𝑒(𝑨 )*+,$ , 𝑨 01,*2,3 ) Query pin Predict whether two nodes in a graph are related 𝑒(𝑨 $ , 𝑨 % ) 𝑨 % 𝑨 $ 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 8

Predict whether two nodes in a graph are related Approach: ¡ Pins have embeddings at each layer pin Aggregator ¡ Layer-0 embedding of board ... a node are its features: Agg. Agg. Agg. ... ... ... § Text, image, … 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 19

[Ying et al., WWW 2018] ¡ PinSage graph convolutional network: § Goal: Generate embeddings for nodes (e.g., pins) in the Pinterest graph containing billions of objects § Key Idea: Borrow information from nearby nodes § E.g., bed rail Pin might look like a garden fence, but gates and beds are rarely adjacent in the graph A vs. § Pin embeddings are essential to many different tasks. Mean and Max both fail Aside from the “Related Pins” task, it can also be used in: § Recommend related ads § Homefeed recommendation § Cluster users by their interest 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 20

1. Collect billions of training pairs from logs. Positive pair: Two pins that are consecutively saved § into the same board within a time interval (1 hour) Negative pair: A random pair of 2 pins § With high probability the pins are not on the same board § 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 21

1. Collect billions of training pairs from logs. Positive pair: Two pins that are consecutively saved § into the same board within a time interval (1 hour) Negative pair: A random pair of 2 pins § With high probability the pins are not on the same board § 2. Train GNN to generate similar embeddings for training pairs 3. Inference : Generate embeddings for all pins 4. Nearest neighbor search in embedding space to make recommendations. 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 22

¡ Train so that pins that are consecutively pinned have similar embeddings ¡ Max-margin loss: X max(0 , − z > u z v + z > L = u z n + ∆ ) ( u,v ) 2 D “margin” (i.e., how set of training pairs “positive”/true “negative” much larger positive from user logs training pair example pair similarity should be compared to negative) 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 23

¡ Four key innovations: 1. On-the-fly graph convolutions Sample the neighborhood around a node and § dynamically construct a computation graph Minibatch of neighborhoods 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 24

¡ Four key innovations: 1. On-the-fly graph convolutions Perform a localized graph convolution around a § particular node Does not need the entire graph during training § At every iteration, only source node embeddings are computed 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 25

¡ Four key innovations: 2. Selecting neighbors via random walks Performing aggregation on all neighbors is infeasible: § How to select the set of neighbors of a node to convolve over? § Personalized PageRank can help! § Define Importance pooling: Define importance-based § neighborhoods by simulating random walks and selecting the neighbors with the highest visit counts 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 26

¡ Proximity to query node(s) Q 5 5 5 5 5 5 14 9 Q 16 7 8 8 8 8 1 1 1 Yummm Strawberries Smoothies Smoothie Madness!•!•!•! 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 27

¡ Proximity to query node(s) Q ¡ Importance pooling § Choose nodes with top K visit counts § Pool over the chosen nodes § The chosen nodes are not necessarily neighbors 5 5 5 5 5 5 14 9 Q 16 7 8 8 8 8 1 1 1 Yummm Strawberries Smoothies Smoothie Madness!•!•!•! 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 28

¡ Example: suppose 𝐿 =5 ¡ Rank nodes based on Random Walk visit counts ¡ Pick top 𝑳 nodes and normalize counts 16 55 , 14 55 , 9 55 , 8 55 , 8 55 ¡ Aggregate messages from the top 𝐿 nodes Top 𝑳 nodes 5 5 5 5 5 5 14 9 Q 16 7 8 8 8 8 1 1 1 Yummm Strawberries Smoothies Smoothie Madness!•!•!•! 12/5/19 Jure Leskovec, Stanford CS224W: Machine Learning with Graphs, http://cs224w.stanford.edu 29

http://cs224w.stanford.edu Three topics for today: 1. GNN - PowerPoint PPT Presentation

CS224W: Analysis of Networks Jure Leskovec, R. Ying and J. You, Stanford University http://cs224w.stanford.edu Three topics for today: 1. GNN recommendation (PinSage) 2. Heterogeneous GNN (Decagon) 3. Goal-directed generation (GCPN) 12/5/19 Jure

http://cs224w.stanford.edu October August 12/3/2013 Jure Leskovec, Stanford CS224W: Social and

http://cs224w.stanford.edu 10/31/2012 Jure Leskovec, Stanford CS224W: Social and Information

http://cs224w.stanford.edu Course website: Course website: http://cs224w.stanford.edu

http://cs224w.stanford.edu 10/25/2010 Jure Leskovec, Stanford CS224W: Social and Information

http://cs224w.stanford.edu ? ? ? ? Machine Learning ? Node classification 12/4/17 Jure

http://cs224w.stanford.edu Nodes Nodes Network Adjacency matrix 11/30/17 Jure Leskovec,

http://cs224w.stanford.edu ? ? ? ? Machine Learning ? Node classification 10/15/19 Jure

http://cs224w.stanford.edu Output: Node embeddings. We can also embed larger network

http://cs224w.stanford.edu Stanford Social Web (ca. 1999) network

http://cs224w.stanford.edu Three basic stages: 1) Pre-processing Construct a matrix

http://cs224w.stanford.edu Main question today: Given a network with labels on some nodes, how

http://cs224w.stanford.edu Networks of tightly Networks of tightly connected groups

http://cs224w.stanford.edu Spreading through networks: Spreading through networks:

http://cs224w.stanford.edu Non overlapping vs overlapping communities Non overlapping

http://cs224w.stanford.edu Teams of 2 3 students (1 is also ok) Teams of 2 3 students

http://cs224w.stanford.edu How to organize/navigate it? How to organize/navigate it?

Explaining Privacy and Fairness Violations in Data-Driven Systems Matt Fredrikson Carnegie

Managing workplace stress Getting the best from today Being better tomorrow Stress: That

Large-scale Paraphrasing for Natural Language Generation Chris Callison-Burch March 26, 2015

Taaltheorie en Taalverwerking BSc Artificial Intelligence Raquel Fernndez Institute for Logic,

POTENTIAL FOR STRUCTURAL TRANSFORMATION OF THE GHANAIAN ECONOMY UNU-WIDER CONFERENCE 13 th

The African IP Trust What it is and Why it is Needed Meg Brindle, PhD, and Ron Layton, Chief

AP Chemistry Compounds 2015-09-14 www.njctl.org Slide 3 / 163 Table of Contents: Compounds Pt.

caramae@whatify.com image credit: 101cookbooks.com death by a 1000 cuts y c a r a m a e @ w h