Heterogeneous Information Networks JIAWEI HAN COMPUTER SCIENCE - PowerPoint PPT Presentation

Hyper Edge-Based Embedding in Heterogeneous Information Networks JIAWEI HAN COMPUTER SCIENCE UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN FEBRUARY 12, 2018 1

Outline  Dimension Reduction: From Low-Rank Estimation vs. Embedding Learning  Network Embedding for Homogeneous Networks  Network Embedding for Heterogeneous Networks  HEBE: Hyper-Edge Based Embedding in Heterogeneous Networks  Aspect-Embedding in Heterogeneous Networks  Locally-Trained Embedding for Expert-Finding in Heterogeneous Networks  Summary and Discussions 2

Big Data Challenge: The Curse of High-Dimensionality Text: Word co-occurrence statistics matrix   High-dimensionality:  There are over 171k words in English language  Redundancy:  Many words share similar semantic meanings  Sea, ocean, marine.. 3

Multi-Genre Network Challenge: High-Dimensional Data too!  Adjacency Matrix … 1 2 3 4 5 6 7 8 9 10 … 1 0 1 1 1 1 0 0 1 0 0 … 2 1 0 1 1 0 0 1 0 0 0 …  High-dimension: 3 1 1 0 1 0 0 0 0 1 0 … 4 1 1 1 0 0 0 0 0 0 0  Facebook has 1860 … 5 1 0 0 0 0 0 0 0 0 0 … 6 1 0 0 0 0 0 0 0 0 0 Million monthly active … 7 1 0 0 0 1 0 0 0 0 0 … 8 1 1 1 1 0 0 0 0 0 0 users (Mar. 2017) … 9 0 0 1 0 0 0 0 0 0 1 … 10 0 0 1 0 0 0 0 1 0 1  Redundancy: … 11 0 0 0 0 0 0 0 0 1 1 … 12 0 1 0 0 0 0 0 0 1 1  Users in the same … 13 1 0 0 0 0 0 0 0 1 1 cluster are likely to be … 14 0 0 1 0 0 1 1 1 0 1 … 15 0 0 0 0 0 1 1 1 1 0 connected … … … … … … … … … … … … 4

Solution to Data & Network Challenge: Dimension Reduction  Why Low-dimensional Space?  Visualization  Compression  Explanatory data analysis  Fill in (impute) missing entries (link/node prediction)  Classification and clustering  Identify / point How to automatically identify the lower-  dimensional space that the high- dimensional data (approximately) lie in 5

Dimension Reduction Approaches: Low-Rank Estimation vs. Embedding Learning rank of X left singular vector right singular vector Latent Factor Vectors (Embeddings) r r m 2 m 2 m 2 f ⌃ U X V > V > U X f r m 1 m 1 Singular Value : dimension in the low-dimensional space  Low-rank estimation  Embedding Learning Data recovery   Representation Learning Imposing low-rank assumption   Project data into a low- dimensional space Regularization   Low-dimensional vector space Low-dimension vector space   Spanned by columns of U  Singular vectors (U)  ≤ f = r   Generalized low-rank model Low-rank model  6

Word2Vec and Word Embedding  Word2vec: created by T. Mikolov at Google (2013) Input: a large corpus; output: a vector space, of 10 2 dimensions  Words sharing common contexts in close proximity in the vector space  Embedding vectors created by Word2vec: better than LSA (Latent Semantic Analysis)  Models: shallow, two-layer neural networks  Two model architectures:  Continuous bag-of-words (CBOW)   Order does not matter, faster  Continuous skip-gram Weigh nearby context words more  heavily than more distant context words Slower but better job for infrequent  words 7

Outline  Dimension Reduction: From Low-rank Estimation vs. Embedding Learning  Network Embedding for Homogeneous Networks  Network Embedding for Heterogeneous Networks  HEBE: Hyper-Edge Based Embedding in Heterogeneous Networks  Aspect-Embedding in Heterogeneous Networks  Locally-Trained Embedding for Expert-Finding in Heterogeneous Networks  Summary and Discussions 8

Embedding Networks into Low-Dimensional Vector Space 9

Recent Research Papers on Network Embedding (2013-2015) Recent Research Papers on Network Embedding Year Distributed Large-scale Natural Graph Factorization 2013 Translating Embeddings for Modeling Multi-relational Data (TransE) 2013 DeepWalk: Online Learning of Social Representations 2014 Combining Two And Three-Way Embeddings Models for Link Prediction in Knowledge Bases (Tatec) 2015 Holographic Embeddings of Knowledge Graphs (HOLE) Diffusion Component Analysis: Unraveling 2015 Functional Topology in Biological Networks 2015 GraRep: Learning Graph Representations with Global Structural Information 2015 Deep Graph Kernels 2015 Heterogeneous Network Embedding via Deep Architectures 2015 PTE: Predictive Text Embedding through Large-scale Heterogeneous Text Networks 2015 LINE: Large-scale Information Network Embedding 2015 J. Tang, M. Qu, M. Wang, M. Zhang, J. Yan, and Q. Mei, “LINE: Large -scale information network embedding”, WWW'15 (cited 134 times) 10

Recent Research Papers on Network Embedding (2016) Recent Research Papers on Network Embedding Year A General Framework for Content-enhanced Network Representation Learning (CENE) 2016 Variational Graph Auto-Encoders (VGAE) 2016 PROSNET: INTEGRATING HOMOLOGY WITH MOLECULAR NETWORKS FOR PROTEIN FUNCTION PREDICTION 2016 Large-Scale Embedding Learning in Heterogeneous Event Data (HEBE) Huan Gui, et al, ICDM 2016 2016 AFET: Automatic Fine-Grained Entity Typing by Hierarchical Partial-Label Embedding 2016 Xiang Ren, et al, EMNLP 2016 Deep Neural Networks for Learning Graph Representations (DNGR) 2016 subgraph2vec: Learning Distributed Representations of Rooted Sub-graphs from Large Graphs 162 2016 Walklets: Multiscale Graph Embeddings for Interpretable Network Classification 2016 Asymmetric Transitivity Preserving Graph Embedding (HOPE) 2016 Xiang Ren, et al, KDD 2016 Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label Embedding (PLE) 2016 Semi-Supervised Classification with Graph Convolutional Networks (GCN) 2016 Revisiting Semi-Supervised Learning with Graph Embeddings (Planetoid) 2016 Structural Deep Network Embedding 2016 node2vec: Scalable Feature Learning for Networks 2016 11

LINE: Large-scale Information Network Embedding J. Tang, M. Qu, M. Wang, M. Zhang, J. Yan, and Q. Mei, “LINE: Large -scale  information network embedding”, WWW'15  Nodes with strong ties turn to be similar 1 st order similarity  Nodes share many neighbors turn to be similar  2 nd order similarity  Well-learnt embedding should preserve both 1 st order and 2 nd order similarity  Nodes 6 & 7: high 1 st order similarity Nodes 5 & 6: high 2 nd order similarity 12

Experiment Setup Dataset  Task  Word analogy: Evaluated on Accuracy  Document classification: Evaluated on Macro-F1 Micro-F1  Vertex classification: Evaluated on Macro-F1 Micro-F1   Result visualization 13

Results: Language Networks Word Analogy  GF (Graph Factorization)  Ahmed et al., WWW2013)  Document Classification 14

Results: Social Networks Flickr dataset  Youtube dataset  15

Outline  Dimension Reduction: From Low-rank Estimation vs. Embedding Learning  Network Embedding for Homogeneous Networks  Network Embedding for Heterogeneous Networks  HEBE: Hyper-Edge Based Embedding in Heterogeneous Networks  Aspect-Embedding in Heterogeneous Networks  Locally-Trained Embedding for Expert-Finding in Heterogeneous Networks  Summary and Discussions 16

Task-Guided and Path-Augmented Heterogeneous Network Embedding T. Chen and Y. Sun, Task-guided and Path-augmented Heterogeneous Network  Embedding for Author Identification, WSDM’17 Given an anonymized paper (often: double-blind review), with  Venue (e.g., WSDM)   Year (e.g., 2017) Keywords (e.g., “heterogeneous network embedding”)  References (e.g., [Chen et al., IJCAI’16] )  Can we predict its authors?  Previous work on author identification: Feature engineering   New approach: Heterogeneous Network Embedding Embedding: automatically represent nodes into lower dimensional feature vectors  Heterogeneous network embedding: Key challenge — select the best type of info  due to the heterogeneity of the network 17

Task-Guided Embedding Consider the ego-network of 𝑞 :  Author score 𝑈 ) , 1 , 𝑌 𝑞 2 , … , 𝑌 𝑞 𝑌 𝑞 = (𝑌 𝑞  Paper embedding 𝑈: # types of nodes associated with paper type  𝑢 : the set of nodes with type t associated with 𝑌 𝑞  paper p Node type embedding 𝑣 𝑏 : embedding of author a  𝑣 𝑜 : embedding of node n  Node embedding 𝑊 𝑞 : embedding of paper p  Weighted average of all the neighbors  The embedding architecture for author identification The score function between p and a is:  Ranking-based objective: maximize the difference  between authors b and a: Soft hinge loss 19

Identification of Anonymous Authors: Experiments Dataset:  AMiner Citation data set   Papers before 2012 are used in training, and papers on and after 2012 are used as test  Baselines Supervised feature-based baselines (i.e. LR, SVM, RF, LambdaMart)  Manually crafted features  Task-specific embedding  Network-general embedding   Pre-training + Task-specific embedding  Take general embedding as initialization of task-specific embedding 22

Heterogeneous Information Networks JIAWEI HAN COMPUTER SCIENCE - PowerPoint PPT Presentation

Hyper Edge-Based Embedding in Heterogeneous Information Networks JIAWEI HAN COMPUTER SCIENCE UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN FEBRUARY 12, 2018 1 Outline Dimension Reduction: From Low-Rank Estimation vs. Embedding Learning

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Mining Heterogeneous Mining Heterogeneous Information Networks Information Networks Xifeng Yan

Unifying Heterogeneous Cray Unifying Heterogeneous Cray Resources and Systems into an

APPLICATIONS OF MINING HETEROGENEOUS INFORMATION NETWORKS Yizhou Sun College of Computer and

Uniform access to heterogeneous Uniform access to heterogeneous grid infrastructures with grid

Learning by Fusing Heterogeneous Data Marinka Zitnik Thesis Defense, October 22 2015 Motivation

Composing heterogeneous software with style Stephen Kell stephen.kell@cs.ox.ac.uk Composing. . .

Decentralized Dynamic Scheduling across Heterogeneous Multi core across Heterogeneous Multi

Static Worksharing Strategies for Heterogeneous Computers with Unrecoverable Failures Anne

An Introduction to Coupling Conditions Homogeneous Heterogeneous Domain Decomposition Problems

Modeling Heterogeneous Modeling Heterogeneous Real- -time Components in BIP time Components in

P2P Networks as Content P2P Networks as Content Delivery Networks Delivery Networks FINAL

Current Network Structure for Pediatrics Hospital Networks Country, state, regional, Academic

Clustering and Ranking in Heterogeneous Information Networks via Gamma-Poisson Model Junxiang

Temporal Coverage Based Content Distribution in Heterogeneous Smart Device Networks Wei Peng

Fault-Tolerant Data Collection in Fault-Tolerant Data Collection in Heterogeneous Intelligent

Searching for Connected/Functional Motifs in Biological Networks St ephane Vialette LIGM

How we search for symmetry by breaking it Rebecca Waldecker and Wilf Wilson University of

Herpesviruses Infectious Diseases in Clinical Practice February 2020 Jennifer Babik, MD, PhD

Math 211 Math 211 A Model of the Human Immune System 2 Model of the Developement of Immunity

Hard to Re ac h? ! I nno vative Strate gie s to E ngage He te ro se xual ACB me n to Build

The boating store has its best sale ever: Pronunciation-attentive Contextualized Pun

Implications of Wealth Heterogeneity For Macroeconomics Christopher Carroll Johns Hopkins

What Helps? Time Management Prioritizing Saying No Routine Consistency