DM-MEETING 4/20/2016 Bijaya Adhikari OUTLINE 1. Nonlinear - - PowerPoint PPT Presentation

dm meeting 4 20 2016
SMART_READER_LITE
LIVE PREVIEW

DM-MEETING 4/20/2016 Bijaya Adhikari OUTLINE 1. Nonlinear - - PowerPoint PPT Presentation

DM-MEETING 4/20/2016 Bijaya Adhikari OUTLINE 1. Nonlinear Laplacian for Digraphs and its Application for Network Analysis 2. Rare Category Detection on Time-Evolving Graphs NONLINEAR LAPLACIAN FOR DIGRAPHS OUTLINE 1. Introduction 2.


slide-1
SLIDE 1

DM-MEETING 4/20/2016

Bijaya Adhikari

slide-2
SLIDE 2

OUTLINE

1. Nonlinear Laplacian for Digraphs and its Application for Network Analysis 2. Rare Category Detection on Time-Evolving Graphs

slide-3
SLIDE 3

NONLINEAR LAPLACIAN FOR DIGRAPHS …

OUTLINE

  • 1. Introduction
  • 2. Preliminaries
  • 3. Related Works
  • 4. Spectral Theroy for Digraphs
  • 5. Experiments
slide-4
SLIDE 4

INTRODUCTION

Spectral Graph Theory: Relations between graph theoretic measures and eigenvalues and eigenvectors of Laplacian Laplacian Normalized Laplacian Where D is Diagonal degree matrix and A is adjacency matrix

slide-5
SLIDE 5

PRELIMINARIES FOR UNDIRECTED GRAPHS

Volume of a Node Set: Cut of a Node Set: , where Conductance of a Node set: Conductance of Graph :

slide-6
SLIDE 6

PRELIMINARIES FOR DIGRAPHS

Out degree : and In degree: Degree : Cut+: , where Out-Conductance : Conductance: Conductance of Graph:

slide-7
SLIDE 7

RELATED WORK

Chung’s Normalized Laplacian: Where is diagonal matrix with , is stationary distribution Following inequality holds for Chung’s Normalized inequality is the conductance with respect to random walk process Where ,

slide-8
SLIDE 8

RELATED WORK

Second eigenvector of Chung’s Normalized Laplacian turns out be minimizer of Where and x is a variable vector Arc (u,v) brings nodes u and v closer in spectral ordering The effect is larger when π is larger.

slide-9
SLIDE 9

SPECTRAL THEORY FOR DIGRAPHS

slide-10
SLIDE 10

NORMALIZED LAPLACIAN FOR DIGRAPHS

slide-11
SLIDE 11

EIGENVALUES AND EIGENVECTORS

Normalized Laplacian has eigenvalue 0 and associated eigenvector What about other ? 1. Since is nonlinear markov operator the number of eigenvalues and eigenvectors are not known. 2. Calculating eigenvalues of nonlinear markov operator is NP-hard in general

slide-12
SLIDE 12

EIGENVALUES AND EIGENVECTORS

They define second eigenvalue as the smallest eigenvalue of

slide-13
SLIDE 13

CHEEGER’S INEQUALITY FOR DIGRAPHS

They show the following: This is more natural extension of cheeger’s inequality for undirected graphs than Chung’s method.

slide-14
SLIDE 14

ALGORITHM

slide-15
SLIDE 15

SPECTRAL ORDERING

The second eigenvector of normalized laplacian is minimizer of Where and

slide-16
SLIDE 16

EXPERIMENTS

Running Time for Algorithm 1:

slide-17
SLIDE 17

EXPERIMENTS

slide-18
SLIDE 18

SPECTRAL EMBEDDING

slide-19
SLIDE 19

SPECTRAL ORDERING

slide-20
SLIDE 20

CONDUCTANCE

slide-21
SLIDE 21

RARE CATEGORY DETECTION ON TIME EVOLVING GRAPHS

Rare category detection: Find minority classes (rare category) in big data by requesting minimum number of labels from the oracle. For static graph: RACH, MUVIR, GRADE and so on This paper is extension of GRADE.

slide-22
SLIDE 22

GRADE

1. Compute pair-wise similarity matrix (Adjacency matrix for graph data) 2. Calculate normalized matrix W, 3. Calculate global similarity matrix A by applying random walk with restart 4. Identify rare classes by querying oracle for nodes (data points) near the boundaries Intuition is that changes in A becomes sharp at the boundary of minority classes.

slide-23
SLIDE 23

DYNAMIC RARE CATEGORY DETECTION

Instead of performing GRADE at each step, make incremental changes to A and neighborhoods of nodes Assumptions 1) Number of examples is fixed 2) Dataset in imbalanced 3) Minority classes are not separable from Majority classes

slide-24
SLIDE 24

SINGLE UPDATE

If only one edge (self-loop) is added at time step t: Where and

slide-25
SLIDE 25
slide-26
SLIDE 26

BATCH UPDATE

slide-27
SLIDE 27

QUERY DYNAMICS

= 1) allocate all budgets at the first time step 2) allocate all budgets at the last time step 3) Allocate all budget at time T_opt 4) Allocate query budget evenly 5) Allocate query budget following exponential distribution

slide-28
SLIDE 28

EXPERIMENTS

slide-29
SLIDE 29

EXPERIMENTS

slide-30
SLIDE 30

EXPERIMENTS

slide-31
SLIDE 31

EXPERIMENTS