di discovering graph temporal association rules
play

Di Discovering Graph Temporal Association Rules Qi Song Mohammad - PowerPoint PPT Presentation

Di Discovering Graph Temporal Association Rules Qi Song Mohammad Hossein Namaki Yinghui Wu Peng Lin Tingjian Ge* Washington State University, *UMass Lowell Temporal association rules in networks Time-aware POI recommendation check-in z


  1. Di Discovering Graph Temporal Association Rules Qi Song Mohammad Hossein Namaki Yinghui Wu Peng Lin Tingjian Ge* Washington State University, *UMass Lowell

  2. Temporal association rules in networks Ø Time-aware POI recommendation check-in z retweet ū u ū POI ≤ 2 hours user P 2 P 1 user user R 1 w w POI POI Left hand side event(LHS) Right hand side event(RHS) Ø in P 2 can be recommended as a point of interest for Ø is a potential customer for Requirement: AR’s with topological , semantic and temporal constraints 2

  3. Outline Ø Graph temporal association rules (GTARs) definition Ø GTARs discovery problem formalization Ø A feasible GTAR discovery algorithm Ø Experiment study: verify the effectiveness of GTARs, and the efficiency of GTAR discovery algorithm. 3

  4. Temporal Graph Ø Temporal graph G T (V,E,L,T). Ø Snapshot G t : induced by the set of all edges associated with time stamp t. G 3 4

  5. Graph temporal association rules (GTAR) Ø GTAR φ = (P 1 ⇒ P 2 , ū, Δt) Ø ū: common shared focus. Ø Δt: a constant that specifies a time interval. If there exists an occurrence of event P 1 at an entity specified by ū at some time t, then it is likely that an event P 2 occurs at the same entity, within a time window [t, t + Δt] RHS LHS check-in retweet z ū u ū ≤ 2 hours POI P 1 user P 2 user user R 2 w w POI POI φ = (P 1 ⇒ P 2 , ū, Δt=2hours) 5

  6. Events and Matching Ø Events Ø Connected subgraph pattern carry a designated focus node. retweet u ū Ø Event matching user user Ø An event P occurred in G T at time t if there is a matching relation (R t ) between P and snapshot G t w P 1 Ø focus occurrence o(P, ū , t) : the nodes in V that matches ū induced by R t POI Ø Example: Ø Matches of ū induced by R 3 in G 3 contains {(x 1 ,3),(x 2 ,3),(x 3 ,3)} Ø o(P 1 , ū, 3) is {x 1 ,x 2 ,x 3 } One subgraph matching of P 1 6

  7. GTAR occurrence Ø Given a time window [t 1 ,t 2 ], φ occurs if at least a node matches the focus of both P 1 and P 2 at t 1 and t 2 , respectively. Ø A time window may contain multiple occurrences of a GTAR. Ø Minimal occurrence Ø O(v)=[t 1 ,t 2 ] is an occurrence of φ in G T supported by node v Ø There exists no O’(v) ⊂ O(v), such that O’(v) is also an occurrence P 1 P 2 7

  8. Support and Confidence Ø Based on minimal occurrences O ( ϕ , G T ) # Occurrence of this rule Supp( ϕ , G T ) = O ( ϕ , G T ) C ( u ) T Normalizer Ø Confidence: measures how likely P 2 occurs within Δt time at the focus occurrence of P 1 # Support of this rule Conf ( ϕ , G T ) = Supp ( ϕ , G T ) Supp ( P 1 , G T ) # Support of LHS 8

  9. GTAR Discovery Informative GTARs Ø Interested in GTARs with high support and confidence Ø Maximal GTARs with size bound to be more informative Ø In a b -maximal GTAR, both LHS and RHS have at most b edges. The Discovery Problem Ø Input : Temporal graph G T , focus ū , time interval Δt , size bound b , support threshold σ, and confidence threshold θ; Ø Output: The set of b -maximal GTARs Σ pertaining to ū and Δt such that for each GTAR φ ∈ Σ, Supp(φ, G T ) ≥ σ, and Conf(φ, G T ) ≥ θ. 9

  10. GTAR Discovery Ø Integrate event mining and rule discovery as a single process Ø Intuition: Rule with high support Conf ( ϕ , G T ) = Supp ( ϕ , G T ) Supp ( P 1 , G T ) LHS with low support Ø LHS generation by best-first strategy. Ø Generate and verify best new LHS events Ø RHS generation given fixed LHS Ø To generate and validate new GTAR candidates by appending best RHS events to verified LHS events. Ø It prefers RHS events with high support. 10

  11. GTAR Discovery Ø GTAR discovery: queue L P’ 1 (ū) queue R retweet user user 1. event P 2 … u ū spawning P 1 (ū) 4. rule validation backtracking check-in POI retweet user user user u ū z ū … P 2 P 7 s w POI show 3. rule spawning 2. event verification 11

  12. Performance analysis and optimization Ø Complexity: Ø Time: O(|T|N(b)(b+|V|)(b+|E|)+N(b) 2 |T|) Ø Space: O(N(b)|C(ū)||T|) Ø Size bound b is small in practice and Ø Number of events N(b) is significantly reduced by pruning rules Ø Optimization Ø Pruning rules: extend (conditional) anti-monotonicity to GTARs Ø Anytime performance: returning GTARs as the events are discovered Ø Batch matching: merge snapshots to a graph and perform one matching 12

  13. Experimental Study Ø Datasets #Nodes #Edges #Labels #Snapshots Citation 4.3M 21.7M 273 80 Panama 839k 3.6M 433 12k Movielens 81.5k 10M 21 1439 Ø Algorithms Ø DisGTAR : our integrated algorithms including all pruning rules Ø DisGTARn : without the pruning strategies. ( Pruning ) Ø IsoGTAR : isolating the snapshots and computes event matching over each snapshots one by one. ( Batch matching ) Ø SeqGTAR: separating event mining and rule discovery to two independent processes. ( Integrate mining ) 13

  14. Performance of GTAR discovery DisGTAR DisGTARn SeqGTAR IsoGTAR Time(s) # verif. Time(s) # verif. Time(s) # verif. Time(s) # verif. Panama 9 1,194 276 8,393 560 8,393 N/A Citation 22 157 994 12,507 1,621 12,507 12,721 11,461 MovieLens 558 191 2,432 1,423 2,445 1,423 N/A DisGTAR outperforms DisGTARn, SeqGTAR, and IsoGTAR by 6.28, 7.85 and 64.79 times on average 14

  15. Anytime performance 18 seconds 8 seconds Time vs. Accuracy (Citation) Time vs. Accuracy (Panama) ∑ Conf ( ϕ , G T ) ϕ ∈Σ t anytime quality(t) = ∑ Conf ( ϕ , G T ) ϕ ∈Σ * DisGTAR converges with high quality GTARs much faster than SeqGTAR 15

  16. Scalability of DisGTAR Varying |G| , # of edges Varying |T| (Synthetic) (Synthetic) DisGTAR is much less sensitive than IsoGTAR DisGTAR is less sensitive to |G| The “packing” of consecutive Pruning rules timestamps to time intervals 16

  17. Case Study Matches: F.Geneve Project Management Matches: Prof. Christopher Manning(Stanford Univ.) 17

  18. Conclusion and future work Conclusion Ø We have proposed a class of temporal association rules over graphs Ø We have studied the discovery problem of GTARs Ø Despite the enhanced expressive power of GTARs, it is feasible to find and apply GTARs in practice. Future work Ø Extending GTARs to multi-focus and exploring other quality metrics Ø Fast online discovery of GTARs over graph streams. Sponsored By: 18 18

  19. Th Thank you! Related Work Ø Event Pattern Discovery by Keywords in Graph Streams (BigData’17) Ø BEAMS: Bounded Event Detection in Graph Streams (ICDE’16) (http://eecs.wsu.edu/~ksasani/BEAMS/Display.php )

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend