Graph Analytics: Complexity, Scalability, and Architectures Peter - PowerPoint PPT Presentation

Graph Analytics: Complexity, Scalability, and Architectures Peter M. Kogge McCourtney Prof. of CSE Univ. of Notre Dame IBM Fellow (retired) Please Sir, I want more GABB: May 23, 2016 1

Thesis • Graph computation is increasing • To date: most benchmarks are batch • Streaming becoming more important • This talk: Combine batch and streaming • Emerging architectures have real promise GABB: May 23, 2016 2

Graph Kernels and Benchmarks GABB: May 23, 2016 3

Graphs • Graph: https://www.researchgate.net/profile/ Mehmet_Bakal/publication/266968024/ – Set of objects called vertices figure/fig4/AS: 295737989582855@1447520839376/ Figure-42-A-sample-basic-retweet- graph.png – Set of links called edges between vertices – May have “properties” • Graph computing of increasing importance – Social Networks – Communication & power networks – Recommendation systems – Genomics – Cyber-security http://icensa.com/sites/default/files/styles/research_image/public/Unknown.png?itok=HfBBjbJK GABB: May 23, 2016 4

Classes of Graph Computation • Characteristics of individual vertices – E.g. “properties” such as degree • Characteristics of graph as a whole – E.g. diameter, max distance, covering • Characteristics of pairs of vertices – E.g. Shortest paths • Characteristics of subgraphs – E.g. Connected components, spanning tree – Similarities of subgraphs, … GABB: May 23, 2016 5

Classes of Application Computations • Batch : function applied to entire graph of major subgraph as it exists at some time • Streaming : – Incoming sequence of small-scale updates • New vertices or edges • Modification of a property of specific vertex or edge • Deletions – Sequence of localized queries GABB: May 23, 2016 6

Current Benchmark Suites Kernel ¡Class Benchmarking ¡Efforts Outputs Kernel Class: what class of Graph ¡Algorithm ¡Platform Compute ¡Vertex ¡Property Output ¡O(|V| k ) ¡List ¡(k>1) computing kernel performs Subgraph ¡Isomorphism Output ¡Global ¡Value Graph ¡Modification HPC ¡Graph ¡Analysis Output ¡O(1) ¡Events Output ¡O(|V|) ¡List Kepner ¡& ¡Gilbert Graph ¡Challenge Connectedness Path ¡Analysis Standalone GraphBLAS Clustering Graph500 Centrality Benchmarking Efforts Firehose Stinger Other VAST Kernel • S => Streaming Anomaly ¡-‑ ¡Fixed ¡Key X S X • B => Batch Anomaly ¡-‑ ¡Unbounded ¡Key X S X • B/S => Both Anomaly ¡-‑ ¡Two-‑level ¡Key X S X BC: ¡Betweeness ¡Centrality X B B B S X BFS: ¡Breadth ¡First ¡Search X B B B B B B X X Search ¡for ¡"Largest" X B X Outputs: what is size or CCW: ¡Weakly ¡Connected ¡Components X B B S X X CCS: ¡ ¡Strongly ¡Connected ¡Components X B B X structure of result of CCO: ¡Clustering ¡Coefficients X B S X kernel execution? CD: ¡Community ¡Detection X X S X X GC: ¡Graph ¡Contraction X B B X GP: ¡Graph ¡Partitioning X B/S B X GTC: ¡Global ¡Triangle ¡Counting X B X Insert/Delete X S X Jaccard X B/S X MIS: ¡Maximally ¡Independent ¡Set B B PR: ¡PageRank X B X SSSP: ¡Single ¡Source ¡Shortest ¡Path X B B/S B X X APSP: ¡All ¡pairs ¡Shortest ¡Path X B X SI: ¡General ¡Subgraph ¡Isomorphism X B/S TL: ¡Triangle ¡Listing X B/S X Geo ¡& ¡Temporal ¡Correlation X B/S X GABB: May 23, 2016 7

A Real World App GABB: May 23, 2016 8

Real World vs. Benchmarks • Processing more than single kernel • Many different classes of vertices • Many different classes of edges • Vertices may have 1000’s of properties • Edges may have timestamps • Both batch & streaming are integrated – Batch to clean/process existing data sets, add properties – Streaming (today) to query graph – Streaming (tomorrow) to update graph in real-time • “Neighborhoods” more important than full graph connectivity GABB: May 23, 2016 9

Sample Real-World Batch Analytic (From Lexis Nexis) Auto Insurance Co: “Tell me about giving auto policy to Jane Doe” in < 0.1sec • 2012: 40+ TB of Raw Data • Periodically clean up & combine to 4-7 TB Look up answers to • Weekly “Boil the Ocean” to precomputed queries precompute answers to all for “Jane Doe”, and combine standard queries – Does X have financial difficulties? – Does X have legal problems? “Jane Doe has no indicators – Has X had significant driving But problems? Relationships she has shared multiple – Who has shared addresses addresses with Joe Scofflaw with X? Who has the following negative – Who has shared property indicators ….” ownership with X? GABB: May 23, 2016 10

Sample Analytic Details • Given: 14.2+ billion records from – 800+ million entities (people, businesses) Vertices – 100+ million addresses – records on who has resided at what address Edges • Goal: for each entity ID, find all other IDs such that – Share at least 2 addresses in common – Or have one address in common and “close” last name – Matching last names requires processing to check for typos (“Levenshtein distance”) • Akin to a join based on common address, with grouping and thresholding on # of join results • Dozens of similar analytics computed once a week on 400 node cluster GABB: May 23, 2016 11

Sample Batch Implementation Platform: Lexis Nexis • Entity data kept in huge persistent tables – Often with 1,000s of columns • Programming in declarative ECL • THOR : runs “offline” on 400+ node systems – Batch analytic processing over large data sets – Large distributed parallel file system – Leaves all data sets for queries in indexed files • ROXIE : runs “online” on smaller system – User queries using output files from THOR – Dynamically interrogate indexed files Software Architecture: – Can perform localized ECL on data subsets https://upload.wikimedia.org/wikipedia/ commons/0/02/Fig4b_HPCC.jpg • No dynamic data updates GABB: May 23, 2016 12

Execution on Today’s Architectures • Model built to estimate usage of following – Bandwidth: Network, Disk, Memory – Processing capability • Baseline: cluster of 400 dual-Xeon nodes • Menu of improvement options investigated • “Conventional” improvements – No one option >45% increase in performance – Significant gains only when all applied at once • “Unconventional” improvements even better – ARMs for Xeons – 2-level memory – Computing in “3D memory” GABB: May 23, 2016 13

A Model Based on Contemporary Architecture 1.E+03 Baseline: ¡1026s Resources ¡Used/node ¡(sec) 10 ¡racks 1.E+02 1.E+01 1.E+00 1.E-‑01 1.E-‑02 1 2 3 4 5 6 7 8 9 Step ¡# Disk CPU Memory Network • Optimal code streams data thru multiple kernels till barrier • No one resource is consistent bottleneck • Inter-node comm: dynamically random small message GABB: May 23, 2016 14

The Core of This Computation as a Benchmark Kernel GABB: May 23, 2016 15

Sample Analytic Details • Given: 14.2+ billion records from – 800+ million entities (people, businesses) Vertices – 100+ million addresses – records on who has resided at what address Edges • Goal: for each entity ID, find all other IDs such that – Share at least 2 addresses in common – Or have one address in common and “close” last name – Matching last names requires processing to check for typos (“Levenshtein distance”) • Akin to a join based on common address, but with grouping and thresholding on # of join results • Dozens of similar analytics computed once a week on 400 node cluster GABB: May 23, 2016 16

Neighborhoods & Jaccard Coefficients: The Essence of NORA problems N(u) = set of neighbors of u u i Γ (u,v) = fraction of neighbors of u and j v v that are in common Γ (u,v) = |N(u) ∩ N(v)|/(N(u) U N(v)| Alternative: d(u) = # of neighbors of u ɤ (u, v) = # of common neighbors Γ (u,v) = ɤ (u, v) /(d(u)+d(v)- ɤ (u, v)) The LexisNexis shared address NORA problem is an extension of this Green and Purple lead to common neighbors Blue lead to non-common neighbors GABB: May 23, 2016 17

Results of a Map-Reduce Batch Implementation RMAT matrices, average d(i) = 16, on 1000 node system, each with 12 cores & 64GB 1.E+05 1.E+13 Time also grows more 1.E+12 than linearly 1.E+04 1.E+11 Coefficients Time ¡(Sec) 1.E+10 1.E+09 # Coefficients grows 1.E+03 more rapidly than # 1.E+08 vertices 1.E+07 1.E+02 1.E+04 1.E+05 1.E+06 1.E+07 1.E+08 1.E+04 1.E+05 1.E+06 1.E+07 1.E+08 Vertices Vertices Measured Modeled Measured Modeled JACS (Jaccard Coefficients / Sec) = 1.6E6*V 0.26 Entire LN Analytic approx 10X faster Burkhardt “Asking Hard Graph Questions,” Beyond Watson Workshop, Feb. 2014. GABB: May 23, 2016 18

Graph Analytics: Complexity, Scalability, and Architectures Peter - PowerPoint PPT Presentation

Graph Analytics: Complexity, Scalability, and Architectures Peter M. Kogge McCourtney Prof. of CSE Univ. of Notre Dame IBM Fellow (retired) Please Sir, I want more GABB: May 23, 2016 1 Thesis Graph computation is increasing To

Scalability and Replication Marco Serafini COMPSCI 532 Lecture 13 Scalability 2 Scalability

Performance and Scalability (Chapter 11) Performance and Scalability Performance: How long

Root zone scalability model Bart Gijsen October 28, 2009 Root zone scalability model

Analytics and Data Summit 2020 Analytics and Data Summit 2020 Analytics and Data Summit 2020

Architectures Architectural styles Software architectures Architectures versus middleware

Inherently Lower Complexity Inherently Lower Complexity Architectures using Architectures using

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Versioning of Topic Map Templates Structuring Versioning and Scalability Scalability Proc.

WITH RAPIDS Joe Eaton, Ph.D. Technical Lead for Graph Analytics AGENDA Introduction - Why

Split clique graph complexity L. Alcn and M. Gutierrez La Plata, Argentina L. Faria and C. M.

Undergraduate Business Analytics Minor Spreadsheet Analytics BANA-2081 Business Analytics

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Massively Parallel Graph Analytics Supercomputing for large-scale graph analytics George M. Slota

Hans Vangheluwe Modelling and Simulation Causes of Complexity Dealing with Complexity

Hans Vangheluwe Modelling and Simulation Causes of Complexity Dealing with Complexity

Complexity and Character of Human Languages The Faculty of Language Informatics 2A: Lecture 28

Management: Evidence from the Pension Fund Industry David Blake, Alberto Rossi, Allan Timmermann,

News System Environment bbsd Innd Jail Server Requirement(Innd) Install INN news server

Anonymous and Transferable Electronic Ticketing Scheme Data Privacy Management, 8th

Generation CMSC 426 - Computer Security Slides originally by Dr. Marron, modified by Robert Joyce

Recent results on branching Brownian motion on the positive real axis Pascal Maillard

Symbolic Execution and Fuzz Testing ISSISP Summer School 2018 Prof. Abhik Roychoudhury National

Automatic Groups Redux Robert Gilman Stevens Institute of Technology Geometric and Asymptotic

Security and Privacy at Scale Geetanjali Sampemane geta@google.com Cloud All your stuff is