Evaluation of Parallel Graph Loading Techniques Manuel Then, Moritz - PowerPoint PPT Presentation

Evaluation of Parallel Graph Loading Techniques Manuel Then, Moritz Kaufmann, Alfons Kemper, Thomas Neumann Technical University of Munich Chair of Database Systems

Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 3

Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 4

General Graph Loading Pipeline Goal : Efficiently load a given graph dataset for explorative analytics • Parse edges and create relabeling • Write edges to worker-local buffer Read • Find unique vertices • Count neighbors Sync • Create final graph data structure • Apply final relabeling Write Analytics • The actual analytics work Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 5

Scenario-specific Graph Loading Problem : The optimal way of loading the graph depends on various factors: • Format of the graph data • Source of the data • Properties of the input data • Target graph data structure • Execution machine Graph loading pipeline must be adapted to the scenario at hand Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 6

General Graph Loading Pipeline Identifier data type? binary, Goal : Efficiently load a given graph dataset for explorative analytics decimal, string? • Parse edges and create relabeling • Write edges to worker-local buffer Read • Find unique vertices • Count neighbors Sync • Create final graph data structure • Apply final relabeling Write Analytics • The actual analytics work Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 8

General Graph Loading Pipeline Identifier data Can input data type? binary, Goal : Efficiently load a given graph dataset for explorative analytics be read multiple decimal, string? times? • Parse edges and create relabeling • Write edges to worker-local buffer Read • Find unique vertices • Count neighbors Sync • Create final graph data structure • Apply final relabeling Write Analytics • The actual analytics work Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 9

General Graph Loading Pipeline Identifier data Random Can input data type? binary, Goal : Efficiently load a given graph dataset for explorative analytics access be read multiple decimal, string? possible? times? • Parse edges and create relabeling • Write edges to worker-local buffer Read • Find unique vertices • Count neighbors Sync • Create final graph data structure • Apply final relabeling Write Analytics • The actual analytics work Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 10

General Graph Loading Pipeline Identifier data Random Can input data type? binary, Goal : Efficiently load a given graph dataset for explorative analytics access be read multiple decimal, string? possible? times? • Parse edges and create relabeling • Write edges to worker-local buffer Read Explicit vertex list available? • Find unique vertices • Count neighbors Sync • Create final graph data structure • Apply final relabeling Write Analytics • The actual analytics work Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 11

General Graph Loading Pipeline Identifier data Random Can input data type? binary, Goal : Efficiently load a given graph dataset for explorative analytics access be read multiple decimal, string? possible? times? • Parse edges and create relabeling • Write edges to worker-local buffer Read Explicit vertex list available? • Find unique vertices • Count neighbors Sync • Create final graph data structure • Apply final relabeling Write Analytics • The actual analytics work Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 12

General Graph Loading Pipeline Identifier data Random Can input data type? binary, Goal : Efficiently load a given graph dataset for explorative analytics access be read multiple decimal, string? possible? times? • Parse edges and create relabeling • Write edges to worker-local buffer Read Explicit vertex list available? • Find unique vertices Which data • Count neighbors structure to Sync generate? • Create final graph data structure • Apply final relabeling Write Analytics • The actual analytics work Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 13

General Graph Loading Pipeline Identifier data Random Can input data type? binary, Goal : Efficiently load a given graph dataset for explorative analytics access be read multiple decimal, string? possible? times? • Parse edges and create relabeling • Write edges to worker-local buffer Read Explicit vertex list available? • Find unique vertices Which data • Count neighbors structure to Sync generate? • Create final graph data structure • Apply final relabeling Write Analytics • The actual analytics work Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 14

Parsers Binary reader • No parsing necessary => directly copy vertex identifiers • Every edge same size => work splitting trivial Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 16

Parsers Binary reader • No parsing necessary => directly copy vertex identifiers • Every edge same size => work splitting trivial Library-provided decimal parsing • Readily-available for many languages • We evaluated C++’s stream operator and strtol • Varying edge length => work splitting more complex Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 17

Parsers 2x 20x 200x Binary reader • No parsing necessary => directly copy vertex identifiers • Every edge same size => work splitting trivial Library-provided decimal parsing • Readily-available for many languages • We evaluated C++’s stream operator and strtol • Varying edge length => work splitting more complex Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 18

Parsers 2x 20x 200x Binary reader • No parsing necessary => directly copy vertex identifiers • Every edge same size => work splitting trivial Library-provided decimal parsing • Readily-available for many languages • We evaluated C++’s stream operator and strtol • Varying edge length => work splitting more complex Iterative decimal parsing • Multiply by ten and add character’s respective digit Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 19

Parsers 2x 20x 200x Binary reader • No parsing necessary => directly copy vertex identifiers • Every edge same size => work splitting trivial Library-provided decimal parsing • Readily-available for many languages • We evaluated C++’s stream operator and strtol • Varying edge length => work splitting more complex Iterative decimal parsing • Multiply by ten and add character’s respective digit Vectorized decimal parsing • Leverage wide vector units for identifier parsing Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 22

Evaluation of Parallel Graph Loading Techniques Manuel Then, Moritz - PowerPoint PPT Presentation

Evaluation of Parallel Graph Loading Techniques Manuel Then, Moritz Kaufmann, Alfons Kemper, Thomas Neumann Technical University of Munich Chair of Database Systems Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 3

Pentalift Pentalift Equipment Equipment Corporation Corporation Loading Dock Loading Dock

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Approximate Graph Operations on Parallel Platforms Approximate Graph Operations on Parallel

PRACTICAL OFF-LOADING & WOUND STRESS FORCE COUNTERING METHODS Presentation to Peter

Web Conferencing Loading Content Table of Contents Web Conferencing Loading Presentations

LOADING & HANDLING OF ROLLED CELLULOSE HOW DOES IT DIFFER FROM LOADING & HANDLING OF

Real Time Loading for Sybase IQ Sybase IQ: Target Markets in 2009 Real-Time Loading Valuable to

A Review of Nitrogen Loading and A Review of Nitrogen Loading and Treatment Performance

LOADING & SECURING DIFFERENT GRADES OF PAPER KRAFT PAPER KRAFT PAPER LOADING CONSIDERATIONS

The Loading Spinner AKA, the throbber Why do we have loading spinners? Purpose: tells the

Fatigue Overview Andrew Ning There are four scenarios we have discussed for analyzing fatigue:

System Loading System Loading Tributary Areas Many floor systems consist of a reinforced

Loading and Manipulating Data Thomas J. Leeper Department of Political Science and Government

ITHI: Identifier Technology Health Indicators Defining Metrics Alain Durand Lacnic 28 / Lacnog

Compositions of Tree Series Transformations Andreas Maletti a Technische Universit at Dresden

We mean .Network File System Introduction: Remote File-systems When networking became widely

Trusses Tomasz Brzezi nski Swansea University & University of Biaystok Malta, March

ARMORE Applied Resiliency for More Trustworthy Grid Operation Research(Update Tim$Yardley

Obscured by Clouds Russ Miles Toby Hobson Thursday, 11 March 2010 Warning! This could get a

Communication Systems GSM University of Freiburg Computer Science Computer Networks and

Algorithms for Processing Massive Data at Network Line Speeds Graham Cormode, DIMACS

Sambuz

Useful Links

Newsletter

Mail Us

Evaluation of Parallel Graph Loading Techniques Manuel Then, Moritz - PowerPoint PPT Presentation

Evaluation of Parallel Graph Loading Techniques Manuel Then, Moritz Kaufmann, Alfons Kemper, Thomas Neumann Technical University of Munich Chair of Database Systems Manuel Then (TUM) | Evaluation of Parallel Graph Loading Techniques 3

Pentalift Pentalift Equipment Equipment Corporation Corporation Loading Dock Loading Dock

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Approximate Graph Operations on Parallel Platforms Approximate Graph Operations on Parallel

PRACTICAL OFF-LOADING &amp; WOUND STRESS FORCE COUNTERING METHODS Presentation to Peter

Web Conferencing Loading Content Table of Contents Web Conferencing Loading Presentations

LOADING &amp; HANDLING OF ROLLED CELLULOSE HOW DOES IT DIFFER FROM LOADING &amp; HANDLING OF

Real Time Loading for Sybase IQ Sybase IQ: Target Markets in 2009 Real-Time Loading Valuable to

A Review of Nitrogen Loading and A Review of Nitrogen Loading and Treatment Performance

LOADING &amp; SECURING DIFFERENT GRADES OF PAPER KRAFT PAPER KRAFT PAPER LOADING CONSIDERATIONS

The Loading Spinner AKA, the throbber Why do we have loading spinners? Purpose: tells the

Fatigue Overview Andrew Ning There are four scenarios we have discussed for analyzing fatigue:

System Loading System Loading Tributary Areas Many floor systems consist of a reinforced

Loading and Manipulating Data Thomas J. Leeper Department of Political Science and Government

ITHI: Identifier Technology Health Indicators Defining Metrics Alain Durand Lacnic 28 / Lacnog

Compositions of Tree Series Transformations Andreas Maletti a Technische Universit at Dresden

We mean .Network File System Introduction: Remote File-systems When networking became widely

Trusses Tomasz Brzezi nski Swansea University &amp; University of Biaystok Malta, March

ARMORE Applied Resiliency for More Trustworthy Grid Operation Research(Update Tim$Yardley

Obscured by Clouds Russ Miles Toby Hobson Thursday, 11 March 2010 Warning! This could get a

Communication Systems GSM University of Freiburg Computer Science Computer Networks and

Algorithms for Processing Massive Data at Network Line Speeds Graham Cormode, DIMACS

Sambuz

Useful Links

Newsletter

Mail Us

PRACTICAL OFF-LOADING & WOUND STRESS FORCE COUNTERING METHODS Presentation to Peter

LOADING & HANDLING OF ROLLED CELLULOSE HOW DOES IT DIFFER FROM LOADING & HANDLING OF

LOADING & SECURING DIFFERENT GRADES OF PAPER KRAFT PAPER KRAFT PAPER LOADING CONSIDERATIONS

Trusses Tomasz Brzezi nski Swansea University & University of Biaystok Malta, March