Pattern-guided Big Data Processing
- n Hybrid Parallel Architectures
Fahad Khalid, Frank Feinbube, and Andreas Polze
Operating Systems and Middleware Group
Pattern-guided Big Data Processing on Hybrid Parallel Architectures - - PowerPoint PPT Presentation
Pattern-guided Big Data Processing on Hybrid Parallel Architectures Fahad Khalid, Frank Feinbube, and Andreas Polze Operating Systems and Middleware Group Motivation Insights from developing simulations for, Enumeration of Elementary
Fahad Khalid, Frank Feinbube, and Andreas Polze
Operating Systems and Middleware Group
– Enumeration of Elementary Flux Modes in Metabolic Networks – Prediction of aftershocks following earthquakes – Prediction of volcanic events – Adiabatic Quantum Computing
– Max Planck Institute of Molecular Plant Physiology – GFZ German Research Center for Geosciences
September 25, 2014 Frank Feinbube | BigSys 2014 2
September 25, 2014 Frank Feinbube | BigSys 2014 3
September 25, 2014 Frank Feinbube | BigSys 2014 4
September 25, 2014 Frank Feinbube | BigSys 2014 5
September 25, 2014 Frank Feinbube | BigSys 2014 6
Serial processing of stages Pipelined processing of stages 𝑇2 𝑇1 𝑇3 𝑇2 𝑇1 𝑇3 𝑇2 𝑇1 𝑇3
𝑇2 𝑇1 𝑇3 ⋯ 𝑇2 𝑇1 𝑇3 ⋯
September 25, 2014 Frank Feinbube | BigSys 2014 7
Analytical solutions to 3D Partial Differential Equations in Vectors Numerical solution to a System of Linear Equations Read input data from file Write output data to file
September 25, 2014 Frank Feinbube | BigSys 2014 8
– Main memory and Cache sizes are limited
– Total memory required/available – Impact of partition size on pipeline performance
P1
Out of Memory
P1,1 P1,2 P1,3
OK OK OK
Partition 0 Partition 1 ⋮ Partition 0 Partition 1 ⋮ ⋯ Complete Dataset Chunk
September 25, 2014 Frank Feinbube | BigSys 2014 9
September 25, 2014 Frank Feinbube | BigSys 2014 10
⋯ ⋯ ⋯ ⋯ ⋯ ⋯
September 25, 2014 Frank Feinbube | BigSys 2014 11
Device Filter Callback Filter PostProcess Filter Device Filter Callback Filter PostProcess Filter Device Filter Callback Filter PostProcess Filter ⋯ ⋯ ⋯ ⋯ ⋯ ⋯
September 25, 2014 Frank Feinbube | BigSys 2014 12
5 10 15 20 25 30 35 40 45 50 55 60 500 million 2 billion 2.5 billion 3.5 billion 4.5 billion 6.3 billion 8.1 billion Time (seconds)
CPU-only Parallel Custom Pipeline HPF Pipeline
September 25, 2014 Frank Feinbube | BigSys 2014 13
September 25, 2014 Frank Feinbube | BigSys 2014 14
parts:
1. Suitable for execution on the GPU 2. Suitable for execution on the CPU
kernels
– Everything goes, except for massive parallelism
the algorithm is suitable for GPUs?
Pattern 𝑜 − 1 Pattern 𝑜 Pattern 1 Pattern 2
⋮
Accelerator CPU Algorithm
September 25, 2014 Frank Feinbube | BigSys 2014 15
September 25, 2014 Frank Feinbube | BigSys 2014 16
September 25, 2014 Frank Feinbube | BigSys 2014 17
<<Map>> GPU Kernel <<Reduce>> CPU Kernel Intermediate Result
September 25, 2014 Frank Feinbube | BigSys 2014 18
September 25, 2014 Frank Feinbube | BigSys 2014 19