Stream Processing for Remote Collaborative Data Analysis
Scott Klasky146, C. S. Chang2, Jong Choi1, Michael Churchill2, Tahsin Kurc51, Manish Parashar3, Alex Sim7, Matthew Wolf14, John Wu7
1 ORNL, 2 PPPL, 3 Rutgers, 4GT, 5 SBU, 6UTK, 7 LBNL
Stream Processing for Remote Collaborative Data Analysis Scott - - PowerPoint PPT Presentation
Stream Processing for Remote Collaborative Data Analysis Scott Klasky 146 , C. S. Chang 2 , Jong Choi 1 , Michael Churchill 2 , Tahsin Kurc 51 , Manish Parashar 3 , Alex Sim 7 , Matthew Wolf 14 , John Wu 7 1 ORNL, 2 PPPL, 3 Rutgers, 4 GT, 5 SBU, 6
1 ORNL, 2 PPPL, 3 Rutgers, 4GT, 5 SBU, 6UTK, 7 LBNL
computing power
– From 1992 NSF grand challenge – Now a possible Exascale Computing Application – “Much more difficult than original thought” – Couple Codes from different time scales, different physics, different spatial regimes.
– Edge simulation “today” producing about 100 PB in 10 days. – Couple MHD effects, Core effects, …. 1 EB in 2 days. – Need data triage techniques – Separate out information from data in near-real-time
– Large number of workflows for each set of analysis
4
– Lead to the loss of energy from tokamak plasmas – Could damage multi-billion tokamak
– Make more processing power available – Allow more scientists to participate in the data analysis operations and monitor the experiment remotely – Enable scientists to share knowledge and processes
Blobs in fusion reaction (Source: EPSI project) Blob trajectory
study ECE-Image movies of blobby turbulence and instabilities between experimental shots in near real-time.
(~550MB/s, over 300 seconds in the future) + Metadata (experimental setting)
feedback within <15min (inter-shot time)
with ADIOS ICEE method
KSTAR KISTI/ORNL
WAN Transfer (ADIOS) Measure Digitizer Buffer Server
KSTAR Storage
ADIOS (Staging) Movie/physic files
Postech KSTAR
(PPPL, GA, MIT, …)
Feedback to next or next day shot
Data post processing/ Making movies in Parallel
Demo at SC14
Simulation (XGC1) Experiment (NSTX GPI)
simulation (pre/post) and experiment at remote locations
camera images from NSTX and XGC1 edge simulation data
real-time for inter-shot experimental comparison, experiment-simulation validation
processing with ADIOS ICEE method, optimized detection algorithms for near real-time analysis
Podhorszki, J. Choi, S. Klasky, R. Tchoua, J. Lofstead, R. Oldfield, M. Parashar, N. Samatova, K. Schwan, A. Shoshani, M. Wolf,
and lessons of developing leadership class I/O frameworks”, Concurrency and Computation: Practice and Experience, 2013
Data Services I/O Services Core ADIOS components Buffering (generic) BP Posix BP MPI-IO HDF5 serial HDF5 parallel NetCDF4 GRIB2 BPC BP container I/O Aggregation Staging User created output Execution engine spatial aggregation BPC BPC temporal aggregation compression FastBit Indexing Data API Control API
– MPI process, Single diagnostic
“sub-files” are generated
Ensem emble e of
chunks = = file file
– Too much data to move in too little time – Storage sizes/speed doesn’t keep up with making NRT decisions
– From our physics understanding – From our understanding of the entropy in the data – From our understanding of the error in the data – Change data into data = model + information (f = F + df)
– Reconstruct “on-the-fly”
11
HPC System Application P F S Application P F S
Staging nodes
Application P F S
Analytics
Application P F S
Analy tics Viz
HPC System Application hpc system
Analy tics Viz
F S
Experiment
Diagnostic
Analytics
1 2 3 4 5 6
Analytics
hierarchies to optimize overall workflow for power vs. performance tradeoffs
hierarchy access
visualization tasks in a complex system
movement compared to memory movement
– On-same core – On different cores – On different nodes – On different machines – Through the storage system
ADIOS ADIOS
In transit Analysis Visualization
data source (data generator) and client (data consumer)
Wide Area Network (WAN)
Data Generation FastBit Indexing ICEE Server Raw Data Index Analysis Analysis FastBit Query Data Hub (Staging)
Data Source Site Remote Client Sites
Analysis Analysis Data Hub (Staging) Analysis
Data Flow Feedback
https://cug.org/proceedings/cug2014_proceedings/includes/files/pap121-file2.pdf