real time monitoring of large scientific simulations d e
play

Real-time Monitoring of Large Scientific Simulations D. E. Laney - PowerPoint PPT Presentation

Real-time Monitoring of Large Scientific Simulations D. E. Laney V. Pascucci, R. J. Frank, G. Scorzelli, L. Linsen, B. Hamann, F. Gygi This work was performed under the auspices of the U.S. Department of Energy by the University of California,


  1. Real-time Monitoring of Large Scientific Simulations D. E. Laney V. Pascucci, R. J. Frank, G. Scorzelli, L. Linsen, B. Hamann, F. Gygi This work was performed under the auspices of the U.S. Department of Energy by the University of California, Lawrence Livermore National Laboratory under Contract No. W-7405-Eng-48.

  2. We would like to enable real-time monitoring of a running simulation with a desktop workstation. Turbulent mixing: 2048 ^ 3 Data post-processed for visualization

  3. Two ideas are combined in our system. • Multiresolution re-ordering of the data • Real-time streaming of the reordered data. • Monitoring via visualization of data or derived-data.

  4. A data streaming framework must utilize a heterogeneous computational environment. Data Data Servers Server Data Sources Data Client Data Client (Powerwall Display) (Desktop Display)

  5. A brute force approach: send the data in the order it occurs in the simulation. • Disadvantages • Bad cache coherence will slow down lower dimensional queries (slices, isosurfaces). • Stopping the stream early results in an unusable partial data set. • Requires preprocessing and delay if scalable visualization is required.

  6. We propose a new multiresolution streaming system. • Light weight: • Only requires init() and send() calls. • Not a problem solving framework • Cache-oblivious out-of-core processing and data streaming. • Simple, scalable infrastructure.

  7. Our approach enables a coarse-to-fine construction of multi-resolution models. Jeep run (Francois Gygi) of 64 carbon atoms, tiled to 256^3 (final iteration). Computation on AIX system, reordered data cached on SGI server, viewed on my workstation (200MHz SGI).

  8. We exploit the correlation of bin/quad/oct- trees with the Lebesgue space-filling curves . The Lebesgue curve is also known as Z-order, Morton, …. Curve. Special case of the general definition introduced by Guiseppe Peano in 1890.

  9. Progressive data streams enable multiresolution visualization. • On the fly hierarchical Z- ordering – Embedded preprocessing (a) (b) – Fast computation of hierarchical index – Stream can be truncated (c) (d) coarse data new level data

  10. Our cache oblivious data layout optimizes visualization queries. • Data is laid out by level and broken into blocks. • Simple load balancing on visualization side. • Fast slicing, isosurfacing, and volume rendering.

  11. The data layout generates increasingly local coverage. Distribution in the grid of each constant size block of data 0 1 2 3 4 5 6 7 8

  12. We implement immediate streaming of simulation data to the storage location. Simulation Visualization System Data Source 1 Data Server 1 Data Source 2 Data Server 2 Data Server M Data Source N

  13. We exploit the data decomposition used in the simulation. • Takes advantage of simulation load balancing • One Data Source per B13 B14 B15 B16 compute node B9 B10 B11 B12 • Each compute node B5 B6 B7 B8 connected to all Data Servers B1 B2 B3 B4

  14. Data Servers are buffering and caching components. Data Source 1 Data Server 1 Data Source 2 Data Server 2 Data Server M Data Source N • Cache data and respond to visualization queries • Filter and buffer data to produce multi- layer configurations

  15. We achieve load balancing with a static data decomposition. • No contention • No data duplication • Streaming: – D = I/d mod N – D is Data server, I is HZ index of sample, d is chunk size • Visualization queries: – Choose number of Data Servers wisely �

  16. The load balancing of visualization queries depends on the total number of data servers. 4.5 Max Load / Ideal Load 4 3.5 3 2.5 2 1.5 Optimal 1 0.5 0 0 10 20 30 40 50 60 70 80 90 100 Number of Data Servers

  17. The measured streaming time is small compared to the compute time. Number of Data Sources 1 8 64 Number of Data Servers 1 3 3 Total Domain Size 128 3 256 3 512 3 Equivalent simulation time/time- 270s 736s 4224s step Send Time 5.31s 7.03s 38.6s

  18. A progressive data stream enables visualization on desktop workstations. Progressive refinement (left to right) of the volume rendering of the electron density distribution.

  19. Data servers handle localized queries.

  20. On a desktop machine you can explore the finest resolution data available. ~ 8 billion values

  21. Future Work • Extend to non-power-of-two grid sizes • Extend to more general mesh structures • Include fast compression between data source and data server • Enable multiple layers of data servers to handle larger simulation sizes.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend