towards optimizing large scale data transfers with end to
play

Towards Optimizing Large-Scale Data Transfers with End-to-End - PowerPoint PPT Presentation

Towards Optimizing Large-Scale Data Transfers with End-to-End Integrity Verification Raj Ke'muthu Argonne Na2onal Laboratory & University of Chicago Si Liu, Xian-He Sun, Illinois Ins2tute of Technology Eun-Sung Jung, Jongik University


  1. Towards Optimizing Large-Scale Data Transfers with End-to-End Integrity Verification Raj Ke'muthu Argonne Na2onal Laboratory & University of Chicago Si Liu, Xian-He Sun, Illinois Ins2tute of Technology Eun-Sung Jung, Jongik University

  2. Exploding data volumes 2004: 36 TB 2014: 3,300 TB Astronomy Climate 2020: 100+ EB MACHO et al.: 1 TB Palomar: 3 TB 2MASS: 10 TB GALEX: 30 TB Sloan: 40 TB Pan-STARRS: 40,000 TB Genomics 10 5 increase in data 100,000 TB volumes in 6 years

  3. End-to-end wide-area data transfers Data Transfer Node Data Transfer Node Storage Storage

  4. Pipeline Transfer and Checksum ! # " " ! ' # ' ! & # & … … ! $%" # $%" ! # $ $ Time

  5. Pipelining Data Transfer and End-to-End Data Integrity Check § Pipelining File-level pipelining: overlap a file transfer and a file integrity check – Block-level pipelining: overlap a block transfer and block data integrity – check • Block size is less than the average file size in a dataset § Analytical Modeling t : Transfer 2me of 500MB data c : Checksum 2me of 500MB data • § Enhancing Block-level Pipelining – Based on the analysis, the best performance can be achieved when the data transfer 2me is close to the data checksum 2me – Checksum-Dominant case: reduce the data checksum 2me (Current Work) – Transfer-Dominant case: reduce the transfer 2me (Future Work) 5 11/13/16

  6. Block-level pipelining -- Results § Results on Cooley § Results on Rain 6 11/13/16

  7. Block-level Pipelining – Perfect Pipeline Comparison of the performance of 1-Checksum-Thread and 2-Checksum-Thread on Cooley 7 11/13/16

  8. Questions

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend