data caching in the virtual imaging platform
play

Data caching in the Virtual Imaging Platform Tristan Glatard 1 1 - PowerPoint PPT Presentation

Data caching in the Virtual Imaging Platform Tristan Glatard 1 1 Creatis, CNRS, INSERM, Universit de Lyon, France on behalf of the VIP project consortium EGI technical forum, September 2011 vip@creatis.insa-lyon.fr Virtual Imaging Platform


  1. Data caching in the Virtual Imaging Platform Tristan Glatard 1 1 Creatis, CNRS, INSERM, Université de Lyon, France on behalf of the VIP project consortium EGI technical forum, September 2011 vip@creatis.insa-lyon.fr

  2. Virtual Imaging Platform Enabling distributed computing for medical image simulation • Computed Tomography • Magnetic Resonance Imaging Sindbad SIMRI {CEA-Leti} {CREATIS} Brainweb + MS lesions + USPIO XCAT • Positrons Emission Tomography • Ultrasound imaging Field-II PET-Sorteo {Technical Univ. Denmark} {CERMEP} Echocardiography vip@creatis.insa-lyon.fr Zubal + tumors

  3. Integration of simulators ● Workflow descriptions Example for Sindbad - CT Task graphs – Simulation parameters Data dependencies – Biological model (XCAT) ● Motivations Express data parallelism – Automated processing of applications (e.g. in portals) – No modification of the simulator codes – ● Integrated 4 simulators 360 x 5h CPU 5h CPU 8.5 days CPU 25 min CPU Simulated data MRI CT US PET vip@creatis.insa-lyon.fr

  4. Platform outline Local clusters Computing VIP Computing cluster bundle Storage {CREATIS} Grid sites Storage Computing Ordonnanceur Information store {CPPM, LHCb} {VPH Exemplar Project} Local Data Manager Workflows and tools Logs and traces Authentication Workflow engine {MAAT-France} Simulated data {MAAT-France} Object models Web portal {INRIA, UNS, CEA-Leti, CREATIS} {CREATIS} {VIP partners} Workflow editor {collaborations} {UNS} vip@creatis.insa-lyon.fr

  5. Platform usage ● Subset of LSGC, biomed VO 8 Consumed CPU years Other 7 VIP 6 5 GATE 4 3 2 1 Dec. 2010 July 2011 vip@creatis.insa-lyon.fr

  6. Data management problem ● EGI three-tier data management – Logical File Catalog (LFC) – single index space – Storage Elements (SE) – DPM, dCache, STORM, Castor – Challenge: data availability between 80-95% ● Data management in VIP – Users upload input files to process on LFC (web interface) – Platform replicates these files 3 times – Files are cached by (pilot) jobs – Output files are stored on site SE; central SEs as failovers – Job error rates due to data transfer issues: 5-10% vip@creatis.insa-lyon.fr

  7. Local Data Manager ● Principle – Dedicated cache SE, used at failover storage – Periodically tries to replicate its files to grid SEs – Available for users and grid jobs ● Implementation – Overlay of DPM SE – Not published in BDII – Accessed using –no-bdii options of lcg-utils vip@creatis.insa-lyon.fr

  8. Reliable Data Management Use-cases Input download Output upload vip@creatis.insa-lyon.fr www.creatis.insa-lyon.fr/vip 8

  9. Evaluation ● Conditions – EGI, biomed VO (production infrastructure) – Ultrasonic simulation of 128 jobs – Each job has 5 input files + 1 output file – Failure rate: 1% ● Results ● Job data transfers failure rate is almost zeroed when failover dedicated storage is used vip@creatis.insa-lyon.fr

  10. Conclusion ● Developments related to data management in VIP – Web-interface to access LFC – Local cache used as failover SE, periodically tries to replicate its files to grid Ses ● Substantial improvement of job reliability ● Web interface vip@creatis.insa-lyon.fr

  11. Thank you ! ● More information – VIP project website http://www.creatis.insa-lyon.fr/vip – VIP platform: http://vip.creatis.insa-lyon.fr – Development roadmap: http://vip.creatis.insa-lyon.fr:9002 ● Demo and poster at France-Grilles booth (15:30- 16:00) vip@creatis.insa-lyon.fr

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend