Distributed and on-demand cache for CMS experiment at LHC Diego - PowerPoint PPT Presentation

29 October - 1 November 2018 Amsterdam, the Netherlands Distributed and on-demand cache for CMS experiment at LHC Diego Ciangottini on behalf of CMS Collaboration and INFN-Cache team D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 1

Outline ● Introduction 2 scenarios of evaluation ● ○ cache on ephemeral storage for opportunistic resources geo-distributed cache with unmanaged storage ○ ● Performance results ● Conclusion and future activities D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 2

CMS current model in a nutshell Hierarchical centrally managed storages ● at computing sites (Tier) ● Payloads run at the site that stores the requested data Remote data access already technically ● supported ○ fallback to remote in case of local read failure overflow of jobs to near sites ○ D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 3

Extension: dynamic resource provisioning Scenario 1 On-demand Cache resources Computing resources are opportunistically deployed on cloud/HPC resources ● storage not necessarily available ○ remote read latency ○ I/O inefficient The cache introduction may offer: ● ephemeral storage for hot data near the computing provider ● optimized wan access , only for data not already on the cache HPC D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 4

Cache layer in data-lake for HL-LHC Scenario 2 Few world-wide custodial centers with data replica managed by the experiment Distributed cache ● Computing Tiers access data directly from closest custodial center Custodial data Using cache for a Content Delivery Network HPC approach: ● geo-distributed network of unmanaged storages ● common namespace ( no data replication ) Tier2 ● request mitigation to custodial sites Tier3 D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 5

Technology: XCache evaluation Two scenarios for evaluation: ● cache on ephemeral storage for opportunistic resources geo-distributed cache with unmanaged storage ● XCache technology have been used in both of the activities: Part of XRootD technology already widely used in WLCG for federating storages ● Storage resources are accessible for any data, anywhere at anytime (AAA) ○ XRootD infrastructure spans all of the Tier-1 and Tier-2 sites in EU and US CMS ○ D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 6

XCache mechanics Open File Storage Federation 1. Cold cache: remote open through storage Federation 2. Warm cache: opens file on local disk File cache Note: remote open is only initiated if/when a requested block is not available in the cache. Read File 1. If in RAM/disk ➞ serve from RAM/disk Client file request 2. Otherwise request data from remote and a. serve it to the client Hit b. write it to disk via write queue (this way data Miss remains in RAM until written to disk) D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 7

Clustering with xrootd cache redirector Client Client Client Through the XrootD redirection is ● Client Client Client possible to federate caches in a content-aware manner XROOTD ○ redirect client to the cache that CACHE REDIRECTOR actually have file on disk ● Loadbalancing: If no cache has Cache Cache Cache the requested file, a round robin selection of cache server is used XROOTD STORAGE REDIRECTOR ( configurable ) STORAGE STORAGE STORAGE STORAGE STORAGE STORAGE D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 8

Cache for opportunistic resources Scenario 1 Remote CMS AAA Federation Opportunistic In case of computing on opportunistic resources STORAGE resources the remote data access pattern can be improved providing: Storage federation STORAGE Disk proxy Disk proxy xrootd proxy cache cache cache ● an on-demand cache layer near STORAGE Disk RAM cpu resources (same cloud STORAGE provider) Cache Redirector scaling horizontally ○ manage caches in a ○ WN STORAGE content-aware manner redirect client to the cache ■ that currently have file on disk D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 9

Testing with CMS workflows Scenario 1 (*) https://dodas-ts.github.io/dodas-doc/ Cloud resource provider ● Real CMS analysis workflows on cloud Opportunistic Storage Service resources (2 volunteer users) Ceph/HDFS/IOVolumes/? 2k jobs @OpenTelekomCloud (OTC) ○ ○ ~150k of users jobs completed reading from Opportunistic Cache Service WLCG standalone cache cluster deployed at OTC XRootD Xcache Xcache Xcache ● DODAS (*) have been used for: Federation ■ same configuration for setup on different Redirector cloud providers ■ automated deployment through: ● Ansible for infrastructure WN WN WN ● K8s or Mesos/Marathon for container Opportunistic CMS startd Service orchestration D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 10

Results Effect of the cache Scenario 1 Failure for latency All in all good performances ● partial healing for high latency remote access failures (timeout) No cache overhead observed ● local-like performances when a cache Cache hit - Avg CPU efficiency hit occurs ● on-demand deployment recipes and easy maintenance Automated deployment through: ● Ansible ● K8s (soon also in helm) Local read reference ● Mesos/Marathon https://cloud-pg.github.io/CachingOnDemand/ D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 11

Distributed testbed deployment Scenario 2 WLCG ● Deployment a geo-distributed cache: XRootD XCache Federation CNAF ○ Clients contact the cache redirector ○ Redirector steers client to ■ the cache that actually have file on disk Cache redirector ■ If no cache has the requested file, a round robin selection of cache server is used XCache T2_IT_Bari ● Network of unmanaged storages for hot data Clients One line configuration tweak on computing ● resources allows to seamlessly integrate the distributed cache on CMS workflows D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 12

Distributed testbed deployment: testbed Scenario 2 Current functional test setup: CNAF XCache redirector federating 2 servers: ● ○ CNAF XCache server (5TB) ○ T2 Bari XCache server (10TB) ● Redirecting part of the CMS analysis workflows to contact National redirector based on dataset name requested ○ ● 2 more sites (Tier2 at Pisa and Legnaro) are planning to join the testbed D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 13

Italian XCache federation: functional checks Scenario 2 ● Test tasks submitted to T2_IT_Bari with empty cache Comparing jobs running at Bari (pointing to cache) with “Ignore locality” ones on ● other sites Avg Job CPU Eff. No penalty in CPU eff in case of empty Bari → Cache cache Pisa → No-Cache Performances of jobs reading from empty cache is comparable with Legnaro → No-Cache remote reading. D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 14

Conclusions and plans In the context of ● Two analyzed scenario have been presented: DOMA-Access WG ○ cache for dynamic resources distributed cache layer for HL-LHC data-lake model ○ ● Performance evaluation motivates further activities ○ on-demand deployment and easy maintenance ○ partial healing for high latency remote access failures ■ no penalty in case of empty cache ■ local-like performances when an hit occurs Work in progress: ● evaluate cache benefits within CMS computing model through simulation ● smart (ML-based) data fetching and request routing based on real-time and historical information deployment in production @INFN ● D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 15

Thank you D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 16

Backup D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 17

D. Ciangottini - Distributed and on-demand cache for CMS experiment at LHC - IEEE eScience 2018 18

Distributed and on-demand cache for CMS experiment at LHC Diego - PowerPoint PPT Presentation

29 October - 1 November 2018 Amsterdam, the Netherlands Distributed and on-demand cache for CMS experiment at LHC Diego Ciangottini on behalf of CMS Collaboration and INFN-Cache team D. Ciangottini - Distributed and on-demand cache for CMS

The CMS HL-LHC Upgrades and Proposed U.S. CMS Contributions Vivian ODell, U. S. CMS HL-LHC

1 Classifying cache misses Cache Organization Classifying misses by causes (3Cs) Cache size,

CMS Data Transfer tests towards LHC data taking CMS Data Transfer tests towards LHC data taking D

presentation Rzsa CNET CNET TF-NOC flash p US LHC US LHC Sndor US LHC US LHC Netw w

Victoria Dec. 14, 2011 ATLAS CMS TRIUMF Workshop on LHC Results TRIUMF Workshop on LHC

Managing the U. S. CMS HL-LHC Upgrades Vivian ODell, U. S. CMS HL-LHC USCMS Project Manager

Flow measurements from CMS Julia Velkovska for the CMS Collaboration CMS flow measurements: LHC

LHC An invitation to further reading. Mike Lamont CERN/AB 1 CERNs accelerators LHC 2 LHC

ATLAS/CMS Upgrades Yasuyuki Horii Nagoya University on Behalf of the ATLAS and CMS

What Is Memory Hierarchy A typical memory hierarchy today: Lecture 13: Cache Basics and Cache

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

L09: Cache Name: ID: Question: Direct Mapping Cache Hit Rate Consider a 4-block empty Cache,

CMS Upgrades for the HL-LHC P . McBride for the CMS SP team USCMS HL-LHC Upgrade Directors

Memory Hierarchy: Cache Memory hierarchy Cache basics Locality Cache organization Cache-aware

CMS Programme India CERN LHC CMS India-CMS Kajari Mazumdar ( on behalf of

Integration of the Italian cache federation within CMS computing model Diego Ciangottini on

NOW Handout Page 1 9 Parallel Architecture Framework Scalable Machines What are the design

Massive Data Algorithmics Lecture 1: Introduction Massive Data Algorithmics Lecture 1:

Cache Memory Raul Queiroz Feitosa Content Memory Hierarchy Principle of Locality Some

Hardware Hardware Implementation Implementation Pascal Gautron R&D Engineer Thomson

Making Good Enough...Better: Addressing the Multiple Objectives of High-Performance Parallel

Stupid !! Andr Seznec 2 Single thread performance Has been driving architecture till

Architectures with Large Die-Stacked DRAM Cache Adarsh Patil Adviser: Prof. R Govindarajan

Plan Motivations (to combine navigation and querying in a file system) Specification (ls = ?,

Sambuz

Useful Links

Newsletter

Mail Us

Distributed and on-demand cache for CMS experiment at LHC Diego - PowerPoint PPT Presentation

29 October - 1 November 2018 Amsterdam, the Netherlands Distributed and on-demand cache for CMS experiment at LHC Diego Ciangottini on behalf of CMS Collaboration and INFN-Cache team D. Ciangottini - Distributed and on-demand cache for CMS

The CMS HL-LHC Upgrades and Proposed U.S. CMS Contributions Vivian ODell, U. S. CMS HL-LHC

1 Classifying cache misses Cache Organization Classifying misses by causes (3Cs) Cache size,

CMS Data Transfer tests towards LHC data taking CMS Data Transfer tests towards LHC data taking D

presentation Rzsa CNET CNET TF-NOC flash p US LHC US LHC Sndor US LHC US LHC Netw w

Victoria Dec. 14, 2011 ATLAS CMS TRIUMF Workshop on LHC Results TRIUMF Workshop on LHC

Managing the U. S. CMS HL-LHC Upgrades Vivian ODell, U. S. CMS HL-LHC USCMS Project Manager

Flow measurements from CMS Julia Velkovska for the CMS Collaboration CMS flow measurements: LHC

LHC An invitation to further reading. Mike Lamont CERN/AB 1 CERNs accelerators LHC 2 LHC

ATLAS/CMS Upgrades Yasuyuki Horii Nagoya University on Behalf of the ATLAS and CMS

What Is Memory Hierarchy A typical memory hierarchy today: Lecture 13: Cache Basics and Cache

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

L09: Cache Name: ID: Question: Direct Mapping Cache Hit Rate Consider a 4-block empty Cache,

CMS Upgrades for the HL-LHC P . McBride for the CMS SP team USCMS HL-LHC Upgrade Directors

Memory Hierarchy: Cache Memory hierarchy Cache basics Locality Cache organization Cache-aware

CMS Programme India CERN LHC CMS India-CMS Kajari Mazumdar ( on behalf of

Integration of the Italian cache federation within CMS computing model Diego Ciangottini on

NOW Handout Page 1 9 Parallel Architecture Framework Scalable Machines What are the design

Massive Data Algorithmics Lecture 1: Introduction Massive Data Algorithmics Lecture 1:

Cache Memory Raul Queiroz Feitosa Content Memory Hierarchy Principle of Locality Some

Hardware Hardware Implementation Implementation Pascal Gautron R&amp;D Engineer Thomson

Making Good Enough...Better: Addressing the Multiple Objectives of High-Performance Parallel

Stupid !! Andr Seznec 2 Single thread performance Has been driving architecture till

Architectures with Large Die-Stacked DRAM Cache Adarsh Patil Adviser: Prof. R Govindarajan

Plan Motivations (to combine navigation and querying in a file system) Specification (ls = ?,

Sambuz

Useful Links

Newsletter

Mail Us

Hardware Hardware Implementation Implementation Pascal Gautron R&D Engineer Thomson