Data Management in ATLAS
Angelos Molfetas on behalf of the ATLAS DQ2 team
Angelos Molfetas
1
Data Management in ATLAS Angelos Molfetas on behalf of the ATLAS DQ2 - - PowerPoint PPT Presentation
Angelos Molfetas Data Management in ATLAS Angelos Molfetas on behalf of the ATLAS DQ2 team 1 ATLAS DDM COLLABORATION A.Molfetas (CERN), F.Barreiro (CERN), A.Tykhonov (Jo ef Stefan Institute), V.Garonne (CERN), S.Campana (CERN), M.Lassnig
Angelos Molfetas
1
2
Angelos Molfetas
3
Tier-1 Tier-0
Online filter farm RAW ESD AOD Reconstruction farm RAW ESD AOD MC Analysis farm Re-reconstruction farm
Tier-2
Analysis farm Monte Carlo farm SelSD, AOD RAW RAW ESD AOD ESD, AOD RAW MC RAW ESD AOD ESD, AOD
LYON BNL
LPC
Tokyo NW GRIF
T3
NET2
FR Cloud BNL Cloud
Pékin NG LYON BNL FZK TRIUMF ASGC PIC SARA RAL CNAF
CERN
Clermont LAPP CCPM Roumanie SW GL SLAC TWT2 Melbourne
DQ2 Clients & API DQ2 Common Modular Framework Production Analysis Interactive Physics Metadata WLCG
OPEN SCIENCE GRID LHC COMPUTING GRID
NORDUGRID
Site Services Centrals Catalogs Database Deletion Transfer Consistency Repository, Content Location, Accounting, Subscription Tracer Data Export
¤ Users need to be able to:
¤ Download/Upload data from the grid ¤ Transfer data between sites ¤ User should not need to know about each storage system
¤ Many different mass storage systems are used - we need a simplified interface that hides the grid’s heterogeneity.
¤ Not trivial ¤ In ATLAS this is done by DQ2 middleware and abstraction layers like SRM
¤ For example:
¤ User downloads dataset by CLI: “dq2-get user.angelos.xxxxxxx” ¤ No specific knowledge is required about castor, dcache, xrootd, etc.
Angelos Molfetas
7
¤ Maintain global state of data (central catalog of all datasets on the grid)
¤ This has to scale ¤ Central point of failure
¤ In ATLAS we have Local File Catalogs (LFC) which also have to be maintained. ¤ For example, uploading data to the grid:
¤ Dq2-put –s files_location user.angelos.xxxxxxx ¤ Has to handle different storage systems ¤ Has to register files in central catalogs ¤ Has to register files in LFC
¤ Not trivial. E.g. order of operations in dq2-put can create dark data
Angelos Molfetas
8
Angelos Molfetas
9
¤ Primary – mandated by the Computing Model ¤ Secondary - in excess of the Computing Model
¤ Collecting traces ¤ Aggregating traces
Angelos Molfetas
10
0.00E+00 2.00E+07 4.00E+07 6.00E+07 8.00E+07
Angelos Molfetas
11
Angelos Molfetas
12
Angelos Molfetas
13
Angelos Molfetas
14
¤ 'srm': 'token:ATLASPRODDISK:srm://lcg-se0.ifh.de:8443/srm/managerv2?SFN=/pnfs/ifh.de/data/atlas/ atlasproddisk/'
CASTOR / dCache DPM / StoRM BestMAN
local access