CVMFS for Data Federations
Derek Weitzel University of Nebraska - Lincoln
CVMFS for Data Federations Derek Weitzel University of Nebraska - - - PowerPoint PPT Presentation
CVMFS for Data Federations Derek Weitzel University of Nebraska - Lincoln Problem with Data Federations Users must know the exact filenames for each job. They have to use special tools they are unfamiliar with in order to use it (such
Derek Weitzel University of Nebraska - Lincoln
with in order to use it (such as xrdcp or stashcp).
OSG has already created one StashCache.
in CVMFS developed by him and I have enabled CVMFS’s use in data federations.
HTTP gateways.
OASIS Stratum-1 infrastructure.
in standard CVMFS hashed’ format
server, i.e. a XRootD server.
source at FNAL
accessible storage at OSG-Connect
federation
Nebraska
source at FNAL
accessible storage at OSG-Connect
federation
Nebraska
filesystem at UChicago, recording differences since last scan.
public found on OSG-Connect.
the CVMFS repository server. Data stays on Stash.
elements requires a CMS- or ATLAS-sized commitment.
caches across the country.
service on OSG-Connect.
and read the data from jobs through StashCache
stashcache.github.io
For a full overview of StashCache, see Brian’s talk from last years AHM.
Standard Site Worker Node CVMFS Worker Node CVMFS StashCache StashCache Server StashCache Server StashCache Server HTTP HTTP StashCache Federation CVMFS Repository Server StashCache Redirector Stash Origin Site Metadata Actual Data Files (XrootD) XrootD
Federation
servers over HTTP
federation for the data
the caching servers
Will work fine for smaller sizes, but OASIS may be more efficient for distribution. *Number of unique bytes touched in 24 hours
Online - pick your favorite.
created, and when the it appears in CVMFS.
0% 25% 50% 75% 100% 0.0 0.5 1.0 1.5 2.0 2.5
Delay in Hours Probability of File Existance
Cumulative Distribution of the CVMFS Publish Delay
0% 25% 50% 75% 100% 0.0 0.5 1.0 1.5 2.0 2.5
Delay in Hours Probability of File Existance
Cumulative Distribution of the CVMFS Publish Delay
using services such as Oasis
software into CVMFS
public files
and even namespace visibility
to enable VOMS authentication.
to CVMFS HTTP(S) server.
authentication.
(mod_gridsite)
cannot currently proxy authenticated access.
preview:
widespread availability will probably occur in July.
yum install --enablerepo=osg-upcoming cvmfs cvmfs-config-osg