3/27/07 ISGC2007: OSG 1
Open Science Grid
Ruth Pordes Fermilab V2
Open Science Grid Ruth Pordes Fermilab V2 3/27/07 ISGC2007: OSG - - PowerPoint PPT Presentation
Open Science Grid Ruth Pordes Fermilab V2 3/27/07 ISGC2007: OSG 1 First of All I am in Taipai in spirit today but not able to be there in person. A big thank you to Simon and Vicky for enabling me to give my talk remotely And
3/27/07 ISGC2007: OSG 1
Ruth Pordes Fermilab V2
3/27/07 ISGC2007: OSG 2
3/27/07 ISGC2007: OSG 3
3/27/07 ISGC2007: OSG 4
People and Organizations as members & partners of a Consortium Consortium with Common Goals Common Goals to support Computer Based Research over wide and local areas of distribution.
Includes Facilities, Experiments, Computer Scientists, Grid organizations etc.
− − Global in Reach: Currently include Taiwan, South Africa, South Global in Reach: Currently include Taiwan, South Africa, South America, Australia, UK America, Australia, UK
Contributor to the vision of Cyberinfrastructure Cyberinfrastructure “.. a cultural community that .. enables distributed knowledge communities .. collaborate and communicate across disciplines, distances and cultures.”
Dr Arden Bement, Director of National Science Foundation in “Cyberinfrastructure Vision for 21st Century Science.
− − Global in Scope: NSF actively discussing how to increasingly Global in Scope: NSF actively discussing how to increasingly sponsor international partnerships. sponsor international partnerships.
endeavor”
From Wikipedia definition of Consortium
3/27/07 ISGC2007: OSG 5
deliver to the goals of the Consortium.
SciDAC-2 program and the National Science Foundation. − − International components for International Science Grid This International components for International Science Grid This Week online newsletter, and outreach and educational Week online newsletter, and outreach and educational activities (currently to Scandinavia - IceCube - and South activities (currently to Scandinavia - IceCube - and South Africa - ATLAS). Africa - ATLAS).
University of Iowa University of Chicago/Argonne National Laboratory University of Florida University of Wisconsin, Madison University of California, San Diego Stanford Linear Accelerator Center Rennaisance Computing Institute Lawrence Berkeley National Laboratory Indiana University Fermi National Accelerator Laboratory Cornell University Columbia University California Institute of Techology Brookhaven National Laboratory Boston University
3/27/07 ISGC2007: OSG 6
3/27/07 ISGC2007: OSG 7
Secure Production Quality Common Distributed Infrastructure Distributed Infrastructure providing access to & sharing of access to & sharing of OSG Consortium members and partners computational and storage computational and storage resources resources over production and research networks.
A reference distribution of common technologies for high throughput distributed computing throughput distributed computing -- the Virtual Data Toolkit (VDT) Virtual Data Toolkit (VDT)
and which is built, tested, integrated, packaged and supported.
− External projects that develop the software and/or we rely on and/or we deliver to are part of the OSG management structure.
Support for Distributed Communities -- of resource owners, science and research users, educators and students, technology developers -
to use and evolve the use and evolve the system. system.
− Including a multi-site integration testbed for bringing new software and services into production use.
3/27/07 ISGC2007: OSG 8
Petascale Science (CEDPS)
(CDIGS)/Globus
University Network(DISUN)
Information Frontier
the WLCG
Computing (APAC)
Network (DISUN)
University
Iowa (GROW)
Education (TIGRE)
Computing)
Collaboration (WLCG)
3/27/07 ISGC2007: OSG 9
3/27/07 ISGC2007: OSG 10
100 Resources across production & integration infrastructures !ncrease in ~15 in last 6 months 27 Virtual Organizations (+ 3 operations VOs) 25% non-physics. ~20,000 cores (from 30 to 4000 cores per cluster) ~6 PB accessible Tapes ~4 PB Shared Disk Sustaining through OSG submissions: Measuring ~180K CPUhours/day. ~Factor of 50% more (being measured) in last 6 months. Using production & research networks
3/27/07 ISGC2007: OSG 11
Courtesy: Frank Wuerthwein
3/27/07 ISGC2007: OSG 12
Infrastructure Applications
Core grid technology distributions:
Condor, Globus, Myproxy: shared with TeraGrid and
Virtual Data Toolkit (VDT)
core technologies + software needed by stakeholders:many components shared with EGEE
OSG Release Cache: OSG specific configurations, utilities etc.
HEP Data and workflow management etc
Biology Portals, databases etc
User Science Codes and Interfaces Existing Farms, Storage, Networks
Astrophysics Data replication etc
3/27/07 ISGC2007: OSG 13
Infrastructure Applications
Core grid technology distributions:
Condor, Globus, Myproxy: shared with TeraGrid and
Virtual Data Toolkit (VDT)
core technologies + software needed by stakeholders:many components shared with EGEE
OSG Release Cache: OSG specific configurations, utilities etc.
HEP Data and workflow management etc
Biology Portals, databases etc
User Science Codes and Interfaces Existing Farms, Storage, Networks
Astrophysics Data replication etc
3/27/07 ISGC2007: OSG 14
Infrastructure Applications
Core grid technology distributions:
Condor, Globus, Myproxy: shared with TeraGrid and
Virtual Data Toolkit (VDT)
core technologies + software needed by stakeholders:many components shared with EGEE
OSG Release Cache: OSG specific configurations, utilities etc.
HEP Data and workflow management etc
Biology Portals, databases etc
User Science Codes and Interfaces Existing Farms, Storage, Networks
Astrophysics Data replication etc
3/27/07 ISGC2007: OSG 15
3/27/07 ISGC2007: OSG 16
3/27/07 ISGC2007: OSG 17
(project) and Council (consortium).
experiments for that Collaboration.
and meets deliverables defined by those projects.
3/27/07 ISGC2007: OSG 18
3/27/07 ISGC2007: OSG 19
Increased need for and scope of Agreements Awareness training and tracking ~50 page Risk Assessment ~30 item Security Plan Comprehensive set of Examination Audits and Reviews. TeraGrid testing Shibboleth -> X509. OSG watching until integrations done by NCSA, EGEE and end-to-end deployments on TeraGrid work.
3/27/07 ISGC2007: OSG 20
tests) Validation Probes Validation Probes, add framework for Site Control of Site Control of Execution Execution, report to WLCG availability repository: June 2007.
− Yes, working with/as part of of the WLCG Monitoring Group,
CPU accounting now at 38 production sites. Already used to spot problems (lack of use, too much use?)
− Also measure of measure of “ “sharing sharing” ”: : use of CPU by VOs other than those that Own the resource. − Interfacing form “pull-/glide-in” mode jobs.
dCache transport accounting available in production - only just starting to gather information.
− Expect to also collect GridFTP storage elements transport accounting.
3/27/07 ISGC2007: OSG 21
mapping of identification certificate role attribute to account UID.
− Preemption allowed but causes problems for many applications. − Will soon need agreements between Sites and VOs. − OSG VO has allocations for the Consortium (e.g. LBNL) and for these decides on priorities and expectations.
role attribute to access control and root path to data.
− Still no good test of interoperation with EGEE.
system allocations. In the future will support reservation and release through SRM.
3/27/07 ISGC2007: OSG 22
− External projects that develop the software and/or we rely on and/or we deliver to are part of the OSG management structure.
3/27/07 ISGC2007: OSG 23
TeraGrid, EGEE, OSG, APAC, NWICG and others.
VDT 1.6.0 and OSG 0.6.0 released and being installed on OSG.
− Includes SRM/dCache distribution, now installed at ~5 sites. − EGEE CEMON used to publish GLUE information to OSG-BDII. − Class-Ad based resource selector available. − Integration and provisioning of OSG release took 2-3 months.
VDT 1.8.0 for release in June list (OSG 0.8.0 for August) and priorities in discussion.
− Build subset collections for EGEE − Add VOMRS, glEXEC, SRM V2.2
− Discussions underway to build subset of VDT under ROCKS for PRAGMA distribution. − Looking to make better use of NMI Build and Test (Metronome) and commonality with ETICS.
3/27/07 ISGC2007: OSG 24
− Accounting, Authz are external projects. − Condor, Globus, dCache are external projects. − EGEE develops software that OSG is interested in. − Etc.
3/27/07 ISGC2007: OSG 25
− Including a multi-site integration testbed for bringing new software and services into production use.
3/27/07 ISGC2007: OSG 26
production on OSG.
to test their legacy applications on OSG. No large production runs recently though.
under ATLAS PANDA framework.
ITB for deliverable from LBL in late summer.
− Will also look at use cases where Globus Auditing service might benefit .
− Resulting in enhancements to prevent overload and better error messages.
3/27/07 ISGC2007: OSG 27
3/27/07 ISGC2007: OSG 28
3/27/07 ISGC2007: OSG 29
− Planning more site administrators training.
3/27/07 ISGC2007: OSG 30