EGEE-II INFSO-RI-031688
Enabling Grids for E-sciencE
www.eu-egee.org
EGEE and gLite are registered trademarks
EGEE and Interoperation
Laurence Field CERN-IT-GD ISGC 2008
EGEE and Interoperation Laurence Field CERN-IT-GD ISGC 2008 - - PowerPoint PPT Presentation
Enabling Grids for E-sciencE EGEE and Interoperation Laurence Field CERN-IT-GD ISGC 2008 www.eu-egee.org EGEE and gLite are registered trademarks EGEE-II INFSO-RI-031688 Overview Enabling Grids for E-sciencE The grid problem definition
EGEE-II INFSO-RI-031688
Enabling Grids for E-sciencE
www.eu-egee.org
EGEE and gLite are registered trademarks
Laurence Field CERN-IT-GD ISGC 2008
2
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
3
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Cross-organizational Grids Intra-organizational Grids Data Centers Virtualization Volunteer Computing Campus Grids Clusters Cloud Computing
Vaporware?
4
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Independent policies, systems and authentication mechanisms
– Pool the resources – Split tasks by specialty – Share common frameworks
Organization B Organization A
5
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Users have a unique identify but also the identity of the VO
– Place “grid” interfaces at the organizational boundary – These map the generic “grid” functions/information/credentials
To the local security functions/information/credentials
Organization B Organization A Virtual Organization
6
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
interfaces required for building a grid infrastructure which enables resource sharing across administrative domains.
components from multiple software providers, including components from gLite, with the documentation and tools required for deploying this as a production quality service.
methodology used by many Linux distributions; A major baseline release to which updates are continually added.
create the software repositories which are logical separated by service to allow them to evolve independently.
in both 32 and 64 bit flavors. Availability for other OS’ is a high priority and the order the priority is driven by demand. http://glite.web.cern.ch/glite/
7
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– 139 partners institutes from over 32 countries – Providing a service grid infrastructure of ~50000 CPUs and ~ 5 PB disk (5 million Gigabytes) of disk storage + tape MSS
Distributed across 260+ sites in 48 countries
– Which is available to more than 7500 users
Organized over 200 Virtual Organizations across 10 applications domains
– Who run are running more than 190K jobs per day
24 hours-a-day, 7 Days a week, 365 days a year
8
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
PBS/Torque LSF Condor Load Leveler Sun Grid Engine GRAM v2 ARC CREAM NAREGI Unicore OSG GRAM v4 Nordugrid Naregi DEISA EGEE Teragrid
9
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Using different interfaces at the organizational boundary
– Independent policies, systems and authentication mechanisms
– Pool the resources – Split task by specialty – Share common frameworks
Grid B Grid A Virtual Organization
10
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Now have multiple ”common” interfaces – Tried to solve one problem, but created another
– The infrastructures were developed independently
Funding based on regions and application domains
– Grid infrastructures are based on different middleware
Experimentation with different approaches Initially there were no standards
– Standards take time to mature
We need to build the infrastructures now!
Good standards require experience
11
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
“The ability to exchange information and to use what has been exchanged” (software)
“The use of interoperable systems“ (Infrastructures)
12
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Compatibility matrix
– Security – Information Services – Job Management – Data Management
– Monitoring – Accounting – Operational links and joint policies – Trouble ticket systems – Operational security
13
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
SRM SRM SRM Storage Control Protocol GSI/VOMS GridFTP GLUE v1 LDAP/GIIS GRAM OSG GSI/VOMS GSI/VOMS Security GridFTP GridFTP Storage Transfer Protocol GLUE v1.2 ARC Schema LDAP/BDII LDAP/GIIS Service Discovery GRAM GridFTP Job Submission EGEE ARC
14
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Common interfaces – Standards
– Gateways – Adaptors and Translators
– Parallel Infrastructures
15
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– The user joins both grids
Uses different clients
– More work for the User
Required for each infrastructure
– Keyhole approach
Restricts functionality
– Method initially used by ATLAS
Split workload between grids
16
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– The site joins both grids
Deploys both interfaces
– User only sees their grid interface – More work for the site
Can only be supported by large sites
– Use By FZK
Participating in EGEE, Nordugrid and D-grid
17
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Single point of failure – Gateway breaks, grid disappears – Scalability bottleneck – All the load through one service
Gateway
18
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– The middleware can then work with both interfaces
Useful feature even when using standards!
– Existing service interfaces can still be used
API Plugin Plugin
19
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Already interoperating since Autumn 2005
– Working on interoperability since Summer 2005 – Anticipated completion May 2008
– Started Summer 2006 – Prototype components available
– Working on interoperability with EGEE since winter 2006 – Interoperable components available
– Separate project
– See talk in the next session!
20
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Applications, Security, Job Management – Information Systems, Data Management
21
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Generic Information Provider Provider EGEE Provider OSG Provider NDGF
GIN BDII ARC BDII
Provider Naregi Provider Teragrid Provider Pragma EGEE Site OSG Site NDGF Site Naregi Grid Teragrid Grid Pragma Grid Translators Glue
22
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
EGEE OSG Naregi Teragrid Pragma Nordugrid
23
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– From the various interoperation activities
– Critical interfaces at the organizational boundary
Security Information Computing Storage
– Problem constrained within the VO
Chose one solution and somewhere to host it.
24
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Users belong to a VO and do work on behalf of the VO
Their identity is their experiment, not their institution
– All other standards will inherit from this one
– Already an existing standard ☺ – This has significantly reduced interoperability problems – Roots of trust, CAs, coordinated by the IGTF
– Groups and roles within a VO – Capabilities etc.
25
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Glue Schema created to facilitate interoperation
Currently v1.3
– Now and OGF working group
Draft of v2.0 ready now!
– 55% grids, 95% sites provide an LDAP interface
Grids and sites participating in GIN
– Various web service interfaces
These all have problems with large query results
26
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– Supported in most grid infrastructures
Reduced interoperability problems
– Is proposed interface to storage – Problems with different interpretations of the specification – Incompatible implementations – With a huge amount of effort it has taken 18 months to get right
– An alternative which is widely used.
27
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– JSDL as defined by the OGF
– As many interfaces as batch systems! – Need to agree on a common interface
OGSA-BES is the current candidate
– V1.0 draft document – A number of prototypes exist but unproven in production – Cream CE and KnowARC CE will implement BES
28
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– A second attempt at the original problem
– Most crucially at the site boundary – The only way forward is real standards
– Production feedback will ensure it works! – The initial choice only select the starting point
– But only standards are sustainable in the long term
29
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
– What problem you are addressing?
Multi-institutional e-Science Infrastructures
– Grid Interoperation is not!
– Less focus required on specific implantations
– It doesn’t matter what they are as long a we agree – Existing use cases will ensure the standards work