Grid Interoperation and Regional Collaboration
Eric Yen ASGC Academia Sinica Taiwan 23 Jan. 2006
Grid Interoperation and Regional Collaboration Eric Yen ASGC - - PowerPoint PPT Presentation
Grid Interoperation and Regional Collaboration Eric Yen ASGC Academia Sinica Taiwan 23 Jan. 2006 Dreams of Grid Computing Global collaboration across administrative domains by sharing of people, resources, data, applications, and
Eric Yen ASGC Academia Sinica Taiwan 23 Jan. 2006
Global collaboration across administrative domains
Production efficiency enables the e-Infrastructure -- a global knowledge
Scalability of the number, robustness & performance
On-demand provisioning Global science needs global grid Bridge the digital divide
All for one and one for all ! Abstraction of system functions, commands, and user actions to
achieve system and installation independence
Retain full administrative autonomy at all participants Natural extension of each grid’s efforts to include more resources in
their own infrastructure
Efficiency through collaboration Facilitating Grid Standardization Other functionality concerns
Discovery - metadata standards for grid resources Site Verification - ongoing / one time / scheduled Job monitoring Define Policies and Rules of Engagement. Security procedures Problem resolution
A service is the logical manifestation of some
Service interaction is facilitated by message
from Atkinson, DeRoure, Dunlop, and Fox et. al., Web Service Grids: An Evolutionary Approach
Success of the Grid infrastructure needs to be
Decompose over the Network -- Ian Foster
Client can integrate dynamically
select & compose services select “best of breed” providers publish result as a new service
Problems
Wide diversity of software platforms used by Grid projects/systems. re-invention of similar services in multiple projects limited sharing of software prototypes between projects
Model the world as a collection of services Resource Descriptions and Aggregation Discovery Composition Adaptation & Evolution Quality of Services: security, performance,
Workflow (lifecycle management) Open Source Implementation
Oliver Keeble
Enabling Grids for E-sciencE
INFSO-RI-508833 7
Component LCG/EGEE OSG ARC Monitoring/IS MDS, LDAP, GLUE, BDII, R-GMA MDS, GLUE, GridCat, MonALISA, LDAP LDAP, ARC schema, MDS Security GSI GSI GSI Software Installation Privileged users, Publish utility All VO members, Flatfile Job submission & description GRAM / JDL GRAM / RSL, Condor-G GridFTP / RSL VO support and management VOMS, LDAP VOMS, GUMS VOMS Data Management GridFTP, LFC, RLS SRM GridFTP, SRM v1.1 GridFTP, RC, RLS SRM v1.1 client
Source: Oliver Keeble, LCG Interoperation
Oliver Keeble
Enabling Grids for E-sciencE
INFSO-RI-508833 9
Source: Oliver Keeble, LCG Interoperation
Oliver Keeble
Enabling Grids for E-sciencE
INFSO-RI-508833 10
– LCG client tools installed as exp software – Site Passes Gstat Grid Status tests – LCG to OSG job submissions working via RB
– SFTs running at the OSG site.
– OSG to LCG job submissions working
Source: Oliver Keeble, LCG Interoperation
Oliver Keeble
Enabling Grids for E-sciencE
INFSO-RI-508833 11
– OSG appears as a single site – 3 OSG sites moving to full interoperability – IS, monitoring, job matches, data transfer – Modified SFTs pass with the exception of Accounting and CAs – Generic Info Provider installed – Dteam supported
– Job submission works – Further work required on OSG monitoring of LCG sites – No standard monitoring interface
Source: Oliver Keeble, LCG Interoperation
Oliver Keeble
Enabling Grids for E-sciencE
INFSO-RI-508833 12
– Requires RB fix
– Harmonisation?
– Common monitoring VO? – GUMS / VOMS versioning
– Adequate logging for audit
– What happen when sites have problems – EGEE has a very proactive operations policy
– Suffer from lack of common interfaces, N² problem – MonALISA/R-GMA interoperation – MIS-CI
– FTS
Source: Oliver Keeble, LCG Interoperation
Oliver Keeble
Enabling Grids for E-sciencE
INFSO-RI-508833 13
31st August 2005, three options were presented.
– LONG TERM - Agree on the interfaces at the site level and work towards producing code that works with these interfaces.
ARC
– MEDIUM TERM - Present these interfaces at the Grid boundary and create a portal that does forwarding and translation. – SHORT TERM - Deploy the LCG and ARC CE in parallel at large sites.
– Document the LCG CE – LCG to ARC job submission – ARC to LCG job submission – Service Discovery / Glue 2
Source: Oliver Keeble, LCG Interoperation
Joint OSG and EGEE Operations Workshop, Culham, September 2005 15
–Short term: Multiple Middlewares at large sites –Medium term: Gateways between grids –Long term: Common Interfaces
Source: Michael Gronager, EGEE/ARC Interoperability Status
Joint OSG and EGEE Operations Workshop, Culham, September 2005 16
Service/component LCG-2, gLite ARC Basis GT2 from VDT GT2 own patch, GT3 pre-WS Data transfer GridFTP, SRM (DPM) GridFTP, SRM v1.1 client Data management EDG RLS, Fireman & Co, LFC RC, RLS, Fireman Information LDAP, GLUE1.1, MDS+BDII, R- GMA LDAP, ARC schema, MDS-GIIS Job description JDL (based on classAds) RSL Job submission Condor-G to GRAM GridFTP VO management VOMS, gLite VOMS, CAS (?) VOMS
Source: Michael Gronager, EGEE/ARC Interoperability Status
Joint OSG and EGEE Operations Workshop, Culham, September 2005 17
–LCG supports submission via Condor-G natively –LCG supports Condor as a queuing system –ARC supports Condor as a queuing system –Cooperation between ARC and Condor led in October 2004 to Condor-G version that can submit jobs to ARC GridFTP (translation from ARC infosystem schema to GLUE was developed by Rod Walker). Was meant to be used by LCG – but nobody configured an RB this way yet
interface?
Source: Michael Gronager, EGEE/ARC Interoperability Status
Joint OSG and EGEE Operations Workshop, Culham, September 2005 18
ARC
–Setup an LCG CE using Condor LRMS –Setup Condor-G queue to submit to ARC
LCG
–Setup an ARC CE using Condor LRMS –Setup Condor-G queue to submit to LCG
Source: Michael Gronager, EGEE/ARC Interoperability Status
Joint OSG and EGEE Operations Workshop, Culham, September 2005 19
jobs:
–Each new queued job spawns a process on the gate keeper, which regularly executes a Perl script –Does not perform for more than 400 jobs
–Problems with caching –Schema not complete
Source: Michael Gronager, EGEE/ARC Interoperability Status
last update 01/11/06 04:29 AM
LCG
LCG and EGEE Grid Sites in the Asia-Pacific Region
4 LCG sites in Taiwan 12 LCG sites in Asia/ Pacific Academia Sinica Grid Centre
Computing Grid (LCG)
for LCG and EGEE
Asia/Pacific Federation in EGEE
LCG site
PAEC NCP Islamabad IHEP Beijing KNU Daegu
GOG Singapore KEK Tsukuba ICEPP Tokyo Taipei - ASGC, IPAS NTU, NCU VECC Kolkata Tata Inst. Mumbai
AP Federation now shares the e-Infrastructure with WLCG
2005/12/16 Simon C. Lin / ASGC
support of the European Research Area (ERA)”
2005/12/16 Simon C. Lin / ASGC
VO Services: deployed from April 2005 in Taiwan (APROC) LCG: ATLAS, CMS BioInformatics, BioMed Geant4 APeSci : for collaboration general e-Science services in Asia
Pacific Areas
APDG: for testing and testbed only TWGRID: established for local services in Taiwan Potential Applications LCG, Belle, nano, biomed, digital archive, earthquake, GeoGrid, astronomy, Atmospheric Science
2005/12/16 Simon C. Lin / ASGC
Production CA Services AP CIC/ROC VO Support Pre-production site User Support MW and technology development Application Development Education and Training Promotion and Outreach Scientific Linux Mirroring and Services
Objectives
Building Grid Infrastructure by Application-led projects Ensure to be scalable in number, robustness & performance of
services and sites
Protect the regional investments in Grid MW components and
evolve continuously
Approaches: Application-driven
Converge to a simple, well-defined service-oriented architecture Capture generic middleware infrastructure components Construct a middleware repository for re-engineering, integrative
testing and interoperability insurance.
Adaption to Web Services Architecture Close collaboration with other major Grids projects and
standardization organizations in the world
Capturing generic middleware services from application
requirements --> closely interaction with application communitites to constuct effective science services
Simplify construction by decomposing content, function and
resource
Construct a middleware repository for re-engineering,
integrative testing and interoperability insurance.
Open Source Model provides a new kind of knowledge- and
community-building infrastructure
Diversity is the norm and healthy, but collaboration is
essential on a worldwide scale