SAN DIEGO SUPERCOMPUTER CENTER
An Intelligent Rule-Oriented Data Management System
DataGrid
Wayne Schroeder San Diego Supercomputer Center, University of California San Diego
An Intelligent Rule-Oriented Data Management System Wayne Schroeder - - PowerPoint PPT Presentation
An Intelligent Rule-Oriented Data Management System Wayne Schroeder San Diego Supercomputer Center, University of California San Diego DataGrid SAN DIEGO SUPERCOMPUTER CENTER Talk Outline Background Brief Overview of the SDSC SRB
SAN DIEGO SUPERCOMPUTER CENTER
DataGrid
Wayne Schroeder San Diego Supercomputer Center, University of California San Diego
SAN DIEGO SUPERCOMPUTER CENTER
SAN DIEGO SUPERCOMPUTER CENTER
Ask for data
Data delivered
SAN DIEGO SUPERCOMPUTER CENTER
Storage Resource Broker
Storage Resource Broker Metadata Catalog
DB
SAN DIEGO SUPERCOMPUTER CENTER
SRB MCAT
DB
SRB SRB SRB SRB SRB
SAN DIEGO SUPERCOMPUTER CENTER
SAN DIEGO SUPERCOMPUTER CENTER
The SRB is an integrated solution which includes:
(mySRB)), and APIs (including C, C++, Java, and Python).
SRB Scales Well (many millions of files, terabytes) Supports Multiple Administrative Domains / MCATs (srbZones) And includes SDSC Matrix: SRB-based data grid workflow management system to create, access and manage workflow process pipelines.
SAN DIEGO SUPERCOMPUTER CENTER
except a string of two quotes in a row
SAN DIEGO SUPERCOMPUTER CENTER
SAN DIEGO SUPERCOMPUTER CENTER
SAN DIEGO SUPERCOMPUTER CENTER
SAN DIEGO SUPERCOMPUTER CENTER
SAN DIEGO SUPERCOMPUTER CENTER
BIRN Toolkit Mediator
Viewing/Visualization Queries/Results Applications Data Management
File System MCAT HPSS
Data Model Data Access
Collaboration NMI Grid Management
Globus GridPort Scheduler Distributed Resources Database SRB Database
SAN DIEGO SUPERCOMPUTER CENTER
SRB server SRB agent SRB server
MCAT Read Application in Boston SRB agent
1 2 3 4 6 5 5/6
Logical Name Or Attribute Condition 1.Logical-to-Physical mapping
3.Access & Audit Control Peer-to-peer Brokering Server(s) Spawning Data Access Parallel Data Access
R1 R2 R2
San Diego Durham
SAN DIEGO SUPERCOMPUTER CENTER
Archives
HPSS, ADSM, UniTree, DMF
Databases
DB2, Oracle, Sybase
File Systems
Unix, NT, Mac OSX
Application
C, C++, Linux I/O Unix Shell
Dublin Core Resource, User User Defined Application Meta-data
Remote Proxies
DataCutter
Third-party copy
Java, NT Browsers Web Prolog Predicate
MCAT
HRM
SAN DIEGO SUPERCOMPUTER CENTER
17
Maintain and Adapt SRB to New Usages: SRB has reached a Stable Plateau
Chart New Areas
MCAT1 MCAT2 MCAT3 Server1.1 Server1.2 Server2.1 Server2.2 Server3.1
SAN DIEGO SUPERCOMPUTER CENTER
19
SAN DIEGO SUPERCOMPUTER CENTER
SAN DIEGO SUPERCOMPUTER CENTER
Resources
Client Interface Admin Interface
Metadata Modifier Module Config Modifier Module Rule Modifier Module
Consistency Check Module
Confs Rule Base
Meta Data Base Engine Rule Current State
Rule Invoker
Micro Service Modules
Resource-based Services
Micro Service Modules
Metadata-based Services Service Manager
Consistency Check Module Consistency Check Module
SAN DIEGO SUPERCOMPUTER CENTER
Condition checking, rule firing Backend Processing Micro Services Setup state and interact with RCAT – updates and modifications to persistent state Cleanup state and interact with RCAT – updates and modifications to persistent state Client Operation such as srbObjCreate
(Phase I Operational Model)
23
services
not break higher-level functionalities
sure that collections are self-consistent
consistent with this rule
24
25
ingestInCollection(S) :- /* store & backup */ chkCond1(S), ingest(S), register(S) findBackUpRsrc(S.Coll, R), replicate(S,R). ingestInCollection(S) :- /*store & check */ chkCond2(S),computeClntChkSum(S,C1), ingest(S), register(S), computeSerChkSum(S,C2), checkAndRegisterChkSum(C1,C2,S). ingestInCollection(S) :- /* store, chk, backup & chk */ chkCond3(S),computeClntChkSum(S,C1), ingest(S), register(S), computeSerChkSum(S,C2), checkAndRegisterChkSum(C1,C2,S), findBackUpRsrc(S.Coll, R), replicate(S,R) computeSerChkSum(S,C3), checkAndRegisterChkSum(C2,C3,S). ingestInCollection(S) :- /*store,check, backup & extract metadata */ chkCond4(S),computeClntChkSum(S,C1), ingest(S), register(S), computeSerChkSum(S,C2), checkAndRegisterChkSum(C1,C2,S), findBackUpRsrc(S.Coll, R), [replicate(S,R) || extractRegisterMetadata(S)]. ingestInCollection(S) :- /* just store */ ingest(S), register(S).
chkCond1(S) :- user(S) == ‘adil@cclrc’. chkCond1(S) :- coll(S) like ‘*/scec.sdsc/img/*’. chkCond2(S) :- user(S) == ‘*@nara’. chkCond3(S) :- user(S) == ‘@salk’. chkCond4(S) :- user(S) == ‘@birn’ , datatype(S) == ‘DICOM’. [OprList] implies delay for later
Opr||Opr implies do them in parallel Opr, Opr implies do them serially
SAN DIEGO SUPERCOMPUTER CENTER
SAN DIEGO SUPERCOMPUTER CENTER