Shibbolized iRODS (and why it matters) 3 rd TERENA Storage Meeting, - - PowerPoint PPT Presentation

shibbolized irods and why it matters
SMART_READER_LITE
LIVE PREVIEW

Shibbolized iRODS (and why it matters) 3 rd TERENA Storage Meeting, - - PowerPoint PPT Presentation

Shibbolized iRODS (and why it matters) 3 rd TERENA Storage Meeting, Dublin, February 12 th -13 th 2009 David Corney, for Jens Jensen, e-Science centre, Rutherford Appleton Lab, UK Overview Introduction A shared e-infrastructure


slide-1
SLIDE 1

3rd TERENA Storage Meeting, Dublin, February 12th -13th 2009 David Corney, for Jens Jensen, e-Science centre, Rutherford Appleton Lab, UK

“Shibbolized iRODS” (and why it matters)

slide-2
SLIDE 2

Overview

Introduction A shared e-infrastructure – current status One area of development: ASPiS

slide-3
SLIDE 3

David Corney CISB07 Nov 19th 2007 3

About STFC...

The Science and Technology Facilities Council (UK) Created on April 1, 2007 (1 of 7 UK research Councils) Responsible for: – fundamental research in particle physics, nuclear physics, astronomy, space – major UK facilities for the physical and life sciences

  • synchrotrons, light sources, lasers, neutrons

– national laboratories at RAL, Daresbury, UKATC – international science projects

  • CERN, ESO, ESA, ILL, ESRF…

Over 2000 staff and an annual budget of over £700M

slide-4
SLIDE 4

Tier Structure

Tier 0 Tier 1

National centres

Tier 2

Regional groups Institutes Workstations Offline farm Online system

CERN computer centre

RAL,UK ScotGrid NorthGrid SouthGridLondon France Italy Germany USA Glasgow EdinburghDurham

Useful model for Particle Physics but not necessary for others

slide-5
SLIDE 5

Tier-1 Hardware

CPU Power (Reconstruction, Simulation, User Analysis etc). 600 systems, 1250 cores, 1500 KSI2K 'Tape' Storage – Long Term retention – write once – read several times a year – 1PB in SL8500 robot + 12 drives Disk Storage (Frequently Accessed)

138 Servers, 3200 drives, 750TB

Currently about 45 racks – with a further 25 due to arrive for Xmass

slide-6
SLIDE 6

6

Rutherford Appleton Laboratory

slide-7
SLIDE 7

EDNS - European Data Infrastructure for Neutron and

Science driver - enabling better science

Neutron diffraction X-ray diffraction NMR High-quality structure refinement

slide-8
SLIDE 8

e-Infrastructure – Access to Multiple Facilities

iCat

SNS - ORNL ISIS – TS1 + 2 DLS CLF ANSTO - Australia

slide-9
SLIDE 9

EDNS - European Data Infrastructure for Neutron and

Technology Driver – integration and interoperation

Single Infrastructure Single User Experience

Capacity Storage Publications Repositories Data Repositories Software Repositories

Raw Data Catalogue Data Analysi s Analysed Data Catalogue Publication Data Catalogue Publication s Catalogue Raw Data Data Analysi s Analyse d Data Publicatio n Data Publicatio ns

Facility 1

Raw Data Data Analysi s Analyse d Data Publicatio n Data Publicatio ns

Facility 2

Raw Data Data Analysi s Analyse d Data Publicatio n Data Publicatio ns

Facility 3

Different Infrastructures Different User Experiences

slide-10
SLIDE 10

Underlying Data Infrastructure

Online Proposal System

User Office System:

User Database Scheduling Health and Safety Proposal Management Metadata Catalogue Data Acquisition System Storage Management System DataAccessPortal Single Sign On Account Creation and Management

ICAT Software Suite, providing the crucial integration of key functions.

slide-11
SLIDE 11

David Corney CISB07 Nov 19th 2007 11

BBSRC Archive system

All (12) Institutes of the BBSRC All (12) Institutes of the BBSRC 6000 scientists across the UK 6000 scientists across the UK 50 TB storage capacity (currently) 50 TB storage capacity (currently) 10 year SLA agreed 10 year SLA agreed

slide-12
SLIDE 12

David Corney CISB07 Nov 19th 2007 12

Data Archive/Management Services

High Energy Physics Experiments (CMS, Atlas, LHcb, Alice, H1,...) ISIS (Neutron Muon Source) Diamond Light Source British Atmospheric Data Centre EISCAT (Radar research) National Earth Observation Data Centre BBSRC archive Solar Physics World Data Centre CICT (Standard IT backups) Central Laser Facility National Crystallography Service, University of Southampton Hartley Library, Southampton University WASP, VIRGO Consortium SOLAR-B (Hinode)

slide-13
SLIDE 13

Data Policy

  • Data Policy (ISIS)

– 3 year embargo on data (+1 if requested) – Commercial data is never made public – Instrument Scientists can access all data from their beamline – Calibration data is public – Any data that involves IPR (e.g. analysed) is private for perpetuity unless explicitly shared by user

  • Automatic Enforcement of policy
  • A research area
slide-14
SLIDE 14

EDNP

European Data Infrastructure for Neutron and Photon Sources

Combining European Neutron and Synchrotron Facilities Already a common user community Across many disciplines – Materials, chemistry, proteomics, pharmaceuticals, nuclear physics, archaeology …

ESRF

slide-15
SLIDE 15

The ASPiS project

Jens Jensen, STFC via David Corney, STFC Terena Storage TF, Dublin February 2009

slide-16
SLIDE 16

ASPiS: people

  • M Hedges, E Liao, T Blanke, CeRCH KCL
  • A Weise, Reading
  • A Hasan, Liverpool
  • J Jensen, R Downing, STFC
slide-17
SLIDE 17

ASPiS

  • iRODS as datastore
  • SSO login via Shibboleth
  • PERMIS access control policy
  • Provenance metadata in PASOA
  • Funded by JISC
slide-18
SLIDE 18

iRODS iRODS PASOA PASOA Shib service Shib service PERMIS PDP PERMIS PDP Disk Disk Apache Apache User

slide-19
SLIDE 19

Shib login

So what does it do?

  • Single password
  • Password managed by home institution
  • S.E.P.
  • Home institution provides attrs
  • ASPiS can use these for access control
  • And for provenance
slide-20
SLIDE 20

Shibboleth login

Home Inst. Home Inst.

iRODS iRODS

slide-21
SLIDE 21

iRODS

  • Rule Engine to manage data workflow
  • Microservices calling out to ext’l

services

  • No changes to iRODS itself
  • Improves maintenance
slide-22
SLIDE 22

Log attrs Log attrs Access Ctrl Access Ctrl Update metadata Update metadata PASOA PASOA PERMIS PDP PERMIS PDP Branch on file type Branch on file type Document metadata Document metadata Image metadata Image metadata Rule Engine iRODS Example Rule workflow

slide-23
SLIDE 23

UK Access Managemen t Federation (Shibboleth) UK Access Managemen t Federation (Shibboleth) STFC iRODS STFC iRODS

Reading

iRODS

Reading

iRODS King’s iRODS King’s iRODS

ASPiS iRODS Federation

Two Federations

slide-24
SLIDE 24

Target Users

  • 1. Arts and Humanities
  • 2. STFC facilities

– Was Diamond Light Source (no IdP) – Now ISIS Neutron Source

  • 3. SRB users on the National Grid

Service

slide-25
SLIDE 25

Timescale

Project start 01 Apr 2008 Today 31 June 2009

slide-26
SLIDE 26

Questions

Thanks for your attention

  • and to David for giving the presentation

For questions, please contact j dot jensen dot ral at googlemail dot com