EUDAT: Towards a pan-European Collaborative Data Infrastructure - - PowerPoint PPT Presentation

eudat towards a pan european collaborative data
SMART_READER_LITE
LIVE PREVIEW

EUDAT: Towards a pan-European Collaborative Data Infrastructure - - PowerPoint PPT Presentation

EUDAT: Towards a pan-European Collaborative Data Infrastructure Federated Identity Management and Access Control Mark van de Sanden SARA, The Netherlands Terena VAMP workshop Utrecht, 6-7 September 2012 Outline Project Core Services


slide-1
SLIDE 1

EUDAT: Towards a pan-European Collaborative Data Infrastructure

Federated Identity Management and Access Control

Mark van de Sanden SARA, The Netherlands Terena VAMP workshop Utrecht, 6-7 September 2012

slide-2
SLIDE 2

Outline

  • Project
  • Core Services
  • AAI Use Case

2

slide-3
SLIDE 3

3

slide-4
SLIDE 4

Data centers and Communities

4

slide-5
SLIDE 5

5

slide-6
SLIDE 6

6

slide-7
SLIDE 7

7

slide-8
SLIDE 8

8

slide-9
SLIDE 9

9

slide-10
SLIDE 10

Communities and Data Centers

What are the basic

10

basic requirements? Which common services are needed?

slide-11
SLIDE 11

11

Dynamic replication to HPC workspace for processing

slide-12
SLIDE 12

Service SR DR MD SSS PID AAI Community CLARIN X + X X + X ENES X X X + X EPOS X X X X

How Services are Shared?

12 EPOS X X X X VPH X X X X LifeWatch X + X + + X

NB: “X”= this service is relevant to this community, “+“ = this community has interest in this service but at a later stage or has a similar service already running in production.

slide-13
SLIDE 13

Example Use Case

Objective: Enable communities to perform (HPC) computations on the replicated data Key benefits: Access to large computing facilities Description: This service will allow the EUDAT communities to dynamically replicate subsets of their data stored in EUDAT to HPC machine workspaces for processing.

EUDAT Storage HPC Facility

CINECA

Community Storage

EPOS

1 3 2

13

processing. Differences with the safe replication scenario:

replicated data are discarded when the analysis application ends; Persistent Identifier (PID) references are not applied to replicated data into HPC workspaces; Users initiate the process of replicating data while in the safe replication scenario data are replicated automatically on a policy basis.

Technologies: GridFTP, Griffin, gTransfer, Globus Online, iRODS

EUDAT Storage HPC Facility

SARA

HPC Facility

PRACE

PID

3 4 2

slide-14
SLIDE 14

EUDAT AAI Use Case

EUDAT is one of the first multi scientific domain project to tackle the data deluge Objective: Provide common data services with a working AAI system in a federated scenario Have to work with many different identity domains: community domains, federated NRENs, e-infrastucture (EGI, PRACE, eduGAIN), local Institutions, OpenID providers, … 14 Potential user base ranges from the current core communities (>10k) to all scientists in EU and beyond. Technologies: Oauth2, OpenID, RADIUS, SAML2, X.509, XACML, etc. Access via Web based, command line, portals and/or via workflows while maintaining access rights and uphold trust and privacy Partners and communities are from across EU countries, have to coop with differences in legislation

slide-15
SLIDE 15

EUDAT Approach

Make use of existing solutions, services and policy frameworks, avoid setting up your own AAI. Distinguish between IdP and AtP providers, whereas AtP are preferably managed by communities. Make use of Credential Conversion or Security

15

Make use of Credential Conversion or Security Token Service technologies, evaluating Contrail, EMI STS and GEMBUS STS Limit the technologies with which the data centers have to coop with, piloting with Shibbolizing services Integration with Community Portals and evaluating the use of Short Lived Certificates. What about homeless and citizen scientists?

slide-16
SLIDE 16

16

sanden@sara.nl