Using iRODS to manage, share and publish research data Ton Smeele - - PowerPoint PPT Presentation

using irods to manage share
SMART_READER_LITE
LIVE PREVIEW

Using iRODS to manage, share and publish research data Ton Smeele - - PowerPoint PPT Presentation

Using iRODS to manage, share and publish research data Ton Smeele & Lazlo Westerhof ITS/ResearchIT, Utrecht University ITS Research IT Agenda Profile Utrecht University Yoda introduction and concepts Demonstration Challenges, issues


slide-1
SLIDE 1

Using iRODS to manage, share and publish research data

Ton Smeele & Lazlo Westerhof ITS/ResearchIT, Utrecht University

slide-2
SLIDE 2

ITS – Research IT

Agenda

Profile Utrecht University Yoda introduction and concepts Demonstration Challenges, issues & lessons learned

slide-3
SLIDE 3

ITS – Research IT

Organisation & people

  • Incl. faculties

Medicine

ESTABLISHED

1636

550

PROFESSORS

7+2

FACULTIES

teaching institutes

6,960

STAFF-MEMBERS

30,523

STUDENTS

slide-4
SLIDE 4

ITS – Research IT

Top ranking

NOBEL PRIZES SPINOZA PRIZES SHANGHAI RANKING 2017

12 15

The Netherlands

1

Europe

13

World

47

slide-5
SLIDE 5

ITS – Research IT

4 Strategic themes - focused research

  • Integrating Utrecht

expertise on youth development, from synapse to society

DYNAMICS OF YOUTH

  • Cooperation, Self-regulation

and Collective Action

  • Sustainability and

Resilience

  • Innovation and Economic

Growth

  • Equality, Inclusiveness and

Social Mobility

  • Democratic Governance,

Citizenship and Trust

INSTITUTIONS FOR OPEN SOCIETIES

  • One Health
  • Personalised Medicine & Health
  • Regenerative Medicine & Stem

Cells

  • Science for Life

LIFE SCIENCES

  • Towards Industry with Negative

Emissions

  • Future Food: Pathways towards

Healthy Planet Diets

  • Transforming Infrastructures for

Sustainable Cities

  • Water Climate & Future Deltas

PATHWAYS TO SUSTAINABILITY

slide-6
SLIDE 6

ITS - Research IT

Why iRODS as Research Data Management platform

  • scalable platform

– can manage billions of files, petabytes of data – infrastructure/vendor neutral solution

  • enforces data policies

– secures sensitive data – auditable controls

  • manages metadata alongside the data

– metadata based data policy execution decisions – data workflow automation

supports demonstrable research integrity can be used to manage large/many data collections facilitates research data workflows

slide-7
SLIDE 7

ITS – Research IT

Utrecht University iRODS managed research data

20 40 60 80 100 120 140 160 180 200 2 4 6 8 10 12 200 400 600 800 1000 1200 1400 1600 Internal External

11 Zones 1400 Users 180 TB Data production instances only, figures are indicative

slide-8
SLIDE 8

ITS – Research IT preconfigured iRODS based system, delivered and supported as a service

– enhanced with (graphical) user interfaces, policies and rules

iRODS

UU Data Policies and -services Apache Web Server

portal

PRODS

network-disk

Davrods

iRODS API

power-user

iCommands

Our iRODS implementation is called "Yoda":

user interaction service configuration data integration 10,000 lines of rules 14 custom microservices

slide-9
SLIDE 9

ITS – Research IT

Yoda Data compartments

Research Vault Research Vault Research Vault Each data compartment relates to an iRODS group Collaborate Deposit/ Read only

slide-10
SLIDE 10

ITS – Research IT

Yoda Communities ("category")

A community comprises

  • f multiple data

compartments Per community:

  • cost calculation/invoicing
  • appointed datamanager(s)
  • metadata schema

Research Vault Research Vault Research Vault Community concept implemented as metadata on iRODS groups

slide-11
SLIDE 11

ITS – Research IT

Collaborate during research via the Yoda disk

WebDAV access from anywhere on any workstation using Davrods

slide-12
SLIDE 12

ITS – Research IT

Data Deposit workflow

Submit Approve Secured

Research Vault Researcher

requests to deposit

Data manager

checks metadata complies with policies

System

deposits a copy in the vault

data package data folder

+

metadata bypass possible for communities that have no datamanager role

slide-13
SLIDE 13

ITS – Research IT

FAIR Data Publication workflow

Submit Approve Published

Vault Researcher

requests to publish

Data manager

checks metadata complies with publication policies

System

publishes the metadata and provides internet access to data if classified as "Open"

data package DOI + landingpage

slide-14
SLIDE 14

ITS – Research IT

'FAIR' Research Data Management using iRODS

Collaborate safely as a group ("Research" folder) Maintain integrity, deposit a folder in the vault Allow FAIR reuse, publish a data package Research Vault

slide-15
SLIDE 15

ITS – Research IT

demonstration

slide-16
SLIDE 16

ITS – Research IT

Challenges, issues and lessons learned

  • Metadata form interaction with browser: was XML now

adopting Json

  • iRODS 4.1.11 stable and reliable except for delayed rules

engine (resolved in 4.2.2+)

  • many components and architectural layers, need to simplify

implementation and configuration

slide-17
SLIDE 17

ITS – Research IT

Yoda manages data during/after research

Collaborate safely as a group ("Research" folder)

  • > membership self-managed by researchers

Maintain integrity, deposit a folder in the vault

  • > metadata can vary per community,
  • > datamanager approves deposit

Allow FAIR reuse, publish a data package

  • > datamanager approves publication, DOI citable data

Research Vault

slide-18
SLIDE 18

ITS – Research IT

Yoda is available under GPL license at https://github.com/UtrechtUniversity

Thank you

More info: Ton Smeele a.p.m.smeele@uu.nl Lazlo Westerhof l.r.westerhof@uu.nl