The implementation of national research data repository in South - - PowerPoint PPT Presentation

the implementation of national research data repository
SMART_READER_LITE
LIVE PREVIEW

The implementation of national research data repository in South - - PowerPoint PPT Presentation

The implementation of national research data repository in South Africa. Mr Mbuyiselo Mqondisi Ndlovu Software Engineer OPEN REPOSITORIES 2019, June 10-13, 2019 | Hamburg, Germany Content Introduction Issues to address


slide-1
SLIDE 1

The implementation of national research data repository in South Africa.

Mr Mbuyiselo Mqondisi Ndlovu Software Engineer OPEN REPOSITORIES 2019, June 10-13, 2019 | Hamburg, Germany

slide-2
SLIDE 2

2

Content

  • Introduction
  • Issues to address
  • Current view of SA institutional data repositories
  • Provisioning of a national data repository – Architecture
  • Interaction with the storage
  • Metadata
  • Persistent Identifier
  • DOI – Data Deposit Tool Interaction
  • Integrated National Research Data Infrastructure
slide-3
SLIDE 3

3

Introduction

  • Not all institutions have IT capability to provide digital storage for

their research data.

  • Need to look at how all institutions can be supported by offering

them a central repository system.

  • Provide competent and user friendly systems to help researchers

interact with centralized data repository is essential.

  • This presentation covers applications and a storage facility

implemented by DIRISA.

slide-4
SLIDE 4

4

Issues to address

Committed to address research data challenges:

  • Storage and back up
  • Sharing data
  • Preserving data
  • Discovering data
  • Data transfer
  • Re-usability
slide-5
SLIDE 5

5

Current view of SA institutional data repositories

  • South Africa has 26 institutions
  • Own repositories and varied infrastructures
  • Digital content generated by faculty, staff and students
  • Currently not centralized
slide-6
SLIDE 6

6

Provisioning of a national data repository – Architecture

40 PB

Passive data: archival data & staging: VM access

8 PB

Active data: near real time interactive access Openstack Storage Virtualisation Cluster

iRODS

DIRISA cloud portal

Allocation of quotas per researcher. Reliable and persistent.

slide-7
SLIDE 7

7

Interaction with the storage

  • Metalnx, iCommands (optional)
  • Designed to work alongside iRODS
  • Metalnx key features:

– Collection management – Metadata extraction – Metadata management – Metadata templates

slide-8
SLIDE 8

8

Metadata

  • Dublin core elements - 15 metadata fields:

– Title – Subject – Description – Creator – Publisher – Contributor – Date – Type – Format – Identifier – Source – Language – Relation – Coverage – Rights

slide-9
SLIDE 9

9

Persistent Identifier

  • Reference to a document, file, web page, or other object
  • Globally unique
  • Forms part of metadata fields
  • Invoked by metalnx through API call during upload process
  • The Anatomy of a Digital Object Identifier (DOI)

Source: http://www.ands.org.au/online-services/doi-service/doi-policy-statement

slide-10
SLIDE 10

10

DOI – Data Deposit Tool Interaction

slide-11
SLIDE 11

11

Integrated National Research Data Infrastructure

  • One central application
  • All nodes connected (Tier 1, Tier 2, and Tier 3)
  • Retrieve data from all institutional repositories via one central UI
  • Interoperability
  • Improved collaboration and sharing

Tier 3 (Inst) Tier 2 (Regional) Tier 1 (National)

DIRISA Region 1 Institution 1 Institution 2 Region 2 Region 3

slide-12
SLIDE 12

12

slide-13
SLIDE 13

Thank you