the implementation of national research data repository
play

The implementation of national research data repository in South - PowerPoint PPT Presentation

The implementation of national research data repository in South Africa. Mr Mbuyiselo Mqondisi Ndlovu Software Engineer OPEN REPOSITORIES 2019, June 10-13, 2019 | Hamburg, Germany Content Introduction Issues to address


  1. The implementation of national research data repository in South Africa. Mr Mbuyiselo Mqondisi Ndlovu Software Engineer OPEN REPOSITORIES 2019, June 10-13, 2019 | Hamburg, Germany

  2. Content • Introduction • Issues to address • Current view of SA institutional data repositories • Provisioning of a national data repository – Architecture • Interaction with the storage • Metadata • Persistent Identifier • DOI – Data Deposit Tool Interaction • Integrated National Research Data Infrastructure 2

  3. Introduction • Not all institutions have IT capability to provide digital storage for their research data. • Need to look at how all institutions can be supported by offering them a central repository system. • Provide competent and user friendly systems to help researchers interact with centralized data repository is essential. • This presentation covers applications and a storage facility implemented by DIRISA. 3

  4. Issues to address Committed to address research data challenges: • Storage and back up • Sharing data • Preserving data • Discovering data • Data transfer • Re-usability 4

  5. Current view of SA institutional data repositories • South Africa has 26 institutions • Own repositories and varied infrastructures • Digital content generated by faculty, staff and students • Currently not centralized 5

  6. Provisioning of a national data repository – Architecture DIRISA cloud portal Passive data: Openstack Storage archival data & Virtualisation Cluster staging: VM access iRODS 8 PB 40 PB Active data: near real time interactive access Allocation of quotas per researcher. Reliable and persistent. 6

  7. Interaction with the storage • Metalnx, iCommands (optional) • Designed to work alongside iRODS • Metalnx key features: – Collection management – Metadata extraction – Metadata management – Metadata templates 7

  8. Metadata • Dublin core elements - 15 metadata fields: – Title – Subject – Description – Creator – Publisher – Contributor – Date – Type – Format – Identifier – Source – Language – Relation – Coverage – Rights 8

  9. Persistent Identifier • Reference to a document, file, web page, or other object • Globally unique • Forms part of metadata fields • Invoked by metalnx through API call during upload process • The Anatomy of a Digital Object Identifier (DOI) Source: http://www.ands.org.au/online-services/doi-service/doi-policy-statement 9

  10. DOI – Data Deposit Tool Interaction 10

  11. Integrated National Research Data Infrastructure Tier 1 (National) DIRISA Tier 2 (Regional) Region 2 Region 1 Region 3 Tier 3 (Inst) Institution 1 Institution 2 • One central application • All nodes connected (Tier 1, Tier 2, and Tier 3) • Retrieve data from all institutional repositories via one central UI • Interoperability • Improved collaboration and sharing 11

  12. 12

  13. Thank you

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend