fair sequencing data repository based on irods
play

FAIR Sequencing Data Repository based on iRODS Felipe O. Gutierrez - PowerPoint PPT Presentation

FAIR Sequencing Data Repository based on iRODS Felipe O. Gutierrez AMC - Academic Medical Center - Amsterdam, Netherlands A.C.Camargo Cancer Center - So Paulo, Brazil F. Oliveira Aldo Sjoerd Diogo A.H.C. van Silvia D. P.F.G. De J.T.


  1. FAIR Sequencing Data Repository based on iRODS Felipe O. Gutierrez AMC - Academic Medical Center - Amsterdam, Netherlands A.C.Camargo Cancer Center - São Paulo, Brazil F. Oliveira Aldo Sjoerd Diogo A.H.C. van Silvia D. P.F.G. De J.T. van Jongejan Repping Gutierrez Ferreira Kampen Olabarriaga den Berg Geest Patrão

  2. Problem ● Inadequate RDM (Research Data Management) solution for NGS data (Next Generation Sequencing): ○ Individual storage and backup ○ Dispersed datasets ○ Disconnected from metadata ○ Not FAIR 2

  3. Considerations Fit within organization ● ICT culture ● Research culture ● Sustainability vision Adhere to international community best practices Reuse and extend existing solutions Freeman, 1983 3

  4. Fit into AMC Vision for RDM Based on NFU Data4Lifesciences WP2 An NGS repository that is: ● Part of an ecosystem ● Controlled by AMC ● Distributed ● Scalable ● FAIR compliant ● Easy to use 4

  5. System Design ● iRODS 4.1.10 ○ Middleware ○ Data virtualization ● Virtuoso 7.2 ○ Triplestore ○ Supports ontologies ● User interfaces: ○ Metalnx web ○ Davrods 4.1 ○ iCommands 5

  6. System Architecture 6

  7. Stewardship: Ontologies ● EDAM Ontology for bioinformatics operations, types of data, data identifiers, data formats, and topics ● OMIABIS Ontologized Minimum Information About Biobank data Sharing (MIABIS) ● OBI Ontology for Biomedical Investigations ● EFO Experimental Factor Ontology 7

  8. Workflow: Data Ingestion 8

  9. Workflow: (meta)data Registration 9

  10. Workflow: (meta)data Retrieval 10

  11. Access and Security 11

  12. Status 12

  13. Report file 13

  14. nmon read KB/s 14

  15. nmon write KB/s 15

  16. nmon IOPs 16

  17. Qualitative & Quantitative questions ● (meta)data preparation? Clear, doable, easy, ... ● (meta)data upload? Type, size, quantity, integrity, ... ● Rule processing? Report file clear and easy, system delay feedback, ... ● (meta)data retrieval? Findable, Accessible, Organized, Interoperable, Reusable, .. ● Concurrent users, variation on the number and size of files. 17

  18. Acknowledgements KEBB: •Barbera van Schaik •Allard van Altena ADICT: Hans van den Berg UvA ICTS: Joyce Nijkamp Medical Library: Lieuwe Kool Clinical Research Unit: Rudy Scholte Reproductive medicine: Sjoerd Repping Genetic Metabolic Diseases: Frédéric Vaz Immunogenomics: Niek de Vries

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend