ELIXIR Recommended Interoperability Resources Carole Goble, - - PowerPoint PPT Presentation

elixir recommended interoperability resources
SMART_READER_LITE
LIVE PREVIEW

ELIXIR Recommended Interoperability Resources Carole Goble, - - PowerPoint PPT Presentation

ELIXIR Recommended Interoperability Resources Carole Goble, ELIXIR-UK Interoperability Platform ExCo ELIXIR Fifth Anniversary, 11 December 2018 www.elixir-europe.org www.elixir-europe.org ELIXIR-EXCELERATE is funded by the European Commission


slide-1
SLIDE 1

www.elixir-europe.org

@ELIXIREurope

www.elixir-europe.org

ELIXIR-EXCELERATE is funded by the European Commission within the Research Infrastructures programme of Horizon 2020, grant agreement number 676559.

ELIXIR Recommended Interoperability Resources

Carole Goble, ELIXIR-UK Interoperability Platform ExCo ELIXIR Fifth Anniversary, 11 December 2018

slide-2
SLIDE 2

Turning FAIR Data into reality: Final Report and Action Plan, European Commission, Nov 2018

slide-3
SLIDE 3

Building a suitable FAIR infrastructure for finding, exchanging, comparing, aggregating and interlinking biological information across Europe

slide-4
SLIDE 4

Rare Disease research

Combine more of the same data type Link up different data types for a more complete picture

Images courtesy of Marco Roos and RD-CONNECT

slide-5
SLIDE 5

Rare Disease research

Harmonise database formats and models Map between the terms used in the databases Link to reference knowledge bases

Images courtesy of Marco Roos

Retrieval and analysis across resources

slide-6
SLIDE 6

Genotypic and Phenotypic data for Crop and Forest Plants

High throughput genomics Large scale automated phenotyping

Standards for representation of genotypic and phenotypic data Make data discoverable and interoperable by common APIs Annotate datasets to deposit into public archives

slide-7
SLIDE 7

Arabidopsis Leaf Length

?

Identifier CO_322: 0000994

Maize Plant Height

Identifier CO_322: 0000007

Thanks to Frederik Coppens

slide-8
SLIDE 8

Icons courtesy of FAIRsharing.org

Is the same identifier being used for X? Can they be linked? Are terms being used consistently? Are terms being used in common? Are the formats the same? Can they be mapped? Are the same things being reported in the same way? Do (micro) services have the same or compatible APIs?

Common Agreements for IDs & Descriptions Standards, Link Points

700 types 224 754 122

Data from Bioportal.bioontology.org, FAIRsharing.org and Identifiers.org

slide-9
SLIDE 9

BOLD (Barcode of life) NCBI Taxonomy Arabian tea plant Cedar waxwing GRIN plant taxonomy Taxon:9606 Mappings across Ontologies for NCIt (Retinoblastoma)*

* Courtesy of Simon Jupp, ** Courtesy of openPHACTS

Mappings across databases for the same entity**

slide-10
SLIDE 10

Making connections across fragmented resources

slide-11
SLIDE 11

Hence…. FAIR Data Principles

Registration and search Persistent and reused identifiers Common, structured, interlinked metadata Open access protocols Machine processing

slide-12
SLIDE 12

Turning FAIR Data Principles into Reality

Open Standards Services & Resources Machine processable

slide-13
SLIDE 13

Interoperability Resources

Validata

slide-14
SLIDE 14

Genotypic and Phenotypic data for Crop and Forest Plants Interoperability Resources

ELIXIR Plant Data Lookup Service

Registries for the standards and ontologies Look up services for the identifiers and concepts Services to help annotate & validate databases and data submissions against to reporting guidelines & formats Services to harvest, map, search metadata .. Map between different concepts and identifiers for same thing.

slide-15
SLIDE 15

Standards: formats, reporting guidelines, ontologies Search engine for datasets. Metadata services: ontology, annotation, validation, harvesting, Indexing Register services and datasets Best practice. Harmonisation of tools and pipelines Describing and sharing workflows between different systems Common Programmable Interfaces Identifier resolution & management Identifier mapping services

Resource Markup Workflows Identifiers Services and Resources Framework FAIR Metrics Registries Linked Data Knowledge Hub BYODs Metadata Standards APIs Workflow Marine Plants Rare Disease Human Data Metadata Services Id Services

What Interoperability Resources are needed?

slide-16
SLIDE 16

ELIXIR Interoperability Resources Framework

Workflows Aggregators Applications Search

Tool & API Resources (Bioschemas) Workflows (CWL) Data type specific

Ontologies, formats, reporting guidelines, APIs

Authorities Identifier Metadata Annotation Markup Citation Harvesting Metadata Validation Ontology Mapping Identifier Mapping Indexing Search Ontology Lookup Identifier resolution Ontology Management Identifier minting

Extract Transform Load

Type specific mapping and resolution Type specific integration

Standards

Standards Registry Ontologies Registry Tools Workflows Identifiers Registry

slide-17
SLIDE 17

Standards Registry Ontologies Registry Identifiers Registry

Interoperable ELIXIR Interoperability Resources Framework

Workflows Aggregators Applications Search

Tool & API Resources (Bioschemas) Workflows (CWL) Data type specific

Ontologies, formats, reporting guidelines, APIs

Standards

Authorities Identifier Tools Workflows Metadata Annotation Markup Citation Metadata Validation Ontology Mapping Identifier Mapping Ontology Lookup Identifier resolution Ontology Management Identifier minting

Extract Transform Load

Harvesting Harvesting Indexing Search Type specific mapping and resolution Type specific integration

slide-18
SLIDE 18

Example: Identifier Resolution of Data on the Web

Multiple URLs for the same collection make object unification challenging

NCBITaxon:9606

http://www.ebi.ac.uk/ols/ontologies/ncbitaxon/terms? short_form=NCBITaxon_9606 http://www.ebi.ac.uk/ena/data/view/Taxo n:9606 http://purl.uniprot.org/taxonomy /9606

Resolution Services keep track and handle the different locations and different identifier systems The Resolution Services themselves are harmonised

Thanks to Nick Juty and Sarala Wimalaratne

slide-19
SLIDE 19

International Interoperability Resources ELIXIR is part of a global ecosystem

slide-20
SLIDE 20

What are Recommended Interoperability Resources? An ELIXIR Service supplied by one or more Nodes

High quality

  • f service

and support Plays important role in our interoperability framework Are FAIR and interoperate in a resource ecosystem

https://www.elixir-europe.org/platforms/interoperability/rir-selection

slide-21
SLIDE 21

What are Recommended Interoperability Resources? An ELIXIR Service supplied by one or more Nodes

https://www.elixir-europe.org/platforms/interoperability/rir-selection

establish connections between data (and other) resources

helps

acquire and expose metadata of data (and

  • ther) resources

create infrastructure needed to build integrable data collections use interoperability resources to support delivery of FAIR principles Plays important role in our interoperability framework

slide-22
SLIDE 22

An Interoperability Resource for findability & metadata exchange for all of ELIXIR’s Resources

Metadata about web based resources using a widely adopted web standard in a community agreed way

MarRef Database

slide-23
SLIDE 23

An Interoperability Resource for findability & metadata exchange for all of ELIXIR’s Resources

Metadata about web based resources using a widely adopted web standard in a community agreed way

aggregators registries search engines applications

slide-24
SLIDE 24

An Interoperability Resource for findability and exchange

  • f ELIXIR’s workflows and pipelines

Pioneered by Marine Metagenomics

Courtesy Rob Finn, Nils P. Willassen and Michael Crusoe

slide-25
SLIDE 25

Resources gap FAIR metadata at source “The first and last mile”

Image courtesy of Sansone, McQuilton et al FAIRsharing.org first last

slide-26
SLIDE 26

First round of Recommended Interoperability Resources completes process ….

slide-27
SLIDE 27

First round of Recommended Interoperability Resources completes process ….

tomorrow!

slide-28
SLIDE 28

RIRs are ELIXIR added value to enable FAIR Core Data Resources

(and other ELIXIR resources)

Oversee quality and reliability Develop an integrated portfolio Support sustainability

RIR

slide-29
SLIDE 29

Acknowledgements

Special Thanks: Michael Crusoe Rafael Jimenez Alasdair Gray Stian Soiland-Reyes Susanna Sansone Simon Jupp Tony Burdett Sira Sarntivijai Jerry Lanfear Nick Juty Sarala Wimalaratne Frederik Coppens Justin Clark-Casey Peter McQuilton Robert Finn Marco Roos And many more!

slide-30
SLIDE 30

www.elixir-europe.org

@ELIXIREurope

www.elixir-europe.org

ELIXIR-EXCELERATE is funded by the European Commission within the Research Infrastructures programme of Horizon 2020, grant agreement number 676559.

Thank you!

slide-31
SLIDE 31

ELIXIR Interoperability Resources Framework

Workflows Aggregators Applications Search

Tool & API Resources (Bioschemas) Workflows (CWL) Data type specific

Ontologies, formats, reporting guidelines, APIs

Authorities Identifier Metadata Annotation Markup Citation Harvesting Metadata Validation Ontology Mapping Identifier Mapping Indexing Search Ontology Lookup Identifier resolution Ontology Management Identifier minting

Extract Transform Load

Type specific mapping and resolution Type specific integration

Standards

Standards Registry Ontologies Registry Tools Workflows Identifiers Registry

slide-32
SLIDE 32

The 2018 Recommendations of RIRs

Resource Description FAIRsharing Registry of curated metadata of DBs, Policies, Standards g:Profiler Gene-centric data integrator - Web UI, and API services Identifiers.org Persistent URL provider & identifier resolver Intermine DIY database portal builder for model organism of choice ISA Framework Curation for metadata of experiments (Project -> Study -> Assay) Ontology Lookup Service (OLS) Google-like ontology term search 3DBIONOTES API* API aiding protein annotation by calling info from ref. resources BridgeDb Identifier mapping for cheminformatics domain DisGeNET API* API SPARQL Endpoint for genetic variant - human disease data MOLGENIS Bioinformatics data integrator suite - explore/annotate/exchange