Do you want to find out MORe ? Dimitris Gavrilis Digital Cura:on - - PowerPoint PPT Presentation
Do you want to find out MORe ? Dimitris Gavrilis Digital Cura:on - - PowerPoint PPT Presentation
Do you want to find out MORe ? Dimitris Gavrilis Digital Cura:on Unit IMIS, Athena Research Center hCp://more.dcu.gr What is MORe Metadata aggregator Easy to use Flexible Scalable Valida:on Metadata quality
- Metadata aggregator
– Easy to use – Flexible – Scalable – Valida:on – Metadata quality – Enrichment
- Used in various projects & infrastructures
(CARARE, 3D-ICONS, LoCloud, ARIADNE,…)
What is MORe
Workflow
OAI-PMH Omeka Wikimedia MINT Harvest Ingest Transform Enrich Publish OAI-PMH Archive RDF Store Elas:c Search Validate Index Delete Reject
Micro-services architecture
Valida:on service mgmt Valida&on micro-services Input sources Structure Schema Linking Schematron rules Data access layer OAI-PMH MINT mapping tool Storage nodes Core services layer Input service mgmt Publish serv. mgmt Publish services Archive Elas:c Search RDF Store OAI-PMH … Wikimedia Omeka Enrichment service mgmt … Thesauri collec:ons Vocabulary matching Background links Geo normaliza:on Geo coding Language iden:fica:on Historic place names Enrichment micro-services File-Upload … …
- Valida:on schemes
– Integrity checking – XSD valida:on – Broken links – Schematron rule based valida:on
Flexible Valida:on
- Get completeness graphs
for every package and
– schema – element – Per mandatory/ recommended set
Metadata Quality
- On the fly indexing, analysis and intui:ve presenta:on of:
– Thema:c informa:on – Spa:al informa:on – Temporal informa:on
Metadata Quality
Enrichment Micro-services
- The enrichment process:
– normalizes your content – creates links among resources – adds new informa:on
- And thus improves:
– Interoperability – Discoverability
Why enrichment ?
- Thema:c
– Thesauri collec:ons – Vocabulary matching – Background links
- Spa:al
– Geo normaliza:on – Geo coding – Reverse geo-coding – Historic place names
- Other
– Language iden:fica:on
Access to resources
SKOS Thesauri Geo-Names DBPedia Wikipedia
- We access to over 30
SKOSified thesauri
Subject
Author Name of vocabulary University of California, Santa Barbara Alexandria Digital Library Feature Type Thesaurus Royal Commission on the Ancient and Historical Monuments of Scotland (RCAHMS) Archeological Objects Thesaurus Scotland English Heritage Archeological Sciences Thesaurus English Heritage Building Materials Thesaurus English Heritage Components Thesaurus American Folklore Society Ethnographic Thesaurus English Heritage Event Type Thesaurus English Heritage Evidence Thesaurus English Heritage FISH Archeological Objects Thesaurus Eionet European Environment Information and Observation Network General Multilingual Environmental Thesaurus GEMET Federation Internationale des Archives du Film (FIAF) General Subject headings for Film Archives The Discovery Programme Irish Monuments The Discovery Programme Irish Periods Royal Commission on the Ancient and Historical Monuments of Scotland (RCAHMS) Maritime Craft Thesaurus Scotland English Heritage Maritime Craft Type Thesaurus English Heritage and Royal Commission on the Historical Monuments of England MDA Archaeological Objects Thesaurus Royal Commission on the Ancient and Historical Monuments of Wales (RCAHMW) Monument Thesaurus Wales Royal Commission on the Ancient and Historical Monuments of Scotland (RCAHMS) Monument Type Thesaurus English Heritage Period Thesaurus Royal Commission on the Ancient and Historical Monuments of Wales (RCAHMW) Period Thesaurus Wales Bibliographic Standards Committee of the Rare Books and Manuscripts Section (ACRL/ALA) Relator Terms for Use in Rare Book and Special Collections Cataloguing Universidad de León Tesauro de Ciencias de la Documentación Library of Congress. Prints and Photographs Division Thesaurus for Graphic Materials 1: Subject Terms Library of Congress. Prints and Photographs Division Thesaurus for Graphic Materials 2: Genre and Physical Characteristic Terms Ministero per i Beni e le Attività Culturali Thesaurus PICO 4.1 UKAT UK Archival Thesaurus (UKAT) UNESCO UNESCO thesaurus
- We have own Geo-names server
Space
- We have our own Perio.do database
Time
Enrichment Plan
- Enrichment plans provide a simple way
- f streamlining the execu:on of the
above enrichment services in order to create powerful
Enrichment plans
Language iden:fica:on Vocabulary matching Geo-normaliza:on Geo-coding Add subject collec:on A only if term X or Y are matched
An enrichment plan can pass on content specific parameters that match your data
A simple use case
Na:ve record (OAI_DC)
XSLT Mapping
EDM Record
Missing language aCributes Place label is a concat string of coordinates
Enriched EDM Record
Language iden:fica:on Vocabulary matching Geo-normaliza:on Geo-coding
Enrichment Plan
- MORe API allows to run the
en:re aggrega:on engine through REST
- Developers area
– API key genera:on – API documenta:on with examples – Example Java projects for NetBeans & Eclipse IDEs
Developers & Crea:ve Industries API Integra:on
- Free for small ins:tu:ons
– Up to 1000 items – Up to 3 publica:ons / year
- Small & medium size ins:tu:ons