Do you want to find out MORe ? Dimitris Gavrilis Digital Cura:on - - PowerPoint PPT Presentation

do you want to find out more
SMART_READER_LITE
LIVE PREVIEW

Do you want to find out MORe ? Dimitris Gavrilis Digital Cura:on - - PowerPoint PPT Presentation

Do you want to find out MORe ? Dimitris Gavrilis Digital Cura:on Unit IMIS, Athena Research Center hCp://more.dcu.gr What is MORe Metadata aggregator Easy to use Flexible Scalable Valida:on Metadata quality


slide-1
SLIDE 1

Do you want to find out MORe ?

Dimitris Gavrilis Digital Cura:on Unit – IMIS, Athena Research Center

hCp://more.dcu.gr

slide-2
SLIDE 2
  • Metadata aggregator

– Easy to use – Flexible – Scalable – Valida:on – Metadata quality – Enrichment

  • Used in various projects & infrastructures

(CARARE, 3D-ICONS, LoCloud, ARIADNE,…)

What is MORe

slide-3
SLIDE 3

Workflow

OAI-PMH Omeka Wikimedia MINT Harvest Ingest Transform Enrich Publish OAI-PMH Archive RDF Store Elas:c Search Validate Index Delete Reject

slide-4
SLIDE 4

Micro-services architecture

Valida:on service mgmt Valida&on micro-services Input sources Structure Schema Linking Schematron rules Data access layer OAI-PMH MINT mapping tool Storage nodes Core services layer Input service mgmt Publish serv. mgmt Publish services Archive Elas:c Search RDF Store OAI-PMH … Wikimedia Omeka Enrichment service mgmt … Thesauri collec:ons Vocabulary matching Background links Geo normaliza:on Geo coding Language iden:fica:on Historic place names Enrichment micro-services File-Upload … …

slide-5
SLIDE 5
  • Valida:on schemes

– Integrity checking – XSD valida:on – Broken links – Schematron rule based valida:on

Flexible Valida:on

slide-6
SLIDE 6
  • Get completeness graphs

for every package and

– schema – element – Per mandatory/ recommended set

Metadata Quality

slide-7
SLIDE 7
  • On the fly indexing, analysis and intui:ve presenta:on of:

– Thema:c informa:on – Spa:al informa:on – Temporal informa:on

Metadata Quality

slide-8
SLIDE 8

Enrichment Micro-services

slide-9
SLIDE 9
  • The enrichment process:

– normalizes your content – creates links among resources – adds new informa:on

  • And thus improves:

– Interoperability – Discoverability

Why enrichment ?

slide-10
SLIDE 10
  • Thema:c

– Thesauri collec:ons – Vocabulary matching – Background links

  • Spa:al

– Geo normaliza:on – Geo coding – Reverse geo-coding – Historic place names

  • Other

– Language iden:fica:on

Access to resources

SKOS Thesauri Geo-Names DBPedia Wikipedia

slide-11
SLIDE 11
  • We access to over 30

SKOSified thesauri

Subject

Author Name of vocabulary University of California, Santa Barbara Alexandria Digital Library Feature Type Thesaurus Royal Commission on the Ancient and Historical Monuments of Scotland (RCAHMS) Archeological Objects Thesaurus Scotland English Heritage Archeological Sciences Thesaurus English Heritage Building Materials Thesaurus English Heritage Components Thesaurus American Folklore Society Ethnographic Thesaurus English Heritage Event Type Thesaurus English Heritage Evidence Thesaurus English Heritage FISH Archeological Objects Thesaurus Eionet European Environment Information and Observation Network General Multilingual Environmental Thesaurus GEMET Federation Internationale des Archives du Film (FIAF) General Subject headings for Film Archives The Discovery Programme Irish Monuments The Discovery Programme Irish Periods Royal Commission on the Ancient and Historical Monuments of Scotland (RCAHMS) Maritime Craft Thesaurus Scotland English Heritage Maritime Craft Type Thesaurus English Heritage and Royal Commission on the Historical Monuments of England MDA Archaeological Objects Thesaurus Royal Commission on the Ancient and Historical Monuments of Wales (RCAHMW) Monument Thesaurus Wales Royal Commission on the Ancient and Historical Monuments of Scotland (RCAHMS) Monument Type Thesaurus English Heritage Period Thesaurus Royal Commission on the Ancient and Historical Monuments of Wales (RCAHMW) Period Thesaurus Wales Bibliographic Standards Committee of the Rare Books and Manuscripts Section (ACRL/ALA) Relator Terms for Use in Rare Book and Special Collections Cataloguing Universidad de León Tesauro de Ciencias de la Documentación Library of Congress. Prints and Photographs Division Thesaurus for Graphic Materials 1: Subject Terms Library of Congress. Prints and Photographs Division Thesaurus for Graphic Materials 2: Genre and Physical Characteristic Terms Ministero per i Beni e le Attività Culturali Thesaurus PICO 4.1 UKAT UK Archival Thesaurus (UKAT) UNESCO UNESCO thesaurus

slide-12
SLIDE 12
  • We have own Geo-names server

Space

slide-13
SLIDE 13
  • We have our own Perio.do database

Time

slide-14
SLIDE 14

Enrichment Plan

  • Enrichment plans provide a simple way
  • f streamlining the execu:on of the

above enrichment services in order to create powerful

Enrichment plans

Language iden:fica:on Vocabulary matching Geo-normaliza:on Geo-coding Add subject collec:on A only if term X or Y are matched

An enrichment plan can pass on content specific parameters that match your data

slide-15
SLIDE 15

A simple use case

slide-16
SLIDE 16

Na:ve record (OAI_DC)

XSLT Mapping

slide-17
SLIDE 17

EDM Record

Missing language aCributes Place label is a concat string of coordinates

slide-18
SLIDE 18

Enriched EDM Record

Language iden:fica:on Vocabulary matching Geo-normaliza:on Geo-coding

Enrichment Plan

slide-19
SLIDE 19
  • MORe API allows to run the

en:re aggrega:on engine through REST

  • Developers area

– API key genera:on – API documenta:on with examples – Example Java projects for NetBeans & Eclipse IDEs

Developers & Crea:ve Industries API Integra:on

slide-20
SLIDE 20
  • Free for small ins:tu:ons

– Up to 1000 items – Up to 3 publica:ons / year

  • Small & medium size ins:tu:ons

– Request a quote Find out more at:

How can I get it ?

hCp://more.dcu.gr

slide-21
SLIDE 21

Thank you d.gavrilis@dcu.gr