Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Valentine Charles, Richard Wallis, Antoine Isaac, Nuno Freire and Hugo Manguinhas | SWIB 2017
publishing and harvesting metadata at Europeana Valentine Charles, - - PowerPoint PPT Presentation
Perspectives on using Schema.org for publishing and harvesting metadata at Europeana Valentine Charles, Richard Wallis, Antoine Isaac, Nuno Freire and Hugo Manguinhas | SWIB 2017 European Cultural Heritage on the Web The main goal of Europeana
Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Valentine Charles, Richard Wallis, Antoine Isaac, Nuno Freire and Hugo Manguinhas | SWIB 2017
The main goal of Europeana is to provide access to cultural heritage and encourage people to engage with culture.
a trusted and authoritative repository of cultural heritage by the search engines.
CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana CC BY-SA
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Publication of data on the Web supported by the Europeana Data Model (EDM)
DBpedia, Wikidata)
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
principles
collaboration is the W3C Schema.org Community Group.
embedded in many different encodings (e.g. RDFa, Microdata and JSON-LD).
describe bibliographic resources.
properties to describe archives and their contents.
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Denmark, CC0 1885, Statens Museum for Kunst L.A Ring Harvest
Mapping EDM to Schema.org
Objective: a Schema.org representation of Europeana EDM, being as rich as possible and tailored to Europeana’s realities and user needs
schema:VisualArtwork, schema:Book, schema:Painting, schema:Sculpture, and schema:Product can be matched to edm:ProvidedCHO
schema:artMedium for schema:VisualArtwork.
schema:VideoObject, schema:AudioObject can be matched to edm:WebResource
match the semantics of EDM contextual classes edm:Agent, edm:Place and foaf:Organization.
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
schema:Book, schema:Painting, schema:Sculpture, schema:ImageObject) will require a mapping with dc:type.
(e.g. schema:ImageObject, schema:AudioObject, schema:VideoObject) will require a mapping between MimeTypes, file extensions, etc. to ascertain the correct type.
artwork such as sculpture, painting, drawing, etc.
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
A minimal requirement is to expand strings into an entity description
CC BY-SA CC BY-SA
different strategies to expand strings into entities
Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
1.Implicit Blank Nodes (nested
CC BY-SA CC BY-SA
URI of the Web Page
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Practicalities for publishing Schema.org at Europeana.eu
University Of Edinburgh, CC BY Roslin Glass Slides, creator unknown Photograph of two men step cutting on the ice face of the Tasman Glacier, New Zealand in the late 19th or early 20th century.
Objective: to enable external organizations in general, and Search Engines in particular, to consume the data into their Knowledge Graphs of resources on the web.
supporting human interaction, we therefore recommend to separate the interface concerns
underlying data structures.
Europeana.
does not impact on its visual output.
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
On-the-fly
mapping/conversion process. + no extra data is stored to support Schema.org; also changes to mapping rules are instantly available.
Batch creation
+ not needing processing to extract data for display.
Combined approach (on-the-fly & batch creation)
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Objective: get search engines to crawl and consume data from the pages describing Europeana resources.
some additional information that will enable the website to be crawled more effectively.
site not being fully crawled and data not being consumed.
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Europeana as a harvester
Slovakia, CC-BY 1990, Slovak National Gallery Felician Moczik Zapad Slnka
crawling ordinary web pages.
In the particular case of digital library websites, sitemaps help dealing with some typical discovery problems faced by CH institutions:
the browsable interface.
updated content.
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
Schema.org vocabulary.
enable the provision of Schema.org metadata interoperable with EDM. More details in the Code4Lib paper
CC BY-SA CC BY-SA Perspectives on using Schema.org for publishing and harvesting metadata at Europeana
05 December 2017