Publishing linked data on data.bibliotheken.nl Ren Voorburg - - PowerPoint PPT Presentation

publishing linked data on data bibliotheken nl
SMART_READER_LITE
LIVE PREVIEW

Publishing linked data on data.bibliotheken.nl Ren Voorburg - - PowerPoint PPT Presentation

Publishing linked data on data.bibliotheken.nl Ren Voorburg rene.voorburg@kb.nl By Koninklijke Bibliotheek CC BY-SA 2.0, h8ps:// image CC BY-SA Koninklijke Bibliotheek KB metadata infrastructure > 4.5 mln catalogue records ~ 1.5 mln


slide-1
SLIDE 1

By Koninklijke Bibliotheek CC BY-SA 2.0, h8ps:// Publishing linked data on data.bibliotheken.nl

René Voorburg
 rene.voorburg@kb.nl

image CC BY-SA Koninklijke Bibliotheek

slide-2
SLIDE 2

KB metadata infrastructure

> 4.5 mln catalogue records
 ~ 1.5 mln digiPzed newspapers
 ~ 135 mln newspaper arPcles
 ~ 300.000 digiPzed periodicals
 ~ 900.000 digiPzed books ‘Organically grown’ metadata Provided through OAI-PMH, SRU (SOLR), h8p-proxy (resolver).

slide-3
SLIDE 3

Why start publishing linked data?

“To be in the web, instead of just on the web” FacilitaPng others to use, reuse and link to KB metadata (and digital objects).

image CC BY-SA Koninklijke Bibliotheek

slide-4
SLIDE 4

Core semanPc & design principles

  • Start with bibliographic metadata, the foundaPon of all
  • IFLA LRM ontology (Work - Expression - ManifestaPon - Item) defines the horizon
  • Map to schema.org as much as possible / when sensible
  • De-conceptualize (/de-SKOS) whenever beneficial
  • URI policy: disPnguish between the thing and its descripPon (‘303 see other’)

image Public Domain

slide-5
SLIDE 5

Technical foundaPons

  • LodView RDF viewer
  • Virtuoso Open Source triple store


backend for LodView
 provides SPARQL interface

  • Apache webserver


for configuring redirects, content negoPaPon, etc

image CC BY-SA Vera de Kok

slide-6
SLIDE 6

Some of the approaches followed in creaPng the RDF

schema-RDF Pica+ ‘Pica+’-RDF SPARQL Pica+ skos-RDF XSLT schema-RDF Pica+ relaPonal db stored procedure

Thesauri: Bibliographic records:

XSLT XSLT

Improved approach:

image CC BY-SA Koninklijke Bibliotheek

relaPonal db schema-RDF PHP

Set ‘DBNL’:

schema-RDF SPARQL XSLT

slide-7
SLIDE 7
slide-8
SLIDE 8
slide-9
SLIDE 9
slide-10
SLIDE 10

Some things we ‘ve learned and want to improve

CreaPng and updaPng RDF is a lot of work


  • More automaPon is wanted

  • Ideally, core metadata systems should be linked data compaPble

Ideally, all KB metadata should adhere to one generic enPty relaPonship model


  • to answer the core quesPon “exactly what are we describing here”

For different uses, ‘views’ with different levels of detail are wanted

image Public Domain

slide-11
SLIDE 11

Intel._EnPty Item ManifestaPon Expression Work RepresentaPon Bitstream File

  • wl:sameAs

Towards a unified KB metadata model

IFLA - LRM/RDA PREMIS

image CC BY-SA Vera de Kok

slide-12
SLIDE 12

LRM/RDA enPty clustering

Experimental LRM /RDA clustering at LRM:Work and LRM:Expression level using the ‘RDA enPty finder’. Part of ongoing work to map Pica+ to RDA.

slide-13
SLIDE 13

Data on the move, a case for mulPple representaPons

a schema:Book

schema:author

a schema:Person a schema:Book a schema:Role a schema:Person

schema:author schema:author schema:name “some pseudonym” schema:name “Real Name” schema:name “Real Name”

image Public Domain

a lrm:Manifest. a 
 lrm:Nomen a lrm:Person

‘related_nomen’ rda: rda:’nomenstring’ “some pseudonym” rda: ‘alt_idenPty_of’

slide-14
SLIDE 14

Modelling mulPple representaPons

a schema:Book

described by

a foaf:Doc. a foaf:Doc. a foaf:Doc.

/id/123 /doc/123 /schemaplus/123 /rda/123

Default representaPon of
 /id/123 RepresentaPon of /id/123, more detailed schema.org RepresentaPon of /id/123, source RDA metadata.

described by described by

image CC BY-SA René Voorburg

slide-15
SLIDE 15

By Koninklijke Bibliotheek CC BY-SA 2.0, h8ps://

image CC BY-SA Koninklijke Bibliotheek

To conclude

  • The linked data effort that started off with an outward focus (‘the user’),


has now turned inwards (‘how to improve our metadata and metadata infrastructure`).

  • Linked data has worked as a catalyst for cooperaPon between the

tradiPonal metadata department (the catalogue) and the more IT-related metadata-services departments.

  • With regards to tools, standards, vocabularies & ontologies, there is sPll

much work to be done to reach maturity.