SKOS
COMP60421 Sean Bechhofer sean.bechhofer@manchester.ac.uk
SKOS COMP60421 Sean Bechhofer sean.bechhofer@manchester.ac.uk - - PowerPoint PPT Presentation
SKOS COMP60421 Sean Bechhofer sean.bechhofer@manchester.ac.uk Ontologies Metadata Resources marked-up with descriptions of their content. No good unless everyone speaks the same language ; Terminologies Provide shared and
COMP60421 Sean Bechhofer sean.bechhofer@manchester.ac.uk
2
– Resources marked-up with descriptions of their content. No good unless everyone speaks the same language;
– Provide shared and common vocabularies of a domain, so search engines, agents, authors and users can communicate. No good unless everyone means the same thing;
– Provide a shared and common understanding of a domain that can be communicated across people and applications, and will play a major role in supporting information exchange and discovery.
3
a community
used to describe a certain conceptualisation of objects in a domain of interest
assumptions made in interpreting those terms
classification (is-a)
description of their characteristics
4
Catalogue Terms/ glossary Thesauri Informal is-a Formal is-a Frames Value Restrictions Expressive Logics
applications
5
architecture – Separating link and document – Explicit navigation around a domain vocabulary
documents based on the
those documents.
6
7
HTML Document in Linked HTML Document out
DLS Agent Knowledge Service Resource Service
Ontology SKOS Search Engine Annotation DB
8
Link Service Linkbase Document Linked Document
9
Document Link Service Thesaurus Linkbase Linked Document
10
Link Service Document Linked Document Ontology Linkbase
11
support the conceptual models.
vocabularies were perhaps more appropriate.
Y. Yesilada, R. Stevens, S. Jupp, and B.
for Dynamic Linking IEEE Internet Computing12(3), p.32--39 2008 http://dx.doi.org/ 10.1109/MIC.2008.68
http://www.flickr.com/photos/buildscharacter/443708336/
12
thesauri
semantics)
13
retrieve objects
vocabulary
retrieve objects
14
representation framework for Knowledge Organisation Systems (KOS)…
variation found in KOS idioms…
KOS within a decentralised, distributed, information environment such as the world wide (semantic) web.
15
schemes”
controlled vocabularies – Many of these already exist and are in use in cultural heritage, library sciences, medicine etc. – A wide range of knowledge sources that can potentially provide value for Semantic Web applications
representation of such schemes. – A migration path bringing such resources “into the Semantic Web”.
16
statements about relationships between those concepts – Semantic Relationships
– Lexical Labels
– Additional documentation
17
Controlled Vocabulary Synonym Ring Authority File Taxonomy Thesaurus Collection of Terms Equivalent Terms Preferred Terms Hierarchy Related Terms Controlled vocabularies: designed for use in classifying or indexing documents and for searching them. Thesaurus: Controlled vocabulary in which concepts are represented by preferred terms, formally organised so that paradigmatic relationships between the concepts are made explicit, and the preferred terms are accompanied by lead-in entries for synonyms or quasi-synonyms.
18
19
purposes
that, e.g. OWL ontologies are formal ontologies.
hierarchies (broader/narrower)represented in SKOS.
interpretation (in terms of sets of instances).
20
Classes are instantiated – Leo is an instance of Lion – Born Free is a book about Lions
aren’t necessarily strictly true of everything – It’s useful to be able to navigate from Cell to Nucleus, even though it’s not the case that all Cells have a Nucleus – Relationships between Polio and Polio virus, Polio vaccine, Polio disease… – Relationships between Accident and Accident Prevention, Accidents in the Home, Radiation Accidents…
hierarchies. – Broader hierarchy is not transitive.
closure of the hierarchy.
21
– E.g. SKOS Concept is a Class, particular concepts are instances of that class
properties of SKOS (e.g. the querying of the transitive closure
vocabularies
22
characteristics of classes – Necessary/sufficient conditions etc. – Model theory/semantics provides interpretations of the assertions involving the properties
– Decoration of concepts/properties/individuals with information which is useful, but does not impact on the formal semantics or logical interpretations
as a Good Thing.
23
General
– Human readable
– Scope notes
– authorship
Application Specific
the model
OntoClean
24
OWL Annotation Properties – Preferred/Alternate/Hidden Labels – Documentation/Notes
OWL ontologies
– OWL API – Protégé
25
– Presenting OWL ontologies as SKOS vocabularies
– Enriching SKOS vocabularies as OWL ontologies.
– Use of SKOS as annotation vocabulary
26
express relationships between concepts in different schemes – broadMatch/narrowMatch – closeMatch – exactMatch
– Indiscriminate use of properties such as owl:sameAs can lead to undesirable consequences.
– URIs for identification – Provide useful information when dereferenced – Link to other URIs
27
SKOS LD Indexing/Retrieval Discovery Semantic Relations Navigation Mapping Linking and Integration beyond URI matching
28
– Protégé 4 plugin exploiting OWL definition of SKOS vocabulary – Reasoning support for classification
– Alternate language support
domain relationships
29
– http://www.ivoa.net/Documents/latest/Vocabularies.html
– http://www.umbel.org
– http://e-culture.multimedian.nl/ – Europeana – ONKI (Finland)
– http://id.loc.gov
30
– http://www.w3.org/TR/skos-reference/
– http://code.google.com/p/skoseditor/
– http://cohse.cs.manchester.ac.uk
Sean Bechhofer sean.bechhofer@manchester.ac.uk
6
– Associating metadata with resources
– Integrating information sources
– Reasoning over the information we have. – Could be light-weight (taxonomy) – Could be heavy-weight (logic-style)
8
related data that wasn’t previously linked.
data – The Web as database
*Linked data slides based on material from Ian Davis and Tom Heath: http://www.slideshare.net/iandavis/30-minute-guide-to-rdf-and-linked-data
9
isbn title author publisherId pages 0743267478 Q&A Vikas Swarup 1435 336 014029466X The Rotters’ Club Jonathan Coe 1546 416 … … … … … .. … … … …
10
isbn title author publisherId pages 0743267478 Q&A Vikas Swarup 1435 336 014029466X The Rotters’ Club Jonathan Coe 1546 416 … … … … … .. … … … …
11
isbn title author publisherId pages 0743267478 Q&A Vikas Swarup 1435 336 014029466X The Rotters’ Club Jonathan Coe 1546 416 … … … … … .. … … … …
12
isbn title author publisherId pages 0743267478 Q&A Vikas Swarup 1435 336 014029466X The Rotters’ Club Jonathan Coe 1546 416 … … … … … .. … … … …
13
book The Rotters’ Club title
subject value property
more generally:
14
isbn title author publisherId pages 0743267478 Q&A Vikas Swarup 1435 336 014029466X The Rotters’ Club Jonathan Coe 1546 416 … … … … … .. … … … …
15
book 014029466X isbn Jonathan Coe author The Rotters’ Club title
16
book 014029466X isbn Jonathan Coe author The Rotters’ Club title publisher Penguin Books name publisher
17
18
http://example.com/person/176 Jonathan Coe http://example.com/name
URIs as names for nodes URIs as names for relations
19
they are describing the same thing.
– Although introduces issues of URI control.
20
http://example.com/person/176 Jonathan Coe http://example.com/book/014029466X http://example.com/person/176 http://example.com/place/xyz765 Birmingham http://example.com/birthplace http://example.com/name http://example.com/name http://example.com/author
21
http://example.com/person/176 Jonathan Coe http://example.com/book/014029466X http://example.com/place/xyz765 Birmingham http://example.com/birthplace http://example.com/name http://example.com/name http://example.com/author
22
information can be retrieved.
retrieve more information about the URI.
formats (e.g. another graph)
23
can be discovered.
– Largely browsers up to now….
describes them. – Use of content negotiation to supply “appropriate” representations – Use of microformats/RDFa to publish data
24
– http://www.w3.org/RDF
– for representing metadata – for describing the semantics of information in a machine- accessible way
25
– ! <Sean,hasColleague,Uli>
Sean Uli
hasColleague
26
Sean Uli
hasColleague
Carole
http://www.cs.man.ac.uk/~sattler hasColleague hasHomePage “Sean K. Bechhofer” hasName
27
property of that Resource
<Description about="some.uri/person/sean_bechhofer"> <hasColleague resource="some.uri/person/uli_sattler"/> <hasName rdf:datatype="&xsd;string">Sean K. Bechhofer</hasName> </Description> <Description about="some.uri/person/uli_sattler"> <o:hasHomePage>http://www.cs.mam.ac.uk/~sattler</o:hasHomePage> </Description> <Description about="some.uri/person/carole_goble"> <o:hasColleague resource="some.uri/person/uli_sattler"/> </Description>
28
29
down in XML, but it doesn’t give any special meaning to vocabulary such as subClassOf or type – Interpretation is an arbitrary binary relation
to define basic vocabulary terms and the relations between those terms – Class, Property – type, subClassOf – range, domain
30
to create vocabularies: – <Person,type,Class> – <hasColleague,type,Property> – <Professor,subClassOf,Person> – <Carole,type,Professor> – <hasColleague,range,Person> – <hasColleague,domain,Person>
resources – specifies how terms should be interpreted
31
Lecturer Academic
Person
rdfs:subClassOf rdf:subClassOf rdfs:subClassOf rdf:type
rdfs:Class
rdf:type rdf:type
32
Sean Lecturer
rdf:type
rdfs:Class
Academic
rdfs:subClassOf rdf:type rdf:type rdfs:type
33
<Species,type,Class> <Lion,type,Species> <Leo,type,Lion>
vocabulary, so constructors can be applied to themselves/each other <type,range,Class> <Property,type,Class> <type,subPropertyOf,subClassOf>
model theory.
34
resources.
35
– No localised range and domain constraints
! Can’t say that the range of hasChild is Person when applied to Persons and Elephant when applied to Elephants
– No existence/cardinality constraints
! Can’t say that all instances of Person have a mother that is also a Person, or that Persons have exactly 2 parents
– No transitive, inverse or symmetrical properties
! Can’t say that isPartOf is a transitive property, that hasPart is the inverse of isPartOf or that touches is symmetrical
– No “native” reasoners for non-standard semantics – May be possible to reason via FO axiomatisation
36
– Such as XML, RDF, RDFS
– Based on familiar KR idioms
– Possible to provide automated reasoning support
– URIs for identification – Provide useful information when dereferenced – Link to other URIs
32
SKOS LD Indexing/Retrieval Discovery Semantic Relations Navigation Mapping Linking and Integration beyond URI matching
– E.g. LCSH: http://id.loc.gov/authorities/subjects/sh85052522
33
can dereference and access definitions
– Heterogeneity of Web APIs
– Applications not implemented against fixed set of data sources.
34
37
– http://data.open.ac.uk
! http://lucero-project.info/
– http://data.southampton.ac.uk
– RDFised version of wikipedia – Scraping structured information from info-boxes. – Quality?
– Programme catalogues published as Linked Data – Crosslinking to resources like dbpedia and MusicBrainz
– Geographical data – Lat/long, postal codes etc.
– SKOS
36
37
38
39
40
41
42
43
– But pushback against formats
– Hash vs slash
– Centralised vs Distributed – Rules vs Guidance – Trust in the Web
! “Darwinian Evolution”
38
– http://inkdroid.org/journal/2010/06/04/the-5-stars-of-open-linked- data/ ! Make data available ! Make it available as structured data ! Use non-proprietary formats ! Use URLs to identify things ! Link your data
– http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/
45
39
– How do we handle the fact that different URIs may be used to refer to the same things? – Use of owl:sameAs may be too strong (can result in all information, including annotations, metadata etc.) being merged.
– Version information in URLs? – Versioning at architectural level (Memento) – How does versioning play with a “follow your nose paradigm”?
– Distributed query across data sets – LD applications tend to use an “extract, transform, load” approach.
40
– Vocabularies used to facilitate integration – Little deep semantics. – “Big O vs little o”
! Role of SKOS and RDF(S)
– Lots of work on “recipes”, mangling relation sources into RDF etc. – What do you actually do with the stuff? – Applications (so far) mainly browsing/CV generation/introspective apps – End user applications? – Build it and they will come….???