DescribingLinkedDatasets OntheDesignandUsageof voiD , - - PowerPoint PPT Presentation

describing linked datasets
SMART_READER_LITE
LIVE PREVIEW

DescribingLinkedDatasets OntheDesignandUsageof voiD , - - PowerPoint PPT Presentation

KeithAlexander(Talis),RichardCyganiak(DERI), MichaelHausenblas(DERI)andJunZhao(UniversityofOxford) DescribingLinkedDatasets OntheDesignandUsageof voiD ,


slide-1
SLIDE 1

Describing
Linked
Datasets


On
the
Design
and
Usage
of
voiD,
 the
‘Vocabulary
Of
Interlinked
Datasets’
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain


Keith
Alexander
(Talis),
Richard
Cyganiak
(DERI),
 

Michael
Hausenblas
(DERI)
and
Jun
Zhao
(University
of
Oxford)


slide-2
SLIDE 2

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 2


Agenda


  • The
Problem

  • Our
Proposal
–
voiD

  • ApplicaNons

  • Next
Steps

slide-3
SLIDE 3

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 3


The
Problem


2008


2007


slide-4
SLIDE 4

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 4


The
Problem


2009


2008


slide-5
SLIDE 5

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 5


The
Problem


  • The
Linking
Open
Data
(LOD)
cloud
gathers


currently
roughly
the
same
momentum
as
the
 Web
in
the
early
1990s


  • How
did
people
deal
with
the
consequences

  • f
having
a
decentralized
system,
back
then?

slide-6
SLIDE 6

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 6


The
Problem


slide-7
SLIDE 7

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 7


The
Problem


  • From
2007
on,
we
have
been
doing
it
in
the


Yahoo!‐catalog‐style:
manually
collec>ng
and
 represen>ng
data
about
the
Linking
Open
Data
 cloud:


– In
the
LOD
cloud
diagram,
we
give
a
qualitaNve
view
in
 form
of
a
visual
graph
 – In
various
ESW
Wiki
pages
we
create
HTML
tables:


  • h`p://esw.w3.org/topic/TaskForces/CommunityProjects/

LinkingOpenData/DataSets/StaNsNcs


  • h`p://esw.w3.org/topic/TaskForces/CommunityProjects/

LinkingOpenData/DataSets/LinkStaNsNcs


slide-8
SLIDE 8

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 8


The
Problem


h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/StaNsNcs
 h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/LinkStaNsNcs


slide-9
SLIDE 9

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 9


The
Problem


  • Currently,
only
human
comprehensible


descrip>ons
(the
LOD
cloud,
Wiki
pages)
 available


  • We
can’t
automate
tasks,
such
as



– Efficient
&
effecNve
search
 – SelecNon
of
dataset
(for
apps,
interlinking
targets)
 – GeneraNon
of
maps,
etc.


slide-10
SLIDE 10

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 10


The
Problem


  • We
can’t
apply
our
tools
and
methods
we


have
experiences
with,
such
as
editors,
 engines,
stores,
etc.


  • Even
worse,
it
doesn’t
scale


– We’d
need
a
Google‐style
approach
that
scales
like
 hell
and
is
powerful
enough
to
enable
the
above
 menNoned
 – Providing
metadata
about
the
LOD
cloud
in
a
 machine‐comprehensible
way


slide-11
SLIDE 11

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 11


Agenda


 The
Problem


  • Our
Proposal
–
voiD

  • ApplicaNons

  • Next
Steps

slide-12
SLIDE 12

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 12


Our
Proposal
‐
voiD


  • SoluNon:
providing
a
formal
descripNon
of


– What
a
dataset
is
about
(topic,
technical
details)
 – How
and
under
which
condiNons
to
access
it
 – How
the
dataset
is
interlinked
with
other
datasets


  • QualitaNve
level:
type
of
interlinking

  • QuanNtaNve
level:
number
of
links,
resources,
etc.


– How
to
discover
the
metadata


  • voiD,
the
“Vocabulary
of
Interlinked
Datasets”


provides
precisely
this


slide-13
SLIDE 13

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 13


Our
Proposal
‐
voiD


  • A
dataset
is
a
set
of
RDF
triples
that
are


published,
maintained
or
aggregated
by
a
 single
provider.



  • A
dataset
is
authorita>ve
with
respect
to
a


certain
URI
namespace
if
it
contains
 informaNon
about
resources
named
by
URIs
in
 this
namespace,
and
is
published
by
the
URI


  • wner
(URI
ownership
as
of
the
AWWW1)

slide-14
SLIDE 14

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 14


Our
Proposal
‐
voiD


  • A
linkset

LS
is
a
set
of
RDF
triples
where
for


all
triples
ti=⟨si,pi,oi⟩
∈
LS,
the
subject
 is
in
one
dataset,
i.e.
all
si

are
described
in
 DS1
,
and
the
object
is
in
another
dataset,
i.e.
 all
oi
are
described
in
DS2
.



slide-15
SLIDE 15

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 15


Our
Proposal
‐
voiD


slide-16
SLIDE 16

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 16


Our
Proposal
‐
voiD


voiD
offers
two
orthogonal
interlinking
types:


  • classic
LOD
vs.
3rd‐party,
differing
in
where
the
interlinking
statements
are


kept.
In
the
first
case
the
interlinking
triples,
i.e.
a
linkset,
are
hosted
in
one


  • f
the
two
involved
datasets,
while
in
the
la`er
case
there
is
a
third
dataset


involved
that
contains
the
interlinking
triples,
i.e.
the
linkset;


  • non‐directed
vs.
directed,
which
addresses
the
issue
if
someone
is


interested
in
staNng
the
direcNon
of
the
interlinking
or
not
(for
example
 with
owl:sameAs)


classic
LOD,
 non‐directed
 3rd‐party,
 non‐directed
 classic
LOD,
 directed
 3rd‐party,
 directed


slide-17
SLIDE 17

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 17


Our
Proposal
‐
voiD


classic
LOD,
 non‐directed


slide-18
SLIDE 18

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 18


classic
LOD,
 directed


Our
Proposal
‐
voiD


slide-19
SLIDE 19

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 19


3rd‐party,
 non‐directed


Our
Proposal
‐
voiD


slide-20
SLIDE 20

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 20


3rd‐party,
 directed


Our
Proposal
‐
voiD


slide-21
SLIDE 21

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 21


Our
Proposal
‐
voiD


  • Reusing
terms
from
other
vocabularies


– foaf:homepage/IFP
 – dcterms:subject
along
with
DBpedia
URIs
 h`p://dbpedia.org/resource/
XXX
 – SCOVO
for
staNsNcs
about
triples,
links,
etc


slide-22
SLIDE 22

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 22


Our
Proposal
‐
voiD


  • PublicaNon
&
discovery
via
sitemaps
and/or


backlinks
(dcterms:isPartOf)


slide-23
SLIDE 23

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 23


Our
Proposal
‐
voiD


  • Once
dataset
providers
have
published
their


voiD
descripNon
in
RDF
along
with
their
 dataset,
one
can
address
the
following
issues:


– How
to
find
some
datasets?
 – How
to
efficiently
find
a
specific
dataset?
 – How
to
effec>vely
find
datasets?
 – How
to
dynamically
select
datasets?
 – How
to
select
datasets
based
on
certain
 preferences?


slide-24
SLIDE 24

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 24


Agenda


 The
Problem
  Our
Proposal
–
voiD


  • ApplicaNons

  • Next
Steps

slide-25
SLIDE 25

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 25


Applica>ons


  • GeneraNon
(ve,
lipSSM,
NX
parser)

  • Vocabulary
Management
(Talis)

  • Explorer
(RKB,
LDE)

  • Query
FederaNon
(Clarck‐Parsia,
OpenLink)

  • Dataset
ranking
(
DING!
talk)

  • PotenNal
ApplicaNons


– Map
of
data
(Sindice)
 – Dynamic
Meshups
for
ApplicaNon


slide-26
SLIDE 26

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 26


Applica>ons


h`p://ld2sd.deri.org/ve



slide-27
SLIDE 27

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 27


Applica>ons


h`p://ld2sd.deri.org/lde



slide-28
SLIDE 28

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 28


Applica>ons


h`p://dblp.rkbexplorer.com/models/void.`l



slide-29
SLIDE 29

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 29


Applica>ons



h`p://linkeddata.uriburner.
com/



slide-30
SLIDE 30

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 30


Agenda


 The
Problem
  Our
Proposal
–
voiD
  ApplicaNons


  • Next
Steps

slide-31
SLIDE 31

Describing
Linked
Datasets
–
On
the
Design
and
Usage
of
voiD,
the
“Vocabulary
Of
Interlinked
Datasets”,
 Linked
Data
Workshop
at
WWW09,
2009‐04‐20,
Madrid,
Spain
 31


Next
Steps


  • voiD
2.0

see
issues
at



h`p://code.google.com/p/void‐impl/issues/list



  • staNsNcs
module
(fix/extend
re
SCOVO)

  • SPARQL
endpoints

  • provenance,
trust
(?)

  • Assist
people
in
publishing
voiD