Prov ovenan nance ce I Infor format ation ion in the in the - - PowerPoint PPT Presentation

prov ovenan nance ce i infor format ation ion in the in
SMART_READER_LITE
LIVE PREVIEW

Prov ovenan nance ce I Infor format ation ion in the in the - - PowerPoint PPT Presentation

Prov ovenan nance ce I Infor format ation ion in the in the W Web of D b of Data ata Olaf f Hart artig Humbol oldt-Universitt zu zu Berlin http://olafhartig.de/foaf.rdf#olaf Provenance of a data item: information about the


slide-1
SLIDE 1

Prov

  • venan

nance ce I Infor format ation ion in the in the W Web of D b of Data ata

Olaf f Hart artig

Humbol

  • ldt-Universität zu

zu Berlin

http://olafhartig.de/foaf.rdf#olaf

slide-2
SLIDE 2

Olaf Hartig - Provenance Information in the Web of Data 2

  • Provenance of a data item: information about the history
slide-3
SLIDE 3

Olaf Hartig - Provenance Information in the Web of Data 3

  • Provenance of a data item: information about the history
slide-4
SLIDE 4

Olaf Hartig - Provenance Information in the Web of Data 4

  • Provenance of a data item: information about the history
slide-5
SLIDE 5

Olaf Hartig - Provenance Information in the Web of Data 5

Outline Towards a model of Web data provenance Provenance information in the Web of data today Upcoming tasks

slide-6
SLIDE 6

Olaf Hartig - Provenance Information in the Web of Data 6

  • Main research areas: (scientific) workflows, DBMSs
  • General focus:

data creation

Existi ting g Provenanc nce Research

slide-7
SLIDE 7

Olaf Hartig - Provenance Information in the Web of Data 7

slide-8
SLIDE 8

Olaf Hartig - Provenance Information in the Web of Data 8

slide-9
SLIDE 9

Olaf Hartig - Provenance Information in the Web of Data 9

slide-10
SLIDE 10

Olaf Hartig - Provenance Information in the Web of Data 10

slide-11
SLIDE 11

Olaf Hartig - Provenance Information in the Web of Data 11

Web data provenance comprises two dimensions: Data Creation • Data Access

slide-12
SLIDE 12

Olaf Hartig - Provenance Information in the Web of Data 12

Basics of

  • f the Prov
  • venance Mode

del

  • Provenance graph describes provenance of a data item
  • Nodes: provenance elements – pieces of provenance info
  • Edges: relate provenance elements to each other
  • Subgraphs for related data items possible
slide-13
SLIDE 13

Olaf Hartig - Provenance Information in the Web of Data 13

Basics of

  • f the Prov
  • venance Mode

del

  • Provenance model defines:
  • Types of provenance elements
  • Relationships
slide-14
SLIDE 14

Olaf Hartig - Provenance Information in the Web of Data 14

Basics of

  • f the Prov
  • venance Mode

del

  • Provenance model defines:
  • Types of provenance elements
  • Relationships
  • High level of abstraction (only main element types)
slide-15
SLIDE 15

Olaf Hartig - Provenance Information in the Web of Data 15

Basics of

  • f the Prov
  • venance Mode

del

  • General differentiation:

Actors Executions Artifacts

slide-16
SLIDE 16

Olaf Hartig - Provenance Information in the Web of Data 16

Data ta Access Dimens nsion

Data Item Information Resource Data Access contains Relation to the provided Information Resource Data Providing Service

(Non-Human)

Data Publisher

(Human)

Service Provider uses controls Data Accessor

(Non-Human)

Access Time

slide-17
SLIDE 17

Olaf Hartig - Provenance Information in the Web of Data 17

Data ta Access Dimens nsion n cont.

  • nt.

Public Key

(Signed)

Artifact Integrity Assurance Relation to the signed Data Signer

  • wns

Verification Result Digital Signature signs

slide-18
SLIDE 18

Olaf Hartig - Provenance Information in the Web of Data 18

Provenance Information Provenance Information Provenance Information

Data ta Creati tion

  • n Dimens

nsion

  • n

Data Creator

(Human or Non-human)

{complete,disjoint} Relation to the created Data Creation Time Creation Guidelines Data Creation responsible for responsible for Data Creating Service

(e.g. Software Agent)

Data Creating Entity

(e.g. Person, Group, Orga.)

Data Creating Device

(e.g. Sensor)

Source Data Data Item

(Encompassing)

Data Item part of

slide-19
SLIDE 19

Olaf Hartig - Provenance Information in the Web of Data 19

Provenance information in the Web of data today

slide-20
SLIDE 20

Olaf Hartig - Provenance Information in the Web of Data 20

Prov

  • venanc

nce-r

  • relate

ted d Vocabul bularies

DC – Dublin Core Metadata Terms FOAF – Friend of a Friend SIOC – Semantically-Interlinked Online Communities

  • SWP – Semantic Web Publishing vocabulary
  • WOT – Web of Trust schema
  • OMV – Ontology Metadata Vocabulary
  • PML – Proof Markup Language
  • Changeset vocabulary
  • Ouzo Provenance Ontology
slide-21
SLIDE 21

Olaf Hartig - Provenance Information in the Web of Data 21

Main n Issue ues Toda day

  • Vocabularies:
  • Partly unsuitable
  • Lack of certain features
  • Coverage of provenance model impossible
slide-22
SLIDE 22

Olaf Hartig - Provenance Information in the Web of Data 22

Prov

  • venanc

nce-r

  • relate

ted d Vocabul bularies

DC – Dublin Core Metadata Terms Property Occurrences* dc:creator about 24,284 dc:contributor 476 dc:source about 3,631 dc:created about 82,720 dc:modified about 12,020 dc:provenance 7 *Measured by querying Sindice; Feb. 7, 2009 (by that time Sindice indexed about 48,99 million documents)

slide-23
SLIDE 23

Olaf Hartig - Provenance Information in the Web of Data 23

Main n Issue ues Toda day

  • Vocabularies:
  • Partly unsuitable
  • Lack of certain features
  • Coverage of provenance model impossible
  • General lack of provenance-related metadata
  • n the Web of data
slide-24
SLIDE 24

Olaf Hartig - Provenance Information in the Web of Data 24

Pos

  • ssibl

ble Reason

  • ns
  • Lack of suitable vocabularies
  • Lack of usable tools
  • Ignorance / lack of sensitization
slide-25
SLIDE 25

Olaf Hartig - Provenance Information in the Web of Data 25

Upcoming tasks

slide-26
SLIDE 26

Olaf Hartig - Provenance Information in the Web of Data 26

Addr dress the Issue ues

  • Let's develop a vocabulary for Web data provenance
  • Proposal: refine the presented provenance model
  • Integrate existing vocabularies for specific types of

provenance elements

slide-27
SLIDE 27

Olaf Hartig - Provenance Information in the Web of Data 27

Addr dress the Issue ues

  • Let's develop a vocabulary for Web data provenance
  • Proposal: refine the presented provenance model
  • Integrate existing vocabularies for specific types of

provenance elements

  • Let's develop usable tools for data providers
  • Edit and publish provenance-related metadata
  • Automatic generation if possible
slide-28
SLIDE 28

Olaf Hartig - Provenance Information in the Web of Data 28

Addr dress the Issue ues

  • Let's develop a vocabulary for Web data provenance
  • Proposal: refine the presented provenance model
  • Integrate existing vocabularies for specific types of

provenance elements

  • Let's develop usable tools for data providers
  • Edit and publish provenance-related metadata
  • Automatic generation if possible
  • Let's raise awareness of data providers
  • Probably the hardest task
  • Maybe voiD can help
slide-29
SLIDE 29

Olaf Ha Harti rtig

Hum umbo boldt-Universität zu zu Berlin

http://olafhartig.de/foaf.rdf#olaf

Thank ank you

  • u!
slide-30
SLIDE 30

Olaf Hartig - Provenance Information in the Web of Data 30

These slides have been created by Olaf Hartig http://olafhartig.de This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License (http://creativecommons.org/licenses/by-sa/3.0/) Attribution:

  • http://www.flickr.com/photos/adrenalin/3032734/
  • http://www.hasslefreeclipart.com
  • http://www.flickr.com/photos/dullhunk/428079229/
  • http://www.flickr.com/photos/darwinbell/1337963794/
  • http://www.flickr.com/photos/alandd/2780700767/
  • http://www.flickr.com/photos/simeon_barkas/2872099696/
  • http://www.flickr.com/photos/robinh00d/122544491/
  • http://www.flickr.com/photos/adrenalin/3032747/