enabling completeness aware querying in sparql
play

Enabling Completeness-aware Querying in SPARQL Luis Galrraga, Katja - PowerPoint PPT Presentation

Enabling Completeness-aware Querying in SPARQL Luis Galrraga, Katja Hose, Simon Razniewski May 14 th , 2017 WebDB, Chicago 1 Outline Completeness in RDF knowledge bases Completeness oracles Our vision Representations for


  1. Enabling Completeness-aware Querying in SPARQL Luis Galárraga, Katja Hose, Simon Razniewski May 14 th , 2017 WebDB, Chicago 1

  2. Outline ● Completeness in RDF knowledge bases ● Completeness oracles ● Our vision – Representations for completeness oracles – Reasoning with completeness oracles – Enabling completeness in SPARQL ● Summary & conclusions 2

  3. Outline ● Completeness in RDF knowledge bases ● Completeness oracles ● Our vision – Representations for completeness oracles – Reasoning with completeness oracles – Enabling completeness in SPARQL ● Summary & conclusions 3

  4. Outline ● Completeness in RDF knowledge bases ● Completeness oracles ● Our vision – Representations for completeness oracles – Reasoning with completeness oracles – Enabling completeness in SPARQL ● Summary & conclusions 4

  5. RDF Knowledge Bases (KBs) Collection of structured knowledge offjcialLanguage Français family offjcialLanguage Italiano Romance family Switzerland citizenOf Leonhard Euler 5

  6. Plenty of KBs out there! 6

  7. Plenty of KBs out there! 7

  8. KBs in action 8

  9. Outline ● Completeness in RDF knowledge bases ● Completeness oracles ● Our vision – Representations for completeness oracles – Reasoning with completeness oracles – Enabling completeness in SPARQL ● Summary & conclusions 9

  10. Completeness in RDF KBs ● KBs are highly incomplete – 1% of people have a citizenship in YAGO 10

  11. Completeness in RDF KBs ● KBs are highly incomplete – 1% of people have a citizenship in YAGO ● We do not know where the incompleteness lies 11

  12. Completeness in RDF KBs ● KBs are highly incomplete – 1% of people have a citizenship in YAGO ● We do not know where the incompleteness lies – A single person in the KB could be actually single or the KB may be incomplete 12

  13. Completeness in RDF KBs ● KBs are highly incomplete – 1% of people have a citizenship in YAGO ● We do not know where the incompleteness lies – A single person in the KB could be actually single or the KB may be incomplete ● Problems for data producers and consumers 13

  14. Completeness in RDF KBs ● KBs are highly incomplete – 1% of people have a citizenship in YAGO ● We do not know where the incompleteness lies – A single person in the KB could be actually single or the KB may be incomplete ● Problems for data producers and consumers – Consumers: no completeness guarantees for queries. – Producers: which parts of the KB need to be populated? 14

  15. Completeness ● Defjned with respect to a query q via a complete hypothetical KB K*. 15

  16. Completeness ● Defjned with respect to a query q via a complete hypothetical KB K*. – A query q is complete in K, if q(K*) ⊆ q(K). 16

  17. Completeness ● Defjned with respect to a query q via a complete hypothetical KB K*. – A query q is complete in K, if q(K*) ⊆ q(K). SELECT ?x WHERE { Switzerland offjcialLang ?x } offjcialLanguage Français offjcialLanguage Italiano Switzerland 17

  18. Completeness ● Defjned with respect to a query q via a complete hypothetical KB K*. – A query q is complete in K, if q(K*) ⊆ q(K). SELECT ?x WHERE { Switzerland offjcialLang ?x } offjcialLanguage Are these all the offjcial Français languages of offjcialLanguage Switzerland? Italiano Switzerland 18

  19. Completeness ● Defjned with respect to a query q via a complete hypothetical KB K*. – A query q is complete in K, if q(K*) ⊆ q(K). SELECT ?x WHERE { Switzerland offjcialLang ?x } offjcialLanguage Are these all the offjcial Français languages of offjcialLanguage Switzerland? Italiano Switzerland [Incomplete query] 19

  20. Completeness in RDF data ● Wikidata provides no value annotations 20

  21. Completeness in RDF data ● Wikidata provides no value annotations SELECT ?x WHERE { USA offjcialLang ?x } offjcialLanguage 21

  22. Completeness in RDF data ● Wikidata provides no value annotations SELECT ?x WHERE { USA offjcialLang ?x } offjcialLanguage [Complete query] 22

  23. Completeness in RDF data ● Wikidata provides no value annotations SELECT ?x WHERE { USA offjcialLang ?x } offjcialLanguage [Complete query] ● Not applicable if we know some offjcial language 23

  24. Completeness in RDF data ● Wikidata provides no value annotations SELECT ?x WHERE { USA offjcialLang ?x } offjcialLanguage [Complete query] ● Not applicable if we know some offjcial language offjcialLanguage Français offjcialLanguage Italiano Switzerland 24

  25. Outline ● Completeness in RDF knowledge bases ● Completeness oracles ● Our vision – Representations for completeness oracles – Reasoning with completeness oracles – Enabling completeness in SPARQL ● Summary & conclusions 25

  26. Completeness oracle ● Boolean function ɷ(q, K) that guesses the completeness of a query q in a KB K . 26

  27. SR completeness oracle ● Function ɷ that guesses the completeness of queries of the form [Galárraga et. al, 2017]: SELECT ?x WHERE { subject relation ?x } 27

  28. SR completeness oracle ● Function ɷ that guesses the completeness of queries of the form [Galárraga et. al, 2017]: SELECT ?x WHERE { subject relation ?x } ● We use the notation ɷ(subject, relation) 28

  29. SR completeness oracle ● Function ɷ that guesses the completeness of queries of the form [Galárraga et. al, 2017]: SELECT ?x WHERE { subject relation ?x } ● We use the notation ɷ(subject, relation) ● ɷ = pca(s, r) = partial completeness assumption 29

  30. SR completeness oracle ● Function ɷ that guesses the completeness of queries of the form [Galárraga et. al, 2017]: SELECT ?x WHERE { subject relation ?x } ● We use the notation ɷ(subject, relation) ● ɷ = pca(s, r) = partial completeness assumption – Query is complete in KB if at least one answer is known 30

  31. Evaluating SR oracles ɷ = pca(s, r) = partial completeness assumption Gold standard: Complete instances in the domain of o ffjcialLanguage Italiano Français Dansk Français Français Italiano 31

  32. Evaluating SR oracles ɷ = pca(s, r) = partial completeness assumption Gold standard: Complete instances in the domain of o ffjcialLanguage Italiano PCA oracle Français Dansk Français Français Italiano 32

  33. Evaluating SR oracles ɷ = american-country-oracle(s, r) Gold standard: Complete instances in the domain of o ffjcialLanguage Italiano PCA oracle Français Dansk Français Français Italiano American country 33 oracle

  34. Evaluating SR oracles PCA oracle American country oracle Precision = 3/5 Precision = 1/2 Recall = 3/4 Recall = 1/4 Gold standard: Complete instances in the domain of o ffjcialLanguage Italiano PCA oracle Français Dansk Français Français Italiano American country 34 oracle

  35. SR completeness oracles ● Closed World Assumption: cwa(s, r) = true ● PCA: pca(s, r) = o : r(s, o) ∃ ● Cardinality: card(s, r) = #(o : r(s, o)) ≥ k ● Popular entities: popularity pop (s, r) = pop(s) ● No-chg over time: nochange chg (s, r) = ∼ chg(s, r) ● Star : star r 1 ,..,r n (s, r) = i {1,..,n} : o : r ∀ ∊ ∃ i (s, o) ● Class: class c (s, r) = type(s, c) ● Rule mining oracle 35

  36. Rule mining SR oracle ● Based on completeness rules ⇒ notype(x, Adult), type(x, Person) complete(x, hasChild) ⇒ dateOfDeath(x, y), lessThan 1 (x, placeOfDeath) incomplete(x, placeOfDeath) 36

  37. Rule mining SR oracle ● Based on completeness rules ⇒ notype(x, Adult), type(x, Person) complete(x, hasChild) ⇒ dateOfDeath(x, y), lessThan 1 (x, placeOfDeath) incomplete(x, placeOfDeath) ● Learned using the AMIE [Galárraga et. al, 2013] rule mining system – On gold standard built via crowdsourcing 37

  38. Rule mining SR oracle ● Based on completeness rules ⇒ notype(x, Adult), type(x, Person) complete(x, hasChild) ⇒ dateOfDeath(x, y), lessThan 1 (x, placeOfDeath) incomplete(x, placeOfDeath) ● Learned using the AMIE [Galárraga et. al, 2013] rule mining system – On gold standard built via crowdsourcing – 100% F1-measure for functional relations, quite good for relations hasChild , graduatedFrom 38

  39. Outline ● Completeness in RDF knowledge bases ● Completeness oracles ● Our vision – Representations for completeness oracles – Reasoning with completeness oracles – Enabling completeness in SPARQL ● Summary & conclusions 40

  40. Representing completeness oracles ● Extensional approach [Darari, et. Al, 2013] – An oracle is a collection of completeness statements about queries 41

  41. Representing completeness oracles ● Extensional approach [Darari, et. Al, 2013] – An oracle is a collection of completeness statements about queries SELECT DISTINCT ?y WHERE { ?x hasOffjcialLang ?y } is complete in the KB 42

  42. Representing completeness oracles ● Extensional approach [Darari, et. Al, 2013] – An oracle is a collection of completeness statements about queries SELECT DISTINCT ?y WHERE { ?x hasOffjcialLang ?y } is complete in the KB Variable a ?x subject hasPattern statement pattern p hasOffjcialLang r e d i c a t e object distinct h a s P r o j e c t i o n V a r i a b l e a ?y true 43

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend