Semantic Wikipedia [[enhances::Wikipedia]] Wikipedia today A free - - PowerPoint PPT Presentation

semantic wikipedia
SMART_READER_LITE
LIVE PREVIEW

Semantic Wikipedia [[enhances::Wikipedia]] Wikipedia today A free - - PowerPoint PPT Presentation

Max Vlkel, Markus Krtzsch, Denny Vrandecic, Heiko Haller, Rudi Studer AIFB and FZI Karlsruhe, Germany @WWW2006, 26.05.2006 Semantic Wikipedia [[enhances::Wikipedia]] Wikipedia today A free online encyclopdia 16th most accessed


slide-1
SLIDE 1

Semantic Wikipedia

Max Völkel, Markus Krötzsch, Denny Vrandecic, Heiko Haller, Rudi Studer AIFB and FZI Karlsruhe, Germany @WWW2006, 26.05.2006

[[enhances::Wikipedia]]

slide-2
SLIDE 2

2

Wikipedia today

A free online encyclopædia 16th most accessed web site on earth

According to Alexa.com

> 4 mio articles

  • ver 30.000 active contributors

contributed 5 times or more per month in Nov 2005

slide-3
SLIDE 3

3

Wikipedia today

slide-4
SLIDE 4

4

Wikipedia soon: An article about the RuleML conference in 2006

slide-5
SLIDE 5

5

Wikipedia today: Consume

slide-6
SLIDE 6

6

Wikipedia today: Consume and Contribute

Everybody can edit (almost) every page

slide-7
SLIDE 7

7

Using Wikipedia Where can I publish my paper

  • n Semantic Web query

languages?

slide-8
SLIDE 8

8

Using Wikipedia Where can I publish my paper

  • n Semantic Web query

languages?

slide-9
SLIDE 9

9

Using Wikipedia Where can I publish my paper

  • n Semantic Web query

languages?

slide-10
SLIDE 10

10

Using Wikipedia Where can I publish my paper

  • n Semantic Web query

languages?

Category Read page Read page Read page Read page Read page Read page Read page Read page Read page Read page Read page Read page Read page

Yellow = index pages Green = cont ent pages

slide-11
SLIDE 11

11

Using Wikipedia Where can I publish my

paper on Semantic Web query languages?

slide-12
SLIDE 12

12

Using Wikipedia Where can I publish my

paper on Semantic Web query languages?

slide-13
SLIDE 13

13

Using Wikipedia Where can I publish my

paper on Semantic Web query languages?

slide-14
SLIDE 14

14

Using Wikipedia Where can I publish my paper

  • n Semantic Web query

languages?

New Conference Update list Update list Update list Update list Update list Update list Update list Update list Create page Update list Update list Update list Update list

slide-15
SLIDE 15

15

Wikipedia is not perfect

Using Wikipedia means reading articles

Manual indexes are no real solution (List of coffee companies,

European cities, List of asteroids named after people, …)

Inconsistencies between different language versions

Find inconsistencies between different language versions, e.g.

Population of Edinburgh (as of 17.05.2006)

En:

448,624, no date

De:

435.790 in 2005

Fr:

448 624 in 2001

Dk:

453.670 in 2004

Problem: No access to data in articles

slide-16
SLIDE 16

16

Can the Semantic Web help?

slide-17
SLIDE 17

17

Analysis

Wikipedia has many users and data,

but: many manual processes, only text-based search

Semantic Web has tools for information processing,

sophisticated queries, but: few data

Wikipedia Semantic Web

Semantic data Queries, Automation Users Tools

?

slide-18
SLIDE 18

18

Goal: Marrying Wikipedia and the Semantic Web

Wikipedia has many users and data,

but: many manual processes, only text-based search

Semantic Web has tools for information processing,

sophisticated queries, but: few data

Wikipedia Semantic Web

Semantic data Queries, Automation Users Tools

slide-19
SLIDE 19

19

Requirements for

Wikipedia:

Must be very easy to use Must have immediate benefit for the users Lists, Inconsistencies, Better Search Must be efficiently implemented Currently 12,000 hits/second

Semantic Web:

Must have export of semantic data Nice to have SPARQL access Must integrate with existing vocabularies

slide-20
SLIDE 20

20

We go to the article on the RuleML2006 conference …

slide-21
SLIDE 21

21

… and edit it

slide-22
SLIDE 22

22

Editing RuleML2006 (non semantic version)

RuleML2006 is the Second International Conference on Rules and Rule Markup Languages for the Semantic Web. It is held from November 9 2006 to November 10 2006 in [[Athens, Georgia]], [[USA]]. For more information, see http://2006.ruleml.org/.

There is already an

  • rdinariy link to the article
  • f „Athens, Georgia“
slide-23
SLIDE 23

23

Editing RuleML2006 (semantic version)

RuleML2006 is the Second International Conference on Rules and Rule Markup Languages for the Semantic Web. It is held from November 9 2006 to November 10 2006 in [[located in::Athens, Georgia]], [[USA]]. For more information, see http://2006.ruleml.org/.

Just say what the relation between this page (RuleML2006) and „Athens, Georgia“ is.

slide-24
SLIDE 24

24

From links …

… in [[Athens, Georgia]], [[USA]]. … … in [[located in::Athens, Georgia]], [[USA]]. …

… to typed links

slide-25
SLIDE 25

25

From values …

… It is held from November 9 2006 to November 10 2006 in… … It is held from [[start date:=November 9 2006]] to [[end date:=November 10 2006]] in…

… to attributes

slide-26
SLIDE 26

26

Save.

slide-27
SLIDE 27

27

I t looks exactly the same as before

slide-28
SLIDE 28

28

What the humans see, when they scroll down

slide-29
SLIDE 29

29

What the humans see, when they scroll down

slide-30
SLIDE 30

30

What the machines see

http://wiki.ontoworld.org/index.php/Special:ExportRDF/RuleML2006

slide-31
SLIDE 31

31

I nformation resources vs. abstract concepts (http range-14) RuleML2006 RDF document for RuleML2006

HTML Link/header

slide-32
SLIDE 32

32

I nformation resources vs. abstract concepts (http range-14) RuleML2006 RuleML2006 RDF document for RuleML2006

HTML Link/header smw:hasArticle rdfs:isDefinedBy http redirect http get

slide-33
SLIDE 33

33

I nformation resources vs. abstract concepts (http range-14) RuleML2006 RuleML2006 RDF document for RuleML2006

http://wiki.ontoworld.org/wiki/Special:ExportRDF/RuleML2006 http://wiki.ontoworld.org/wiki/RuleML2006 http://wiki.ontoworld.org/wiki/_RuleML2006 HTML Link/header smw:hasArticle rdfs:isDefinedBy http redirect http get

slide-34
SLIDE 34

34

Did I get that right?

Everybody can create any relation or attribute?

Yes

Each relation and each attribute have their own wiki-article

Relation:located in Attribute:population

Can I have class hierachies, datatypes and all this?

slide-35
SLIDE 35

35

Mapping Wiki Concepts to OWL

[[Category:Monarchies]] [[population:= 5,062,011]] [[length:= 17 km]] [[located in::Scotland]] [[contains::Caffeine]] [[Scotland]] Syntax

  • Link
  • wl:Class

Category

  • wl:DatatypeProperty

Attribute

  • wl:ObjectProperty

Relation OWL Semantic MediaWiki

slide-36
SLIDE 36

36

Why typed links?

Cheap annotations

Content (links, values) is already there, just mark them up

Content and annotations are one

DRY – don‘t repeat yourself Metadata doens‘t get out of sync

Defined annotation process

Authors are annotators

Locality of annotation User interface

No new tool to learn Total control over annotations

Wiki-style: No structure imposed

All relations and attributes can be used

slide-37
SLIDE 37

37

Benefits for the Semantic Web

Wikipedia as URI source (see Workshop IRW2006)

Simple social process: Prefix article name with an underscore

RDF/OWL Data export

Header links to RDF file (PiggyBank, OINK, Tabulator) Per page or bulk Everything has an rdf:label All properties have rdfs:isDefinedBy XSD datatypes fully supported

Semantic Web food chain

SPARQL endpoint SPARQL tools (SNORQL)

slide-38
SLIDE 38

38

Benefits for Wikipedians: < ask> for your data

Inline queries allow for questions like …

… movies from the 70s starring Sean Connery … list of events (all conferences and workshops)

<ask format="ul" link="all"> [[Category:Event]] </ask>

slide-39
SLIDE 39

39

Benefits for Wikipedians: < ask> for your data

Inline queries allow for questions like …

… movies from the 70s starring Sean Connery … list of events with their deadline

<ask format="ul" link="all"> [[Category:Event]] [[paper deadline:=*]] </ask>

slide-40
SLIDE 40

40

Benefits for Wikipedians: < ask> for your data

<ask format="ul" link="all"> [[Category:Event]] [[paper deadline:=>June 1 2006]] [[paper deadline:=<December 31 2006]] [[title:=*]] [[paper deadline:=*]] [[Category:Topic Semantic Web query languages]] </ask>

slide-41
SLIDE 41

41

Applications

Automatic tables and lists

E.g. Countries sorted by area, population, alphabet, …

Maintenance with hand crafted checks

Does every country have one capital?

Integration in applications

latte = wikipedia.get(“Latte Macchiatto”); print latte[“contains”]

Visualization and browsing … And many unexpected ones

slide-42
SLIDE 42

42

Who is using Semantic MediaWiki?

Our research group

slide-43
SLIDE 43

43

Features

Linking of concepts with external URIs [[equivalent URI:= … ]]

  • wl:equivalentClass, owl:equivalentProperty, owl:sameAs

OWL import (generates pages from ontology) All relations and attributes can be documented Re-use of existing Category-system Annotating templates for a quick win Correct RDF export Flexible system for collaboratively creating

content with semantic annotations

slide-44
SLIDE 44

44

Future Work

Performance Performance Scalability More Expressiveness

Transitive, symmetric, and inverse relations

User Interface

Suggested relations and attributes

Evaluation

slide-45
SLIDE 45

45

Conclusions Annotation for the masses “Soft” introduction

People are often scared about Semantic Web Semantic features can be ignored

Content and metadata are one Linking to existing ontologies Create the egg of the chicken-and-egg problem

slide-46
SLIDE 46

46

Semantic Wikipedia [[enhances::Wikipedia]]

Thank You!