Old Wine in New Bottles? The Semantic Web COMP34512 Sebastian - - PowerPoint PPT Presentation

old wine in new bottles the semantic web comp34512
SMART_READER_LITE
LIVE PREVIEW

Old Wine in New Bottles? The Semantic Web COMP34512 Sebastian - - PowerPoint PPT Presentation

Old Wine in New Bottles? The Semantic Web COMP34512 Sebastian Brandt brandt@cs.manchester.ac.uk (Slides by Bijan Parsia, bparsia@cs.man.ac.uk) Tuesday, 6 May 2014 From Web to Semantic Web The Semantic Web is not a separate Web but an


slide-1
SLIDE 1

Old Wine in New Bottles? The Semantic Web COMP34512

Sebastian Brandt brandt@cs.manchester.ac.uk

(Slides by Bijan Parsia, bparsia@cs.man.ac.uk)

Tuesday, 6 May 2014

slide-2
SLIDE 2

From Web to Semantic Web

The Semantic Web is not a separate Web but an extension of the current one, in which information is given well-defined meaning, better enabling computers and people to work in cooperation. “The Semantic Web” by Tim Berners-Lee, James Hendler, and Ora Lassila, Scientific American, 2001

2

Tuesday, 6 May 2014

slide-3
SLIDE 3

4

The Basic Web

  • We have

–URLs/URIs/IRIs –HTTP –HTML (and XML based formats, JSON, etc.)

  • People interact with the web

–Via hypermedia (the web browser)

  • Web sites wrap databases, mail, chat, message boards,

stores, management systems, etc. etc. etc.

–Programmatically

  • Via “raw” HTTP
  • Via Web Services

–Other interfaces

  • Esp. in mobile

Tuesday, 6 May 2014

slide-4
SLIDE 4

Programs vs. People

  • Access to the information on the (HTML) Web
  • Primarily designed for people
  • People read web pages
  • People click on links
  • People read more pages
  • What’s on those pages?
  • “Information”, data, knowledge
  • How do we
  • write programs
  • that manipulate that info?
  • 2 choices
  • Scrape
  • Structure

5

Tuesday, 6 May 2014

slide-5
SLIDE 5

Interlude: Web Applications

  • The HTML web has not stood still!
  • Web 2.0/AJAX
  • XMLHttpRequest

– JSON!

  • HTML 5
  • Applications, not documents or data!
  • Mobile Apps
  • Not even HTML!
  • Attenuated linking
  • Lots of data floating around
  • Public and private
  • Lots of data silos
  • But lots of APIs/endpoints as well

6

Tuesday, 6 May 2014

slide-6
SLIDE 6

The KR Vision

Knowledge representation is a field which is currently seems to have the reputation of being initially interesting, but which did not seem to shake the world to the extent that some of its proponents hoped. It made sense but was of limited use on a small scale, but never made it to the large scale. This is exactly the state which the hypertext field was in before the Web. Each field had made certain centralist assumptions -- if not in the philosophy, then in the implementations, which prevented them from spreading globally. But each field was based on fundamentally sound ideas about the representation of

  • knowledge. The Semantic Web is what we will get if we perform

the same globalization process to Knowledge Representation that the Web initially did to Hypertext. We remove the centralized concepts of absolute truth, total knowledge, and total provability, and see what we can do with limited knowledge.

http://www.w3.org/DesignIssues/RDFnot.html

7

Tuesday, 6 May 2014

slide-7
SLIDE 7

Knowledge Representation

  • “Knowledge”

– Comes in different forms

  • Knowing that vs. knowing how (or “know-how)
  • “Propositional” vs. “procedural”
  • Representation

– Anything with “aboutness” – Different representations have different properties

  • We want “computable” represenations
  • Knowledge + Representation?

– (“KR” or sometimes “KRep) – Computable representations of human propositional knowledge

  • A working definition!
  • For our purposes, generally based on a logic

8

Tuesday, 6 May 2014

slide-8
SLIDE 8

The Sorry-State-of-KR Argument

  • Knowledge representation is a field which

– currently seems to have the reputation of being initially interesting, but – did not seem to shake the world to the extent that some of its proponents hoped.

  • It made sense

– but was of limited use on a small scale, – but never made it to the large scale.

  • This is exactly the state which the hypertext field

was in before the Web.

9

Tuesday, 6 May 2014

slide-9
SLIDE 9
  • Each field [e.g., hypertext] had made certain centralist

assumptions -- –if not in the philosophy, then in the implementations, –which prevented them from spreading globally. –But each field was based on fundamentally sound ideas about the representation of knowledge.

  • The Semantic Web is what we will get

–if we perform the same globalization process to Knowledge Representation –that the Web initially did to Hypertext. –(Call this the Argumentum ad WebSuccessam)

  • What is this globalization process?
  • What is the value we hope for?

The Hypertext Analogy

10

Tuesday, 6 May 2014

slide-10
SLIDE 10

The Value

  • Having A Web Sized KR

–All knowledge captured in a computable form –Who knows?

  • Human level or greater AI?
  • Better cognitive prosthetics

–Computing at that scale is a...challenge

  • Having A Web Of KRs

–Ad hoc integration becomes easier –Less code and effort on understanding data

  • and more on using it
  • E.g., Semantic Search

11

Tuesday, 6 May 2014

slide-11
SLIDE 11

Process: Decentralization?

  • Not very directive!
  • Do these things even exist in KR?
  • What does it take to remove them?
  • What are the problems of centralization?

“We remove the centralized concepts of absolute truth, total knowledge, and total provability, and see what we can do with limited knowledge.”

12

Tuesday, 6 May 2014

slide-12
SLIDE 12

13

How to Combine Web & KR

  • 1. Uploading a KR to the Web
  • 2. Mining the Web to Generate a KR
  • 3. Publishing a Web Based KR App
  • 4. Something distinctive

Tuesday, 6 May 2014

slide-13
SLIDE 13

1) Uploading a KR to the Web

  • Still a popular mechanism!

– The Web is effective at sharing files – And more!

14

Tuesday, 6 May 2014

slide-14
SLIDE 14

Profound effects on KR Research

  • 2002

≈200 ontologies on the Web

  • 2006

≈ 1000 ontologies on the Web

  • 2011

≈ 30,000 ontologies on the Web

  • Size and complexity of ontologies growing
  • Tools can handle them!

15

Tuesday, 6 May 2014

slide-15
SLIDE 15

Effects on Research: Depth

  • Publishing systematically yields benefits

– NCIt published monthly for 10 years – Can study the evolution of ontologies

16

Tuesday, 6 May 2014

slide-16
SLIDE 16

Effects on Practice?

  • Harder to say

– Reuse and linking not esp. high

  • Designing for reuse is hard
  • Reusing is hard
  • Reinvention is the norm
  • “Standards” make progress on reuse

– But usually high level and pushed hard (e.g., BFO)

– Web based editing still in infancy

  • Most work is done with offline tools
  • See software engineering
  • Standard Web Benefits

– Examples – Feedback – Standardization of formalisms – Indirect (tools get better)

17

Tuesday, 6 May 2014

slide-17
SLIDE 17

2) Mining the Web

  • The Web is an information resource

– let’s use it

  • Use the content

– DBpedia comes to mind

  • Use the people!

– E.g. ConceptNet 2

18

Tuesday, 6 May 2014

slide-18
SLIDE 18

Using People

  • Direct crowd sourcing

– Let The People curate

  • Indirect crowd sourcing

– Feedback and use – Map data!

  • Excellent for concrete facts

– Unproven for more complex things

19

Tuesday, 6 May 2014

slide-19
SLIDE 19

3) Publishing a Web Based KR App

http://www.isi.edu/isd/LOOM/PowerLoom/documentation/ontosaurus-screenshot.gif 20

Tuesday, 6 May 2014

slide-20
SLIDE 20

What is Wolfram Alpha?

Wolfram|Alpha's long-term goal is to make all systematic knowledge immediately computable and accessible to everyone. We aim to collect and curate all objective data; implement every known model, method, and algorithm; and make it possible to compute whatever can be computed about anything.

http://www.wolframalpha.com/about.html

21

Tuesday, 6 May 2014

slide-21
SLIDE 21

What is Wolfram Alpha?

Wolfram|Alpha's long-term goal is to make all systematic knowledge immediately computable and accessible to everyone. We aim to collect and curate all objective data; implement every known model, method, and algorithm; and make it possible to compute whatever can be computed about anything.

http://www.wolframalpha.com/about.html

A (Private) Semantic Web!

22

Tuesday, 6 May 2014

slide-22
SLIDE 22

Interlude: Other Examples

  • Wolfram Alphaesqe

– Evi/TrueKnowledge

  • Lighter understanding, narrower scope

– Garlik

  • Deeper understanding, narrower scope

– Watson

  • Even lighter understanding, even greater scale

– Semantic Search in Google (Knowledge Graph)

  • 500 million objects
  • 3.5 billion facts
  • All fairly centralized!

– Although they pull a lot from other sources – Share results but not data

23

Tuesday, 6 May 2014

slide-23
SLIDE 23

Do we still need decentralization?

  • In 1993, the Web >>> any 1 org’s capacity
  • Today?

–Google, Amazon, Facebook, etc. –Can handle both the data and the people “We remove the centralized concepts of absolute truth, total knowledge, and total provability, and see what we can do with limited knowledge.”

24

Tuesday, 6 May 2014

slide-24
SLIDE 24

25

4) Something Distinctive

  • 1-3 are not fundamentally distinctive

–Not a new web (really) –Not a new kind of KR

  • What would be a Webesque KR?

–BTW: I’ve no idea

  • (OK, I’ve some ideas...)

–Do we need such a thing?

  • Web distinctiveness might be overstated
  • Technology may have caught up

–Google (or Facebook) could run the Web

Tuesday, 6 May 2014

slide-25
SLIDE 25

URIs: Distinctive? (Uniform Resource Identifier)

Tuesday, 6 May 2014

slide-26
SLIDE 26

27

Global Communication

In order to communicate internally, a community agrees…on a set of terms and their meanings. One goal of the Web…has been to build a global community in which any party can share information with any other party.

! To achieve this goal, the Web makes use of a single global identification system: the URI. URIs are a cornerstone of Web architecture, providing identification that is common across the Web. The global scope of URIs promotes large-scale "network effects": the value of an identifier increases the more it is used consistently…

http://www.w3.org/TR/webarch/#identification

Tuesday, 6 May 2014

slide-27
SLIDE 27

The Magic of Linking?

  • In HTML

– URLs/URIs/IRIs are key – URL design within a Website – URL design between sites

  • What can they do for us?

– Linked data? – Web ontologies? – URIs are everywhere

  • But what are they doing?

28

Tuesday, 6 May 2014

slide-28
SLIDE 28

Linked Data “Principles”

  • Linked Data is the new cool fad!
  • 1. Use URIs as names for things
  • 2. Use HTTP URIs so that people can look up those names.
  • 3. When someone looks up a URI, provide useful information,

using the standards (RDF*, SPARQL)

  • 4. Include links to other URIs. so that they can discover more

things.

http://www.w3.org/DesignIssues/LinkedData.html

29

Tuesday, 6 May 2014

slide-29
SLIDE 29

Linked Data “Principles”

  • Linked Data is the new cool fad!
  • 1. Use URIs as names for things
  • 2. Use HTTP URIs so that people can look up those names.
  • 3. When someone looks up a URI, provide useful information,

using the standards (RDF*, SPARQL)

  • 4. Include links to other URIs. so that they can discover more

things.

  • Still not very directive!

– Some direction

  • More than said here: E.g., “reuse URIs”

– Entity and attribute reconciliation?!

– “Providing useful information” is tricky

  • Esp. for programs!
  • Meaning of terms....ontologies?!

– (In the broadest sense of the term)

http://www.w3.org/DesignIssues/LinkedData.html

30

Tuesday, 6 May 2014

slide-30
SLIDE 30

Where are we?

  • What’s the linking?

– Ask yourself what sorts of links there are – Ask yourself what do the links support

  • And who (or what) exploits them

– What makes a linked datum?

  • And what value add over an unlinked one?
  • What’s the useful information?

– What sort of information could there be? – What makes it useful?

  • For what task?
  • Many open questions

– Which may be closing negatively!

31

Tuesday, 6 May 2014