OWL Simplified English* Richard Power Open University, UK * A - - PowerPoint PPT Presentation

owl simplified english
SMART_READER_LITE
LIVE PREVIEW

OWL Simplified English* Richard Power Open University, UK * A - - PowerPoint PPT Presentation

OWL Simplified English* Richard Power Open University, UK * A finite-state language for ontology editing Semantic Web Authoring Tool (EPSRC 2009-2012) Open University (Department of Computing) Richard Power Sandra Williams Allan Third Tu


slide-1
SLIDE 1

OWL Simplified English*

Richard Power Open University, UK

* A finite-state language for ontology editing

slide-2
SLIDE 2

Open University (Department of Computing)

Richard Power

Sandra Williams Allan Third Tu Anh Nguyen

Manchester University (School of Computer Science) Robert Stevens

Alan Rector Fennie Liang

Sussex University (Department of Informatics)

Donia Scott

Semantic Web Authoring Tool

(EPSRC 2009-2012)

slide-3
SLIDE 3

Objectives

Clarify relationship of formal languages (OWL) to natural languages (English) Develop tools for viewing and editing OWL ontologies in natural language

Theoretical Practical

slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6
slide-7
SLIDE 7

Download editing tool

http://mcs.open.ac.uk/rp3242/editor/ Requires Java runtime environment

slide-8
SLIDE 8

Outline

  • Motivation
  • Demonstration
  • Language
  • Coverage
  • Conclusion
slide-9
SLIDE 9

Previous work

Attempto Controlled English (ACE) Sydney OWL Syntax (SOS) Rabbit ACE Wiki ROO (Rabbit to OWL Ontology construction) RoundTrip Ontology Authoring

Controlled Natural Languages Ontology Editing Tools

slide-10
SLIDE 10

OWL Simplified English

  • Very simple rules for forming sentences
  • Little or no effort required to build lexicon
  • Disallows structurally ambiguous sentences
  • Can be interpreted by finite-state transducer
  • Coverage limited in theory, adequate in practice

SIMPLICITY COVERAGE

slide-11
SLIDE 11

Editing tool

  • User edits text as in predictive authoring
  • Patterns for a complete sentence are
  • ffered as in WYSIWYM
  • Patterns contain anchors for entity names

(individual, class, property) for which

  • ptions are computed from the current text
  • Efficient implementation is much easier if

the grammar is finite-state

slide-12
SLIDE 12

Outline

  • Motivation
  • Demonstration
  • Language
  • Coverage
  • Conclusion
slide-13
SLIDE 13

Editing pane Options pane Message pane

slide-14
SLIDE 14
slide-15
SLIDE 15
slide-16
SLIDE 16
slide-17
SLIDE 17
slide-18
SLIDE 18
slide-19
SLIDE 19
slide-20
SLIDE 20
slide-21
SLIDE 21
slide-22
SLIDE 22
slide-23
SLIDE 23

Outline

  • Motivation
  • Demonstration
  • Language
  • Coverage
  • Conclusion
slide-24
SLIDE 24

Axiom in OSE and OFS

London is a city that is capital of the United Kingdom and is divided into at least 30 boroughs.

ClassAssertion(Class(#London), ObjectIntersectionOf(Class(#city), ObjectHasValue(ObjectProperty(#capitalOf), NamedIndividual(#UK)) ObjectMinCardinality(30, ObjectProperty(#dividedInto), Class(#borough))))

slide-25
SLIDE 25

Restricted words

London is a city that is capital of the United Kingdom and is divided into at least 30 boroughs. Individual name Class name Property name ENTITY NAMES a/an, and, or, that, not, anything, something, every, no, least, most, only, exactly, ... Some words are used only as scaffolding, and cannot be included in an entity name

slide-26
SLIDE 26

Word categories

Syntactic sugar

every, no, a/an, and, or, that, ...

Number

two, 365, 3.14, ...

String

“Pride and Prejudice”, “XY123”, ...

Verb (present)

is, has, takes, participates, ...

Preposition

  • f, by, in, from, ...

Proper noun

John, X23, London, ...

Noun/other

person, taken, yellow, slowly, ...

slide-27
SLIDE 27

How words are categorised

Syntactic sugar

a/an, and,...

Number

two, 365, ...

String

“XY123”, ...

Verb (present)

takes, ...

Preposition

  • f, by, ...

Proper noun

John, ...

Noun/other

person, ... Listed in program Number words, digits Double quotes Listed by USER Upper-case letter Listed in program Lower-case letter

slide-28
SLIDE 28

Entity names

Individual

Proper noun, ‘the’ Proper noun, ‘the’, Number, String, Preposition, Noun/other

Class

Proper name, ‘the’, Number, String, Preposition, Noun/other Proper name, ‘the’, Number, String, Preposition, Noun/other

Property

‘is’, ‘has’, Verb (present) Noun/other, Preposition

Literal

Number, String

Entity Opening Continuation

slide-29
SLIDE 29

Reason for these rules

The author is not required to define names for individuals, classes and properties in advance, so the system must infer when they start and end.

The queen is a woman that lives in Buckingham Palace and is married to a Greek that is named “Phillip”. Individual Class Property Literal

slide-30
SLIDE 30

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation (PN, the, Number, String, Prep, Noun/other) VC Verb-phrase continuation (Prep, Noun/other) FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

1

Unspecified(null,null)

slide-31
SLIDE 31

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation VC Verb-phrase continuation FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

2

ClassAssertion(NamedIndividual(#Tony),null)

slide-32
SLIDE 32

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation VC Verb-phrase continuation FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

3

ClassAssertion(NamedIndividual(#Tony_Blair),null)

slide-33
SLIDE 33

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation VC Verb-phrase continuation FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

4

ClassAssertion(NamedIndividual(#Tony_Blair),null)

slide-34
SLIDE 34

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation VC Verb-phrase continuation FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

5

ClassAssertion(NamedIndividual(#Tony_Blair), UnspecifiedRestriction(ObjectProperty(#is_married),null))

slide-35
SLIDE 35

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation VC Verb-phrase continuation FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

6

ClassAssertion(NamedIndividual(#Tony_Blair), UnspecifiedRestriction(ObjectProperty(#is_married_to),null))

slide-36
SLIDE 36

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation VC Verb-phrase continuation FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

7

ClassAssertion(NamedIndividual(#Tony_Blair), ObjectSomeValuesFrom(ObjectProperty(#is_married_to),Class()))

slide-37
SLIDE 37

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation VC Verb-phrase continuation FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

8

ClassAssertion(NamedIndividual(#Tony_Blair), ObjectSomeValuesFrom(ObjectProperty(#is_married_to),Class(#lawyer)))

slide-38
SLIDE 38

OBJECT INDEF COMPLEMENT FINISH FS FS START SUBJECT ISUB CSUB the PN a/an every NC NC NC COPULAR VPLIST AUX VC has is VC a/an NC NC COBJ IOBJ NC NC a/an NC PN the VC FV

PN Proper noun NC Noun-phrase continuation VC Verb-phrase continuation FV Tensed verb FS Full stop

Tony Blair is married to a lawyer.

9

ClassAssertion(NamedIndividual(#Tony_Blair), ObjectSomeValuesFrom(ObjectProperty(#is_married_to),Class(#lawyer)))

slide-39
SLIDE 39

Basic sentence patterns

slide-40
SLIDE 40

Sentence continuations

slide-41
SLIDE 41

Sentence structure

Sentence = Subject Predicate Subject = [Individual] Subject = A|Every|No [Class] Predicate = is NPList that VPList that VPChain NPList = a [Class] and a [Class] … VPList = [Props] a [Class] and [Props] … VPChain = [Props] a [Class] that [Props] …

slide-42
SLIDE 42

Outline

  • Motivation
  • Demonstration
  • Language
  • Coverage
  • Conclusion
slide-43
SLIDE 43

Complex axiom patterns

Results from corpus of

  • ver 550 ontologies

99.8% of axioms had simple subject term All the top 20 complex predicate patterns are within the constraints of OWL Simplified English If axiom patterns were created randomly we would expect just 2-3 to lie within our constraints

slide-44
SLIDE 44

Three fundamental patterns

  • Genus-Differentia (Aristotle)

– A pet-owner is a person that owns a pet

  • Restriction list

– A pet-owner owns a pet and cleans a cage

  • Alternative role-fillers

– A pet-owner owns a cat or a dog or a canary

slide-45
SLIDE 45

Measuring practical coverage

  • Enumerate all possible complex class

expressions up to a given complexity level

  • Apply a criterion to determine which expressions

yield ambiguous sentences

  • Count the expected frequency of ambiguous

sentences if all complex class expressions were equally likely

  • Compare with the observed frequency for

complex class expressions in an ontology corpus

slide-46
SLIDE 46

Structural ambiguity

A child has as parent a mother and a father.

A N Vs a N and a N. A N [Vs a N that Vs an N] and Vs a N. A N Vs a N that [Vs an N and Vs a N].

A queen appoints a minister that governs a country and wears a crown.

slide-47
SLIDE 47

Enumerating complex classes (1)

P C C

Complexity = 2 (number of non-terminal nodes)

slide-48
SLIDE 48

Enumerating complex classes (2)

P C

Complexity = 5 (normalised to binary tree)

P P C C

slide-49
SLIDE 49

Observed vs Expected

5439 84 3107 2416

Ambiguous Non-ambiguous Observed Expected The expected frequency of complex axioms yielding ambiguous verbalisations was 2416/5523 or 43.7%. The obtained frequency was 84/5523 or 1.5%.

slide-50
SLIDE 50

Outline

  • Motivation
  • Demonstration
  • Language
  • Coverage
  • Conclusion
slide-51
SLIDE 51

Conclusions on complexity

  • Overwhelmingly ontology authors favour

complex class constructions that are not structurally ambiguous when verbalised

  • Therefore, if we restrict sentence patterns

to avoid structural ambiguity, almost all axioms found in our corpus could be formulated

  • Probably many of the remaining axioms

could be refactored

slide-52
SLIDE 52

Main ideas in OSE

  • Editing tool combines predictive authoring

and WYSIWYM

  • Finite-state controlled language favours

efficient implementation and prevention of structural ambiguity

  • Language requires minimal lexical input

from user (verb list)

  • Language allows but does not impose

correct English

slide-53
SLIDE 53

Will it work

User studies still pending ...

slide-54
SLIDE 54

Questions?

... but thanks for your attention