Configuring Domain Knowledge for Natural Language Understanding - - PowerPoint PPT Presentation

configuring domain knowledge for natural language
SMART_READER_LITE
LIVE PREVIEW

Configuring Domain Knowledge for Natural Language Understanding - - PowerPoint PPT Presentation

Introduction Parsing Process Experimental Results Conclusion Configuring Domain Knowledge for Natural Language Understanding Matt Selway, Wolfgang Mayer, Markus Stumptner Advanced Computing Research Centre (ACRC) School of IT &


slide-1
SLIDE 1

Introduction Parsing Process Experimental Results Conclusion

Configuring Domain Knowledge for Natural Language Understanding

Matt Selway, Wolfgang Mayer, Markus Stumptner

Advanced Computing Research Centre (ACRC) School of IT & Mathematical Sciences University of South Australia

ConfWS 2013, Vienna, AT 29 August, 2013

slide-2
SLIDE 2

Introduction Parsing Process Experimental Results Conclusion Case Study (extract)

Introduction Goals: Translate natural language specifications and requirements into formal models Support software development process (MDE)

improve quality of requirements reduce manual effort and ambiguities identify and correct inconsistencies and errors

Problems: Requires semantic processing or understanding of texts Our Solution: Apply knowledge-based configuration to fragments of a semantic model comprising the domain knowledge

slide-3
SLIDE 3

Introduction Parsing Process Experimental Results Conclusion Case Study (extract)

SBVR Core Concepts

Thing

  • bjectName String

Meaning ThingImpl StateOfAffairs Expression ReferenceScheme SbvrSet SemanticFormulation BindableTarget RoleBinding Concept Question Proposition

isFalse Boolean isObligatedT

  • BeT

rue Boolean isPermittedT

  • BeT

rue Boolean isNecessarilyT rue Boolean isT rue Boolean isPossiblyT rue Boolean

NounConcept FactType IndividualConcept ObjectType Role Quantity

quantity Number

ObjectTypeImpl ConceptType SituationalRole SituationalRole_RolePart FactTypeRole SbvrNumber

/quantity Number number Number

FactTypeImpl Characteristic BinaryFactType PropositionImpl Fact StateOfAffairsImpl Actuality ActualityImpl Representation Designation Definition FactTypeForm Statement DesignationImpl Placeholder

startingCharacterPosition Number

SententialForm NounForm SbvrT ext

text String

ClosedSemanticFormulation LogicalFormulation Projection ClosedSemanticFormulationImpl ClosedLogicalFormulation_ClosedPart ClosedLogicalFormulation AtomicFormulation ModalFormulation LogicalOperation Quantification NecessityFormulation ObligationFormulation PermissibilityFormulation PossibilityFormulation LogicalNegation BinaryLogicalOperation Conjunction Disjunction Equivalence ExclusiveDisjunction Implication NandFormulation NorFormulation WhetherornotFormulation UniversalQuantification AtleastnQuantification

minimumCardinality Number

NumericRangeQuantification

maximumCardinality Number minimumCardinality Number

AtmostnQuantification

maximumCardinality Number

ExactlynQuantification

cardinality Number

AtleastnQuantificationImpl ExistentialQuantification

minimumCardinality = 1 Number

AtmostnQuantificationImpl AtmostoneQuantification

maximumCardinality = 1 Number

ExactlynQuantificationImpl ExactlyoneQuantification

cardinality = 1 Number

Variable

isUnitary Boolean

Thing1CatchesThing2

slide-4
SLIDE 4

Introduction Parsing Process Experimental Results Conclusion Case Study (extract)

SBVR Core Concepts

Thing

  • bjectName String

Meaning ThingImpl StateOfAffairs Expression ReferenceScheme SbvrSet SemanticFormulation BindableTarget RoleBinding Concept Question Proposition

isFalse Boolean isObligatedT

  • BeT

rue Boolean isPermittedT

  • BeT

rue Boolean isNecessarilyT rue Boolean isT rue Boolean isPossiblyT rue Boolean

NounConcept FactType IndividualConcept ObjectType Role Quantity

quantity Number

ObjectTypeImpl ConceptType SituationalRole SituationalRole_RolePart FactTypeRole SbvrNumber

/quantity Number number Number

FactTypeImpl Characteristic BinaryFactType PropositionImpl Fact StateOfAffairsImpl Actuality ActualityImpl Representation Designation Definition FactTypeForm Statement DesignationImpl Placeholder

startingCharacterPosition Number

SententialForm NounForm SbvrT ext

text String

ClosedSemanticFormulation LogicalFormulation Projection ClosedSemanticFormulationImpl ClosedLogicalFormulation_ClosedPart ClosedLogicalFormulation AtomicFormulation ModalFormulation LogicalOperation Quantification NecessityFormulation ObligationFormulation PermissibilityFormulation PossibilityFormulation LogicalNegation BinaryLogicalOperation Conjunction Disjunction Equivalence ExclusiveDisjunction Implication NandFormulation NorFormulation WhetherornotFormulation UniversalQuantification AtleastnQuantification

minimumCardinality Number

NumericRangeQuantification

maximumCardinality Number minimumCardinality Number

AtmostnQuantification

maximumCardinality Number

ExactlynQuantification

cardinality Number

AtleastnQuantificationImpl ExistentialQuantification

minimumCardinality = 1 Number

AtmostnQuantificationImpl AtmostoneQuantification

maximumCardinality = 1 Number

ExactlynQuantificationImpl ExactlyoneQuantification

cardinality = 1 Number

Variable

isUnitary Boolean

Thing1CatchesThing2

slide-5
SLIDE 5

Introduction Parsing Process Experimental Results Conclusion Case Study (extract)

SBVR Core Concepts

Thing

  • bjectName String

Meaning ThingImpl StateOfAffairs Expression ReferenceScheme SbvrSet SemanticFormulation BindableTarget RoleBinding Concept Question Proposition

isFalse Boolean isObligatedT

  • BeT

rue Boolean isPermittedT

  • BeT

rue Boolean isNecessarilyT rue Boolean isT rue Boolean isPossiblyT rue Boolean

NounConcept FactType IndividualConcept ObjectType Role Quantity

quantity Number

ObjectTypeImpl ConceptType SituationalRole SituationalRole_RolePart FactTypeRole SbvrNumber

/quantity Number number Number

FactTypeImpl Characteristic BinaryFactType PropositionImpl Fact StateOfAffairsImpl Actuality ActualityImpl Representation Designation Definition FactTypeForm Statement DesignationImpl Placeholder

startingCharacterPosition Number

SententialForm NounForm SbvrT ext

text String

ClosedSemanticFormulation LogicalFormulation Projection ClosedSemanticFormulationImpl ClosedLogicalFormulation_ClosedPart ClosedLogicalFormulation AtomicFormulation ModalFormulation LogicalOperation Quantification NecessityFormulation ObligationFormulation PermissibilityFormulation PossibilityFormulation LogicalNegation BinaryLogicalOperation Conjunction Disjunction Equivalence ExclusiveDisjunction Implication NandFormulation NorFormulation WhetherornotFormulation UniversalQuantification AtleastnQuantification

minimumCardinality Number

NumericRangeQuantification

maximumCardinality Number minimumCardinality Number

AtmostnQuantification

maximumCardinality Number

ExactlynQuantification

cardinality Number

AtleastnQuantificationImpl ExistentialQuantification

minimumCardinality = 1 Number

AtmostnQuantificationImpl AtmostoneQuantification

maximumCardinality = 1 Number

ExactlynQuantificationImpl ExactlyoneQuantification

cardinality = 1 Number

Variable

isUnitary Boolean

Thing1CatchesThing2

slide-6
SLIDE 6

Introduction Parsing Process Experimental Results Conclusion Case Study (extract)

SBVR-based Specification (1) EU-Rent Vocabulary rental organisation unit Definition: organisational unit that operates part of EU-Rent’s car rental business rental organisation unit having rental responsibility Definition: .... the rental organisation unit is responsible for the

  • peration of customer-facing rental business

rental organisation unit having area responsibility Definition: .... the rental organisation unit includes organisation units for which it has the responsibility to coordinate operations and ensure resources

slide-7
SLIDE 7

Introduction Parsing Process Experimental Results Conclusion Case Study (extract)

SBVR-based Specification (2) EU-Rent Vocabulary (cont.) local area Definition: rental organisation unit .... that has area responsibility branch Definition: rental organisation unit .... that has rental responsibility branch is included in local area Synonymous Form: local area includes branch EU-Rent Rules ..... Each branch is included in ....... exactly.....

  • ne local area.

.... The country of . a branch is .... the country .... that includes ... the

  • perating company ....

that includes ... the local area .... that includes ... the branch.

slide-8
SLIDE 8

Introduction Parsing Process Experimental Results Conclusion

Cognitive Grammar Semantic Structures Sentence Accommodation Evokes

slide-9
SLIDE 9

Introduction Parsing Process Experimental Results Conclusion

Syntactic Analysis (1)

. [ ..... Each]* [branch] *[is included in]* [ ....... exactly.....

  • ne]*

[local area] BE LC GC 1 1 1 1 1 1 1 1 3/4 4/5 1 1 1 1 4/5 2/5 3/5 2/5 3/5 4/5 4/5 1 2/5 3/5

slide-10
SLIDE 10

Introduction Parsing Process Experimental Results Conclusion

Syntactic Analysis (2) Process is iterative As catches occur, the combination is sent to the accommodation process If accommodation is successful, the resulting partial configuration can be re-used in larger combinations If unsuccessful, the combination is discarded

slide-11
SLIDE 11

Introduction Parsing Process Experimental Results Conclusion

Syntactic Analysis (3) Traditional lexicons include: word syntactic categories (noun, verb, etc.) plurality voice (passive, active) transitivity Our Lexicon includes: word (link to semantic model) expectations (linked to sites of semantic structure) expectation direction (possibly derived from model)

slide-12
SLIDE 12

Introduction Parsing Process Experimental Results Conclusion

Semantic Accommodation Performed using component-based configuration SBVR Model to configuration mapping:

Class hierarchy → Component type taxonomy Objects → Components Attributes and Data Types → Attributes and Attribute Types Relationships → Ports and Port Types Constraints → Constraints

Syntactic information encoded as a Port Type and a pair of Ports (catches and caughtby) A constraint forces the configurator to try to connect two components if they are connected through catches–caughtby

slide-13
SLIDE 13

Introduction Parsing Process Experimental Results Conclusion

Partial Configuration

slide-14
SLIDE 14

Introduction Parsing Process Experimental Results Conclusion

Complete Configuration

slide-15
SLIDE 15

Introduction Parsing Process Experimental Results Conclusion

Configuration Results Using a slightly different KB, so numbers differ

R1 R1 Iterative R2 R2 Iterative

  • Ave. Time (seconds)

1.8 4.5 11.8 128 # Constraints 107 259 405 4467 # Variables Assigned 2090 5824 6627 82782 # Assigned Unconn. 2078 5812 6579 82734

  • Min. # Backtracks

56 56 442 434

  • Max. # Backtracks

59 58 462 454

  • Ave. # Backtracks

57.6 57.0 452 443 # Comps. Created 6 6 24 24

slide-16
SLIDE 16

Introduction Parsing Process Experimental Results Conclusion

Changes and New Results Added new port type based on constrained set variables Multiple ports representing a relation → a single port

R1 R1 Iterative R2 R2 Iterative

  • Ave. Time (seconds)

0.16 0.41 0.67 6.11 # Constraints 102 244 390 4263 # Variables Assigned 258 699 824 9830 # Assigned Unconn. 246 687 776 9782

  • Min. # Backtracks

10 10 97 82

  • Max. # Backtracks

11 11 114 108

  • Ave. # Backtracks

10.6 10.5 105 94 # Comps. Created 6 6 24 24

slide-17
SLIDE 17

Introduction Parsing Process Experimental Results Conclusion

Contribution Combined Cognitive Grammar and Configuration to perform NL Processing/Understanding Semantic analysis of text

improves mappings between NL and the semantic model allows semantic disambiguation of terms

Iterative construction of a domain model

ensures consistency improves identification of erroneous, ambiguous, and inconsistent statements

Simplifies lexicon for our target application

slide-18
SLIDE 18

Introduction Parsing Process Experimental Results Conclusion

Future Work Optimise the Configuration process:

Additional ordering heuristics to minimise backtracks for more complex sentences Implement better handling of correct partial configurations

Perform analysis of entire specification to evaluate scalability as domain knowledge (i.e the model) grows