Multilingual Verbalisation of Modular Ontologies using GF and lemon
Brian Davis, Ramona Enache, Jeroen van Grondelle and Laurette Pretorius CNL 2012 August 29, 2012
Multilingual Verbalisation of Modular Ontologies using GF and lemon - - PowerPoint PPT Presentation
Multilingual Verbalisation of Modular Ontologies using GF and lemon Brian Davis, Ramona Enache, Jeroen van Grondelle and Laurette Pretorius CNL 2012 August 29, 2012 Structure WHY? Be Informed use case as context The meta-model/model
Brian Davis, Ramona Enache, Jeroen van Grondelle and Laurette Pretorius CNL 2012 August 29, 2012
WHY? Be Informed use case as context The meta-model/model separation - meta-model semantics WHAT? Verbalisation and … — Modularisation — Label variants and their manipulation — Multilingualism — lemon-GF mapping HOW? Achieving these four aspects in GF (and lemon) WHAT NEXT? Ideas about future work
Challenges: Adoption of ontologies -> new audiences (knowledge engineers and ontologists, business users, end users, etc.) -> access via verbalisation in multiple languages Dealing with complexity -> many constraints; changing rules; contextual rules, e.g. customer, time, …; rules from many sources that may cause conflict and overlap Be Informed Business Process Ontology: Captures all relevant activities, artifacts, involved roles etc. and the relations between these in a modularised way: Meta-model, using pre- and post-condition semantics, and Models of specific business process applications Verbalisation: Based on pattern sentences
MOLTO is funded by the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement FP7-ICT-247914.
throughout policy lifecycle
interaction
immigrations, Dutch government in the Caribbean)
Europe, emission trading)
makers, citizens)
documents and letters
CNL 2012: GF, lemon, multilingualism, label variants, modularisation
Van Grondelle, J.C., Gülpers, M.: Specifying flexible business processes using pre and post conditions. In PoEM, Volume 92 of Lecture Notes in Business Information Processing, Springer (2011) 38–51:
(a) a document of type DOCUMENT WITH DETAILS is available.
(a) a document of type SUBMISSION FORM has been created. Clumsy grammar and lack of fluency Non-scalability in terms of number of supported languages
Meta-model Models Concepts and relations chosen based on consensus Individual parties (no consensus) Determined once, fixed Introduced over time, frequent changes Ontology formalism Various information sources/formalisms/styles Created by knowledge engineers Created by a wide range of people BI default meta models (stable) Resource intensive development (changes/updates) Lexicalisation and verbalisation Labels follow the ontology according to guidelines (e.g. case, activity, etc.) Labels exhibit large variation Complexity at lexical and syntax/grammatical levels (pattern sentences) Complexity at lexical level
Sources of Variation: Non-linguistic
Linguistic
e.g. “Intake” or “Equality principle”
result is published”
Manipulation :
verbalisation of label variants that refer to the same concept in the
patterns, with increased fluency (L, C and N)
underlying application
Ontology verbalisation: exploits the complementary strengths of GF and lemon (modularisation, mapping …) GF: captures ontological information as well as the required sentence structure for multiple languages lemon: provides concrete label information in multiple languages Specification of BI business processes in terms of pre- and post-conditions requires verbalisation of such conditions, in accordance with the sentence patterns Concept labels are to be verbalised as propositional statements. Triples (activities with pre-conditions and/or post-conditions) are verbalised as conditional statements (“A if B”), where A and B are simple propositional statements with modalities, as appropriate.
(Activity, Requires_Available, Artifact subtyped as Document)
Artifact → Fragment ;
available_A));
This work is funded in part by the European Community's Seventh Framework Program (FP7/2007-2013) under Grant Agreement no.: FP7-ICT-248458 and FP7-ICT-247914
Brian Davis Ramona Enache Jeroen van Grondelle Laurette Pretorius