Dagstuhl-Seminar on Rule Markup Techniques 7th of February, 2002 - PowerPoint PPT Presentation

Dagstuhl-Seminar on Rule Markup Techniques 7th of February, 2002 XL: A rule-based query and transformation language for XML and SSD Fran¸ cois Bry and Sebastian Schaffert Ludwig-Maximilians-Universit¨ at M¨ unchen http://www.pms.informatik.uni-muenchen.de Sebastian Schaffert Page 1

Outline 1. Motivation 2. XML and Terms 3. Elements of a Query and Transformation Language • Construct-Query Rules • Database Terms • Query Terms • Construct Terms 4. Simulation Unification 5. Conclusion Sebastian Schaffert Page 2

Motivation Imagine two online bookstores that provide a list of books with, among other things, their titles and prices. Bookstore A: Example <bib> 1 <book> 2 <title>Cryptonomicon</title> 3 <authors> 4 <author>Alice</author> 5 <author>Bob</author> 6 </authors> 7 <price>39.95</price> 8 </book> 9 <book> 10 <title>Applied Cryptography</title> 11 <author>Alice</author> 12 <price>34.95</price> 13 </book> 14 ... 15 </bib> 16 Sebastian Schaffert Page 3

Motivation – cont. Bookstore B, on the other hand could provide a list in the style of the following excerpt: Example <reviews> 1 <entry> 2 <title>Applied Cryptography</title> 3 <price>36.95</price> 4 <comment>A good book on cryptography</comment> 5 </entry> 6 <entry> 7 <title>Cryptonomicon</title> 8 <price>31.95</price> 9 <comment>A must-have for your private intelligence service</comment> 10 </entry> 11 ... 12 </reviews> 13 Sebastian Schaffert Page 4

Motivation – cont. A common query for such heterogenous sources could be: Give me a list of all books with a comparison of its price at store A and B The result for the example databases would look as follows: Example <books-with-prices> 1 <book-with-prices> 2 <title>Applied Cryptography</title> 3 <price-A>34.95</price-A> 4 <price-B>36.95</price-B> 5 </book-with-prices> 6 <book-with-prices> 7 <title>Cryptonomicon</title> 8 <price-A>39.95</price-A> 9 <price-B>31.95</price-B> 10 </book-with-prices> 11 ... 12 </books-with-prices> 13 Sebastian Schaffert Page 5

Motivation – cont. In a “navigational” query language like the XPath-based XQuery [2] and XSLT [1] this query would consist of several independant “subqueries” for each of the databases: 1. find all entries in the database that have a title and a price ( /bib/book[title and price] ) 2. for each of the entries, retrieve the title ( ./title ) 3. for each of the entries, retrieve the price ( ./price ) It is easy to observe that there is no (immediate) connection between these subqueries other than the sequence in which they are evaluated. Furthermore, the construction part and the query part are tightly integrated . Sebastian Schaffert Page 6

Motivation – cont. In XQuery, the query would look like this: Example <books-with-prices> 1 { FOR $a in document("A/bib.xml")//book, 2 $b in document("B/reviews.xml")//entry 3 WHERE $b/title = $a/title 4 RETURN 5 <book-with-prices> 6 { $b/title } 7 <price-A> 8 { $a/price/text() } 9 </price-A> 10 <price-B> 11 { $b/price/text() } 12 </price-B> 13 </book-with-prices> 14 } 15 </books-with-prices> 16 Sebastian Schaffert Page 7

Motivation – cont. In contrast to such a navigational way of querying, we propose a rule-based approach – similar to languages like Prolog – with the following two main features: • term-based (“positional”) querying with a template of the data in the database • rule-based programs with a clear separation between construction- and query part Sebastian Schaffert Page 8

Motivation – cont. It is our conviction that the declarativeness of such a language . . . • will make it easier to use in many cases (it may even be possible to create a visual interface for it) • will make complex transformations more obvious (and thus lead to easier maintainability) Sebastian Schaffert Page 9

Motivation – cont. In our XL-approach, the query could look like this: Example construct 1 <book-with-prices> 2 <title>T</title> 3 <price-A>Pa</price-A> 4 <price-B>Pb</price-B> 5 </book-with-prices> 6 where 7 in A/bib.xml: 8 <book> 9 <title>T</title> 10 <price>Pa</price> 11 </book> 12 and 13 in B/reviews.xml: 14 <entry> 15 <title>T<title> 16 <price>Pb</price> 17 </entry> 18 Sebastian Schaffert Page 10

XML and Terms Term representations of XML data are straightforward: Example Example <bib> bib( 1 1 <book> book( 2 2 <title>Cryptonomicon</title> title(’Cryptonomicon’), 3 3 <authors> authors( 4 4 <author>Alice</author> author(’Alice’), 5 5 <author>Bob</author> author(’Bob’) 6 6 </authors> ), 7 7 <price>39.95</price> price(39.95) 8 8 </book> ), 9 9 <book> book( 10 10 <title>Applied Cryptography</title> title(’Applied Cryptography’), 11 11 <author>Alice</author> author(’Alice’), 12 12 <price>34.95</price> price(34.95) 13 13 </book> ), 14 14 ... ... 15 15 </bib> ) 16 16 Sebastian Schaffert Page 11

XML and Terms – cont. However, it is not so easy to apply the methods of logic programming to such terms: 1. XML is semi -structured: • structure may be incomplete • a given structure (DTD, XML Schema) may be ignored • several entries of similar kind may have differing structure 2. Data is organized in a different way than in traditional term-based approaches: • alternatives are nested within the same term instead of using several terms • order may or may not be of relevance, depending on the application Sebastian Schaffert Page 12

XML and Terms – cont. A term-based language has to cope with these properties. In the XL -project, we propose: • a term language that provides constructs for dealing with unknown and flexible structure • a non-standard unification algorithm that makes use of these constructs for querying flexible data with nested alternatives • a rule language building on top of the two latter concepts Sebastian Schaffert Page 13

Elements of a Query and Transformation Language Construct-Query Rules A program in the language XL consists of one or more rules of the style t q t c 1 ∧ · · · ∧ t q ← n Head Body where each term in the body is evaluated against a (possibly different) database or head of another rule. The head is used to “construct” the answer. Both backward and forward chaining of rules is possible in the current approach. Sebastian Schaffert Page 14

Elements of a Query and Transformation Language Database Terms Database Terms are an abstraction of XML documents. • l [ t 1 , . . . , t n ] is a database term with the root labelled l and the sequence of children t 1 , . . . , t n is ordered • l { t 1 , . . . , t n } is a database term with the root labelled l and the bag of children t 1 , . . . , t n is unordered Instead of l [] and l {} (i.e. n = 0 ), we write simply l . Sebastian Schaffert Page 15

Elements of a Query and Transformation Language Query Terms Query Terms . . . • are a pattern for the data in the database • contain variables in order to retrieve information from the database Sebastian Schaffert Page 16

Elements of a Query and Transformation Language Query Terms In contrast to Prolog goals, however, Query Terms have the following additional properties: • subterms with additonal structure might be answers • subterms with different subterm ordering might be answers • the query term might specify subterms at an unspecified depth Sebastian Schaffert Page 17

Elements of a Query and Transformation Language Query Terms In our abstract syntax, we write Query Terms similarly to Database Terms, with the following additional properties: • double parentheses ( [[]] and {{}} ) are used to specify a total matching, while single parentheses express partial matching • the descendant construct allows to represent subterms at an unspecified depth ( desc t ) • variables refer to subterms in the Query Term ( X ❀ t , read X “as” t ) Obviously, such a flexible structure implies that there might be several alternative answers for a query. Sebastian Schaffert Page 18

Elements of a Query and Transformation Language Query Terms – Example bib { book { T ❀ title, desc author { A ❀ ·}}} . might be an appropriate query term for a structure where the depth of the author elements below a book is not known: Example <bib> 1 <book> 2 <title>Applied Cryptography</title> 3 <author>Alice</author> 4 </book> 5 <book> 6 <title>Cryptonomicon</title> 7 <authors> 8 <author>Alice</author> 9 <author>Bob</author> 10 </authors> 11 </book> 12 </bib> 13 Sebastian Schaffert Page 19

Dagstuhl-Seminar on Rule Markup Techniques 7th of February, 2002 - PowerPoint PPT Presentation

Dagstuhl-Seminar on Rule Markup Techniques 7th of February, 2002 XL: A rule-based query and transformation language for XML and SSD Fran cois Bry and Sebastian Schaffert Ludwig-Maximilians-Universit at M unchen

Document Markup Document Markup - Reveal Codes Wikipedia Uses Wikitext markup Example

Hypertext Markup Language Drawing on the Web Hypertext Markup Language Drawing on the Web A

Hypertext Markup Language Introduction to Web Design Hypertext Markup Language Introduction to

An XML Markup Language An XML Markup Language Framework for Lexical Databases Framework for

Markup - Why? How? Espen S. Ore University of Oslo What is markup? (My working

Pattern Markup-Language Pattern Markup-Language A tool for simplifying data extraction A tool

HTML 1 Markup Languages A Markup Language is a computer language in which data and

CS276B Text Retrieval and Mining Winter 2005 Lecture 12 What is XML? n eXtensible Markup

Lets make HTML5 & JavaScript Games! credit: w3.or g HTML is a markup lan g ua g e Markup

The Role of Markup in the Digital Humanities Desmond Schmidt Information Security Institute

PB138 Markup Languages Tom a s Pitner March 22, 2013 Tom a s Pitner PB138

HTML? HyperText Markup Language is the standard markup language used to create web pages.

Rule Changes - Non rule change year Review of 2017 rule changes - just the easy to forgot

Common Rule Advanced Notice of Proposed Rulemaking (ANPRM) IRB Investigator Advanced Notice

2nd RULE: You MUST TALK about BOOK CLUB. 2nd RULE: You DO NOT talk about 3rd RULE: PERSEVERE -- If

Rule #1: Have a takeaway. Rule #2: Keep It Simple. Rule #3: Repetition is Good. Rule #4: Be

From the Tagless-Final Cookbook Embedding and Optimizing (Hardware) Domain-Specific Languages in

Documentation Production using Open Source Tools Elena Zannoni, elena.zannoni@oracle.com Stefan

PA153 Natural Language Processing 08 - Lexicographic tools and computational lexicography Karel

Define-XML Creation & Validation Anran Xie Agenda 1. Define-XML

COMP6037 Read Blackboards Announcements Read Blackboards Discussions

Extensible Programming and Modeling Languages Ted Kaminski, Yogesh Mali, August Schwerdfeger and

M. Bagic Babac, M. Kunstic, D. Jevtic University of Zagreb, Croatia Faculty of Electrical

8. XSL Transformations (XSLT) XSL = Extensible Stylesheet Language Task: Transform the XML

Dagstuhl-Seminar on Rule Markup Techniques 7th of February, 2002 - PowerPoint PPT Presentation

Dagstuhl-Seminar on Rule Markup Techniques 7th of February, 2002 XL: A rule-based query and transformation language for XML and SSD Fran cois Bry and Sebastian Schaffert Ludwig-Maximilians-Universit at M unchen

Document Markup Document Markup - Reveal Codes Wikipedia Uses Wikitext markup Example

Hypertext Markup Language Drawing on the Web Hypertext Markup Language Drawing on the Web A

Hypertext Markup Language Introduction to Web Design Hypertext Markup Language Introduction to

An XML Markup Language An XML Markup Language Framework for Lexical Databases Framework for

Markup - Why? How? Espen S. Ore University of Oslo What is markup? (My working

Pattern Markup-Language Pattern Markup-Language A tool for simplifying data extraction A tool

HTML 1 Markup Languages A Markup Language is a computer language in which data and

CS276B Text Retrieval and Mining Winter 2005 Lecture 12 What is XML? n eXtensible Markup

Lets make HTML5 &amp; JavaScript Games! credit: w3.or g HTML is a markup lan g ua g e Markup

The Role of Markup in the Digital Humanities Desmond Schmidt Information Security Institute

PB138 Markup Languages Tom a s Pitner March 22, 2013 Tom a s Pitner PB138

HTML? HyperText Markup Language is the standard markup language used to create web pages.

Rule Changes - Non rule change year Review of 2017 rule changes - just the easy to forgot

Common Rule Advanced Notice of Proposed Rulemaking (ANPRM) IRB Investigator Advanced Notice

2nd RULE: You MUST TALK about BOOK CLUB. 2nd RULE: You DO NOT talk about 3rd RULE: PERSEVERE -- If

Rule #1: Have a takeaway. Rule #2: Keep It Simple. Rule #3: Repetition is Good. Rule #4: Be

From the Tagless-Final Cookbook Embedding and Optimizing (Hardware) Domain-Specific Languages in

Documentation Production using Open Source Tools Elena Zannoni, elena.zannoni@oracle.com Stefan

PA153 Natural Language Processing 08 - Lexicographic tools and computational lexicography Karel

Define-XML Creation &amp; Validation Anran Xie Agenda 1. Define-XML

COMP6037 Read Blackboards Announcements Read Blackboards Discussions

Extensible Programming and Modeling Languages Ted Kaminski, Yogesh Mali, August Schwerdfeger and

M. Bagic Babac, M. Kunstic, D. Jevtic University of Zagreb, Croatia Faculty of Electrical

8. XSL Transformations (XSLT) XSL = Extensible Stylesheet Language Task: Transform the XML

Lets make HTML5 & JavaScript Games! credit: w3.or g HTML is a markup lan g ua g e Markup

Define-XML Creation & Validation Anran Xie Agenda 1. Define-XML