Storing XML Data In a Native Repository
Kamil Toman ktoman@ksi.mff.cuni.cz
- Dept. of Software Engineering
Storing XML Data In a Native Repository Kamil Toman - - PowerPoint PPT Presentation
Storing XML Data In a Native Repository Kamil Toman ktoman@ksi.mff.cuni.cz Dept. of Software Engineering Faculty of Mathematics and Physics Charles University Introduction Since 1998 XML has become a very popular standard for electronic
– Text values of elements or attributes are represen-
– Mixed contents elements are modeled as trees
1 2 4 5 1 1 1 1 1 1 1 3
– This id can be used as a reference in an index or
– Can be used as on document updates
– Allows effective query evaluation utilizing structur-
(1,100,1) (10,5,2) (20,50,2) (11,0,3) (25,10,3) ( (40,10,3) (30,0,4) (45,0,4) 3 4 5 6 9 12 21 18
– the information is mostly short-lived – Every node must be first looked up in an index
Symbols Syntactic tree Canonic Tree Document Information Data Model
– Besides common word or value indices, additional
– Not only data but even the structure of XML
– Expensive index updates may be needed
– Both XPath and XQuery are Turing-complete
– XML tree algebra (TAX) approach – XQuery subset translated to TAX operations
– Lightweight, can manage only small to medium
– XPath subset + fulltext extensions
– Using B-trees, fully updatable – Navigational approach + large indices
– XPath fully implemented, navigational approach – XUpdate supported
– Chosen data model is sufficient for implementation
– Managing dynamic XML data is much harder than
– Finding a more general way how to express and
– Reducing space needed for structural and term