1
OrientX: an Integrated, Schema- Based Native XML Database System
Meng Xiaofeng, Wang Xiaofeng, Xie Min, Zhang Xin, Zhou Junfeng School of information, Renmin University of China WISA2006
OrientX: an Integrated, Schema- Based Native XML Database System - - PowerPoint PPT Presentation
OrientX: an Integrated, Schema- Based Native XML Database System Meng Xiaofeng, Wang Xiaofeng, Xie Min, Zhang Xin, Zhou Junfeng School of information, Renmin University of China WISA2006 1 Introduction OrientX means: O riginal R UC I DK
1
Meng Xiaofeng, Wang Xiaofeng, Xie Min, Zhang Xin, Zhou Junfeng School of information, Renmin University of China WISA2006
2/21
WISA2006
Original RUC IDKE Native XML Database – RUC: Renmin University of China – IDKE: Institute of Data and Knowledge Engineering – Native XML DataBase: Exposing a logical model
(non Native XML DataBase: for example, based on relation database)
3/21
WISA2006
4/21
WISA2006
5/21
WISA2006
6/21
WISA2006
7/21
WISA2006
– do not decompose the document, build index on it to direct the structure. – Query complexity and efficiency are restricted by the power of index.
– decompose the document into sub trees according to storage space partition. – Persistent the structure in the tree. – save space
– decompose the document into nodes sequence , each node corresponding to a type (element, attribute, …). – May use too many links to persistent relation between nodes
8/21
WISA2006
CSB CEB Clustered BSB BEB Broad-first DB DSB DEB Depth-first Document- based SubTree- based Element- based
Implemented techniques are marked in red
One node is a record, through preorder traversing tree Like DEB, but each record is a sub-tree. The size of sub tree is close to physical page size One element is a record, but all node with the same tag name will be clustered-stored. Akin to DSB, each record is a sub tree. But all sub trees with the same structure are clustered store.
9/21
WISA2006
r
a1 a2 t1 l1 l2 f2 f1 DEB CEB
r
a2 a1 l2 f2 l1 f1 t1 t1
r
a1 a2 f1 f2 l2 l1
Source doc
10/21
WISA2006
r
a1 a2 t1 l1 l2 f2 f1
r
a1 a2 t1 l1 l2 f2 f1
DSB(Depth-first sub-tree based) CSB (clustered sub-tree based)
r
a1 a2 t1 l1 l2 f2 f1
DOC
Proxy node (virtual node)
Also have Proxy node
11/21
WISA2006
12/21
WISA2006
Path index SUPEX: Index Architecture
13/21
WISA2006
( bib//author)
14/21
WISA2006
15/21
WISA2006
16/21
WISA2006
1. Step 2. CondTreeNode 3. Path 4. ForVarBind 5. LetVarBind 6. FLWR 7. EleConstructor 8. AttrConstructor 9. BuiltInFun
Currently, Navigation Containing 13 operators:
17/21
WISA2006
Parser and Translator
Evaluator Engine
XQuery Query Initial Query plan
18/21
WISA2006
19/21
WISA2006
20/21
WISA2006
21/21
WISA2006