1
Approaching the Skyline in Z Order
Pennsylvania State University, USA Singapore Management University, Singapore
Ken C. K. Lee Baihua Zheng Huajing Li Wang-Chien Lee
1 2 1 1
Presented in VLDB 2007, University of Vienna, Austria
1 2
Approaching the Skyline in Z Order 1 2 Ken C. K. Lee Baihua Zheng - - PowerPoint PPT Presentation
Approaching the Skyline in Z Order 1 2 Ken C. K. Lee Baihua Zheng 1 1 Huajing Li Wang-Chien Lee 1 Pennsylvania State University, USA 2 Singapore Management University, Singapore Presented in VLDB 2007, University of Vienna, Austria 1
1
1 2 1 1
Presented in VLDB 2007, University of Vienna, Austria
1 2
2
3
4
5
Dominance test and Candidate Admission
1 2 3
4
Organization of source dataset can facilitate data access (I/O cost) and eliminate candidate reexam Organization of skyline candidate set can improve dominance test efficiency (CPU-cost) Block-level dominance test can improve dominance test efficiency (CPU-cost)
6
7
1 2 3 4 5 6 7 1 2 3 4 5 6 7 p1 p3 p2 p4 p5 p6 p7 p8
p9 1 3 2 4
8
1 2 3 4 5 6 7 1 2 3 4 5 6 7 p1 p3 p2 p4 p5 p6 p7 p8
p9 dominance region of p1 maximal point
1) The nearest neighboring point (e.g. p1) should be a skyline 2) Other points behind it should be dominated. 3) The remaining points are incomparable and possibly other skyline points. R-tree is used to index data points as it is good to support NN search. BBS: use iterative NN search to reduce the repeated access of R-tree.
9
R-tree: indexes data points to support NN search. BBS: iterative NN search to reduce the repeated access of R-tree.
1 2 3 4 5 6 7 1 2 3 4 5 6 7 p1 p3 p2 p4 p5 p6 p7 p8
p9
a heap orders accessed data points a main memory R-tree (mmR-tree) stores candidate skylines’ dominance regions for dominance tests.
High main memory contention to maintain a heap Inefficient to support dominance tests
P9 has to against Ba and Bb as it is enclosed by their MBBs.
10
– Partitioning a 2D space into 4 equi-sized subspaces – Data points in Region IV
possibly dominated by those in Region II and Region III
– Region I Region II Region III Region IV, or – Region I Region III Region II Region IV
1 2 3 4 5 6 7 1 2 3 4 5 6 7 p1 p3 p2 p4 p5 p6 p7 p8
x y
p9 I II III IV 1 2 3 4 5 6 7 1 2 3 4 5 6 7 p1 p3 p2 p4 p5 p6 p7 p8 p9
11
1 2 3 4 5 6 7 1 2 3 4 5 6 7 p1 p3 p2 p4 p5 p6 p7 p8
x y
p9 I II III IV
v
12
13
1 2 3 4 5 6 7 p1 p3 p2 p4 p5 p6 p7 p8 p9 1 2 3 4 5 6 7
p
1
p
5 p 6 p 7
p
8 p 9
p
2 p 3 p 4
[ p
1 , p 1 ] [ p 2 , p 4 ]
[ p
5 , p 7 ] [ p 8 , p 9 ]
[ p
1 , p 4 ] [ p 5 , p 9 ]
14
maxpt minpt curve segment Z-region RZ-region p9 p8
15
1 2 3 4 5 6
16
17
Skyline point ZBtree ZBtree nodes {} N1, N2 {} N3, N4, N2 {} N7, N4, N2 {p1} N8, N2 {p1},{p2,p3} N2 {p1},{p2,p3} N5, N6 {p1},{p2,p3},{p5,p6} N6
18
19
20
21
22
23
24
property is no longer valid)
– P2 (1, 6), and P8 (4,5)
dominated are removed.
dominant skyline.
25
26
27