Improving Spatial Data Processing by Clipping Minimum Bounding Boxes - - PowerPoint PPT Presentation

improving spatial data processing by clipping minimum
SMART_READER_LITE
LIVE PREVIEW

Improving Spatial Data Processing by Clipping Minimum Bounding Boxes - - PowerPoint PPT Presentation

Improving Spatial Data Processing by Clipping Minimum Bounding Boxes Darius Sidlauskas Sean Chester EPFL NTNU Eleni Tzirita Zacharatou Anastasia Ailamaki EPFL EPFL Br Brain mo model el (axons) 97% of the Minimum Bounding Box is empty


slide-1
SLIDE 1

Improving Spatial Data Processing by Clipping Minimum Bounding Boxes

Anastasia Ailamaki

EPFL

Darius Sidlauskas

EPFL

Sean Chester

NTNU

Eleni Tzirita Zacharatou

EPFL

slide-2
SLIDE 2

2

97% of the Minimum Bounding Box is empty

Br Brain mo model el (axons)

slide-3
SLIDE 3

3

Up to 64% of the accessed leaf nodes are false hits

Empty space ce è unnece cessary I/Os Os

Query 20 40 60 80 100 High Medium Low Query Selectivity Optimal/Actual #leafAcc (%)

slide-4
SLIDE 4

4

Tighter struct cture (convex hull)

Empty space from 97% to 37%, but requires 49+ points

slide-5
SLIDE 5

5

Ho How t w to r

  • red

educe d e dea ead sp space wi e with th on

  • nly f

few e w extr tra p poi

  • ints

ts

slide-6
SLIDE 6

6

“Li “Light cuts” ” using only few extra points

slide-7
SLIDE 7

7

45% reduction in empty space with just 3 extra points

“Li “Light cut cuts” us using o ng onl nly f few e w extra po points

slide-8
SLIDE 8
  • Relevant to a corner of the Minimum Bounding Box.
  • The rectangular area between the clip point and the

corner is dead.

8

Cl Clip point

  • 2
  • 1
  • 3
  • 4
  • 5

<p2,00> <p3,00>

Low representation overhead for clipped areas

R11 <p1,11> R00

slide-9
SLIDE 9
  • Augments the Minimum Bounding Box with a set of

clip points.

  • The lesser the retained volume, the better the

approximation.

9

Cl Clipped ed Bo Bounding g Bo Box (CBB) CBB)

  • 2
  • 1
  • 3
  • 4
  • 5

<q,11> <t,00> R11 R00 <p,11>

slide-10
SLIDE 10

10

Challenge: Choice ce of cl clip points

  • 2
  • 1
  • 3
  • 4
  • 5

Choose ≤ k clip points that maximize the eliminated volume

slide-11
SLIDE 11

11

Candidate cl clip points

  • 3
  • 1
  • 2
  • 4
  • 5

R00

  • For given corner Rb:
  • Consider only points in the outer surface of the objects oi.
  • Consider only the closest corner oib.
slide-12
SLIDE 12

12

Candidate cl clip points

  • 3
  • 1
  • 2
  • 4
  • 5

R00

  • For given corner Rb:
  • Consider only points in the outer surface of the objects oi.
  • Consider only the closest corner oib.
slide-13
SLIDE 13

Sk Skyline e cl clip points

  • 3
  • 1
  • 2
  • 4
  • 5

R00

  • For given corner Rb:
  • Consider only points in the outer surface of the objects oi.
  • Consider only the closest corner oib.
  • Only the clip points in the Skyline of {oib} are valid clip

points!

slide-14
SLIDE 14

14

Sk Skyline-ba based d CBB

  • 3
  • 1
  • 2
  • 4
  • 5

R00 R11 R10 R01

  • Get skyline points with respect to each corner Rb.
  • Choose up to k points.
slide-15
SLIDE 15
  • 5
  • 1
  • 4

15

Sk Skyline-ba based d CBB (k k = 1 = 1)

  • 2
  • 3
slide-16
SLIDE 16

16

Sk Skyline-ba based d CBB (k = 2 = 2)

  • 5
  • 4
  • 1
  • 2
  • 3
slide-17
SLIDE 17

17

Sk Skyline-ba based d CBB (k = 3 = 3)

  • 5
  • 4
  • 1
  • 2
  • 3
slide-18
SLIDE 18
  • “Between” two skyline points.
  • Retain the “best” value in each dimension.
  • Clip away significantly more dead space.
  • Require more expensive pre-processing.

18

St Stairline cl clip po points ts

  • 3
  • 1
  • 2
  • 4
  • 5

R11 q p

slide-19
SLIDE 19

19

St Stairline-ba based d CBB CBB

  • 3
  • 1
  • 2
  • 4
  • 5

R11 R00 R10 R01

  • Get stairline points that are valid clip points with

respect to each corner Rb.

  • Choose up to k points.
slide-20
SLIDE 20
  • 3
  • 1
  • 2
  • 4
  • 5

20

St Stairline-ba based d CBB (k k = 1 = 1)

slide-21
SLIDE 21
  • 3
  • 1
  • 2
  • 4
  • 5

21

St Stairline-ba based d CBB (k k = 2 = 2)

slide-22
SLIDE 22
  • 3
  • 1
  • 2
  • 4
  • 5

22

St Stairline-ba based d CBB (k k = 3 = 3)

slide-23
SLIDE 23
  • 3
  • 1
  • 2
  • 4
  • 5

23

St Stairline-ba based d CBB (k k = 4 = 4)

slide-24
SLIDE 24
  • 3
  • 1
  • 2
  • 4
  • 5

24

St Stairline-ba based d CBB (k k = 5 = 5)

slide-25
SLIDE 25
  • R-tree variants Quadratic [QR-tree], Hilbert [HR-tree], R*-tree,

Revised R*-tree [RR*-tree]

  • Range queries
  • High: ≈ 1 object per query
  • Medium: ≈ 10 objects per query
  • Low: ≈ 100 objects per query
  • Hardware Quad-core Intel Core i7-3770 CPU @ 3.4GHz, 16GB RAM,

500GB HDD - 7200RPM

25

Ex Experimental Setup

rea02 ~2M elements axo03 ~2.5 M elements par02/par03 230 elements

  • Spatial Join
slide-26
SLIDE 26

26

Dead space ce el elimi mination

CBBs remove 27% - 60% of dead space

Stairline clipping Dead space / node volume (%) QR-tree HR-tree R*-tree RR*-tree Dead space:

slide-27
SLIDE 27

27

Range query performance ce

Selectivity: high medium low

≈26% I/O reduction across all R-trees/workloads

  • Avg. #leafAcc w.r.t. original (%)

QR-tree HR-tree R*-tree RR*-tree Stairline clipping

slide-28
SLIDE 28

Querying 1B spatial object cts

28

  • Avg. query time (sec)

par02: 71 GB par03: 91 GB

Enabling interactive times for 1B objects

Selectivity: medium RR*-tree Original Skyline-clipped Stairline-clipped HR-tree

slide-29
SLIDE 29
  • The Minimum Bounding Box (MBB) is ubiquitous
  • Compact
  • Cheap intersection tests
  • Poor approximation of real data: can be > 90 % empty

è up to 64% unnecessary I/Os!

  • The Clipped Bounding Box
  • Augments the MBB with few additional clip points
  • Retains the simplicity of the MBB
  • Eliminates up to 60 % of dead space
  • Enables interactive exploration of 1B objects

29

Ta Take home message Th Than ank you

  • u!