CS133 Computational Geometry Computational Geometry on Big Data 1 - - PowerPoint PPT Presentation

β–Ά
cs133
SMART_READER_LITE
LIVE PREVIEW

CS133 Computational Geometry Computational Geometry on Big Data 1 - - PowerPoint PPT Presentation

CS133 Computational Geometry Computational Geometry on Big Data 1 Big Geometric Data Geotagged Satellite Imagery Check ins Tweets Billions of check ins More than 17 PB Billions of tweets Millions more every day 2 MapReduce Map Reduce


slide-1
SLIDE 1

CS133

Computational Geometry

Computational Geometry on Big Data

1

slide-2
SLIDE 2

Big Geometric Data

2

More than 17 PB

Satellite Imagery Geotagged Tweets

Billions of tweets Billions of check ins Millions more every day

Check ins

slide-3
SLIDE 3

MapReduce

3

Input Big Data Shuffle Map Map Map Map Reduce Reduce Reduce Reduce Output Split Input

slide-4
SLIDE 4

CG Algorithms on big data

Utilize divide and conquer algorithms

  • 1. Partition the input across machines
  • 2. (Optional) prune partitions that do not

contribute to answer

  • 3. Apply the algorithm locally in each partition
  • 4. Combine the partial answers to compute

the final result

slide-5
SLIDE 5

Examples

Convex hull algorithm Closest pair Farthest pair Voronoi diagram/Delaunay triangulation

5

slide-6
SLIDE 6

Data Partitioning

6

slide-7
SLIDE 7

Spatial Partitioning

7

slide-8
SLIDE 8

Skyline (Maximal Vectors)

Select all non-dominated points

8

Input Output

𝑦1, 𝑧1 ≻ 𝑦2, 𝑧2 ⟺ 𝑦1 β‰₯ 𝑦2 ∧ 𝑧1 β‰₯ 𝑧2

slide-9
SLIDE 9

Skyline in MapReduce

Non-spatial partitioning Spatial partitioning ο‚„Global skyline Local skyline ο‚‚Pruning Partition

slide-10
SLIDE 10

Skyline Pruning

10

π‘ž2 π‘ž1 π‘ž1 π‘ž2 π‘ž3

π‘ž1. π‘¦π‘›π‘—π‘œ, π‘ž1. π‘§π‘›π‘—π‘œ ≻ π‘ž2. 𝑦𝑛𝑏𝑦, π‘ž2. 𝑧𝑛𝑏𝑦 π‘ž1. π‘¦π‘›π‘—π‘œ, π‘ž1. 𝑧𝑛𝑏𝑦 ≻ π‘ž3. 𝑦𝑛𝑏𝑦, π‘ž3. 𝑧𝑛𝑏𝑦 π‘ž1. 𝑦𝑛𝑏𝑦, π‘ž1. π‘§π‘›π‘—π‘œ ≻ π‘ž2. 𝑦𝑛𝑏𝑦, π‘ž2. 𝑧𝑛𝑏𝑦

Partition domination rules

slide-11
SLIDE 11

Convex Hull

11

slide-12
SLIDE 12

Convex Hull in MapReduce

Non-spatial partitioning Spatial partitioning Partition Local hull ο‚„Global hull ο‚‚Pruning

slide-13
SLIDE 13

Pruning

13

The intersection of the four skyline pruning rules with all directions

slide-14
SLIDE 14

Closest Pair

Find the pair of points that have the shortest Euclidean distance

Input Output

slide-15
SLIDE 15

Closest Pair in MapReduce

Non-spatial partitioning Spatial partitioning  Global closest pair ο‚‚ Local closest pair Partition

slide-16
SLIDE 16

Farthest Pair

Find the pair of points that have the largest Euclidean distance

Input Output

slide-17
SLIDE 17

Farthest Pair in MapReduce

Non-spatial partitioning Spatial partitioning Partition Local farthest pair ο‚„Global farthest pair ο‚‚Pruning

slide-18
SLIDE 18

Voronoi Diagram

Partitioning Local VD Pruning Vertical Merge Pruning Horizontal Merge Final output

slide-19
SLIDE 19

Voronoi Diagram Pruning

19

slide-20
SLIDE 20

Conclusion

Computational geometry algorithms can be parallelized Both non-spatial and spatial partitioning can be used Spatial partitioning enables some pruning techniques This method applies to several computational geometry algorithms