GPU ACCELERATED SELF-JOIN FOR THE DISTANCE SIMILARITY METRIC
MIKE GOWANLOCK
NORTHERN ARIZONA UNIVERSITY SCHOOL OF INFORMATICS, COMPUTING & CYBER SYSTEMS
BEN KARSIN
UNIVERSITY OF HAWAII AT MANOA DEPARTMENT OF INFORMATION AND COMPUTER SCIENCES
GPU ACCELERATED SELF-JOIN FOR THE DISTANCE SIMILARITY METRIC MIKE - - PowerPoint PPT Presentation
GPU ACCELERATED SELF-JOIN FOR THE DISTANCE SIMILARITY METRIC MIKE GOWANLOCK NORTHERN ARIZONA UNIVERSITY SCHOOL OF INFORMATICS, COMPUTING & CYBER SYSTEMS BEN KARSIN UNIVERSITY OF HAWAII AT MANOA DEPARTMENT OF INFORMATION AND COMPUTER
NORTHERN ARIZONA UNIVERSITY SCHOOL OF INFORMATICS, COMPUTING & CYBER SYSTEMS
UNIVERSITY OF HAWAII AT MANOA DEPARTMENT OF INFORMATION AND COMPUTER SCIENCES
parallelism for indexing multi-dimensional datasets
Distributed Systems, vol. 26, no. 8, pp. 2258–2271, 2015.
parallel index for multi-dimensional datasets” JPDC, 113, pp. 195–203, 2018.
parallelism for indexing multi-dimensional datasets
Distributed Systems, vol. 26, no. 8, pp. 2258–2271, 2015.
parallel index for multi-dimensional datasets” JPDC, 113, pp. 195–203, 2018.
parallelism for indexing multi-dimensional datasets
Distributed Systems, vol. 26, no. 8, pp. 2258–2271, 2015.
parallel index for multi-dimensional datasets” JPDC, 113, pp. 195–203, 2018.
Searched point Data points ε-bounded search space (9 cells in 2-D)
Searched point Data points ε-bounded search space (9 cells in 2-D)
6 8 14 18 21 22 30 34 44 36 40
B: · · · G: · · · Ch = 6
Amin
h
= 14 Amax
h
= 15
Ch = 7
Amin
h
= 16 Amax
h
= 18
Ch = |G|
Amin
h
= · · · Amax
h
= |D| 1 2 3 4 5 6 7 8 |G| = 11 9 10 18 · · · 1 36 7 2 31 19 30
A:
3 |D| 1 2 13 14 15 16 17 18 · · · · · · · · · |D| − 1 55 p1 · · ·
D:
p2 p3 p18 p19 · · · p30 p31 p|D| p55 · · · · · · p36 · · · 1 2 3 4 5 6 1 2 3 4 5 6 8 14 18 21 22 30 36 34 40 44 ✏ 6 B[1] p36 p7
(a) (b)
1 2 3 4 5 1 2 3 4 5 6 6
1 2 3 4 5 1 2 3 4 5 6 6 1 2 3 4 5 1 2 3 4 5 6 6
1 2 3 4 5 1 2 3 4 5 6 6
1 2 3 4 5 1 2 3 4 5 6 6
1 2 3 4 5 1 2 3 4 5 6 6
1 2 3 4 5 1 2 3 4 5 6 6
1 2 3 4 5 1 2 3 4 5 6 6
1 2 3 4 5 1 2 3 4 5 6 6
1 2 3 4 Y 1 2 3 4 X 2 1 Z 3 4 1 2 3 4 Y 1 2 3 4 X 4 3 2 1 X 1 2 3 4 Y 2 1 Z
1 2 3 4 5 1 2 3 4 5 6 6
GPU: Brute Force R-Tree SuperEGO GPU GPU: unicomp
0.1 0.2 0.3 0.4 0.5
100 101 102 103 104 105
2 4 6 8 10
10−1 100 101 102 103 104 105
GPU: Brute Force R-Tree SuperEGO GPU GPU: unicomp
10−2 10−1 100 101
20 40 60 80 100 120 140
Avg: All SW2DA SW3DA SW2DB SW3DB SDSS2DA SDSS2DB Syn2D2M Syn3D2M Syn4D2M Syn5D2M Syn6D2M Syn2D10M Syn3D10M Syn4D10M Syn5D10M Syn6D10M
10−2 10−1 100 101
1 2 3 4 5 6 7
Speedup
Avg: All Avg: Real SW2DA SW3DA SW2DB SW3DB SDSS2DA SDSS2DB Syn2D2M Syn3D2M Syn4D2M Syn5D2M Syn6D2M Syn2D10M Syn3D10M Syn4D10M Syn5D10M Syn6D10M
0.00 0.25 0.50 0.75 1.00 1.25 1.50
0.0 0.5 1.0 1.5 2.0 2.5 3.0
SW2DA SW3DA SW2DB SW3DB SDSS2DA SDSS2DB
1 2 3 4 5
0.0 0.5 1.0 1.5 2.0 2.5 3.0
Syn2D10M Syn3D10M Syn4D10M Syn5D10M Syn6D10M