Query Execution in Column-Stores
Atte Hinkka
Seminar on Columnar Databases, Fall 2012
1
Query Execution in Column-Stores Atte Hinkka Seminar on Columnar - - PowerPoint PPT Presentation
Query Execution in Column-Stores Atte Hinkka Seminar on Columnar Databases, Fall 2012 1 Central concepts Column (query) operators Compression considerations Materialization strategies Vectorized operations 2 Query what?
Atte Hinkka
Seminar on Columnar Databases, Fall 2012
1
2
3
4
scan.next() => {“Virtanen”, “Veijo”, 2011-02-01, 3}
4
scan.next() => {“Virtanen”, “Veijo”, 2011-02-01, 3} scan.next() => {“Meikäläinen”, “Matti”, 2012-06-01, 3}
4
scan.next() => {“Virtanen”, “Veijo”, 2011-02-01, 3} select.next() => {“Meikäläinen”, “Matti”, 2012-06-01, 3} scan.next() => {“Meikäläinen”, “Matti”, 2012-06-01, 3}
4
5
aggregation operators (AVG, SUM, ...) may spend only 10% of time averaging and summing, i.e. doing actual work
double multiplication in 3 cycles, MySQL takes 49 to do that; no loop pipelining!
6
7
8
9
predicates
where joins of facts & dimensions
10
11
12
13
Query operators
materialization
array manipulation
vectorized processing and high CPU efficiency
14
15
based on vectors, not tuples or values
efficiency
Instruction-Multiple- Data (SIMD) instructions
16
can handle predicates
17