P o s t g r e S Q L a s a C o l u m n a r S t o r e DCPUG May - PowerPoint PPT Presentation

P o s t g r e S Q L a s a C o l u m n a r S t o r e DCPUG May 2014 Reston, VA Stephen Frost sfrost@snowman.net Resonate, Inc. • Digital Media • PostgreSQL • Hadoop • techjobs@resonateinsights.com • http://www.resonateinsights.com

S t e p h e n F r o s t •PostgreSQL •Major Contributor, Committer •Implemented Roles in 8.3 •Column-Level Privileges in 8.4 •Contributions to PL/pgSQL, PostGIS •Resonate, Inc. •Principal Database Engineer •Online Digital Media Company •We're Hiring! - techjobs@resonateinsights.com

D o y o u r e a d . . . •planet.postgresql.org

W h y C o l u m n a r ? •Reduced overhead •High compressability •Min/Max indexes

P G C o l u m n a r C h a l l e n g e s •Large per-row overhead •Page header •Line pointers •24 bytes per row of visibility information •226 rows/page with single 4-byte integer •No Min/Max indexes •No compression

P a g e H e a d e r s p e c i f i c s •pd_lsn - Identifies xlog record for last change •pd_checksum - Page checksum, if set •pd_flags - Set of flags •pd_lower - Offset to start of free space •pd_upper - Offset to end of free space •pd_special - Offset to special area •pd_pagesize_version - Size (bytes) & layout version •pd_prune_xid - Oldest prunable XID in tuples

P a g e H e a d e r S i z e s •pd_lsn - 8 bytes •pd_checksum - 2 bytes •pd_flags - 2 bytes •pd_lower - 2 bytes •pd_upper - 2 bytes •pd_special - 2 bytes •pd_pagesize_version - 2 bytes •pd_prunce_xid - 4 bytes • •Total: 24 bytes

L i n e p o i n t e r s •One pointer per tuple •4 bytes each •15 bits for offset to start of tuple •2 bits for flags (tuple state) •15 bits for length of tuple (in bytes)

T u p l e H e a d e r •xmin - inserting XID - 4 bytes •xmax - deleteing XID - 4 bytes •ctid - command ID - 4 bytes •tid - current TID - 4 bytes •infomask - 4 bytes •header size / offset - 1 byte •variable length NULL bitmap - 1+ bytes • •Total: 24 bytes

T u p l e s •Data: Single integer - 4 bytes •Padding: 4 bytes (would get bigint for free...) •Tuple length total: 32 bytes •Stored from the end of the page •First tuple at 8160 (8192-32) •Last tuple at 960 (960 - 991)

O v e r a l l P a g e •Page header •0 - 24 •Line pointers •24 - 927 (226x 4 bytes each) •Tuples •960 - 8192 (226x 32 bytes each) •Free space •928 - 960 (32 bytes) •Can't fit another line # + tuple •Density: 11% (904 bytes of data in 8192 page)

W h y P G ? •ACID hotness •Checksums (tho we don't use them, yet) •Already use it for many other things •Great platform for development •Can "hide" the columnar reality

A r r a y s i n P G •An array adds another header to the mix •After the header, tightly packed integers •Arrays can be compressed via TOAST •Can store mixed arrays / non-arrays in a tuple

A r r a y H e a d e r •vl_len - Array length (req'd for var-length types) •ndims - Number of dimensions •dataoffset - Offset to array data (0 if no NULLs) •elemtype - Oid for array element (eg: integer) •(per dimension) •ARR_DIMS - Number of entries in the dimension •ARR_LBOUND - Lower bound of the dimension

A r r a y H e a d e r S i z e s •vl_len - 4 bytes •ndims - 4 bytes •dataoffset - 4 bytes •elemtype - 4 bytes •ARR_DIMS - 4 bytes •ARR_LBOUND - 4 bytes • •Total: 24 •Tuple + Array headers: 24 + 24 = 48 bytes

A r r a y T u p l e s •Data: Single integer - 4 bytes •Padding: 4 bytes (would get bigint for free...) •Tuple length total: 56 bytes •Stored from the end of the page •First tuple at 8136 (8192-56) •Last tuple at 576 (576 - 631)

O v e r a l l P a g e •Page header •0 - 24 •Line pointers •24 - 567 (136x 4 bytes each) •Tuples •576 - 8192 (136x 56 bytes each) •Free space •568 - 576 (8 bytes) •Can't fit another line # + tuple •Density: 7% (544 bytes of data in 8192 page)

W h y A r r a y s ? •After array header, the values are tightly packed •Consider arrays of 200 integers •Big array header, but • Much less overhead overall •Far fewer line pointers •Am I for real? Let's check it out-

A r r a y T u p l e s x 2 0 0 •Data: 200 integers - 800 bytes •Padding: NONE •Tuple length total: 848 bytes •Stored from the end of the page •First tuple at 7344 (8192-848) •Last tuple at 560 (560 - 1407)

O v e r a l l P a g e •Page header •0 - 24 •Line pointers •24 - 59 (9x 4 bytes each) •Tuples •560 - 8192 (9x 848 bytes each) •Free space •60 - 560 (500 bytes) •Could fit another array if we tried.. •Density: 88% (7200 bytes of data in 8192 page)

A r r a y T u p l e s x 2 1 2 •Squeeze it out... •Data: 212 integers - 848 bytes •Padding: NONE •Tuple length total: 896 bytes •Stored from the end of the page •First tuple at 7296 (8192-896) •Last tuple at 128 (128 - 1023)

O v e r a l l P a g e •Page header •0 - 24 •Line pointers •24 - 59 (9x 4 bytes each) •Tuples •128 - 8192 (9x 896 bytes each) •Free space •60 - 128 (68 bytes) - Lots of room... •Not that much lost •Density: 93% (7632 bytes of data in 8192 page)

C a v e a t s •Have to use unnest() •PG may have more difficulty planning •Have to unnest() an entire array to extract data •No visibility info on each value •Stored in individual tables, can be awkward •Requires mapping tables

C o l u m n a r A d v a n t a g e s ? •Compression via TOAST •Will try to compress arrays > 2k •May be unable to •Requires more CPU to decompress •Min/Max •Add 'min_value' and 'max_value columns •Use regular btree indexes •Include in queries

Q u e s t i o n s ?

T h a n k y o u ! Stephen Frost sfrost@snowman.net @net_snow

P o s t g r e S Q L a s a C o l u m n a r S t o r e DCPUG May - PowerPoint PPT Presentation

P o s t g r e S Q L a s a C o l u m n a r S t o r e DCPUG May 2014 Reston, VA Stephen Frost sfrost@snowman.net Resonate, Inc. Digital Media PostgreSQL Hadoop techjobs@resonateinsights.com http://www.resonateinsights.com

An In-Depth Analysis of Data Aggregation Cost Factors in a Columnar In-Memory Database Stephan

Dremel: Interactice Analysis of Web-Scale Datasets By Sergey Melnik, Andrey Gubarev, Jing Jing

Deep Learning and Hardware: Matching the Demands from the Machine Learning Community Ekapol

Classifying local four gluon S-matrices Subham Dutta Chowdhury November 20, 2020 YITP Strings

Check MIB <draft-nunzi-check-mib-00.txt> Giorgio Nunzi, Juergen Quittek, Marcus Brunner,

Intro to Oracle TimesTen --- By Sima Zhu Why in-memory? Basic Architecture TimesTen

Todays Class Carnegie Mellon Univ. Storage Models Dept. of Computer Science System

CS 744: Big Data Systems Shivaram Venkataraman Fall 2018 ADMINISTRIVIA - Assignment 2 grades:

- Lessons learned - What is ? Project of Museum fr Naturkunde Berlin + AWI + University

reionisation: in the context of small-scale structure Sownak Bose sownak.bose@cfa.harvard.edu

Traps and Faults Traps and Faults Review: Mode and Space Review: Mode and Space C A B data

C R E S A N E W M E T H O D T O WA R D S M E A S U R I N G T H E - M A S S S E

The Strategist: Summary Adam Brandenburger J.P . Valles Professor, NYU Stern School of

Tackling Extreme Poverty through Programmes Targeting the World's Ultra-Poor Professor Oriana

4 th Grade PSI The History of Planet Earth 2015-11-10 www.njctl.org Slide 3 / 107 Slide 4 /

Symplectic map description of Halleys comet dynamics P. Haag 1 , G. Rollin 1 , J. Lages 1 , D.

Muon Beam Line for COMET Introduction - Updates for the Superconducting Magnet R&D Magnets

On Vizings edge-colouring question Marthe Bonamy July 28, 2020 Marthe Bonamy On Vizings

Invariant Manifolds Near L 1 and L 2 Points in the Restricted Three-Body Problem Gladston Duarte

Outline 1. (Meta-) Heuristics for Local Search Local Search (Meta-) Heuristics Heuristics for

Buy a Feature Adventure in Immutability and Actors David Pollak CUFP September 26 th , 2008

Predicting structures: Practical concerns CS 6355: Structured Prediction 1 So far What are

The promise and perils of jacksonian America 1836 1 1815 Timeline 1816 - 1836 1836 1815

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

P o s t g r e S Q L a s a C o l u m n a r S t o r e DCPUG May - PowerPoint PPT Presentation

P o s t g r e S Q L a s a C o l u m n a r S t o r e DCPUG May 2014 Reston, VA Stephen Frost sfrost@snowman.net Resonate, Inc. Digital Media PostgreSQL Hadoop techjobs@resonateinsights.com http://www.resonateinsights.com

An In-Depth Analysis of Data Aggregation Cost Factors in a Columnar In-Memory Database Stephan

Dremel: Interactice Analysis of Web-Scale Datasets By Sergey Melnik, Andrey Gubarev, Jing Jing

Deep Learning and Hardware: Matching the Demands from the Machine Learning Community Ekapol

Classifying local four gluon S-matrices Subham Dutta Chowdhury November 20, 2020 YITP Strings

Check MIB &lt;draft-nunzi-check-mib-00.txt&gt; Giorgio Nunzi, Juergen Quittek, Marcus Brunner,

Intro to Oracle TimesTen --- By Sima Zhu Why in-memory? Basic Architecture TimesTen

Todays Class Carnegie Mellon Univ. Storage Models Dept. of Computer Science System

CS 744: Big Data Systems Shivaram Venkataraman Fall 2018 ADMINISTRIVIA - Assignment 2 grades:

- Lessons learned - What is ? Project of Museum fr Naturkunde Berlin + AWI + University

reionisation: in the context of small-scale structure Sownak Bose sownak.bose@cfa.harvard.edu

Traps and Faults Traps and Faults Review: Mode and Space Review: Mode and Space C A B data

C R E S A N E W M E T H O D T O WA R D S M E A S U R I N G T H E - M A S S S E

The Strategist: Summary Adam Brandenburger J.P . Valles Professor, NYU Stern School of

Tackling Extreme Poverty through Programmes Targeting the World's Ultra-Poor Professor Oriana

4 th Grade PSI The History of Planet Earth 2015-11-10 www.njctl.org Slide 3 / 107 Slide 4 /

Symplectic map description of Halleys comet dynamics P. Haag 1 , G. Rollin 1 , J. Lages 1 , D.

Muon Beam Line for COMET Introduction - Updates for the Superconducting Magnet R&amp;D Magnets

On Vizings edge-colouring question Marthe Bonamy July 28, 2020 Marthe Bonamy On Vizings

Invariant Manifolds Near L 1 and L 2 Points in the Restricted Three-Body Problem Gladston Duarte

Outline 1. (Meta-) Heuristics for Local Search Local Search (Meta-) Heuristics Heuristics for

Buy a Feature Adventure in Immutability and Actors David Pollak CUFP September 26 th , 2008

Predicting structures: Practical concerns CS 6355: Structured Prediction 1 So far What are

The promise and perils of jacksonian America 1836 1 1815 Timeline 1816 - 1836 1836 1815

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Check MIB <draft-nunzi-check-mib-00.txt> Giorgio Nunzi, Juergen Quittek, Marcus Brunner,

Muon Beam Line for COMET Introduction - Updates for the Superconducting Magnet R&D Magnets