NoSQL Postgres Oleg Bartunov Ivan Panchenko Postgres Professional - PowerPoint PPT Presentation

NoSQL Postgres Oleg Bartunov Ivan Panchenko Postgres Professional Moscow University PGDay, Saint Petersburg, July 6, 2017

NoSQL (предпосылки) • Relatjonal DBMS - integratjonal • All APPs communicatjes through RDBMS • SQL — universal language to work with data • All changes in RDBMS are available to all • Changes of the scheme are diffjcult, so → slow releases • Mostly for interactjve work • Aggregates are mostly interested, not the data itself, SQL is needed • SQL takes cares about transactjons, consistency … instead of human

The problem • The world of data and applicatjons is changing • BIG DATA (Volume of data,Velocity of data in-out, Variety of data) • Web applicatjons are service-oriented (SQL → HTTP) • No need for the monolithic database • Service itself can aggregate data and check consistency of data • High concurrency, simple queries • Simple database (key-value) is ok • Eventual consistency is ok, no ACID overhead (ACID → BASE) • Applicatjon needs faster releases, «on-fmy» schema change • NoSQL databases match all of these — scalable, effjcient, fault-tolerant, no rigid schema, ready to accept any data.

NoSQL databases (wikipedia) …+++ Document store Eventually-consistent key-value store Key/value store on disk * Dynamo * Lotus Notes * Cassandra * Tuple space * CouchDB * Project Voldemort * Memcachedb * MongoDB * Redis * Apache Jackrabbit Ordered key-value store * SimpleDB * Colayer * NMDB * fmare * XML databases * Luxio * Tokyo Cabinet o MarkLogic Server * Memcachedb * BigTable o eXist * Berkeley DB Graph Key/value cache in RAM * Neo4j Object database * AllegroGraph * Db4o * memcached Tabular * InterSystems Caché * Velocity * BigTable * Objectjvity/DB * Redis * Mnesia * ZODB * Hbase * Hypertable

The problem • What if NoSQL functjonality is not enough ? • What if applicatjon needs ACID and fmexibility of NoSQL ? • Relatjonal databases work with data with schema known in advance • One of the major complaints to relatjonal databases is rigid schema. It's not easy to change schema online (ALTER TABLE … ADD COLUMN...) • Applicatjon should wait for schema changing, infrequent releases • NoSQL uses json format, why not have it in relatjonal database ?

Challenge to PostgreSQL ! • Full support of semi-stuctured data in PostgreSQL • Storage • Operators and functjons • Effjciency (fast access to storage, indexes) • Integratjon with CORE (planner, optjmizer) • Actually, PostgreSQL is schema-less database since 2003 — hstore, one of the most popular extension !

Introductjon to Hstore id col1 col2 col3 col4 col5 Hstore key1=>val1, key2=>val2,..... ● Easy to add key=>value pair ● No need change schema, just change hstore. ● Schema-less PostgreSQL in 2003 !

NoSQL Postgres briefmy • 2003 — hstore (sparse columns, schema-less) • 2006 — hstore as demo of GIN indexing, 8.2 release • 2012 (sep) — JSON in 9.2 (verify and store) • 2012 (dec) — nested hstore proposal • 2013 — PGCon, Otuawa: nested hstore • 2013 — PGCon.eu: binary storage for nested data jsonb vs hstore • 2013 (nov) — nested hstore & jsonb (betuer/binary) • 2014 (feb-mar) — forget nested hstore for jsonb • Mar 23, 2014 — jsonb commitued for 9.4 • Autumn, 2018 — SQL/JSON for 10.X or 11 ?

JSONB - 2014 ● Binary storage ● Nestjng objects & arrays ● Indexing JSON - 2012 ● Textual storage ● JSON verifjcatjon HSTORE - 2003 ● Perl-like hash storage ● No nestjng ● Indexing

Two JSON data types !!!

Jsonb vs Json SELECT j::json AS json, j::jsonb AS jsonb FROM (SELECT '{"cc":0, "aa": 2, "aa":1,"b":1}' AS j) AS foo; json | jsonb ----------------------------------+---------------------------- {"cc":0, "aa": 2, "aa":1,"b":1} | {"b": 1, "aa": 1, "cc": 0} (1 row) • json: textual storage «as is» • jsonb: no whitespaces • jsonb: no duplicate keys, last key win • jsonb: keys are sorted by (length, key) • jsonb has a binary storage: no need to parse, has index support

Very detailed talk about JSON[B] htup://thebuild.com/presentatjons/json2015-pgconfus.pdf

JSONB is great, BUT there is No good query language — jsonb is a «black box» for SQL

Find something «red» • Table "public.js_test" Column | Type | Modifiers --------+---------+----------- id | integer | not null value | jsonb | select * from js_test; id | value ----+----------------------------------------------------------------------- 1 | [1, "a", true, {"b": "c", "f": false}] 2 | {"a": "blue", "t": [{"color": "red", "width": 100}]} 3 | [{"color": "red", "width": 100}] 4 | {"color": "red", "width": 100} 5 | {"a": "blue", "t": [{"color": "red", "width": 100}], "color": "red"} 6 | {"a": "blue", "t": [{"color": "blue", "width": 100}], "color": "red"} 7 | {"a": "blue", "t": [{"color": "blue", "width": 100}], "colr": "red"} 8 | {"a": "blue", "t": [{"color": "green", "width": 100}]} 9 | {"color": "green", "value": "red", "width": 100} (9 rows)

Find something «red» • VERY COMPLEX SQL QUERY VERY COMPLEX SQL QUERY WITH RECURSIVE t(id, value) AS ( SELECT * FROM SELECT js_test js_test.* UNION ALL FROM ( (SELECT id FROM t WHERE value @> '{"color": SELECT "red"}' GROUP BY id) x t.id, JOIN js_test ON js_test.id = x.id; COALESCE(kv.value, e.value) AS value FROM t LEFT JOIN LATERAL jsonb_each( id | value CASE WHEN jsonb_typeof(t.value) = ----+----------------------------------------------------------------------- 'object' THEN t.value ELSE NULL END) kv ON true 2 | {"a": "blue", "t": [{"color": "red", "width": 100}]} LEFT JOIN LATERAL 3 | [{"color": "red", "width": 100}] jsonb_array_elements( CASE WHEN 4 | {"color": "red", "width": 100} jsonb_typeof(t.value) = 'array' THEN t.value 5 | {"a": "blue", "t": [{"color": "red", "width": 100}], "color": "red"} ELSE NULL END) e ON true 6 | {"a": "blue", "t": [{"color": "blue", "width": 100}], "color": "red"} WHERE kv.value IS NOT NULL OR e.value IS (5 rows) NOT NULL ) )

Find something «red» • WITH RECURSIVE t(id, value) AS ( SELECT * FROM SELECT js_test js_test.* UNION ALL FROM ( (SELECT id FROM t WHERE value @> '{"color": SELECT "red"}' GROUP BY id) x t.id, JOIN js_test ON js_test.id = x.id; COALESCE(kv.value, e.value) AS value FROM • Jsquery t LEFT JOIN LATERAL jsonb_each( CASE WHEN jsonb_typeof(t.value) = SELECT * FROM js_test SELECT * FROM js_test 'object' THEN t.value WHERE WHERE ELSE NULL END) kv ON true value @@ '*.color = "red"'; value @@ '*.color = "red"'; LEFT JOIN LATERAL jsonb_array_elements( CASE WHEN jsonb_typeof(t.value) = 'array' THEN t.value ELSE NULL END) e ON true https://github.com/postgrespro/jsquery WHERE • A language to query jsonb data type kv.value IS NOT NULL OR e.value IS NOT NULL • Search in nested objects and arrays ) • ) More comparison operators with indexes support

JSON in SQL-2016

JSON in SQL-2016 • ISO/IEC 9075-2:2016(E) - htups://www.iso.org/standard/63556.html • BNF htups://github.com/elliotchance/sqltest/blob/master/standards/2016/bnf .txt • Discussed at Developers meetjng Jan 28, 2017 in Brussels • Post -hackers, Feb 28, 2017 (March commitgest) «Atuached patch is an implementatjon of SQL/JSON data model from SQL-2016 standard (ISO/IEC 9075-2:2016(E)), which was published 2016- 12-15 ...» • Patch was too big (now about 16,000 loc) and too late for Postgres 10 :(

SQL/JSON in PostgreSQL • It‘s not a new data type, it‘s a JSON data model for SQL • PostgreSQL implementatjon is a subset of standard: • JSONB - ORDERED and UNIQUE KEYS • jsonpath data type for SQL/JSON path language • nine functjons, implemented as SQL CLAUSEs

SQL/JSON in PostgreSQL • Jso Jsonpa path provides an ability to operate (in standard specifjed way) with json structure at SQL-language level • Dot notatjon — $.a.b.c • Array - [*] • Filter ? - $.a.b.c ? (@.x > 10) • Methods - $.a.b.c.x.type() SELECT * FROM js WHERE JSON_EXISTS(js, 'strict $.tags[*] ? (@.term == "NYC")'); SELECT * FROM js WHERE js @> '{"tags": [{"term": "NYC"}]}';

NoSQL Postgres Oleg Bartunov Ivan Panchenko Postgres Professional - PowerPoint PPT Presentation

NoSQL Postgres Oleg Bartunov Ivan Panchenko Postgres Professional Moscow University PGDay, Saint Petersburg, July 6, 2017 NoSQL () Relatjonal DBMS - integratjonal All APPs communicatjes through RDBMS SQL

Viva, the NoSQL Postgres ! Oleg Bartunov Lomonosov Moscow University, Postgres Professional

Deep Postgres Extensions in Rus t: postgres-extension.rs Jefg Davis <jdavis@postgresql.org>

NoSQL and MongoDB 1 2 Introduction to NoSQL Based on a presentation by Traversy Media 3 What

NoSQL Source: Pramod J. Sadalage and Martin Fowler NoSQL Distilled: A Brief Guide to the

NoSQL like There is No Tomorrow Khawaja Head of Engineering, NoSQL Swaminathan Sivasubramanian

How to Use NoSQL in Enterprise Java Applications Patrick Baumgartner NoSQL Roadshow | Zrich |

NoSQL Terje Gjster, Ph.D. UiA, Grimstad 16. November 2015 Overview Introduction and

How to Use NoSQL in Enterprise Java Applications Patrick Baumgartner NoSQL Roadshow | Basel |

Using Prometheus and Grafana to build a Postgres Dashboard Gregory Stark October 25, 2018

Postgres As Your New DevStack Postgres as a tool for software developers Susanne Schmidt

Jsonpath in examples and roadmap Nikita Glukhov, Oleg Bartunov Postgres Professional SQL/JSON

The NoSQL Ecosystem 7-21-10 Wednesday, July 21, 2010 Executive summary NoSQL is about using

1 2 What is covered in this presentation? A brief history of databases NoSQL WHY, WHAT

NoSQL Concepts, Techniques & Systems Part 1 Valentina Ivanova IDA, Linkping University

NoSQL CS226 Big-data Management 1 Based on a presentation by Traversy Media 2 What is

Tarantool - a NoSQL Tarantool - a NoSQL database with SQL database with SQL Pavel Lapaev,

Revealing Elasticsearch Implementation, Integration, and Execution Objective: Get access to a

High Performance Computing in Web Browsers CE Seminar WT14/15 Henning Lohse High Performance

Online Condition Monitoring in CalemEAM Michael Bchner Erfurt University of Applied Sciences

MongoDB Thomas Schwarz, SJ MongoDB History 2007 Developed by 10gen as a Platform as a Service

BUILDING BLOCKS UML & more.... banerjee@cs.queensu.ca 1 Main Sections UML Use Case

Software Development About us 100% Mexican Software development company. Partners won ExperTIC

Automation & Testing Saved a Project from the Brink of Collapse Jonathan Solrzano-Hamilton

Algebra Based Physics Electromagnetic Waves 2015-12-01 www.njctl.org Slide 3 / 108 Slide 4 /

NoSQL Postgres Oleg Bartunov Ivan Panchenko Postgres Professional - PowerPoint PPT Presentation

NoSQL Postgres Oleg Bartunov Ivan Panchenko Postgres Professional Moscow University PGDay, Saint Petersburg, July 6, 2017 NoSQL () Relatjonal DBMS - integratjonal All APPs communicatjes through RDBMS SQL

Viva, the NoSQL Postgres ! Oleg Bartunov Lomonosov Moscow University, Postgres Professional

Deep Postgres Extensions in Rus t: postgres-extension.rs Jefg Davis &lt;jdavis@postgresql.org&gt;

NoSQL and MongoDB 1 2 Introduction to NoSQL Based on a presentation by Traversy Media 3 What

NoSQL Source: Pramod J. Sadalage and Martin Fowler NoSQL Distilled: A Brief Guide to the

NoSQL like There is No Tomorrow Khawaja Head of Engineering, NoSQL Swaminathan Sivasubramanian

How to Use NoSQL in Enterprise Java Applications Patrick Baumgartner NoSQL Roadshow | Zrich |

NoSQL Terje Gjster, Ph.D. UiA, Grimstad 16. November 2015 Overview Introduction and

How to Use NoSQL in Enterprise Java Applications Patrick Baumgartner NoSQL Roadshow | Basel |

Using Prometheus and Grafana to build a Postgres Dashboard Gregory Stark October 25, 2018

Postgres As Your New DevStack Postgres as a tool for software developers Susanne Schmidt

Jsonpath in examples and roadmap Nikita Glukhov, Oleg Bartunov Postgres Professional SQL/JSON

The NoSQL Ecosystem 7-21-10 Wednesday, July 21, 2010 Executive summary NoSQL is about using

1 2 What is covered in this presentation? A brief history of databases NoSQL WHY, WHAT

NoSQL Concepts, Techniques &amp; Systems Part 1 Valentina Ivanova IDA, Linkping University

NoSQL CS226 Big-data Management 1 Based on a presentation by Traversy Media 2 What is

Tarantool - a NoSQL Tarantool - a NoSQL database with SQL database with SQL Pavel Lapaev,

Revealing Elasticsearch Implementation, Integration, and Execution Objective: Get access to a

High Performance Computing in Web Browsers CE Seminar WT14/15 Henning Lohse High Performance

Online Condition Monitoring in CalemEAM Michael Bchner Erfurt University of Applied Sciences

MongoDB Thomas Schwarz, SJ MongoDB History 2007 Developed by 10gen as a Platform as a Service

BUILDING BLOCKS UML &amp; more.... banerjee@cs.queensu.ca 1 Main Sections UML Use Case

Software Development About us 100% Mexican Software development company. Partners won ExperTIC

Automation &amp; Testing Saved a Project from the Brink of Collapse Jonathan Solrzano-Hamilton

Algebra Based Physics Electromagnetic Waves 2015-12-01 www.njctl.org Slide 3 / 108 Slide 4 /

Deep Postgres Extensions in Rus t: postgres-extension.rs Jefg Davis <jdavis@postgresql.org>

NoSQL Concepts, Techniques & Systems Part 1 Valentina Ivanova IDA, Linkping University

BUILDING BLOCKS UML & more.... banerjee@cs.queensu.ca 1 Main Sections UML Use Case

Automation & Testing Saved a Project from the Brink of Collapse Jonathan Solrzano-Hamilton