Advanced SQL 03 Arrays and User-Defined Functions Torsten Grust - PowerPoint PPT Presentation

Advanced SQL 03 — Arrays and User-Defined Functions Torsten Grust Universität Tübingen, Germany

� � 1 ┆ Arrays: Aliens(?) Inside Table Cells SQL tables adhere to the First Normal Form (1NF): values v inside table cells are atomic w.r.t. the tabular data model: A ⋯ v ⋯ Let us now discuss the array data type: v may hold an ordered array of elements {x ₁ ,...,x ₙ } . 1 SQL treats v as an atomic unit, but ""# ""# array functions and operators also enable SQL to query the x ᵢ individually (there's some ⭍ with 1NF here). 1 To the PostgreSQL developer who decided to use { ⋯ } to denote arrays : No dessert for you today!

� � 2 ┆ Array Types For type τ , τ [] (or τ array ) is the type of homogenous arrays of elements of τ . τ may be built-in or user-defined (enums, row types). Array size is unspecified — the array is dynamic. (PostgreSQL accepts τ [n] but the n is ignored.) T A "# int[] ⋯ {10,20,30} ⋯ ⋯ {30,20,10} ⋯ ⋯ {} ⋯ ⋯ {NULL} ⋯

“Simulating” Arrays (Tabular Array Semantics) T τ -Arrays ┌───┬──────┬───┐ ┌──────────┬───┬─────────┐ │ ⋯ │ A :: κ│ ⋯ │ │ array :: κ│ idx │ elem :: τ | ├───┼──────┼───┤ ├╌╌╌╌╌╌╌╌╌╌┼╌╌╌┼─────────┤ │ │ α₁ │ │ │ α₁ │ 1 │ 10 │ │ │ α₂ │ α₁ │ 2 │ 20 │ │ │ α₃ │ │ │ α₁ │ 3 │ 30 │ │ │ α₄ │ │ │ ⎧ α₂ │ 1 │ 30 │ └───┴──────┴───┘ ⎨ α₂ │ 2 │ 20 │ │ ⎩ α₂ │ 3 │ 10 │ ⚠ empty array α₃ absent │ α₄ │ 1 │ NULL │ └──────────┴───┴─────────┘ κ denotes a suitable key data type. Arrays indexes are of type int and 1-based.

3 ┆ Array Literals One-dimensional array literals of type τ [] : array[] :: τ [] empty array of elements of type τ array[‹x ₁ ›,…,‹x ₙ ›] ⎱ all ‹x ᵢ › of type τ '{‹x ₁ ›,…,‹x ₙ ›}' :: τ [] ⎰ Multi-dimensional rectangular array literals of type τ [][] : ⚠ all sub-arrays need to agree in size ₁◼◼◼◼ >────────?────────@ >────────?────────@ ⋮◼◼◼◼ array[array[‹x ₁₁ ›,…,‹x ₁ₙ ›],…,array[‹x ₖ₁ ›,…,‹x ₖₙ ›]] ᵏ◼◼◼◼ '{{‹x ₁₁ ›,…,‹x ₁ₙ ›},…,{‹x ₖ₁ ›,…,‹x ₖₙ ›}}' :: τ [][] ¹⋯⋯ⁿ

Example: Tree Encoding ( parents[i] ≡ parent of node i ) ✎ t ₁ t ₂ t ₃ ¹ ᵃ ⁶ ᵍ ¹ ³ ᵃ ╷ᵈ ² ⁵ ᵇ ᶜ ⁴ ⁷ ᵇ ᶜ      ╵    ╵ ¹ ⁵ ᵈ ᵉ ² ⁴ ⁵ ᵇ ᶜ ᵉ ³ ⁴⁶ ᵈ ᵉᶠ     ² ³ ᶠ ᵃ Tree shape and node labels held in separate in-sync arrays: Trees tree parents labels t ₁ {NULL,1,2,2,1,5} {'a','b','d','e','c','f'} t ₂ {4,1,1,6,5,NULL,6} {'d','f','a','b','e','g','c'} t ₃ {NULL,1,NULL,1,3} {'a','b','d','c','e'} ¹ ² ³ ⁴ ⁵ ¹ ² ³ ⁴ ⁵ ⤎ @idx

Constructing Arrays Append / prepend element ★ to array or con cat enate arrays: array_append (array[x ₁ ,…,x ₙ ], ★ ) ≡ array[x ₁ ,…,x ₙ , ★ ] array_prepend(array[x ₁ ,…,x ₙ ], ★ ) ≡ array[ ★ ,x ₁ ,…,x ₙ ] array_cat(array[x ₁ ,…,x ₙ ], array[y ₁ ,…,y ₘ ]) ≡ array[x ₁ ,…,x ₙ ,y ₁ ,…,y ₘ ] Overloaded operator || embraces all of the above: xs || ★ ≡ array_append(xs, ★ ) ★ || xs ≡ array_prepend(xs, ★ ) xs || ys ≡ array_cat(xs,ys)

Accessing Arrays: Indexing / Slicing Array indexes i are 1-based (let xs ≡ array[x ₁ ,…,x ₙ ] ): xs[i] ≡ x ᵢ i ∉ {1,…,n}: NULL (NULL)[i] ≡ NULL xs[NULL] ≡ NULL xs[i:j] ≡ array[x ᵢ ,…,x ⱼ ] i > j: array[] xs[i: ] ≡ array[x ᵢ ,…,x ₙ ] ⎱ ⚠ requires xs[ :j] ≡ array[x ₁ ,…,x ⱼ ] ⎰ PostgreSQL 9.6 Access last element x ₙ : xs[array_length(xs, f )] # of elements in dimension f : n xs[cardinality(xs)] + ∑ (# of elements) in all dimensions

Searching for Elements in Arrays ✎ Indexing accesses array by position. Instead, searching accesses arrays by contents . Let xs ≡ array[x ₁ ,…,x ᵢ₋₁ , ★ ,x ᵢ₊₁ ,…,x ⱼ₋₁ , ★ ,x ⱼ₊₁ ,…,x ₙ ] and comparison operator θ ∈ { = , < , > , <> , <= , >= }: x θ ANY(xs) ≡ ∃ i ∈ {1, ⋯ ,n}: x θ xs[i] x θ ALL(xs) ≡ ∀ i ∈ {1, ⋯ ,n}: x θ xs[i] array_position(xs, ★ ) ≡ i if ★ not found: NULL array_positions(xs, ★ ) ≡ array[i,j] if ★ not found: array[] array_replace(xs, ★ , ⬟ ) ≡ array[x ₁ ,…, ⬟ ,…, ⬟ ,…,x ₙ ] ⁱ ʲ

4 ┆ A Bridge Between Arrays and Tables: unnest & array_agg SELECT t.elem ┌────┐ Table t FROM unnest(array[x ₁ ,…,x ₙ ]) AS t(elem) │ elem │ q──────r─────s ≡ ├────┤ ≡ xs │ x ₁ │ │ ⋮ │ │ x ₙ │ └────┘ SELECT array_agg (t.elem) AS xs ┌─────────┐ FROM ( VALUES (x ₁ ), │ xs │ ⋮ ≡ ├─────────┤ (x ₙ )) AS t(elem) │ {x ₁ ,…,x ₙ } │ └─────────┘ unnest( ･ ) : a set-returning function . More on that soon. ⚠ Preservation of order of the x ᵢ is not guaranteed ""#

Representing Order (Indices) As First-Class Values ✎ SELECT t.* ┌────┬───┐ FROM unnest(array[x ₁ ,…,x ₙ ]) │ elem │ idx| WITH ORDINALITY AS t(elem,idx) ≡ ├────┼───┤ + │ x ₁ │ 1 │ │ ⋮ │ ⋮ │ recall ordered aggregates │ x ₙ │ n │ >───────────────?──────────────@ └────┴───┘ SELECT array_agg (t.elem ORDER BY t.idx) AS xs ┌─────────┐ FROM ( VALUES (x ₁ ,1), │ xs │ ⋮ ≡ ├─────────┤ (x ₙ ,n)) AS t(elem,idx) │ {x ₁ ,…,x ₙ } │ └─────────┘ ‹f›( ⋯ ) WITH ORDINALITY adds a trailing column ( + ) of ascending indices 1 , 2 , ""# to the output of function ‹f› .

A Relational Array Programming Pattern Availability of unnest( ･ ) and ordered array_agg( ･ ) suggests a pattern for relational array programming : Array Table Table Array ┌─┬─┐ ┌─┬─┐ ➊ │◼│ 1 │ ➋ │▲│ 2 │ ➌ { ◼ , ◼ , ◼ } │◼│ 2 │ │▲│ 3 │ { ▲ , ▲ , ▲ } ┊ │◼│ 3 │ ┊ │▲│ 1 │ ┊ ┊ └─┴─┘ SQL └─┴─┘ ┊ ┊ ┊ unnest( ･ ) WITH ORDINALITY array_agg( ･ ORDER BY ･ ) At ➋ use the full force of SQL, read/transform/generate elements and their positions at will. ➊ + ➌ constitute overhead : an RDBMS is not an array PL.

5 ┆ Table-Generating Functions What is the type of unnest( ･ ) ? unnest( ･ ) establishes a bridge between arrays and SQL's tabular data model: 2 unnest :: τ [] → SETOF τ In SQL, functions of type τ₁ → SETOF τ₂ are known as set- returning or table(-generating) functions . May be invoked wherever a query expects a table ( FROM clause). Several built-in — may also be defined by the user . 2 Unfortunate naming again: SETOF should probably read BAGOF or TABLE OF .

Series and Subscript Generators Built-in table-generating functions that generate tables of consecutive numbers : generate_series(‹f›,‹t›,‹s›) ┊ generate_subscripts(‹xs›,‹d›) ┌───────────┐ ┊ ┌───────┐ │ □ │ ┊ │ □ │ ├───────────┤ ┊ ├───────┤ │ ‹f› │ ┊ │ 1 │ │ ‹f›+1 × ‹s› │ ┊ │ 2 │ │ ‹f›+2 × ‹s› │ ┊ ┊ ⋮ ┊ ┊ ⋮ ┊ ┊ │ } │ ≤ ‹t› 2 │ ‹f›+ }× ‹s› │ ┊ └───────┘ └───────────┘ ┊ ‹s› ≡ 1, if absent ┊ } ≡ array_length(‹xs›,‹d›) ‹f›,‹t›: numbers/timestamps ┊ can also enumerate } ,…,1

Text Generators (Regular Expression Matching) Use regular expression 3 ‹re› to extract matched substrings from ‹t› or split text ‹t› at defined positions: 1. regexp_matches(‹t›,‹re›,'g') , yields SETOF text[] : Generates one array xs per match of ‹re› in ‹t› . Element xs[i] holds the match of the i ᵗʰ capture group (in ( ⋯ ) ). 2. regexp_split_to_table(‹t›,‹re›) , yields SETOF text : Uses the matches of ‹re› in ‹t› as separators to split ‹t› . Yields table of 5 +1 rows if ‹re› matches 5 times. 3 See regexr.com for tutorials and an interactive playground, for example.

Advanced SQL 03 Arrays and User-Defined Functions Torsten Grust - PowerPoint PPT Presentation

Advanced SQL 03 Arrays and User-Defined Functions Torsten Grust Universitt Tbingen, Germany 1 Arrays: Aliens(?) Inside Table Cells SQL tables adhere to the First Normal Form (1NF): values v inside table cells are atomic

SQL SQL SQL = Structured Query Language Standard query language for relational

A1 (Part 2): Injection SQL Injection SQL injection is prevalent SQL injection is impactful Why a

What is SQL? SQL stands for Structured Query Language SQL lets you access and manipulate

BASIC SQL CHAPTER 4 (6/E) CHAPTER 8 (5/E) 1 CHAPTER 4 OUTLINE SQL Data Definition and

This Lecture SQL The SQL language SQL, the relational model, and E/R diagrams SQL Data

Intermezzo: A typical database architecture 136 A typical database architecture SQL SQL SQL

Basic SQL Lecture 2 1 Outline Data in SQL Simple Queries in SQL Queries with more

Advanced SQL 01 The Core of SQL Torsten Grust Universitt Tbingen, Germany 1 The Core

Lecture 3: Advanced SQL 1 / 64 Advanced SQL Relational Language Relational Language User

The SQL Procedure Language (SQL PL) Tony Andrews Themis Education tandrews@themisinc.com

SQL & MySQL Jeff Siarto - TC 361 Whats the Difference? MySQL is a database SQL is

Outline Background SQL history and terminology Introduction SAS seminar Proc

SQL Developed by IBM (for System R) in the 1970s. Standard used by many vendors.

SQL and JS Pitfalls Assignment 2 Preparation SQL Concepts SQL vs. NoSQL

SQL , the Structured Query Language Overview Introduction DDL Commands DML Commands SQL

What is SQL Database Managed Instance? SQL Database (DBaaS) A flavor of SQL DB that designed to

An Interactive System Level Simulation Environment for Systems-On-Chip Daniel Knorreck,

Emergent Phenomena in High-Energy Particle Collisions Peter Skands (Monash University) Image

STATISTICAL PARSING 23.04.19 Statistical Natural Language Processing 1 Statistical Parsing

gumtree Nick Hauser NOBUGS 2014 Sept 24-26 Tsukuba Adding spokes Tried not to reinvent the

Modeling Heterogeneous Embedded Systems with TTool Daniela Genius, Ludovic Apvrille, Letitia W.

Summary of the NuSTEC Workshop on Neutrino-Nucleus Pion Production in the Resonance Region

SysML Model Transformation for Safety and Security Florian Lugou, Raba Ameur-Boulifa, Ludovic

Session 05 Robust and Resistant Regression V&R 6.5, p. 156 ff A radical change of view

Advanced SQL 03 Arrays and User-Defined Functions Torsten Grust - PowerPoint PPT Presentation

Advanced SQL 03 Arrays and User-Defined Functions Torsten Grust Universitt Tbingen, Germany 1 Arrays: Aliens(?) Inside Table Cells SQL tables adhere to the First Normal Form (1NF): values v inside table cells are atomic

SQL SQL SQL = Structured Query Language Standard query language for relational

A1 (Part 2): Injection SQL Injection SQL injection is prevalent SQL injection is impactful Why a

What is SQL? SQL stands for Structured Query Language SQL lets you access and manipulate

BASIC SQL CHAPTER 4 (6/E) CHAPTER 8 (5/E) 1 CHAPTER 4 OUTLINE SQL Data Definition and

This Lecture SQL The SQL language SQL, the relational model, and E/R diagrams SQL Data

Intermezzo: A typical database architecture 136 A typical database architecture SQL SQL SQL

Basic SQL Lecture 2 1 Outline Data in SQL Simple Queries in SQL Queries with more

Advanced SQL 01 The Core of SQL Torsten Grust Universitt Tbingen, Germany 1 The Core

Lecture 3: Advanced SQL 1 / 64 Advanced SQL Relational Language Relational Language User

The SQL Procedure Language (SQL PL) Tony Andrews Themis Education tandrews@themisinc.com

SQL &amp; MySQL Jeff Siarto - TC 361 Whats the Difference? MySQL is a database SQL is

Outline Background SQL history and terminology Introduction SAS seminar Proc

SQL Developed by IBM (for System R) in the 1970s. Standard used by many vendors.

SQL and JS Pitfalls Assignment 2 Preparation SQL Concepts SQL vs. NoSQL

SQL , the Structured Query Language Overview Introduction DDL Commands DML Commands SQL

What is SQL Database Managed Instance? SQL Database (DBaaS) A flavor of SQL DB that designed to

An Interactive System Level Simulation Environment for Systems-On-Chip Daniel Knorreck,

Emergent Phenomena in High-Energy Particle Collisions Peter Skands (Monash University) Image

STATISTICAL PARSING 23.04.19 Statistical Natural Language Processing 1 Statistical Parsing

gumtree Nick Hauser NOBUGS 2014 Sept 24-26 Tsukuba Adding spokes Tried not to reinvent the

Modeling Heterogeneous Embedded Systems with TTool Daniela Genius, Ludovic Apvrille, Letitia W.

Summary of the NuSTEC Workshop on Neutrino-Nucleus Pion Production in the Resonance Region

SysML Model Transformation for Safety and Security Florian Lugou, Raba Ameur-Boulifa, Ludovic

Session 05 Robust and Resistant Regression V&amp;R 6.5, p. 156 ff A radical change of view

SQL & MySQL Jeff Siarto - TC 361 Whats the Difference? MySQL is a database SQL is

Session 05 Robust and Resistant Regression V&R 6.5, p. 156 ff A radical change of view