Comprehending Monadic Queries Jeremy Gibbons (joint work with Fritz - PowerPoint PPT Presentation

Comprehending Monadic Queries Jeremy Gibbons (joint work with Fritz Henglein, Ralf Hinze, Nicolas Wu) WG2.11#15, November 2015

Comprehending Monadic Queries 2 1. Comprehensions • ZF axiom schema of specification: { x 2 | x ∈ Nat ∧ x < 10 ∧ x is even } • SETL set-formers: { x ∗ x : x in { 0 . . 9 } | x mod 2 = 1 } • Eindhoven Quantifier Notation: ( x : 0 � x < 10 ∧ x is even : x 2 ) • Haskell (NPL, Python, . . . ) list comprehensions: [ x ∧ 2 | x ← [ 0 . . 9 ], even x ]

Comprehending Monadic Queries 3 2. Relational algebra vs calculus Consider two database tables: customers : cid , name , address invoices : iid , customer , amount , due A query in relational algebra (‘point-free’, on relations): π name , amount , address (σ due < today ( customers ⋈ cid = customer invoices )) The same query in relational calculus (‘point-wise’, on tuples): SELECT name, amount, address FROM customers, invoices WHERE cid = customer AND due < today The algebraic style may be convenient for formal manipulation, but the calculus style is much more accessible for readers. DBMSs typically translate from calculus-style input to algebra-style intermediate representation.

Comprehending Monadic Queries 4 3. Comprehending queries Trinder (1991) argued for comprehensions as a query notation: [ ( c . name , c . address , i . amount ) | c ← customers , i ← invoices , c . cid == i . customer , i . due < today ] Very influential observation in the DBPL community. Formed the basis of languages such as Buneman’s Kleisli , Microsoft LINQ , Wadler’s Links , as well as querying for objects ( OQL ) and XML ( XQuery ).

Comprehending Monadic Queries 5 4. Comprehending monads (Wadler 1992) The necessary structure is that of a monad ( T , > > = , return ) : (> > = ) :: T a → ( a → T b ) → T b ( x > > = f ) > > = k = x > > = (λ a → f a > > = k ) return :: a → T a return a > > = k = k a x > > = return = x with additionally mzero :: T a . Comprehensions can then be generalized to other monads: D [ e | ] = return e D [ e | p ← e ′ , Q ] = e ′ > > = λ p → D [ e | Q ] = guard e ′ > D [ e | e ′ , Q ] > D [ e | Q ] D [ e | let d , Q ] = let d in D [ e | Q ] (where guard b = if b then return () else mzero ). Hence monad comprehensions for sets, bags, maps-to-monad-zeroes, etc.

Comprehending Monadic Queries 6 5. The problem with joins The comprehension yields a terrible query plan! Constructs entire cartesian product, then discards most of it: cp customers invoices ⊲ filter (λ( c , i ) → c . cid == i . customer ) ⊲ filter (λ( c , i ) → i . due < today ) ⊲ fmap (λ( c , i ) → ( c . name , c . address , i . amount ) (where ⊲ is reverse function application). Better to group by customer identifier, then handle groups separately: ( indexBy cid customers ) ‘ merge ‘ ( indexBy customer invoices ) ⊲ fmap ( id × filter (λ i → i . due < today )) ⊲ fmap ( fmap (λ c → ( c . name , c . address )) × fmap (λ i → i . amount )) (where indexBy partitions, and merge pairs on common index). But this doesn’t correspond to anything expressible in comprehensions.

Comprehending Monadic Queries 7 6. Comprehensive comprehensions Various extensions to the comprehension syntax: • parallel (‘zip’) comprehensions (since GHC 5.0, 2001): [( x , y ) | x ← [ 1 , 2 , 3 ] | y ← [ 4 , 5 , 6 ]] • ‘order by’ and ‘group by’ (Wadler & Peyton Jones, 2007): [ ( the dept , sum salary ) | ( name , dept , salary ) ← employees , then group by dept using groupWith , then sortWith by sum salary , then take 5 ] (NB group by rebinds the variables bound earlier!) Initially just for lists, but. . .

Comprehending Monadic Queries 8 Generalized comprehensive comprehensions . . . generalizes nicely to other monads (Giorgidze et al, 2011): D [ e | ( Q | R ), S ] = mzip ( D [ vQ | Q ]) ( D [ vR | R ]) > > = λ( vQ , vR ) → D [ e | S ] D [ e | Q , then f by b , R ] = f (λ vQ → b ) ( D [ vQ | Q ]) > = λ vQ → D [ e | R ] > D [ e | Q , then group by b using f , R ] = f (λ vQ → b ) ( D [ vQ | Q ]) > > = λ ys → case ( fmap vQ 1 ys , ..., fmap vQ n ys ) of vQ → D [ e | R ] where vQ is the tuple of variables bound by Q (and used subsequently), and vQ i is a selector mapping vQ to its i th component.

Comprehending Monadic Queries 9 7. Solving the problem with (equi-)joins Maps-to-bags form a monad-with-zero—roughly: type Map k v = k → v type Table k v = Map k ( Bag v ) Now define indexBy :: Eq k ⇒ ( v → k ) → Bag v → Table k v indexBy f xs k = filter (λ v → f v == k ) xs merge :: Table k v → Table k w → Table k ( v , w ) merge f g = λ k → cp ( f k ) ( g k ) Can use merge for parallel comprehensions: instance MonadZip ( Table k ) where mzip = merge and indexBy for grouping.

Comprehending Monadic Queries 10 Given input tables customers :: Bag ( CID , Name , Address ) invoices :: Bag ( IID , CID , Amount , Date ) evaluate our example query as: query :: Map Int ( Name , Address , Bag Amount ) query = [ ( the name , the addr , amount ) | ( cid , name , addr ) ← customers , then group by cid using indexBy | ( iid , customer , amount , due ) ← invoices , due < today , then group by customer using indexBy ] Avoids expanding the whole cartesian product.

Comprehending Monadic Queries 11 8. Aggregation For database queries, want to aggregate collections: count , sum , some , . . . Problem: maps may be infinite. Solution: restrict to finite maps. Problem: not a monad— return a = λ k → a yields a non-finite map. Solution? semi-monads (with bind but no return). Problem: semi-monad comprehensions—base case uses return : D [ e | ] = return e This is surmountable. . . but we prefer: Solution: graded (indexed, parametric) monads

Comprehending Monadic Queries 12 9. Graded monads Monad ( T , > = , return ) has endofunctor T : C → C , polymorphic functions > (> > = ) :: T a → ( a → T b ) → T b return :: a → T a such that ( x > > = f ) > > = k = x > > = (λ a → f a > > = k ) return a > > = k = k a x > > = return = x Katsumata’s M-graded monad ( T , > > = , return ) for monoid ( M , · , ε) has (non-endo-)functor T : M → [ C , C ] and (> > = ) :: T m a → ( a → T n b ) → T ( m · n ) b return :: a → T ε a with same laws. We use T = Table over monoid ( K , × , 1 ) of finite key types.

� � Comprehending Monadic Queries 13 10. Adjunctions, and query optimization Optimizations depend on a body of meaning-preserving transformations , all arising from algebraic properties of the datatypes— adjunctions : L C ⊥ D with ⌊·⌋ : C ( L X , Y ) ≃ D ( X , R Y ) : ⌈·⌉ R Currying yields indexing; products yield projection and merge; coproducts yield filters; free commutative monoids yield selection and aggregation. Monads famously arise from adjunctions; graded monads do too, albeit in a slightly more complicated way. Work in progress: justifying standard query optimizations via these correspondences.

Comprehending Monadic Queries 14 11. Comprehending semi-monads Prohibit comprehensions with no qualifiers; multiple base cases instead. D [ε | p ← e ′ ] = fmap (λ p → e ′ ) ε D [ε | e ′ ] = ... -- not allowed D [ε | let d ] = ... -- not allowed D [ε | ( Q | R )] = fmap (λ( vQ , vR ) → ε) ( mzip ( D [ vQ | Q ]) ( D [ vR | R ])) D [ε | Q , then f by b ] = fmap (λ vQ → ε) ( f (λ vQ → b ) ( D [ vQ | Q ])) D [ε | Q , then group by b using f ] = fmap (λ ys → case ( fmap vQ 1 ys , ..., fmap vQ n ys ) of vQ → ε) ( f (λ vQ → b ) ( D [ vQ | Q ])) Also, we can’t define guard if we don’t have return , so desugaring of guards needs to change: = if e ′ then D [ε | Q ] else mzero D [ε | e ′ , Q ]

Comprehending Monadic Queries Jeremy Gibbons (joint work with Fritz - PowerPoint PPT Presentation

Comprehending Monadic Queries Jeremy Gibbons (joint work with Fritz Henglein, Ralf Hinze, Nicolas Wu) WG2.11#15, November 2015 Comprehending Monadic Queries 2 1. Comprehensions ZF axiom schema of specification: { x 2 | x Nat x <

Predicting and Comprehending Predicting and Comprehending Asteroid Impacts Asteroid Impacts

Comprehending Ringads Jeremy Gibbons WadlerFest, April 2016 Comprehending Ringads 2 1.

Monadic I/O in Haskell Jim Royer CIS 352 March 5, 2019 Jim Royer Monadic I/O in [1ex]

Queries in PSM The following rules apply to the use of queries: CS 235: 1. Queries

Range Minimum and Lowest Common Ancestor Queries Slides by Solon P. Pissis November 15, 2019

Top- -k k Queries Queries on SQL on SQL Databases Databases Top Top-k Queries on SQL

Middleware Queries Queries Middleware Middleware Queries Prof. Paolo Ciaccia Prof. Paolo

SYMBOLIC LOGIC UNIT 10: SINGULAR SENTENCES Singular Sentences (monadic) Paris is beautiful

Probabilistic Termination by Monadic Affine Sized Typing Ugo dal Lago Charles Grellois FOCUS

From L owenheim to Pnueli, from Pnueli to PSL and SVA Moshe Y. Vardi Rice University Thread

Characterization of Logics on Infinite Linear Orderings Thomas Colcombet ACTS 9-13 February

Monadic Reflection in Haskell Andrzej Filinski DIKU, University of Copenhagen, Denmark

Arithmetic interpretation of the monadic fragment of intuitionistic predicate logic and

Monadic Predicate ukasiewicz Logic. Standard versus General Tautologies Flix Bou University

Topological spaces of monadic MV-algebras R. Grigolia (Tbilisi), A. Di Nola and G. Lenzi (Salerno)

CANADIAN IMMIGRATION LAW EVENT Comprehending the Incomprehensible September

Lecture 3: Homotopical models of type theory Nicola Gambino School of Mathematics University of

THURSTON COUNTY COMPREHENSIVE PLAN UPDATE PLANNING COMMISSION JANUARY 16, 2019 Chapter 3

The Biequivalence of Locally Cartesian Closed Categories and Martin-L of Type Theory with ,

NExIOM, the NASA Constellation Program Ontologies How they are supporting NASA Constellation

Proofs with Feasible Computational Content Helmut Schwichtenberg Mathematisches Institut der

A Mini Course on Martin-L of Type Theory Algebras, Coalgebras, and Interactive Theorem Proving

Computing and Big Data M Goals, Basics, and Models Antonio Corradi Academic year 2018/2019

A Waterfall in the City Centre Richard Huws Last Surviving Fountain Richard Moore, B.Arch