Database Management Systems, R. Ramakrishnan and J. Gehrke 1
Relational Algebra and SQL
Johannes Gehrke johannes@cs.cornell.edu http://www.cs.cornell.edu/johannes Slides from Database Management Systems, 3rd Edition, Ramakrishnan and Gehrke.
Relational Algebra and SQL Johannes Gehrke johannes@cs.cornell.edu - - PowerPoint PPT Presentation
Relational Algebra and SQL Johannes Gehrke johannes@cs.cornell.edu http://www.cs.cornell.edu/johannes Slides from Database Management Systems, 3 rd Edition, Ramakrishnan and Gehrke. Database Management Systems, R. Ramakrishnan and J. Gehrke 1
Database Management Systems, R. Ramakrishnan and J. Gehrke 1
Johannes Gehrke johannes@cs.cornell.edu http://www.cs.cornell.edu/johannes Slides from Database Management Systems, 3rd Edition, Ramakrishnan and Gehrke.
Database Management Systems, R. Ramakrishnan and J. Gehrke 2
– Strong formal foundation based on logic. – Allows for much optimization.
– QLs not expected to be “Turing complete”. – QLs not intended to be used for complex calculations. – QLs support easy, efficient access to large data sets.
Database Management Systems, R. Ramakrishnan and J. Gehrke 3
Database Management Systems, R. Ramakrishnan and J. Gehrke 4
– Schemas of input relations for a query are fixed (but
– The schema for the result of a given query is also
– Positional notation easier for formal definitions,
– Both used in SQL
Database Management Systems, R. Ramakrishnan and J. Gehrke 5
v “Sailors” and “Reserves”
v We’ll use positional or
Database Management Systems, R. Ramakrishnan and J. Gehrke 6
Database Management Systems, R. Ramakrishnan and J. Gehrke 7
– Selection ( ) Selects a subset of rows from relation. – Projection ( ) Deletes unwanted columns from relation. – Cross-product ( ) Allows us to combine two relations. – Set-difference ( ) Tuples in reln. 1, but not in reln. 2. – Union ( ) Tuples in reln. 1 and in reln. 2.
– Intersection, join, division, renaming: Not essential, but
Database Management Systems, R. Ramakrishnan and J. Gehrke 8
v Deletes attributes that are not in
v Schema of result contains exactly
v Projection operator has to
– Note: real systems typically
Database Management Systems, R. Ramakrishnan and J. Gehrke 9
v Selects rows that satisfy
v No duplicates in result!
v Schema of result
v Result relation can be
Database Management Systems, R. Ramakrishnan and J. Gehrke 10
v All of these operations take
– Same number of fields. – `Corresponding’ fields
v What is the schema of result?
Database Management Systems, R. Ramakrishnan and J. Gehrke 11
(sid) sname rating age (sid) bid day 22 dustin 7 45.0 22 101 10/10/96 22 dustin 7 45.0 58 103 11/12/96 31 lubber 8 55.5 22 101 10/10/96 31 lubber 8 55.5 58 103 11/12/96 58 rusty 10 35.0 22 101 10/10/96 58 rusty 10 35.0 58 103 11/12/96
Database Management Systems, R. Ramakrishnan and J. Gehrke 12
Database Management Systems, R. Ramakrishnan and J. Gehrke 13
Database Management Systems, R. Ramakrishnan and J. Gehrke 14
– i.e., A/B contains all x tuples (sailors) such that for every y
– Or: If the set of y values (boats) associated with an x value
Database Management Systems, R. Ramakrishnan and J. Gehrke 15
Database Management Systems, R. Ramakrishnan and J. Gehrke 16
– (Also true of joins, but joins are so common that systems
– x value is disqualified if by attaching y value from B, we
Database Management Systems, R. Ramakrishnan and J. Gehrke 17
Database Management Systems, R. Ramakrishnan and J. Gehrke 18
Database Management Systems, R. Ramakrishnan and J. Gehrke 19
What happens if is replaced by in this query?
Database Management Systems, R. Ramakrishnan and J. Gehrke 20
Database Management Systems, R. Ramakrishnan and J. Gehrke 21
Database Management Systems, R. Ramakrishnan and J. Gehrke 22
Database Management Systems, R. Ramakrishnan and J. Gehrke 23
SELECT [DISTINCT] target-list FROM
[WHERE condition] SELECT S.Name FROM
WHERE S.Age > 25 SELECT DISTINCT S.Name FROM
WHERE S.Age > 25
Database Management Systems, R. Ramakrishnan and J. Gehrke 24
SELECT S.sname FROM Sailors S, Reserves R WHERE S.sid=R.sid AND R.bid=103
Database Management Systems, R. Ramakrishnan and J. Gehrke 25
– Compute the cross-product of relation-list – Discard resulting tuples if they fail condition. – Delete attributes that are not in target-list – If DISTINCT is specified, eliminate duplicate rows.
Database Management Systems, R. Ramakrishnan and J. Gehrke 26
SELECT S.sname FROM Sailors S, Reserves R WHERE S.sid=R.sid AND R.bid=103
Database Management Systems, R. Ramakrishnan and J. Gehrke 27
SELECT S.sid FROM Sailors S, Reserves R WHERE S.sid=R.sid AND R.bid=103
Database Management Systems, R. Ramakrishnan and J. Gehrke 28
SELECT S.sid FROM Sailors S, Boats B, Reserves R WHERE S.sid=R.sid AND R.bid=B.bid AND (B.color=‘red’ OR B.color=‘green’) SELECT S.sid FROM Sailors S, Boats B, Reserves R WHERE S.sid=R.sid AND R.bid=B.bid AND B.color=‘red’ UNION SELECT S.sid FROM Sailors S, Boats B, Reserves R WHERE S.sid=R.sid AND R.bid=B.bid AND B.color=‘green’
Database Management Systems, R. Ramakrishnan and J. Gehrke 29
SELECT S.sid FROM Sailors S, Boats B1, Reserves R1, Boats B2, Reserves R2 WHERE S.sid=R1.sid AND R1.bid=B1.bid AND
S.sid=R2.sid AND R2.bid=B2.bid AND B1.color=‘red’ AND B2.color=‘green’
Database Management Systems, R. Ramakrishnan and J. Gehrke 30
SELECT S.sid FROM Sailors S, Boats B, Reserves R WHERE S.sid=R.sid AND R.bid=B.bid AND B.color=‘red’ INTERSECT SELECT S.sid FROM Sailors S, Boats B, Reserves R WHERE S.sid=R.sid AND R.bid=B.bid AND B.color=‘green’
– EXCEPT is set difference
Database Management Systems, R. Ramakrishnan and J. Gehrke 31
SELECT S.age, S.age-5 AS age2, 2*S.age AS age2 FROM
WHERE S.sname LIKE ‘B_%B’
Database Management Systems, R. Ramakrishnan and J. Gehrke 32
SELECT S.sname FROM Sailors S WHERE EXISTS (SELECT * FROM Reserves R WHERE R.bid=103 AND S.sid=R.sid)
Database Management Systems, R. Ramakrishnan and J. Gehrke 33
SELECT S.sname FROM
WHERE NOT EXISTS (SELECT * FROM Reserves R WHERE R.bid=103 AND S.sid=R.sid)
Database Management Systems, R. Ramakrishnan and J. Gehrke 34
SELECT S.sname FROM
Sailors S
WHERE NOT EXISTS ((SELECT B.bid FROM Boats B) EXCEPT
(SELECT R.bid
FROM Reserves R WHERE R.sid=S.sid))
Database Management Systems, R. Ramakrishnan and J. Gehrke 35
SELECT S.sname FROM Sailors S WHERE NOT EXISTS (SELECT B.bid FROM Boats B WHERE NOT EXISTS (SELECT R.bid FROM Reserves R WHERE R.bid=B.bid AND R.sid=S.sid))
Database Management Systems, R. Ramakrishnan and J. Gehrke 36
SELECT * FROM Sailors S WHERE S.rating > ALL (SELECT S2.rating FROM Sailors S2 WHERE S2.sname=‘Horatio’)
Database Management Systems, R. Ramakrishnan and J. Gehrke 37
COUNT (*) COUNT ( [DISTINCT] A) SUM ( [DISTINCT] A) AVG ( [DISTINCT] A) MAX (A) MIN (A) SELECT AVG (S.age) FROM Sailors S WHERE S.rating=10 SELECT COUNT (*) FROM Sailors S
SELECT COUNT (DISTINCT S.rating) FROM Sailors S WHERE S.sname=‘Bob’
Database Management Systems, R. Ramakrishnan and J. Gehrke 38
SELECT S.sname, S.age FROM
WHERE S.Rating > 7 AND
FROM Sailors S2 WHERE S2.Rating > 7)
Database Management Systems, R. Ramakrishnan and J. Gehrke 39
– If rating values go from 1 to 10; we can write 10
Database Management Systems, R. Ramakrishnan and J. Gehrke 40
SELECT [DISTINCT] target-list FROM
[WHERE condition] GROUP BY grouping-list
SELECT S.rating, MIN(S.Age) FROM
GROUP BY S.rating
Database Management Systems, R. Ramakrishnan and J. Gehrke 41
– Compute the cross-product of relation-list – Discard resulting tuples if they fail condition. – Delete attributes that are not in target-list – Remaining tuples are partitioned into groups by the
– One answer tuple is generated per group
Database Management Systems, R. Ramakrishnan and J. Gehrke 42
SELECT S.rating, MIN (S.age) FROM Sailors S WHERE S.age >= 18 GROUP BY S.rating
Database Management Systems, R. Ramakrishnan and J. Gehrke 43
SELECT MIN(S.Age) FROM
GROUP BY S.rating SELECT S.name, S.rating, MIN(S.Age) FROM
GROUP BY S.rating
Database Management Systems, R. Ramakrishnan and J. Gehrke 44
SELECT B.bid, COUNT (*) AS scount FROM
WHERE R.bid=B.bid AND B.color=‘red’ GROUP BY B.bid
Database Management Systems, R. Ramakrishnan and J. Gehrke 45
SELECT Temp.rating, Temp.avgage FROM
FROM Sailors S GROUP BY S.rating) AS Temp WHERE Temp.avgage = (SELECT MIN (Temp2.avgage) FROM (SELECT AVG(S.age) as avgage FROM Sailors S GROUP BY S.rating) AS Temp2
Database Management Systems, R. Ramakrishnan and J. Gehrke 46
SELECT Temp.rating, Temp.minage FROM
FROM Sailors S WHERE S.age >= 18 GROUP BY S.rating) AS Temp WHERE Temp.cnt >= 2
Database Management Systems, R. Ramakrishnan and J. Gehrke 47
SELECT [DISTINCT] target-list FROM
[WHERE qualification] GROUP BY grouping-list HAVING group-qualification
SELECT S.rating, MIN(S.Age) FROM
WHERE
GROUP BY S.rating HAVING COUNT(*) >= 2
Database Management Systems, R. Ramakrishnan and J. Gehrke 48
– Compute the cross-product of relation-list – Discard resulting tuples if they fail condition. – Delete attributes that are not in target-list – Remaining tuples are partitioned into groups by the value
– The group-qualification is applied to eliminate some groups – One answer tuple is generated per qualifying group
Database Management Systems, R. Ramakrishnan and J. Gehrke 49
v Only S.rating and S.age are
GROUP BY or HAVING clauses;
v 2nd column of result is
SELECT S.rating, MIN (S.age) FROM Sailors S WHERE S.age >= 18 GROUP BY S.rating HAVING COUNT (*) > 1
Database Management Systems, R. Ramakrishnan and J. Gehrke 50
SELECT S.rating, MIN (S.age) FROM Sailors S WHERE S.age >= 18 GROUP BY S.rating HAVING 1 < (SELECT COUNT (*) FROM Sailors S2 WHERE S.rating=S2.rating)
Database Management Systems, R. Ramakrishnan and J. Gehrke 51
SELECT S.rating, AVG (S.age) AS avgage FROM Sailors S GROUP BY S.rating ORDER BY avgage
ORDER BY can only appear in top-most query
Database Management Systems, R. Ramakrishnan and J. Gehrke 52
Database Management Systems, R. Ramakrishnan and J. Gehrke 53
SELECT S.Name FROM
WHERE S.Age > 25
Database Management Systems, R. Ramakrishnan and J. Gehrke 54
SELECT S.Name FROM
WHERE NOT(S.Age > 25) OR S.rating > 7
A/B True False NULL True True True True False True False NULL NULL True NULL NULL A NOT(A) True False False True NULL NULL
Database Management Systems, R. Ramakrishnan and J. Gehrke 55
v Useful when
v Can use queries
v Constraints can
CREATE TABLE Reserves
PRIMARY KEY (bid,day), CONSTRAINT noInterlakeRes CHECK (`Interlake’ <>
FROM Boats B WHERE B.bid=bid)))
Database Management Systems, R. Ramakrishnan and J. Gehrke 56
CREATE ASSERTION smallClub CHECK
Database Management Systems, R. Ramakrishnan and J. Gehrke 57
v The relational model has rigorously defined query
v Relational algebra is more operational; useful as
v Several ways of expressing a given query; a query
v SQL is the lingua franca for accessing database