Exercise 11: Graph Databases and Path Queries Database Theory - PowerPoint PPT Presentation

Exercise 11: Graph Databases and Path Queries Database Theory 2020-07-06 Maximilian Marx, David Carral 1 / 49

Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we encode arbitrary hypergraphs (relational databases) in RDF? RDF can be considered as a synonym for “labelled directed graph” here – the technical details of the RDF standard are not important for this exercise. 2 / 49

Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we encode arbitrary hypergraphs (relational databases) in RDF? RDF can be considered as a synonym for “labelled directed graph” here – the technical details of the RDF standard are not important for this exercise. Solution. 3 / 49

Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we encode arbitrary hypergraphs (relational databases) in RDF? RDF can be considered as a synonym for “labelled directed graph” here – the technical details of the RDF standard are not important for this exercise. Solution. ◮ Let G be some labelled hypergraph. 4 / 49

Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we encode arbitrary hypergraphs (relational databases) in RDF? RDF can be considered as a synonym for “labelled directed graph” here – the technical details of the RDF standard are not important for this exercise. Solution. ◮ Let G be some labelled hypergraph. ◮ We construct G RDF by reifying hyperedges: for every p -labelled hyperedge ϕ = p ( t 1 , t 2 , . . . , t ℓ ) in G , 5 / 49

Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we encode arbitrary hypergraphs (relational databases) in RDF? RDF can be considered as a synonym for “labelled directed graph” here – the technical details of the RDF standard are not important for this exercise. Solution. ◮ Let G be some labelled hypergraph. ◮ We construct G RDF by reifying hyperedges: for every p -labelled hyperedge ϕ = p ( t 1 , t 2 , . . . , t ℓ ) in G , ◮ we add labels p 1 , p 2 , . . . , p ℓ ; 6 / 49

Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we encode arbitrary hypergraphs (relational databases) in RDF? RDF can be considered as a synonym for “labelled directed graph” here – the technical details of the RDF standard are not important for this exercise. Solution. ◮ Let G be some labelled hypergraph. ◮ We construct G RDF by reifying hyperedges: for every p -labelled hyperedge ϕ = p ( t 1 , t 2 , . . . , t ℓ ) in G , ◮ we add labels p 1 , p 2 , . . . , p ℓ ; ◮ a vertex v ϕ ; and 7 / 49

Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we encode arbitrary hypergraphs (relational databases) in RDF? RDF can be considered as a synonym for “labelled directed graph” here – the technical details of the RDF standard are not important for this exercise. Solution. ◮ Let G be some labelled hypergraph. ◮ We construct G RDF by reifying hyperedges: for every p -labelled hyperedge ϕ = p ( t 1 , t 2 , . . . , t ℓ ) in G , ◮ we add labels p 1 , p 2 , . . . , p ℓ ; ◮ a vertex v ϕ ; and ◮ edges p 1 ( c ϕ , t 1 ) , p 2 ( c ϕ , t 2 ) , . . . , p ℓ ( c ϕ , t ℓ ) to G RDF . 8 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 1. The “Same generation” Datalog program from the lecture: S ( x , x ) ← human ( x ) S ( x , y ) ← parent ( x , w ) ∧ S ( v , w ) ∧ parent ( y , v ) 9 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 1. The “Same generation” Datalog program from the lecture: S ( x , x ) ← human ( x ) S ( x , y ) ← parent ( x , w ) ∧ S ( v , w ) ∧ parent ( y , v ) Solution. 10 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 1. The “Same generation” Datalog program from the lecture: S ( x , x ) ← human ( x ) S ( x , y ) ← parent ( x , w ) ∧ S ( v , w ) ∧ parent ( y , v ) Solution. ◮ S matches paths of the form parent n ◦ human ◦ parent n , with n ≥ 0. 1. 11 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 1. The “Same generation” Datalog program from the lecture: S ( x , x ) ← human ( x ) S ( x , y ) ← parent ( x , w ) ∧ S ( v , w ) ∧ parent ( y , v ) Solution. ◮ S matches paths of the form parent n ◦ human ◦ parent n , with n ≥ 0. 1. ◮ This is not a regular language, and hence cannot be expressed as a 2RPQ. 12 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 1. The “Same generation” Datalog program from the lecture: S ( x , x ) ← human ( x ) S ( x , y ) ← parent ( x , w ) ∧ S ( v , w ) ∧ parent ( y , v ) Solution. ◮ S matches paths of the form parent n ◦ human ◦ parent n , with n ≥ 0. 1. ◮ This is not a regular language, and hence cannot be expressed as a 2RPQ. ◮ Since the length of a matched path is not accessible in a C2RPQ, this can also not be expressed as a C2RPQ. 13 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 2. Ancestors born in the same city: AncCity ( x , y , x ′ , y ′ ) ← parent ( x , x ′ ) ∧ bornIn ( x , y ) ∧ bornIn ( x ′ , y ′ ) AncCity ( x , y , x ′′ , y ′′ ) ← AncCity ( x , y , x ′ , y ′ ) ∧ AncCity ( x ′ , y ′ , x ′′ , y ′′ ) Query ( x , x ′ , y ) ← AncCity ( x , y , x ′ , y ) Solution. ◮ S matches paths of the form parent n ◦ human ◦ parent n , with n ≥ 0. 1. ◮ This is not a regular language, and hence cannot be expressed as a 2RPQ. ◮ Since the length of a matched path is not accessible in a C2RPQ, this can also not be expressed as a C2RPQ. 2. 14 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 2. Ancestors born in the same city: AncCity ( x , y , x ′ , y ′ ) ← parent ( x , x ′ ) ∧ bornIn ( x , y ) ∧ bornIn ( x ′ , y ′ ) AncCity ( x , y , x ′′ , y ′′ ) ← AncCity ( x , y , x ′ , y ′ ) ∧ AncCity ( x ′ , y ′ , x ′′ , y ′′ ) Query ( x , x ′ , y ) ← AncCity ( x , y , x ′ , y ) Solution. ◮ S matches paths of the form parent n ◦ human ◦ parent n , with n ≥ 0. 1. ◮ This is not a regular language, and hence cannot be expressed as a 2RPQ. ◮ Since the length of a matched path is not accessible in a C2RPQ, this can also not be expressed as a C2RPQ. 2. The following C2RPQ expresses Query: ( parent ◦ parent ∗ )( x , x ′ ) ∧ bornIn ( x , y ) ∧ bornIn ( x ′ , y ) 15 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 3. Ancestors of Dresden-based family lines: DDAnc ( x , y ) ← parent ( x , y ) ∧ bornIn ( x , dresden ) ∧ bornIn ( y , dresden ) DDAnc ( x , z ) ← DDAnc ( x , y ) ∧ parent ( y , z ) ∧ bornIn ( z , dresden ) Solution. ◮ S matches paths of the form parent n ◦ human ◦ parent n , with n ≥ 0. 1. ◮ This is not a regular language, and hence cannot be expressed as a 2RPQ. ◮ Since the length of a matched path is not accessible in a C2RPQ, this can also not be expressed as a C2RPQ. 2. The following C2RPQ expresses Query: ( parent ◦ parent ∗ )( x , x ′ ) ∧ bornIn ( x , y ) ∧ bornIn ( x ′ , y ) 3. 16 / 49

Exercise 2. Exercise. Can the following Datalog programs be encoded using a C2RPQ? In each case, give a suitable C2RPQ or explain why there is none. 3. Ancestors of Dresden-based family lines: DDAnc ( x , y ) ← parent ( x , y ) ∧ bornIn ( x , dresden ) ∧ bornIn ( y , dresden ) DDAnc ( x , z ) ← DDAnc ( x , y ) ∧ parent ( y , z ) ∧ bornIn ( z , dresden ) Solution. ◮ S matches paths of the form parent n ◦ human ◦ parent n , with n ≥ 0. 1. ◮ This is not a regular language, and hence cannot be expressed as a 2RPQ. ◮ Since the length of a matched path is not accessible in a C2RPQ, this can also not be expressed as a C2RPQ. 2. The following C2RPQ expresses Query: ( parent ◦ parent ∗ )( x , x ′ ) ∧ bornIn ( x , y ) ∧ bornIn ( x ′ , y ) 3. ◮ DDAnc matches paths where every node has a bornIn-connection to dresden. 17 / 49

Exercise 11: Graph Databases and Path Queries Database Theory - PowerPoint PPT Presentation

Exercise 11: Graph Databases and Path Queries Database Theory 2020-07-06 Maximilian Marx, David Carral 1 / 49 Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we

Top- -k k Queries Queries on SQL on SQL Databases Databases Top Top-k Queries on SQL

Inductive Inductive Inductive Inductive Databases Databases Databases Databases and

Queries in PSM The following rules apply to the use of queries: CS 235: 1. Queries

Neo4j and graph databases Presented By: Stephanie McIntyre Graph Databases: The Database Model

Exercise 4: Conjunctive Queries, CSP, and Hypergraphs Database Theory 2020-05-04 Maximilian

Creating Databases and Tables Introduction to Databases in Python Creating Databases

Lecture 11: Persistent Memory Databases 1 / 71 Persistent Memory Databases Recap

More On Paths Supplement to Chapter 4, Graph Theory Path definition What is a path? We

Three Graph Algorithms Shortest Distance Paths Distance/Cost of a path in weighted graph sum of

GRAPH TRAVERSAL PATH FINDING AND GRAPH TRAVERSAL Path finding refers to determining the shortest

Databases Picture by Jeremy Hiebert [http://www.flickr.com/photos/jeremyhiebert/] Graph Databases

An Analysis of the Feasibility of Graph Compression Techniques for Indexing Regular Path Queries

Exercise and Secondary Exercise and Secondary Exercise and Secondary Exercise and Secondary

Exercise 4: Fight Club Karl Gmeiner 2015 Exercise 4: Fight Club 1 Exercise 4: Fight Club The

Exercise 2: Materials Exercise 2: Materials FLUKA Beginners Course Exercise 2: Materials Aim

Exercise 12: Heavy ions beams Exercise 12: Heavy ions beams Beginners FLUKA Course Exercise

Modelling Word Similarity An Evaluation of Automatic Synonymy Extraction Algorithms Kris Heylen,

Synonyms and Antonyms Synonym: a word that means exactly the same as another word. Antonym: a

Design = To plan or organize Synonym = plan Design is essentially the opposite of chance.

Lesson 8 Vocabulary & Anti synonym Different words with synonym similar meanings

Using an Inverted Index Synopsis for Query Latency and Performance Prediction Nicola Tonellotto

Review of data aggregation Review of data aggregation Query distribution AVERAGE 1 1 2 2 3

CSE 258 Lecture 1.5 Web Mining and Recommender Systems Supervised learning Regression

An Introduction to Distributed Data Streaming Elements and Systems Paris

Exercise 11: Graph Databases and Path Queries Database Theory - PowerPoint PPT Presentation

Exercise 11: Graph Databases and Path Queries Database Theory 2020-07-06 Maximilian Marx, David Carral 1 / 49 Exercise 1 Exercise. It was explained in the lecture that RDF and Property Graph can encode the same graph structures. How could we

Top- -k k Queries Queries on SQL on SQL Databases Databases Top Top-k Queries on SQL

Inductive Inductive Inductive Inductive Databases Databases Databases Databases and

Queries in PSM The following rules apply to the use of queries: CS 235: 1. Queries

Neo4j and graph databases Presented By: Stephanie McIntyre Graph Databases: The Database Model

Exercise 4: Conjunctive Queries, CSP, and Hypergraphs Database Theory 2020-05-04 Maximilian

Creating Databases and Tables Introduction to Databases in Python Creating Databases

Lecture 11: Persistent Memory Databases 1 / 71 Persistent Memory Databases Recap

More On Paths Supplement to Chapter 4, Graph Theory Path definition What is a path? We

Three Graph Algorithms Shortest Distance Paths Distance/Cost of a path in weighted graph sum of

GRAPH TRAVERSAL PATH FINDING AND GRAPH TRAVERSAL Path finding refers to determining the shortest

Databases Picture by Jeremy Hiebert [http://www.flickr.com/photos/jeremyhiebert/] Graph Databases

An Analysis of the Feasibility of Graph Compression Techniques for Indexing Regular Path Queries

Exercise and Secondary Exercise and Secondary Exercise and Secondary Exercise and Secondary

Exercise 4: Fight Club Karl Gmeiner 2015 Exercise 4: Fight Club 1 Exercise 4: Fight Club The

Exercise 2: Materials Exercise 2: Materials FLUKA Beginners Course Exercise 2: Materials Aim

Exercise 12: Heavy ions beams Exercise 12: Heavy ions beams Beginners FLUKA Course Exercise

Modelling Word Similarity An Evaluation of Automatic Synonymy Extraction Algorithms Kris Heylen,

Synonyms and Antonyms Synonym: a word that means exactly the same as another word. Antonym: a

Design = To plan or organize Synonym = plan Design is essentially the opposite of chance.

Lesson 8 Vocabulary &amp; Anti synonym Different words with synonym similar meanings

Using an Inverted Index Synopsis for Query Latency and Performance Prediction Nicola Tonellotto

Review of data aggregation Review of data aggregation Query distribution AVERAGE 1 1 2 2 3

CSE 258 Lecture 1.5 Web Mining and Recommender Systems Supervised learning Regression

An Introduction to Distributed Data Streaming Elements and Systems Paris

Lesson 8 Vocabulary & Anti synonym Different words with synonym similar meanings