welcome
play

Welcome! A Case for Response Time Focused Query Processing Olaf - PowerPoint PPT Presentation

Welcome! A Case for Response Time Focused Query Processing Olaf Hartjg @olafiartjg 1 Informatjon in Dynamic Web Pages A Case for Response Time Focused Query Processing Olaf Hartjg


  1. Welcome! A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 1

  2. Informatjon in Dynamic Web Pages A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 2

  3. Informatjon in Dynamic Web Pages A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 3

  4. Informatjon in Dynamic Web Pages Support for such an incremental visualization has not received much attention in existing work on querying the Web of Data A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 4

  5. Let's rethink our optjmizatjon criteria for Web querying! A case for response tjme focused query processing Olaf Hartjg Dept. of Computer and Informatjon Science, Linköping University, Sweden @olafiartjg

  6. Terminology ● Web querying: queries directly over Web data sources – querying a federation of SPARQL endpoints – querying Linked Data on the Web (interface: URI lookups) – querying other types of Linked Data Fragment interfaces – etc. A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 6

  7. Terminology ● Web querying: queries directly over Web data sources – querying a federation of SPARQL endpoints – querying Linked Data on the Web (interface: URI lookups) – querying other types of Linked Data Fragment interfaces – etc. ● Query execution time (QET): time from issuing a query until the query execution process has been completed ● Response time (RT): time from issuing a query until a specific portion of the query result has been produced – may be measured in terms of a specific number of result elements (i.e., solution mappings in the context of SPARQL) – or in terms of a specific percentage of result elements A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 7

  8. Agenda Aiming to minimize QET is different from aiming to minimize RT ● – Evidence 1 – Evidence 2 Some of our work on RT-focused query processing ● – An attempt to optimize the response times of traversal-based query execution – An attempt to make the core fragment of SPARQL suitable for the task A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 8

  9. Minimizing QET ≠ Minimizing RT Evidence 1 Based on: Maribel Acosta, Maria-Esther Vidal, and York Sure-Vetter: Diefficiency Metrics: Measuring the Continuous Efficiency of Query Processing Approaches . ISWC 2017.

  10. Executjng a Query via a TPF Interface different client-side strategies to execute a given query over a dataset that can be accessed via a Triple Pattern Fragment (TPF) interface A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 10

  11. Minimizing QET ≠ Minimizing RT Evidence 2 Based on: Olaf Hartig and M. Tamer Özsu: Walking without a Map: Ranking- Based Traversal for Querying Linked Data . ISWC 2016. Olaf Hartig and M. Tamer Özsu: Optimizing Response Times of Traversal-Based Linked Data Queries (Extended Version). arXiv:1607.01046

  12. Linked Data Query Processing ● Focus: querying Linked Data live on the Web by relying only on the Linked Data principles – look up URIs to access original data at runtime A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 12

  13. Linked Data Query Processing ● Focus: querying Linked Data live on the Web by relying only on the Linked Data principles – look up URIs to access original data at runtime ● Queries – typically expressed using SPARQL (in practice, BGPs only) – reachability-based query semantics; i.e., scope of evaluation is virtual union of all data in a well-defined reachable subweb A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 13

  14. Linked Data Query Processing ● Focus: querying Linked Data live on the Web by relying only on the Linked Data principles – look up URIs to access original data at runtime ● Queries – typically expressed using SPARQL (in practice, BGPs only) – reachability-based query semantics; i.e., scope of evaluation is virtual union of all data in a well-defined reachable subweb ● Traversal-based query execution – intertwines local result construction with a recursive traversal of (specific) data links – natural support of reachability-based query semantics (discovers reachable subweb at runtime) A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 14

  15. Concrete Implementatjon Approach Data Retrieval Operator Triple pattern ( ?v1 , knows, ?v2 ) . . . Triple Triple Pattern Pattern Operator Operator Dispatcher A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 15

  16. Data Retrieval Operator RDF triple ( Bob, knows, Alice ) Triple pattern . . . . . . . . ( ?v1 , knows, ?v2 ) GET http://example.org/... . . . Triple Triple Pattern Pattern Operator Operator Dispatcher A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 16

  17. Triple Patuern Operator RDF triple ( Bob, knows, Alice ) Triple pattern . . . . . . . . ( ?v1 , knows, ?v2 ) . . . Intermediate Solution Timestamp: 1 Bindings: ?v1 → Bob, ?v2 → Alice Flags: [ ∙ | √ | ∙ | ∙ ] Dispatcher A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 17

  18. Dispatcher . . . . . . . . . . . Intermediate Solution Timestamp: 1 Bindings: ?v1 → Alice, ?v2 → Bob Output Flags: [ ∙ | √ | ∙ | ∙ ] A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 18

  19. Triple Patuern Operator cont’d . . . . . . . . . . . ? X Output A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 19

  20. Triple Patuern Operator cont’d Intermediate Solution Timestamp: 327 Intermediate Solution . . . . . . . . Bindings: ?v1 → Bob, ?v3 → Berlin Timestamp: 461 Flags: [ √ | ∙ | ∙ | ∙ ] Bindings: ?v1 → Bob, ?v2 → Steve . . . Flags: [ ∙ | √ | ∙ | ∙ ] ? Intermediate Solution Timestamp: 461 Bindings: ?v1 → Bob, ? v2 → Steve, ?v3 → Berlin Flags: [ √ | √ | ∙ | ∙ ] Output A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 20

  21. Propertjes ● Supports any reach- Data ability-based query Retrieval . . . . . . . . semantics ● Highly adaptive . . . – Routing of inter- mediate solutions TP Operator TP Operator – Inspired by “Eddies” (Anvur & Hellerstein, SIGMOD 2000) Dispatcher Output A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 21

  22. Hypothesis Query execution time (QET) and response time (RT) can be reduced by applying a suitable routing policy. A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 22

  23. Test with Difgerent Routjng Policies Data retrieval operator simply appends to its lookup queue ● Web simulation environment, diverse test queries (here one of them) ● A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 23

  24. Test with Difgerent Routjng Policies Data retrieval operator simply appends to its lookup queue ● Web simulation environment, diverse test queries (here one of them) ● Each bar represents geometric mean of 5 independent executions ● Response time for last reported solution, relative to overall QET Response time for first reported solution, relative to overall QET A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 24

  25. Test with Difgerent Routjng Policies Data retrieval operator simply appends to its lookup queue ● Web simulation environment, diverse test queries (here one of them) ● Each bar represents geometric mean of 5 independent executions ● Response time for last reported solution, relative to overall QET Response time for first reported solution, relative to overall QET … is essentially the same for all executions of the query A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 25

  26. Test with Difgerent Routjng Policies Data retrieval operator simply appends to its lookup queue ● Web simulation environment, diverse test queries (here one of them) ● Each bar represents geometric mean of 5 independent executions ● Response time for last reported solution, relative to overall QET Response time for first reported solution, relative to overall QET Routing policy has no impact! A Case for Response Time Focused Query Processing – Olaf Hartjg @olafiartjg 26

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend