rsp optimisation techniques
play

RSP Optimisation Techniques M.I. Ali http://intizarali.org - PowerPoint PPT Presentation

Tutorial on RDF Stream Processing 2016 M.I. Ali, J-P Calbimonte, D. Dell'Aglio, E. Della Valle, and A. Mauri http://streamreasoning.org/events/rsp2016 RSP Optimisation Techniques M.I. Ali http://intizarali.org @intizarali


  1. Tutorial on RDF Stream Processing 2016 M.I. Ali, J-P Calbimonte, D. Dell'Aglio, E. Della Valle, and A. Mauri http://streamreasoning.org/events/rsp2016 RSP Optimisation Techniques M.I. Ali http://intizarali.org @intizarali ali.intizar@insight- centre.org

  2. Data Streams are Everywhere Smart Cities and IoT are  leading to an era of streaming world Sensors and mobile  devices are producing an enormous amount of data Mostly in streaming  fashion http://streamreasoning.org/events/rsp2016

  3. Introducing Semantics in Data Streams Why RDF Data Streams?  • Interoperable (easy integration) • Machine Readable • Reasoning • On-demand discovery • Ideal for the web • Dereferencing http://streamreasoning.org/events/rsp2016

  4. The Goal 4 02/11/2016 http://streamreasoning.org/events/rsp2016

  5. CityPulse: Real-time IoT Data Analytics and Large Scale Data Analytics for Smart Cities Applications CityPulse aims to support the integration of dynamic data  sources and context-dependent on-demand adaptations of processing chains during run-time. CityPulse aims to bridge the gap between the application  technologies on the IoT and real world data streams. It will use Cyber-Physical and Social data and will employ big  data analytics and intelligent methods to aggregate, interpret and extract meaningful knowledge and perceptions from large sets of heterogeneous data streams. http://streamreasoning.org/events/rsp2016

  6. CityPulse: Real-time IoT Data Analytics and Large Scale Data Analytics for Smart Cities Applications http://streamreasoning.org/events/rsp2016

  7. Smart City Applications http://streamreasoning.org/events/rsp2016

  8. Is RSP Ready for Action? Available Engines  • CQELS • C-SPARQL • SPARQLStream • … Processing capabilities tests  • Benchmarks – LS – SR – CSR Performance and Scalability  http://streamreasoning.org/events/rsp2016

  9. Is RSP Ready for Action? RSP is still in its cradle  On-going work for query  language and semantics Existing RSP engines are  not more than prototypes Benchmarking for  performance and scalability testing in control environment http://streamreasoning.org/events/rsp2016

  10. Challenges for RSP Optimisation • Data Distribution – Data produced by streams is highly distributed • Unpredictable Data Rate – Stream observation rate is variable – Stream Bursts http://streamreasoning.org/events/rsp2016

  11. Challenges for RSP Optimisation • Number of Concurrent queries – A large number of audience or end users e.g. Citizens of a smart city • Background Data Integration – Streaming queries process a combination of streaming and static knowledge – Currently static knowledge base is processed in memory http://streamreasoning.org/events/rsp2016

  12. Challenges for RSP Optimisation • Quasi-static Data – Fetch and locally process can result into outdated results for quasi-static data • On-demand Discovery – Stream Processing operate in a frequently changing world – Data and applications change quite frequently • Adaptation – Streaming queries in dynamic environment need continuous monitoring http://streamreasoning.org/events/rsp2016

  13. How can we optimise RSP? Benchmarking  Resource Optimisation  Resource Sharing/Join  Optimiaiton Scalability  Load Balancing  Hybrid Reasoning  http://streamreasoning.org/events/rsp2016

  14. Benchmarks SR Bench  LS Bench  CSR Bench  Benchmarking Infrastructure CityBench  YABench  Heaven  http://streamreasoning.org/events/rsp2016

  15. CityBench Benchmarking Suite- CTI CityBench Queries Configurable T estbed Infrastructure (CTI) Smart City Applications Dataset Con fi guration Smart City Query Performance Configuration Data Streams Evaluator … Module Module … RSP Engine Benchmark Results Static Datastore http://streamreasoning.org/events/rsp2016

  16. CityBench Benchmarking Suite  CityBench is designed to evaluate RSP engines for Smart City Applications  It comprises of • 7 real time smart city data sets containing live RDF streams • Configurable Testbed Infrastructure with 6 parameters • 13 queries for 3 smart city applications e.g. Travel Planner, Parking Finder and CityDashboard http://streamreasoning.org/events/rsp2016

  17. CityBench Benchmarking Suite CityBench Datasets  • Vehicle Traffic • Parking • Weather • Pollution • Cultural Events • Library Events • User Location Stream http://streamreasoning.org/events/rsp2016

  18. CityBench Benchmarking Suite- CTI  Configuration Parameters • Changes in Input Streaming Rate • Play Back Time • Variable Background Data Sizes • Number of Concurrent Queries • Number of Streams within a Single Query • Selection of the RSP Engine http://streamreasoning.org/events/rsp2016

  19. CityBench Evaluation  We evaluated 2 state of the art RSP engines • CQELS • C-SPARQL  Both engines were test for their • Latency • Memory Consumption • Completeness  Different settings by fine tuning CTI Parameters • Number of queries, users, background data size etc. 19 http://streamreasoning.org/events/rsp2016 02/11/2016

  20. CityBench Evaluation : Latency  Latency over Increasing Number of Input Streams latency� (ms)� 6000� Q10_8-csparql� Q10_2-csparql� 5000� Q10_2-cqels� 1200� Q10_5-csparql� 4000� 1000� Q10_5-cqels� 800� 3000� 600� 400� 200� 0� 1� 2� 3� 4� 5� 6� 7� 8� 9� 10� 11� 12� 13� 14� 15� experiment� me� (minutes)� http://streamreasoning.org/events/rsp2016

  21. CityBench Evaluation : Latency  Latency over Increasing Number of Concurrent Queries • CQELS: Q1, Q5 and Q8 Q5� Q5-10� Q1� latency� (ms)� latency� (ms)� Q5-20� Q8-20� 600� Q1-10� 7000� Q8-10� Q8� Q1-20� 6000� 500� 5000� 400� 4000� 300� 3000� 200� 2000� 100� 1000� 0� 0� 1� 2� 3� 4� 5� 6� 7� 8� 9� 10� 11� 12� 13� 14� 15� 1� 2� 3� 4� 5� 6� 7� 8� 9� 10� 11� 12� 13� 14� 15� experiment� me� (minute)� experiment� me� (minute)� http://streamreasoning.org/events/rsp2016

  22. CityBench Evaluation : Latency  Latency over Increasing Number of Concurrent Queries • C-SPARQL: Q1, Q5 and Q8 Q5� latency� (ms)� latency� (ms)� Q1� Q5-10� 3500� 2500� Q1-10� Q5-20� Q8� Q1-20� 3000� 2000� 2500� 1500� 2000� 1500� 1000� 1000� 500� 500� 0� 0� 1� 2� 3� 4� 5� 6� 7� 8� 9� 10� 11� 12� 13� 14� 15� 1� 2� 3� 4� 5� 6� 7� 8� 9� 10� 11� 12� 13� 14� 15� experiment� me� (minute)� experiment� me� (minute)� http://streamreasoning.org/events/rsp2016

  23. CityBench Evaluation : Memory Consumption  Memory Consumption over Increasing the Number of Concurrent Queries memory� memory� (MB)� (MB)� 180� 600� Q1� Q1-20� 160� 500� Q5-1� Q1� Q5-20� 140� 400� Q1-20� Q5� 120� 300� Q5-20� 100� 200� 80� 100� 60� 0� 1� 2� 3� 4� 5� 6� 7� 8� 9� 10� 11� 12� 13� 14� 15� 1� 2� 3� 4� 5� 6� 7� 8� 9� 10� 11� 12� 13� 14� 15� experiment� me� (minute)� experiment� me� (minute)� http://streamreasoning.org/events/rsp2016

  24. CityBench Evaluation : Memory Consumption  Memory Consumption over Increasing the Size of Background Data memory� 3MB-cqels� 20MB-cqels� (MB)� 30MB-cqels� 3MB-csparql� 250� 20MB-csparql� 30MB-csparql� 200� 150� 100� 50� 0� 1� 2� 3� 4� 5� 6� 7� 8� 9� 10� 11� 12� 13� 14� 15� experiment� me� (minutes)� http://streamreasoning.org/events/rsp2016

  25. CityBench Evaluation: Completeness  Memory Consumption over Increasing the Size of Background Data Completeness� cqels� csparql� (%)� 98� 97� 97� 96� 96� 100� 91.4� 90� 82.4� 74.2� 80� 73.2� 70� 54.4� 60� 50� 40� 30� 20� 10� 0� 30� 60� 90� 120� 150� stream� input� rate� (triple/s)� http://streamreasoning.org/events/rsp2016

  26. RDF Stream Processing (RSP) : Challenges • Optimal Data Source Discovery Streams are everywhere • Multiple data streams can answer the same • query Optimal data stream selection • Catering for user-defined constraints and • preferences • On-Demand Stream Federation Automated composition of primitive data streams • to answer complex queries Adaptation  Data source properties can change over time • Make sure selected sources remain “optimal” • throughout life cycle of the query http://streamreasoning.org/events/rsp2016

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend