probabilities and provenance on trees and treelike
play

Probabilities and Provenance on Trees and Treelike Instances Antoine - PowerPoint PPT Presentation

Probabilities and Provenance on Trees and Treelike Instances Antoine Amarilli 1 , Pierre Bourhis 2 , Pierre Senellart 1 , 3 , 4 September 7th, 2016 1 Tlcom ParisTech 2 CNRS CRIStAL 3 National University of Singapore 4 cole normale


  1. Probabilities and Provenance on Trees and Treelike Instances Antoine Amarilli 1 , Pierre Bourhis 2 , Pierre Senellart 1 , 3 , 4 September 7th, 2016 1 Télécom ParisTech 2 CNRS CRIStAL 3 National University of Singapore 4 École normale supérieure 1/7

  2. How to travel to Highlights from Paris? 2/7

  3. How to travel to Highlights from Paris? 2/7

  4. How to travel to Highlights from Paris? 2/7

  5. How to travel to Highlights from Paris? 2/7

  6. How to travel to Highlights from Paris? (Metro|RER)*|(Bus|Tram)* 2/7

  7. How to travel to Highlights from Paris? 2/7

  8. How to travel to Highlights from Paris? (Metro|RER)*|(Bus|Tram)* 2/7

  9. How to travel to Highlights from Paris? 50% (Metro|RER)*|(Bus|Tram)* 2/7

  10. How to travel to Highlights from Paris? 50% 2/7

  11. How to travel to Highlights from Paris? 50% 90% (Metro|RER)*|(Bus|Tram)* 2/7

  12. How to travel to Highlights from Paris? 72% 42% 37% 78% 90% 50% 83% 90% (Metro|RER)*|(Bus|Tram)* 2/7

  13. How to travel to Highlights from Paris? 72% What is the 42% probability that I can attend Highlights 2016? 37% 78% 90% 50% 83% 90% (Metro|RER)*|(Bus|Tram)* 2/7

  14. Problem statement Input: ? Query Q (Metro|RER)*|(Bus|Tram)* 3/7

  15. Problem statement Input: ? Query Q (Metro|RER)*|(Bus|Tram)* Database D or graph 3/7

  16. Problem statement Input: ? Query Q (Metro|RER)*|(Bus|Tram)* Database D or graph % Probabilities on facts or edges 50% 3/7

  17. Problem statement Input: ? Query Q (Metro|RER)*|(Bus|Tram)* Database D or graph % Probabilities on facts or edges 50% Output: the probability that the query is true under the distribution (assuming independence of all probabilistic events) 3/7

  18. Problem statement Input: ? Query Q (Metro|RER)*|(Bus|Tram)* Database D or graph % Probabilities on facts or edges 50% Output: the probability that the query is true under the distribution (assuming independence of all probabilistic events) Complexity: already #P-hard in the input database! (from #MONOTONE-SAT) 3/7

  19. Using treewidth to make the problem tractable 4/7

  20. Using treewidth to make the problem tractable 4/7

  21. Using treewidth to make the problem tractable 4/7

  22. Using treewidth to make the problem tractable Treewidth by example: 4/7

  23. Using treewidth to make the problem tractable Treewidth by example: 4/7

  24. Using treewidth to make the problem tractable Treewidth by example: 4/7

  25. Using treewidth to make the problem tractable Treewidth by example: 4/7

  26. Using treewidth to make the problem tractable Treewidth by example: 4/7

  27. Using treewidth to make the problem tractable Treewidth by example: 4/7

  28. Using treewidth to make the problem tractable Treewidth by example: 4/7

  29. Using treewidth to make the problem tractable Treewidth by example: 4/7

  30. Using treewidth to make the problem tractable Treewidth by example: 4/7

  31. Using treewidth to make the problem tractable Treewidth by example: 4/7

  32. Using treewidth to make the problem tractable Treewidth by example: 4/7

  33. Using treewidth to make the problem tractable Treewidth by example: 4/7

  34. Using treewidth to make the problem tractable Treewidth by example: 4/7

  35. Using treewidth to make the problem tractable Treewidth by example: 4/7

  36. Using treewidth to make the problem tractable Treewidth by example: • Trees have treewidth 1 • Cycles have treewidth 2 • k -cliques and ( k − 1 ) -grids have treewidth k − 1 4/7

  37. Using treewidth to make the problem tractable Treewidth by example: • Trees have treewidth 1 • Cycles have treewidth 2 • k -cliques and ( k − 1 ) -grids have treewidth k − 1 → Treelike : the treewidth is bounded by a constant 4/7

  38. Tractability on treelike instances Treelike data MSO query (RER|metro)* |(bus|tram)* 5/7

  39. Tractability on treelike instances Treelike data Tree automaton MSO query (RER|metro)* |(bus|tram)* 5/7

  40. Tractability on treelike instances Treelike data Tree encoding Tree automaton MSO query (RER|metro)* |(bus|tram)* 5/7

  41. Tractability on treelike instances Treelike data Tree encoding linear Query [Courcelle] answer TRUE Tree automaton MSO query (RER|metro)* |(bus|tram)* 5/7

  42. Tractability on treelike instances Treelike data Tree encoding Tree automaton MSO query (RER|metro)* |(bus|tram)* 5/7

  43. ∧ Tractability on treelike instances Treelike data Tree encoding Provenance circuit linear Tree automaton MSO query (RER|metro)* |(bus|tram)* 5/7

  44. ∧ Tractability on treelike instances Treelike data Tree encoding Provenance circuit linear linear Tree automaton MSO query (RER|metro)* 42% |(bus|tram)* Probability 5/7

  45. ∧ Tractability on treelike instances Treelike data Tree encoding Provenance circuit linear linear Tree automaton MSO query (RER|metro)* 42% |(bus|tram)* Probability Theorem For any fixed Boolean MSO query q and k ∈ N , given a database D of treewidth ≤ k with independent probabilities , we can compute in linear time the probability that D satisfies q 5/7

  46. Lower bound What can we do for unbounded-treewidth instances? 6/7

  47. Lower bound What can we do for unbounded-treewidth instances? ... not much. 6/7

  48. Lower bound Theorem For any graph signature σ , there is a first-order query q such that for any constructible unbounded-treewidth class I , probability evaluation of q on I is #P-hard under RP reductions 6/7

  49. Lower bound Theorem For any graph signature σ , there is a first-order query q such that for any constructible unbounded-treewidth class I , probability evaluation of q on I is #P-hard under RP reductions Proof idea: extract instances of a hard problem as topological minors using recent polynomial bounds [Chekuri and Chuzhoy, 2014] 6/7

  50. Lower bound Theorem For any graph signature σ , there is a first-order query q such that for any constructible unbounded-treewidth class I , probability evaluation of q on I is #P-hard under RP reductions 1 1 maps edges to vertex-disjoint paths 2 2 maps vertices to vertices 3 4 3 4 Proof idea: extract instances of a hard problem as topological minors using recent polynomial bounds [Chekuri and Chuzhoy, 2014] 6/7

  51. Future and ongoing work • Improving the lower bound: • From graphs to arbitrary arity databases • From FO down to unions of conjunctive queries with � = 7/7

  52. Future and ongoing work • Improving the lower bound: • From graphs to arbitrary arity databases • From FO down to unions of conjunctive queries with � = 2 2 ... 2 | Q | � � • Complexity in query and database — currently Ω × | D | → Which queries can efficiently be compiled to automata? 7/7

  53. Future and ongoing work • Improving the lower bound: • From graphs to arbitrary arity databases • From FO down to unions of conjunctive queries with � = 2 2 ... 2 | Q | � � • Complexity in query and database — currently Ω × | D | → Which queries can efficiently be compiled to automata? • Non-Boolean queries: efficient enumeration of query results? 7/7

  54. Future and ongoing work • Improving the lower bound: • From graphs to arbitrary arity databases • From FO down to unions of conjunctive queries with � = 2 2 ... 2 | Q | � � • Complexity in query and database — currently Ω × | D | → Which queries can efficiently be compiled to automata? • Non-Boolean queries: efficient enumeration of query results? • Other tasks: probabilistic conditioning “Knowing that I’m here, what’s the probability that RER B is up?” 7/7

  55. Future and ongoing work • Improving the lower bound: • From graphs to arbitrary arity databases • From FO down to unions of conjunctive queries with � = 2 2 ... 2 | Q | � � • Complexity in query and database — currently Ω × | D | → Which queries can efficiently be compiled to automata? • Non-Boolean queries: efficient enumeration of query results? • Other tasks: probabilistic conditioning “Knowing that I’m here, what’s the probability that RER B is up?” Thanks for your attention! 7/7

  56. References I Chekuri, C. and Chuzhoy, J. (2014). Polynomial bounds for the grid-minor theorem. In STOC . Courcelle, B. (1990). The monadic second-order logic of graphs. I. Recognizable sets of finite graphs. Inf. Comput. , 85(1).

  57. Image credits • Slide 2: • https://commons.wikimedia.org/wiki/File: Paris_Metro_map.svg (cropped), user Umx on Wikimedia Commons, public domain • http://www.parisvoyage.com/images/cartoon18.jpg , ParisVoyage, fair use • http://www.vianavigo.com/fileadmin/galerie/pdf/CGU_t_.pdf (cropped), RATP, fair use • Slides 4 and 5: https://commons.wikimedia.org/wiki/File: Carte_Transilien_RER_sch%C3%A9matique.svg (modified), user Benjamin Smith on Wikimedia Commons, license CC BY-SA 4.0 international.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend