Acessing the Deep Web with Keywords: A Foundational Approach
Andrea Calí and Martín Ugarte
IKC 2017
Acessing the Deep Web with Keywords: A Foundational Approach Andrea - - PowerPoint PPT Presentation
Acessing the Deep Web with Keywords: A Foundational Approach Andrea Cal and Martn Ugarte IKC 2017 Dish Pages country Dish Pages country If you search for a country, you get the typical dishes from that country, and the chefs who
Acessing the Deep Web with Keywords: A Foundational Approach
Andrea Calí and Martín Ugarte
IKC 2017
Dish Pages
country
Dish Pages
country
If you search for a country, you get the typical dishes from that country, and the chefs who prepare them
Dish Pages
country
If you search for a chef, you get his nationality and the amount of Michelin stars he has earned If you search for a country, you get the typical dishes from that country, and the chefs who prepare them
Dish Pages
country
Dish Pages
Italy
country
Dish Pages
Italy
Dish Nation Chef risotto Italy Beck
country
Dish Pages
Beck
country
Dish Pages
Beck
Chef Stars Nation Beck 3 Germany
country
Dish Pages
Germany
country
Dish Pages
Germany
Dish Nation Chef späzle Germany Passard
country
Dish Pages
Passard
country
Dish Pages
Passard
Chef Stars Nation Passard 2 France
country
Dish Pages
France
country
Dish Pages
France
Dish Nation Chef foie gras France Bottura raclette France Elverfield
country
Dish Pages
Bottura
country
Dish Pages
Bottura
Chef Stars Nation Bottura 3 Italy
country
Dish Pages
Bottura Elverfield
country
Dish Pages
Elverfield
country
Chef Stars Nation Dish Nation Chef
Chef Stars Nation Dish Nation Chef
input
input output output
Chef Stars Nation Dish Nation Chef
input
input output output
Italy
Chef Stars Nation Dish Nation Chef risotto Italy Beck
input
input output output
Italy
Chef Stars Nation Beck 3 Germany Dish Nation Chef risotto Italy Beck
input
input output output
Italy
Chef Stars Nation Beck 3 Germany Dish Nation Chef risotto Italy Beck späzle Germany Passard
input
input output output
Italy
Chef Stars Nation Beck 3 Germany Passard 2 France Dish Nation Chef risotto Italy Beck späzle Germany Passard
input
input output output
Italy
Chef Stars Nation Beck 3 Germany Passard 2 France Dish Nation Chef risotto Italy Beck späzle Germany Passard foie gras France Bottura raclette France Elverfield
input
input output output
Italy
Chef Stars Nation Beck 3 Germany Passard 2 France Bottura 3 Italy Dish Nation Chef risotto Italy Beck späzle Germany Passard foie gras France Bottura raclette France Elverfield
input
input output output
Italy
Chef Stars Nation Beck 3 Germany Passard 2 France Bottura 3 Italy Dish Nation Chef risotto Italy Beck späzle Germany Passard foie gras France Bottura raclette France Elverfield
input
input output output
Italy
Chef Stars Nation Beck 3 Germany Passard 2 France Bottura 3 Italy Dish Nation Chef risotto Italy Beck späzle Germany Passard foie gras France Bottura raclette France Elverfield
input
input output output Same Abstract Domain
Italy
ρ1 : qa(C) ˆ r2(C, 3, italy). ρ2 : ˆ r1(D, N, C) domN(N), r1(D, N, C). ρ3 : ˆ r2(C, S, N) domC(C), r2(C, S, N). ρ4 : domC(C) ˆ r1(D, N, C). ρ5 : domN(N) ˆ r2(C, S, N). ρ6 : domN(italy).
ρ1 : qa(C) ˆ r2(C, 3, italy). ρ2 : ˆ r1(D, N, C) domN(N), r1(D, N, C). ρ3 : ˆ r2(C, S, N) domC(C), r2(C, S, N). ρ4 : domC(C) ˆ r1(D, N, C). ρ5 : domN(N) ˆ r2(C, S, N). ρ6 : domN(italy).
CQ answering under access limitations
ρ1 : qa(C) ˆ r2(C, 3, italy). ρ2 : ˆ r1(D, N, C) domN(N), r1(D, N, C). ρ3 : ˆ r2(C, S, N) domC(C), r2(C, S, N). ρ4 : domC(C) ˆ r1(D, N, C). ρ5 : domN(N) ˆ r2(C, S, N). ρ6 : domN(italy).
CQ answering under access limitations
Tuple t, Initial constants I, CQ Q, DB D, access limitations Is t in the answers to Q starting with constants I ?
ρ1 : qa(C) ˆ r2(C, 3, italy). ρ2 : ˆ r1(D, N, C) domN(N), r1(D, N, C). ρ3 : ˆ r2(C, S, N) domC(C), r2(C, S, N). ρ4 : domC(C) ˆ r1(D, N, C). ρ5 : domN(N) ˆ r2(C, S, N). ρ6 : domN(italy).
CQ answering under access limitations t ∈ ans(Q1, I, D)
Tuple t, Initial constants I, CQ Q, DB D, access limitations Is t in the answers to Q starting with constants I ?
CQ answering under access limitations
Tuple t, Initial constants I, CQ Q, DB D, access limitations Is t in the answers to Q starting with constants I ?
t ∈ ans(Q1, I, D)
CQ answering under access limitations
Tuple t, Initial constants I, CQ Q, DB D, access limitations Is t in the answers to Q starting with constants I ?
t ∈ ans(Q1, I, D)
CQ answering under access limitations
Tuple t, Initial constants I, CQ Q, DB D, access limitations Is t in the answers to Q starting with constants I ?
t ∈ ans(Q1, I, D)
Star Pages
restaurant
Star Pages
restaurant
If you input a chef and a restaurant, it will tell you how many stars that restaurant earned with that chef.
Star Pages
restaurant
Star Pages
La Pergola Beck
restaurant
Star Pages
Chef Restaurant Stars Beck La Pergola 3
La Pergola Beck
restaurant
Assume the initial set of constants is 100 chefs and 100 restaurants.
Assume the initial set of constants is 100 chefs and 100 restaurants. We need to try all pairs <chef, restaurant> to
Assume the initial set of constants is 100 chefs and 100 restaurants.
Chef Restaurant Stars Beck La Pergola 3
Even on this database! We need to try all pairs <chef, restaurant> to
Assume the initial set of constants is 100 chefs and 100 restaurants.
Chef Restaurant Stars Beck La Pergola 3
Even on this database! We need to try all pairs <chef, restaurant> to
Assume the initial set of constants is 100 chefs and 100 restaurants.
Chef Restaurant Stars Beck La Pergola 3
Even on this database! We need to try all pairs <chef, restaurant> to
I want to search this website starting from this set of keywords
Restricted case
I want to search this website starting from this set of keywords
Restricted case
What can a user retrieve from my database if he starts from this set of keywords?
Unestricted case
But they are equivalent in the worst case…
Querying the Deep Web with keywords
Querying the Deep Web with keywords Recursive extraction needed
Querying the Deep Web with keywords Recursive extraction needed Two scenarios:
Querying the Deep Web with keywords Recursive extraction needed Two scenarios:
First results on computational complexity
Model the restricted case through oracles
Model the restricted case through oracles Theoretical lower bounds
Model the restricted case through oracles Theoretical lower bounds etc…