SLIDE 1
SLIDE 2 Query SELECT MIN(price) FROM flight WHERE depart=“MXP” AND arrive=“HYD”
R e f
m u l a t e d Q u e r y Reformulated Query
lowestFare(“MXP”,“HYD”)
calcPrice(“MXP”,“HYD”,”economy”) O r b i t z F l i g h t S e a r c h K L M O n l i n e Q a n t a s S p e c i a l s
N e w S e r v i c e :
Alitalia Source Definitions:
- Orbitz Flight Search
- KLM Online
- Qantas Specials
Mediator
Generate Model of Service?
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 3 zipcode distance
K n
n S
r c e 1 K n
n S
r c e 2 K n
n S
r c e 3 New Source 4
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 4 K n
n S
r c e 1 K n
n S
r c e 2 K n
n S
r c e 3
source1(zip1, lat1, long1), source1(zip2, lat2, long2), source2(lat1, long1, lat2, long2, dist2), source3(dist2, dist). centroid(zip1, lat1, long1), centroid(zip2, lat2, long2), greatCircleDist(lat1, long1, lat2, long2, dist2), convertKm2Mi(dist1, dist2).
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 5
80210 90266 842.37 843.65 60601 15201 410.31 410.83 10005 35555 899.50 899.21
match
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 6 Motivation Approach Search Scoring Experiments Related Work Conclusions
Target Tuples Candidate Tuples
N e w S
r c e Known Source Known Source
SLIDE 7 Motivation Approach Search Scoring Experiments Related Work Conclusions
P r e v i
s W
k !
Lerman, Plangprasopchok and Knoblock. Automatically labeling data used by web services. AAAI’06.
Target Tuples Candidate Tuples
N e w S
r c e Known Source Known Source
SLIDE 8 Invoke target with set of random inputs; Add empty clause to queue; while (queue not empty) v := best definition from queue; forall (v’ in Expand(v)) if ( Eval(v’) > Eval(v) ) insert v’ into queue;
new source
through space of candidate definitions Expressive Language
Sufficient for modeling most online sources
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 9
New Source 5
Input <zip1, dist1> Output <zip2, dist2>
Empty Result Non-empty Result randomly generated input tuples
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 10
source5(_,_,_,_). source5(zip1,_,_,_) :- source4(zip1,zip1,_). source5(zip1,_,zip2,dist2) :- source4(zip2,zip1,dist2). source5(_,dist1,_,dist2) :- <(dist2,dist1). …
Expand
New Source 5
source5( $zip1,$dist1,zip2,dist2)
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 11
source5(_,_,_,_). source5(zip1,_,_,_) :- source4(zip1,zip1,_). source5(zip1,_,zip2,dist2) :- source4(zip2,zip1,dist2). source5(_,dist1,_,dist2) :- <(dist2,dist1). …
Expand
source5(zip1,dist1,zip2,dist2) :- source4(zip2,zip1,dist2), source4(zip1,zip2,dist1). source5(zip1,dist1,zip2,dist2) :- source4(zip2,zip1,dist2), <(dist2,dist1). …
Expand Motivation Approach Search Scoring Experiments Related Work Conclusions
New Source 5
source5( $zip1,$dist1,zip2,dist2)
SLIDE 12
Standard techniques Non-standard technique
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 13
Input <$zip1, $dist1> Target Output <zip2, dist2> Clause Output <zip2, dist2>
No Overlap No Overlap Overlap!
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 14
return average(fitness) forall (tuple in InputTuples) T_target = invoke(target, tuple) T_clause = execute(clause, tuple) if not (|T_target|=0 and |T_clause|=0) fitness =
At least half of input tuples are non-empty invocations of target Jaccard similarity Average results only when output is returned
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 15
10.6 km ≈ 10.54 km Google Inc. ≈ Google Incorporated Mon, 31. July 2006 ≈ 7/31/06
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 16 Equality Approximations:
- 1% for distance, speed, temperature & price
- 0.002 degrees for latitude & longitude
- JaroWinkler > 0.85 for company, hotel & airport
- hand-written procedure for date.
Inductive search bias:
- Max clause length: 7
- Predicate repetition: 2
- Max variable level: 5
- Executable candidates
- No variable repetition
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 17
Distinguished forecast from current conditions current price = yesterday’s close + change
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 18
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 19
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 20 B l
b e r g C u r r e n c y R a t e s W
l d w i d e H
e l D e a l s 5 * H
e l s B y S t a t e D i s t a n c e B e t w e e n Z i p c
e s Government Hotel List Great Circle Distance Centroid
Hotels By Zipcode US Hotel Rates Yahoo Exchange Rates G
l e H
e l S e a r c h
Motivation Approach Search Scoring Experiments Related Work Conclusions
SLIDE 21
Motivation Approach Search Scoring Experiments Related Work Conclusions