Mediator y e r u Q d e t a l u Query m r o e f K L - - PowerPoint PPT Presentation

mediator
SMART_READER_LITE
LIVE PREVIEW

Mediator y e r u Q d e t a l u Query m r o e f K L - - PowerPoint PPT Presentation

Motivation Approach Search Scoring Experiments Related Work Conclusions O r b i t z F l i g h t lowestFare(MXP,HYD) S e a r c h Mediator y e r u Q d e t a l u Query m r o e


slide-1
SLIDE 1
slide-2
SLIDE 2

Query SELECT MIN(price) FROM flight WHERE depart=“MXP” AND arrive=“HYD”

R e f

  • r

m u l a t e d Q u e r y Reformulated Query

lowestFare(“MXP”,“HYD”)

calcPrice(“MXP”,“HYD”,”economy”) O r b i t z F l i g h t S e a r c h K L M O n l i n e Q a n t a s S p e c i a l s

N e w S e r v i c e :

Alitalia Source Definitions:

  • Orbitz Flight Search
  • KLM Online
  • Qantas Specials

Mediator

Generate Model of Service?

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-3
SLIDE 3

zipcode distance

K n

  • w

n S

  • u

r c e 1 K n

  • w

n S

  • u

r c e 2 K n

  • w

n S

  • u

r c e 3 New Source 4

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-4
SLIDE 4

K n

  • w

n S

  • u

r c e 1 K n

  • w

n S

  • u

r c e 2 K n

  • w

n S

  • u

r c e 3

source1(zip1, lat1, long1), source1(zip2, lat2, long2), source2(lat1, long1, lat2, long2, dist2), source3(dist2, dist). centroid(zip1, lat1, long1), centroid(zip2, lat2, long2), greatCircleDist(lat1, long1, lat2, long2, dist2), convertKm2Mi(dist1, dist2).

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-5
SLIDE 5

80210 90266 842.37 843.65 60601 15201 410.31 410.83 10005 35555 899.50 899.21

match

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-6
SLIDE 6

Motivation Approach Search Scoring Experiments Related Work Conclusions

Target Tuples Candidate Tuples

N e w S

  • u

r c e Known Source Known Source

slide-7
SLIDE 7

Motivation Approach Search Scoring Experiments Related Work Conclusions

P r e v i

  • u

s W

  • r

k !

Lerman, Plangprasopchok and Knoblock. Automatically labeling data used by web services. AAAI’06.

Target Tuples Candidate Tuples

N e w S

  • u

r c e Known Source Known Source

slide-8
SLIDE 8

Invoke target with set of random inputs; Add empty clause to queue; while (queue not empty) v := best definition from queue; forall (v’ in Expand(v)) if ( Eval(v’) > Eval(v) ) insert v’ into queue;

  • 1. Sample the

new source

  • 2. Best-first search

through space of candidate definitions Expressive Language

Sufficient for modeling most online sources

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-9
SLIDE 9

New Source 5

Input <zip1, dist1> Output <zip2, dist2>

Empty Result Non-empty Result randomly generated input tuples

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-10
SLIDE 10

source5(_,_,_,_). source5(zip1,_,_,_) :- source4(zip1,zip1,_). source5(zip1,_,zip2,dist2) :- source4(zip2,zip1,dist2). source5(_,dist1,_,dist2) :- <(dist2,dist1). …

Expand

New Source 5

source5( $zip1,$dist1,zip2,dist2)

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-11
SLIDE 11

source5(_,_,_,_). source5(zip1,_,_,_) :- source4(zip1,zip1,_). source5(zip1,_,zip2,dist2) :- source4(zip2,zip1,dist2). source5(_,dist1,_,dist2) :- <(dist2,dist1). …

Expand

source5(zip1,dist1,zip2,dist2) :- source4(zip2,zip1,dist2), source4(zip1,zip2,dist1). source5(zip1,dist1,zip2,dist2) :- source4(zip2,zip1,dist2), <(dist2,dist1). …

Expand Motivation Approach Search Scoring Experiments Related Work Conclusions

New Source 5

source5( $zip1,$dist1,zip2,dist2)

slide-12
SLIDE 12

Standard techniques Non-standard technique

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-13
SLIDE 13

Input <$zip1, $dist1> Target Output <zip2, dist2> Clause Output <zip2, dist2>

No Overlap No Overlap Overlap!

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-14
SLIDE 14

return average(fitness) forall (tuple in InputTuples) T_target = invoke(target, tuple) T_clause = execute(clause, tuple) if not (|T_target|=0 and |T_clause|=0) fitness =

At least half of input tuples are non-empty invocations of target Jaccard similarity Average results only when output is returned

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-15
SLIDE 15

10.6 km ≈ 10.54 km Google Inc. ≈ Google Incorporated Mon, 31. July 2006 ≈ 7/31/06

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-16
SLIDE 16

Equality Approximations:

  • 1% for distance, speed, temperature & price
  • 0.002 degrees for latitude & longitude
  • JaroWinkler > 0.85 for company, hotel & airport
  • hand-written procedure for date.

Inductive search bias:

  • Max clause length: 7
  • Predicate repetition: 2
  • Max variable level: 5
  • Executable candidates
  • No variable repetition

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-17
SLIDE 17

Distinguished forecast from current conditions current price = yesterday’s close + change

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-18
SLIDE 18

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-19
SLIDE 19

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-20
SLIDE 20

B l

  • m

b e r g C u r r e n c y R a t e s W

  • r

l d w i d e H

  • t

e l D e a l s 5 * H

  • t

e l s B y S t a t e D i s t a n c e B e t w e e n Z i p c

  • d

e s Government Hotel List Great Circle Distance Centroid

  • f Zipcode

Hotels By Zipcode US Hotel Rates Yahoo Exchange Rates G

  • g

l e H

  • t

e l S e a r c h

Motivation Approach Search Scoring Experiments Related Work Conclusions

slide-21
SLIDE 21

Motivation Approach Search Scoring Experiments Related Work Conclusions