Building Mashups
Craig Knoblock University of Southern California
Thanks to Rattapoom Tuchinda
Building Mashups Craig Knoblock University of Southern California - - PowerPoint PPT Presentation
Building Mashups Craig Knoblock University of Southern California Thanks to Rattapoom Tuchinda Whats a Mashup? A website or application that combines content from more than one source into an integrated experience [wikipedia] a) LA crime map
Thanks to Rattapoom Tuchinda
a) LA crime map c) Ski bonk b) zillow.com
Wrapper Wrapper
Clean Clean Attribute Attribute
Combine
Customize Display
300+ for MS) represents an
customize widget can be time consuming
issues and ignore others.
Based on the talk from http://www.cs.cmu.edu/~jasonh/presentations/chi2007-marmite.pdf
Embedded Browser Table Interaction Modes
{Restaurant name, address, phone, Review} {Restaurant name, address, phone, review, Date of Inspection, Score}
{Restaurant name, address, Date of Inspection, Score}
26
TBODY tr tr td td
1. 2. Japon Bistro
td a br br
970 E Colora.. Upscale yet affordabl..
td a br br
8400 Wilshir. Chic elegance….. Hokusai
27
TBODY tr tr td td
1. 2. Japon Bistro
td a br br
970 E Colora.. Upscale yet affordab
td a br br
8400 Wilshir. Chic elegance… Hokusai
Possible Attribute restaurant name (3) artist name (1) {a |a,s: a ∈ att (s) ∧ (val(a,s) ⊂ V)}
…
Sushi Sasabune Hokusai Japon Bistro
Newly extracted data Data repository
95 … 927 E.. Japon Bistro 99 … 8439.. Katana 90 … 8400.. Hokusai Health Rating … Address restaurant name … … … … … … French Renoir … … Japanese Hokusai … … nationality artist name … … 23 Katana … … 25 Sushi Roku … … 27 Sushi Sasabune … … zagat Rating restaurant name
Zagat Artist Info LA Health Rating
Sushi Roka Sushi Sasabune Hokusai Japon Bistro
Newly extracted data Data repository
95 … 927 E.. Japon Bistro 99 … 8439.. Katana 90 … 8400.. Hokusai Health Rating … Address restaurant name … … 23 Katana … … 25 Sushi Roku … … 27 Sushi Sasabune … … zagat Rating restaurant name
Zagat LA Health Rating
. . .
Predefined Rules
28 Reviews → 28 Subset Rule: (s1s2..sk) → (d1d2…dt) ∧ (k <= t) ∧ si ∈ {d1,d2,…,dt} ∧ di ≠ dj
Data repository
95 … 927 E.. Japon Bistro 99 ... 8439.. Katana 90 … 8400.. Hokusai Health Rating .. Address restaurant name … … 23 Katana … … 25 Sushi Roku … … 27 Sushi Sasabune … … zagat Rating restaurant name
Zagat LA Health Rating
{v} = val(a,s) where a {x} s is any source where att(s) {x} ≠ {}
{a}R = possible new attribute selection for row i. {x} = Set intersection({a}) over all the value rows.
34
2.22x 0.67x 4.16x 6.49x 3.32x
– Simile [Huynh 2005], Dapper, D.Mix [Hartman 2007], OpenKapow
– Potter’s Wheel [Raman 2001]
– Google MyMap
– Intel’s Mashmaker [Ennals 2007]
– Yahoo’s Pipes, Microsoft’s Popfly, IBM’s QED Wiki, Bungee Labs, Proto Software, Marmite [Wong 2007]
– Programming by Demonstration [Cypher 1993, Lau 2001] – Building Queries by Demonstration [Tuchinda 2007]