TheDynamicWeb Zoetrope: WWW Interac-ng with the Ephemeral Web - - PDF document

the dynamic web
SMART_READER_LITE
LIVE PREVIEW

TheDynamicWeb Zoetrope: WWW Interac-ng with the Ephemeral Web - - PDF document

6/8/09 TheDynamicWeb Zoetrope: WWW Interac-ng with the Ephemeral Web EytanAdar,MiraDontcheva JamesFogarty,DanWeld UniversityofWashington&AdobeSystems TheNowWeb


slide-1
SLIDE 1

6/8/09
 1
 Zoetrope: Interac-ng with the Ephemeral Web

Eytan
Adar,
Mira
Dontcheva
 James
Fogarty,
Dan
Weld


University
of
Washington
&
Adobe
Systems


The
Dynamic
Web


WWW


The
Now
Web


See
the
web
through
a
 browser
of
“now”
 Access
services
through
 an
index
of
“now”


Temporally
insensi-ve
 (ignore
temporal
context
and
historical
values)


Is
this
really
a
good
price?


slide-2
SLIDE 2

6/8/09
 2


Is
this
really
a
good
price?


When
is
really
the
right
Rme
to
buy?
 What
is
the
relaRonship
between
 weather
and
traffic
in
SeaTle?
 What
is
the
relaRonship
between

 traffic
and
temperature
in
SeaTle?


Now
what?
 Manually
 collect,
page
 by
page


traffic
and
temperature
in
SeaTle?
 an
alternaRve…


t t Temp Travel -me

slide-3
SLIDE 3

6/8/09
 3


Related
Work


  • EvoluRon
of
the
Web



[FeTerly
et
al.,
WWW
‘03]
 [Ntoulas
et
al.,

WWW
‘04]


 Internet
Archives


Photo


Related
Work


  • EvoluRon
of
the
Web


  • Monitoring


[Sugiura
&
Koseki,
UIST
’98]
 [Greenberg
&
Boyle,
GI
‘06]


Photo


Related
Work


  • EvoluRon
of
the
Web


  • Monitoring

  • Clipping
&
AggregaRng


[schraefel
et
al.,
WWW
‘02]
 [Dontcheva
et
al.,
UIST
‘07]


Photo


Our
Approach


slide-4
SLIDE 4

6/8/09
 4


Zoetrope
Architecture









Crawler


Web


Data
Store
 XML/Images/Text/…
 Zoetrope
Engine
 …
 …


O1
 O2
 O3


Zoetrope
Front
End
 R1
 R2
 R3


Content
Streams
&
Operators


{…
<Ti
,
Ci>
,
<Ti+1
,
Ci+1>
,
…}
 Time
 stamp
 Content
(typed,
but
can
be
 anything:
HTML,
images)


Content
Streams
&
Operators


{…
<Ti
,
Ci>
…}


Transform
 Operator
 Filter
 Operator
 Render
 Operator


Output:
Content
stream
with
 modified
data
payloads
(Ci’)
 Input:
Content
Stream
 Output:
Subset
of
Content
Stream
 Output:
“Visual”
Output


Construct
everything
from
operators


Lens,
locked
 to
coordinate
 {…
<Ti
,
Ci>
…}


Generate
 Images
 Crop
to
 Rectangle
 Pick
image
 based
on
 slider
 Render
into
 lens


slide-5
SLIDE 5

6/8/09
 5
 Construct
everything
from
operators


{…
<Ti
,
Ci>
…}


Select
DOM
 forest
 Extract
 numerical
 value
 Render
into
 Rme
series


<…>
$49.10
<…>
 <…>
$4310
<…>
 <…>
$44.10
<…>
 <…>
$50.20
<…>
 49.10
 43.10
 44.10
 50.20



LENSES


Visual

(same
place
on
screen)
 Structural
(same
place
in
HTML)


DOM
Structure


slide-6
SLIDE 6

6/8/09
 6


Content
(same
text)
 Filters


RENDERERS


slide-7
SLIDE 7

6/8/09
 7


Binding
Lenses
 Demo


slide-8
SLIDE 8

6/8/09
 8


ContribuRons


  • Visual programming toolkit
for
temporal



Web
content


  • Seman-cs for
temporal
data
streams

  • Operators for
fast
processing
of
temporal


data
streams


  • Indexing structures for
fast
processing
and


interacRon
with
Web
content
over
Rme


Future
Work


  • EvaluaRon


– User
Study
 – Performance


Future
Work


  • EvaluaRon


– User
Study
 – Performance


  • Data
gathering


InformaRon
Arbitrage


  • Combine
observaRons
to
improve
granularity


cnn.com
 Weather.com


Rme


slide-9
SLIDE 9

6/8/09
 9


Future
Work


  • EvaluaRon


– User
Study
 – Performance


  • Data
gathering

  • Scaling
up


Possible
Distributed
Architecture


  • Server
collects
snapshots

  • Client
collects
finer
resoluRon


– Gaps?


  • DHT
for
collected
data
(good
for
intervals)

  • Query:
operator
chain
+
data


– Release
on
similarity
threshold
 – Repeat


Thanks!


Jaime
Teevan,
Susan
Dumais,
Rob
Miller
,
GRAIL,
DUB
 NSF
Graduate
Fellowship,
ARCS,
WRF/TJ
Cable
Professorship


?