Using Latency-Recency Profiles for Data Delivery on the Web Laura - PowerPoint PPT Presentation

Using Latency-Recency Profiles for Data Delivery on the Web Laura Bright Louiqa Raschid University of Maryland

Introduction • Caching improves data delivery on the web • Cached data may become stale • Keeping data fresh adds overhead – Latency – Bandwidth – Server Load • Existing techniques do not consider client latency and recency preferences August 22, 2002 VLDB 2002 2

Outline • Web Technologies • Existing Solutions • Latency-Recency Profiles • Experiments • Conclusions August 22, 2002 VLDB 2002 3

Proxy Caches proxy clients cache Internet servers • Resides between clients and web • Objects have Time-to-Live (TTL); expired objects validated at server • Validation adds overhead • No server cooperation required August 22, 2002 VLDB 2002 4

Application Servers clients database Internet servers application server • Offload functionality of database-backed web servers • May perform caching to improve performance • Servers may propagate updates to cache August 22, 2002 VLDB 2002 5

Web Portals Remote servers clients Internet Portal • Provide information gathered from multiple data sources • Problem: updates to objects at sources – Update propagation consumes bandwidth – Objects at portal may be stale August 22, 2002 VLDB 2002 6

Consistency Approaches • Time-to-Live (TTL) • Always-Use-Cache (AUC) • Server-Side Invalidation (SSI) August 22, 2002 VLDB 2002 7

Time-to-Live (TTL) client cache server • Estimated lifetime of cached object • When TTL expires, cache must validate object at server • No server cooperation • Proxy caches August 22, 2002 VLDB 2002 8

Always Use Cache (AUC) client cache server • Objects served from the cache • Background prefetching keeps cached objects up to date • No server cooperation • Portals, web crawlers August 22, 2002 VLDB 2002 9

Server Side Invalidation (SSI) client cache server • Servers send updates to cache • Guarantees freshness • Increases workload at server • Application servers August 22, 2002 VLDB 2002 10

Summary • Existing techniques do not consider client preferences • May add unnecessary overhead (latency, bandwidth, or server load) – TTL, SSI • May not meet client recency preferences – AUC • Our goal: consider client preferences and reduce overhead August 22, 2002 VLDB 2002 11

Latency-Recency Profiles • Used in download decision at cache • Profile - set of application specific parameters that reflect client latency and recency preferences • Examples – Stock trader tolerates latency of 5 seconds for most recent stock quotes – Casual web browser wants low latency; tolerates data with recency two updates August 22, 2002 VLDB 2002 13

Profile Parameters • Set by clients • Target Latency ( ) -acceptable latency T L of request • Target Age( ) - acceptable recency of T A data • Examples T T – stock trader: =0 updates, =5 seconds L A T T – casual browser: =2 updates, = 2 A L seconds August 22, 2002 VLDB 2002 14

Profile-Based Downloading • Parameters appended to requests • Scoring function determines when to validate object and when to use cached copy • Scales to multiple clients • Minimal overhead at cache August 22, 2002 VLDB 2002 15

Scoring Function Properties • Tunability – Clients control latency-recency tradeoff • Guarantees – upper bounds with respect latency or recency • Ease of implementation August 22, 2002 VLDB 2002 16

Example Scoring Function T T • T = target value ( or ) A L • x = actual or estimated value (Age or Latency) • K =constant that tunes the rate the score decreases 1 if x T ≤ { Score (T, x, K) = K/(x - T K) otherwise + August 22, 2002 VLDB 2002 17

Combined Weighted Score • Used by cache • Age = estimated age of object • Latency = estimated latency • w = relative importance of meeting target latency • (1 - w) = importance of meeting recency CombinedSc ore (1 - w) * Score(T Age, K ) = A, A w * Score(T Latency, K ) + L, L August 22, 2002 VLDB 2002 18

Profile-Based Downloading • CacheScore- expected score of using the cached object CacheScore (1 - w) * Score(T , Age, K ) w * 1.0 = + A A • DownloadScore- expected score of downloading a fresh object DownloadSc ore (1 - w) * 1.0 w * Score(T , Latency, K ) = + L L • If DownloadScore > CacheScore, download fresh object, otherwise use cache August 22, 2002 VLDB 2002 19

Tuning Profiles W= 0.5, No firm upper bound K values control slope August 22, 2002 VLDB 2002 20

Upper Bounds W > 0.5- Firm Latency Upper Bound K W = 0.6, =2 =0.5 K W = 0.6, =2 =2 K L K A L A Download if Latency <3 Download if Latency <2 August 22, 2002 VLDB 2002 21

Baseline Algorithms • Time-to-Live (TTL) – Estimated lifetime of object – Can be estimated by a server or as function of time object last modified – Provides most recent data – When object’s TTL expires, new object must be downloaded August 22, 2002 VLDB 2002 23

Baseline Algorithms • AlwaysUseCache (AUC) – minimizes latency – If object is in cache, always serve without validation – Prefetch cached objects in round robin manner to improve recency – Prefetch rates of 60 objects/minute and 300 objects/minute August 22, 2002 VLDB 2002 24

Baseline Algorithms • Server-Side Invalidation (SSI) • SSI-Msg – Server sends invalidation messages only – Cache must request updated object • SSI-Obj – Server sends updated objects to cache – Reduces latency but consumes bandwidth August 22, 2002 VLDB 2002 25

Trace Data • Proxy cache trace data obtained from NLANR in January 2002 • 3.7 million requests over 5 days • 1,365,545 distinct objects, avg size 2.1 KB • Performed preprocessing • Age estimated using last-modified time • Latency is average over previous requests T • Profiles: = 1 second, = 1 update T L A • Cache size range: 1% of world size- infinite August 22, 2002 VLDB 2002 26

Synthetic Data • World of 100,000 objects • Zipf-like popularity distribution • Update intervals uniformly distributed from 10 min-2 hours • Workload of 8 requests/sec • Object sizes 2-12 KB • Infinite cache August 22, 2002 VLDB 2002 27

Metrics • Validations - messages between cache and servers – Useful validations - object was modified – Useless validations - object not modified • Downloads - objects downloaded from servers • Stale hits - objects served from cache that were modified at server August 22, 2002 VLDB 2002 28

Comparison- Trace Data TTL AUC- AUC- Profile 60 300 Val msgs 252367 378312 1891560 92943 Useful 24898 933 2810 22896 vals Useless 122074 279349 327776 67601 vals Avg. Est. 0 18.4 11.1 0.87 Age Stale hits 4282 31285 22897 7704 August 22, 2002 VLDB 2002 29

Comparison- Trace Data 25000 350000 35000 300000 30000 20000 250000 25000 TTL 15000 TTL TTL 200000 20000 AUC-60 AUC-60 AUC-60 150000 10000 15000 AUC-300 AUC-300 AUC-300 100000 10000 5000 Profile Profile 50000 Profile 5000 0 0 0 Useless Useful Stale Hits Vals. Validations August 22, 2002 VLDB 2002 30

Comparison- Synthetic Data 180000 160000 140000 120000 SSI-Msg 100000 TTL 80000 AUC-300 60000 Profile 40000 20000 0 Validations Downloads Stale Hits August 22, 2002 VLDB 2002 31

Effect of Cache Size – X-axis- cache size, Y-axis- average latency – Profile lies between extremes of TTL and AUC – Profile exploits increased cache size better than TTL August 22, 2002 VLDB 2002 32

Effect of Cache Size – X-axis- cache size, Y-axis- number of stale objects – AUC must prefetch many objects when cache is large – Profile can scale to large cache size August 22, 2002 VLDB 2002 33

Effect of Surges • Surge- client request workload exceeds capacity of server or network • Two groups of clients: T T – MostRecent: = 0 = 1 sec A L T T – LowLatency: = 1 = 0 sec A L • 30 second surge period • Capacity Ratio = available resources/ resources required August 22, 2002 VLDB 2002 34

Effect of Surges August 22, 2002 VLDB 2002 35

Related Work • Refreshing cached data – Cho and Garcia-Molina 2000 • WebViews – Labrinidis and Roussopoulos, 2000, 2001 • Caching Dynamic Content – Candan et al. 2001 – Luo and Naughton 2001 • Caching Approximate Values – Olston and Widom 2001, 2002 August 22, 2002 VLDB 2002 36

Conclusions and Future Work • Latency-Recency Profiles can reduce overhead while meeting client preferences • Future work: – Implementation – Mobile Environments – Effects of server cooperation August 22, 2002 VLDB 2002 37

Using Latency-Recency Profiles for Data Delivery on the Web Laura - PowerPoint PPT Presentation

Using Latency-Recency Profiles for Data Delivery on the Web Laura Bright Louiqa Raschid University of Maryland Introduction Caching improves data delivery on the web Cached data may become stale Keeping data fresh adds overhead

PROGRESSIVE DELIVERY PROGRESSIVE DELIVERY CONTINUOUS DELIVERY THE RIGHT WAY CONTINUOUS DELIVERY

Asynchronous I/O Stack: A Low-latency Kernel I/O Stack for Ultra-Low Latency SSDs Jinkyu Jeong

PULTRUSION PROFILES and APPLICATIONS Example of various shapes and size of pultruded profiles

Recency Types for JavaScript Phillip Heidegger, Peter Thiemann University of Freiburg

Profiles and Multiple Alignments COMP 571 Luay Nakhleh, Rice University 2 Outline Profiles

Profiles Profiles Dr Diana Birch Dr Diana Birch Youth Support Youth Support Introduction

Profiles and Multiple Alignments COMP 571 Luay Nakhleh, Rice University Outline Profiles and

Lets talk locks! @kavya719 kavya locks. locks are slow locks are slow latency

FAILURE AT NETFLIX VELOCITY Cannot Connect to the Netflix Service 0 0 Ms % IMPACT LATENCY

Low Latency Live Video Streaming over HTTP 2.0 Sheng Wei, Vishy Swaminathan | Adobe Research

Clean Delivery Kit: A job aid Linda Bruce, PATH 1 What are Delivery What are Delivery Kits?

Taming Latency In Data Center Applications Ph.D. Defense of Dissertation Mohan Kumar Advisor:

Green Latency-aware Data Deployment in Data Centers: Balancing Latency, Energy in Networks and

CROSS-LAYER CROSS-LAYER LATENCY-AWARE AND -PREDICTABLE LATENCY-AWARE AND -PREDICTABLE DATA

STORM AND LOW-LATENCY PROCESSING www.inf.ed.ac.uk Low latency processing Similar to data

TOES Agency Monthly Budget Profiles 2013-14 TOES Monthly Budget Profiles Overview

Sequential Performance Analysis with Callgrind and KCachegrind 4 th Parallel Tools Workshop, HLRS,

CSN08101 Digital Forensics Lecture 10: Windows Registry Module Leader: Dr Gordon Russell

Web and Intranet Performance Issues Adapted from Menasc & Almeida 1 Learning Objectives

Hypertext Transport Protocol (HTTP) Mendel Rosenblum CS142 Lecture Notes - HTTP

Scaling that Rails App Christian Amor Kvalheim Linqia.com Search Service for social

1 Course updates Project user understanding phase: writeup due Monday Oct 12 Home page for each

CS4513 Dist ribut ed Comput er Syst ems The Web Huge client-ser ver syst em (Ch 11.1)

OpenCms Days 2011 Workshop Track: The OpenCms 8 Content Subscription Engine Georg Westenberger,

Using Latency-Recency Profiles for Data Delivery on the Web Laura - PowerPoint PPT Presentation

Using Latency-Recency Profiles for Data Delivery on the Web Laura Bright Louiqa Raschid University of Maryland Introduction Caching improves data delivery on the web Cached data may become stale Keeping data fresh adds overhead

PROGRESSIVE DELIVERY PROGRESSIVE DELIVERY CONTINUOUS DELIVERY THE RIGHT WAY CONTINUOUS DELIVERY

Asynchronous I/O Stack: A Low-latency Kernel I/O Stack for Ultra-Low Latency SSDs Jinkyu Jeong

PULTRUSION PROFILES and APPLICATIONS Example of various shapes and size of pultruded profiles

Recency Types for JavaScript Phillip Heidegger, Peter Thiemann University of Freiburg

Profiles and Multiple Alignments COMP 571 Luay Nakhleh, Rice University 2 Outline Profiles

Profiles Profiles Dr Diana Birch Dr Diana Birch Youth Support Youth Support Introduction

Profiles and Multiple Alignments COMP 571 Luay Nakhleh, Rice University Outline Profiles and

Lets talk locks! @kavya719 kavya locks. locks are slow locks are slow latency

FAILURE AT NETFLIX VELOCITY Cannot Connect to the Netflix Service 0 0 Ms % IMPACT LATENCY

Low Latency Live Video Streaming over HTTP 2.0 Sheng Wei, Vishy Swaminathan | Adobe Research

Clean Delivery Kit: A job aid Linda Bruce, PATH 1 What are Delivery What are Delivery Kits?

Taming Latency In Data Center Applications Ph.D. Defense of Dissertation Mohan Kumar Advisor:

Green Latency-aware Data Deployment in Data Centers: Balancing Latency, Energy in Networks and

CROSS-LAYER CROSS-LAYER LATENCY-AWARE AND -PREDICTABLE LATENCY-AWARE AND -PREDICTABLE DATA

STORM AND LOW-LATENCY PROCESSING www.inf.ed.ac.uk Low latency processing Similar to data

TOES Agency Monthly Budget Profiles 2013-14 TOES Monthly Budget Profiles Overview

Sequential Performance Analysis with Callgrind and KCachegrind 4 th Parallel Tools Workshop, HLRS,

CSN08101 Digital Forensics Lecture 10: Windows Registry Module Leader: Dr Gordon Russell

Web and Intranet Performance Issues Adapted from Menasc &amp; Almeida 1 Learning Objectives

Hypertext Transport Protocol (HTTP) Mendel Rosenblum CS142 Lecture Notes - HTTP

Scaling that Rails App Christian Amor Kvalheim Linqia.com Search Service for social

1 Course updates Project user understanding phase: writeup due Monday Oct 12 Home page for each

CS4513 Dist ribut ed Comput er Syst ems The Web Huge client-ser ver syst em (Ch 11.1)

OpenCms Days 2011 Workshop Track: The OpenCms 8 Content Subscription Engine Georg Westenberger,

Web and Intranet Performance Issues Adapted from Menasc & Almeida 1 Learning Objectives