1 Issues for Cache Hierarchies Issues for Cache Hierarchies - PDF document

Cooperative Caching Cooperative Caching Previous work has shown that hit rate increases with population size [Duska et al. 97, Breslau et al. 98] However, single proxy caches have practical limits Cooperative Web Caching Cooperative Web Caching • Load, network topology, organizational constraints One technique to scale the client population is to have proxy caches cooperate. • Content sharing Local hit vs. remote hit • Problem : how to locate objects and route requests? • Lots of history here [Exodus: Franklin], [xFS: Dahlin], [GMS: Feeley&Voelker] Cooperative Web Proxy Caching Hierarchical Caches Cooperative Web Proxy Caching Hierarchical Caches Sharing and/or coordination of cache state among multiple Web Idea : place caches at exchange or proxy cache nodes switching points in the network, and origin Web site cache at each level of the hierarchy. Effectiveness of proxy cooperation depends on: (e.g., U.S. Congress) ♦ Inter-proxy communication distance ♦ Proxy utilization and load balance INTERNET ♦ Size of client population served upstream Resolve misses through the parent. Proxy Clients Internet downstream clients clients Clients clients Clients [Source: Geoff Voelker] Content Content- -Sharing Among Peers Sharing Among Peers Harvest- Harvest -Style ICP Hierarchies Style ICP Hierarchies INTERNET Idea : Since siblings are “close” in the network, allow them to share their cache contents directly. Examples Idea: multicast probes within each Harvest [Schwartz96] “family”: pick first hit response or Squid (NLANR) wait for all miss responses. NetApp NetCache INTERNET object request object response query (probe) clients clients client clients query response 1

Issues for Cache Hierarchies Issues for Cache Hierarchies Hashing: Cache Array Routing Protocol (CARP) Hashing: Cache Array Routing Protocol (CARP) • With ICP: query traffic within “families” (size n ) INTERNET Inter-sibling ICP traffic (and aggregate overhead) is quadratic with n. Query-handling overhead grows linearly with n. • miss latency Microsoft ISA Proxy Server Object passes through every cache from origin to client: deeper hierarchies scale better, but impose higher latencies. g-p q-u • storage v-z a-f A recently-fetched object is replicated at every level of the tree. Advantages • effectiveness 1. single-hop request resolution Interior cache benefits are limited by capacity if objects are not 2. no redundant caching of objects hash 3. allows client-side implementation likely to live there long (e.g., LRU). “GET www.hotsite.com” function 4. no new cache-cache protocols 5. reconfigurable Issues for CARP Directory- -based: Summary Cache for ICP based: Summary Cache for ICP Issues for CARP Directory • no way to exploit network locality at each level Idea : each caching server replicates the cache directory e.g., relies on local browser caches to absorb repeats (“summary”) of each of its peers (e.g., siblings). • load balancing [Cao et. al. Sigcomm98] • hash can be balanced and/or weighted with a load factor reflecting • Query a peer only if its local summary indicates a hit. the capacity/power of each server • To reduce storage overhead for summaries, implement the • must rebalance on server failures summaries compactly using Bloom Filters . Reassigns (1/n) th of cached URLs for array size n. May yield false hits (e.g., 1%), but not false misses. URLs from failed server are evenly distributed among the remaining n-1 servers. Each summary is three orders of magnitude smaller than the cache itself, and can be updated by multicasting just the flipped bits. • miss penalty and cost to compute the hash In CARP, hash cost is linear in n : hash with each node and pick the “winner”. A Summary- A Summary -ICP Hierarchy ICP Hierarchy Issues for Directory- Issues for Directory -Based Caches Based Caches • Servers update their summaries lazily. INTERNET Update when “new” entries exceed some threshold percentage. Summary caches at each level of the hierarchy miss e.g., Squid configured Update delays may yield false hits and/or false misses. reduce inter-sibling miss queries by 95+%. to use cache digests • Other ways to reduce directory size? Vicinity cache [Gadde/Chase/Rabinovich98] hit Subsetting by popularity [Gadde/Chase/Rabinovich97] • What are the limits to scalability? If we grow the number of peers? object request object response If we grow the cache sizes? query client query response 2

On the Scale and Performance.... UW Trace Characteristics On the Scale and Performance.... UW Trace Characteristics [Wolman/Voelker/.../Levy99] is a key paper in this area over the last few years. Trace UW • first negative result in SOSP (?) Duration 7 days • illustrates tools for evaluating wide-area systems HTTP objects 18.4 million simulation and analytical modeling HTTP requests 82.8 million • illustrates fundamental limits of caching Avg. requests/sec 137 benefits dictated by reference patterns and object rate of change Total Bytes 677 GB forget about capacity, and assume ideal cooperation Servers 244,211 • ties together previous work in the field Clients 22,984 wide-area cooperative caching strategies analytical models for Web workloads • best traces [Source: Geoff Voelker] Cooperation Across Organizations Cooperation Across Organizations A Multi- -Organization Trace Organization Trace A Multi Treat each UW organization as an independent “company” University of Washington (UW) is a large and diverse client population Evaluate cooperative caching among these organizations Approximately 50K people UW client population contains 200 independent campus How much Web document reuse is there among these organizations organizations? Museums of Art and Natural History • Place a proxy cache in front of each organization. Schools of Medicine, Dentistry, Nursing • What is the benefit of cooperative caching among these 200 Departments of Computer Science, History, and Music proxies? A trace of UW is effectively a simultaneous trace of 200 diverse client organizations • Key: Tagged clients according to their organization in trace [Source: Geoff Voelker] [Source: Geoff Voelker] Ideal Hit Rates for UW proxies Ideal Hit Rates for UW proxies Ideal Hit Rates for UW proxies Ideal Hit Rates for UW proxies Ideal hit rate - infinite storage, ignore Ideal hit rate - infinite storage, ignore cacheability, expirations cacheability, expirations Average ideal local Average ideal local hit rate: 43% hit rate: 43% Explore benefits of perfect cooperation rather than a particular algorithm Average ideal hit rate increases from 43% to 69% with cooperative caching [Source: Geoff Voelker] [Source: Geoff Voelker] 3

Cacheable Hit Rates for Cacheable Hit Rates for Sharing Due to Affiliation Sharing Due to Affiliation UW proxies UW proxies Cacheable hit rate - same as ideal, but doesn’t ignore cacheability Cacheable hit rates are much lower than ideal (average is 20%) Average cacheable hit rate increases from 20% to 41% with (perfect) cooperative caching UW organizational sharing vs. random organizations Difference in weighted averages across all orgs is ~5% [Source: Geoff Voelker] [Source: Geoff Voelker] Scaling Cooperative Caching Hit Rate vs. Client Population Scaling Cooperative Caching Hit Rate vs. Client Population Organizations of this size can benefit significantly from cooperative Curves similar to other studies caching • [e.g., Duska97, Breslau98] But…we don’t need cooperative caching to handle the entire UW Small organizations population size • Significant increase in hit rate • A single proxy (or small cluster) can handle this entire population! as client population increases • No technical reason to use cooperative caching for this • The reason why cooperative environment caching is effective for UW • In the real world, decisions of proxy placement are often political Large organizations or geographical • Marginal increase in hit rate as client population increases How effective is cooperative caching at scales where a single cache cannot be used? [Source: Geoff Voelker] [Source: Geoff Voelker] In the Paper... In the Paper... Trace Trace- -Driven Simulation: Sources of Error Driven Simulation: Sources of Error 1. End effects : is the trace interval long enough? 1. Do we believe this? What are some possible sources of error in this tracing/simulation study? Need adequate time for steady-state behavior to become apparent. 2. Sample size : is the population large enough? What impact might they have? Is it representative? 2. Why are “ideal” hit rates so much higher for the MS trace, 3. Completeness : does the sample accurately capture the client reference but the cacheable hit rates are the same? streams? What is the correlation between sharing and cacheability? What about browser caches and lower-level proxies? How would they 3. Why report byte hit rates as well as object hit rates? affect the results? 4. Client subsets : how to select clients to represent a subpopulation? Is the difference significant? What does this tell us about reference patterns? 5. Is the simulation accurate/realistic? cacheability, capacity/replacement, expiration, latency 4

1 Issues for Cache Hierarchies Issues for Cache Hierarchies - PDF document

Cooperative Caching Cooperative Caching Previous work has shown that hit rate increases with population size [Duska et al. 97, Breslau et al. 98] However, single proxy caches have practical limits Cooperative Web Caching Cooperative Web Caching

RGB-D Object Discovery via Mul7scene Analysis Evan Herbst,

From heuristic to optimal models in naturalistic visual search Angela Radulescu 1,2 *, Bas van

How do we deal with such pointers? What about write-barrier cost? Inter-generational ptrs

Minimization of Energy Loss using Integrated Evolutionary Approaches

Recognizing and Learning Object Categories Based on work and slides by R. Fergus, P. Perona, A.

The Cost of Capital Chapter 14 Principles Applied in This Chapter Principle 1: Money Has a

Introduction John Earwaker john_earwaker@first-economics.com Starting Point for Todays

PR19 Methodology webinar: Aligning Risk and Return and Financeability 19 July 2017 Trust in

Binary Addition Goals DEF: A binary adder with input length n is a combinational circuit

Planning Graphs and Graphplan Section 10.3 Nilufer Onder Department of Computer Science

The complete cost of cofactor h = 1 Implementing Weierstrass curves with complete formulas Peter

Fossil Fuel Taxation in the Presidents 2013 Budget Andre J. Barbe Department of Economics

accounts hacked among several in recent weeks to come upon security breaches in their AP

COS448:Innova-ngAcross Technology,Business,&Markets

Revenue & Growth Shan-Hung Wu CS, NTHU We get some satisfactory users after 8 months,

Physics requirement studies: Higgs measurements & others Manqi 18/11/19 CEPC WS@IHEP 1

HTML5 M ICHEL A NDR CTO MAN @ SAXOBANK . COM @michelandre71 AGENDA Saxo Bank Primer:

and Political Ecology Framework ISIS Summit Vienna 2015 THE INFORMATION SOCIETY AT THE CROSSROADS

JJ._ \ ( r:'{"t e !l) I rs, (itErr.Jo lt1 JJ.r +- : .r.+s,.,..| t'o J E . (!r'rsr.t

Lecture 4 - Real - time Ray Tracing Welcome! , = (, )

Ideal strategy follows program execution behaviors Multiple-area collection Problem: CPU

Ray-tracing Acceleration Motivation Distribution Ray Tracing Soft shadows

Investigating System Performance for DevOps Using Kernel Tracing jeremie.galarneau@efficios.com

Slide 1 ___________________________________ 8.1 Re sponsibility Ac c ounting o De c e ntr a

1 Issues for Cache Hierarchies Issues for Cache Hierarchies - PDF document

Cooperative Caching Cooperative Caching Previous work has shown that hit rate increases with population size [Duska et al. 97, Breslau et al. 98] However, single proxy caches have practical limits Cooperative Web Caching Cooperative Web Caching

RGB-D Object Discovery via Mul7scene Analysis Evan Herbst,

From heuristic to optimal models in naturalistic visual search Angela Radulescu 1,2 *, Bas van

How do we deal with such pointers? What about write-barrier cost? Inter-generational ptrs

Minimization of Energy Loss using Integrated Evolutionary Approaches

Recognizing and Learning Object Categories Based on work and slides by R. Fergus, P. Perona, A.

The Cost of Capital Chapter 14 Principles Applied in This Chapter Principle 1: Money Has a

Introduction John Earwaker john_earwaker@first-economics.com Starting Point for Todays

PR19 Methodology webinar: Aligning Risk and Return and Financeability 19 July 2017 Trust in

Binary Addition Goals DEF: A binary adder with input length n is a combinational circuit

Planning Graphs and Graphplan Section 10.3 Nilufer Onder Department of Computer Science

The complete cost of cofactor h = 1 Implementing Weierstrass curves with complete formulas Peter

Fossil Fuel Taxation in the Presidents 2013 Budget Andre J. Barbe Department of Economics

accounts hacked among several in recent weeks to come upon security breaches in their AP

COS448:Innova-ngAcross Technology,Business,&amp;Markets

Revenue &amp; Growth Shan-Hung Wu CS, NTHU We get some satisfactory users after 8 months,

Physics requirement studies: Higgs measurements &amp; others Manqi 18/11/19 CEPC WS@IHEP 1

HTML5 M ICHEL A NDR CTO MAN @ SAXOBANK . COM @michelandre71 AGENDA Saxo Bank Primer:

and Political Ecology Framework ISIS Summit Vienna 2015 THE INFORMATION SOCIETY AT THE CROSSROADS

JJ._ \ ( r:'{&quot;t e !l) I rs, (itErr.Jo lt1 JJ.r +- : .r.+s,.,..| t'o J E . (!r'rsr.t

Lecture 4 - Real - time Ray Tracing Welcome! , = (, )

Ideal strategy follows program execution behaviors Multiple-area collection Problem: CPU

Ray-tracing Acceleration Motivation Distribution Ray Tracing Soft shadows

Investigating System Performance for DevOps Using Kernel Tracing jeremie.galarneau@efficios.com

Slide 1 ___________________________________ 8.1 Re sponsibility Ac c ounting o De c e ntr a

COS448:Innova-ngAcross Technology,Business,&Markets

Revenue & Growth Shan-Hung Wu CS, NTHU We get some satisfactory users after 8 months,

Physics requirement studies: Higgs measurements & others Manqi 18/11/19 CEPC WS@IHEP 1

JJ._ \ ( r:'{"t e !l) I rs, (itErr.Jo lt1 JJ.r +- : .r.+s,.,..| t'o J E . (!r'rsr.t