Web Caching based on: Web Caching , Geoff Huston Web Caching and - PDF document

Web Caching based on: ✗ Web Caching , Geoff Huston ✗ Web Caching and Zipf-like Distributions: Evidence and Implications , L. Breslau, P. Cao, L. Fan, G. Phillips, S. Shenker ✗ On the scale and performance of cooperative Web proxy caching , A. Wolman, G. Voelker, N. Sharma, N. Cardwell, A. Karlin, H. Levy Page 1 of 24 Web Caching

Hypertext Transfer Protocol (HTTP) based on an end-to-end model (the network is a passive element) Pros: ✗ The server can modify the content; ✗ The server can track all content requests; ✗ The server can differentiate between different clients; ✗ The server can authenticate the client. Cons: ✗ Popular servers are under continuous high stress ✗ high number of simultaneously opened connections ✗ high load on the surrounding network ✗ The network may not be efficiently utilized Page 2 of 24 Web Caching

Why Caching in the Network? ✔ reduce latency by avoiding slow links between client and origin server ✗ low bandwidth links ✗ congested links ✔ reduce traffic on links ✔ spread load of overloaded origin servers to caches Page 3 of 24 Web Caching

Caching points ✔ content provider (server caching) ✔ ISP ✗ more than 70% of ISP traffic is Web based; ✗ repeated requests at about 50% of the traffic; ✗ ISPs interested in reducing the transmission costs; ✗ provide high quality of service to the end users. ✔ end client Page 4 of 24 Web Caching

Caching Challenges ✔ Cache consistency ✗ identify if an object is stale or fresh ✔ Dynamic content ✗ caches should not store outputs of CGI scripts (Active Cache?) ✔ Hit counts and personalization ✔ Privacy concerned user ✗ how do you get a user to point to a cache ✔ Access control ✗ legal and security restrictions ✔ Large multimedia files Page 5 of 24 Web Caching

Web cache access pattern - (common thoughts) ✔ the distribution of page requests follows a Zipf-like distribution: the relative probability of a request for the i-th most popular page is 1 i α ⁄ α ≤ proportional to with 1 ✔ for an infinite sized cache, the hit-ratio grows in a log-like fashion as a function of the client population and of the number of requests ✔ the hit-ratio of a web cache grows in a log-like fashion as a function of cache size ✔ the probability that a document will be referenced k requests after it was last ⁄ referenced is roughly proportional to 1 k (temporal locality) Page 6 of 24 Web Caching

Web cache access pattern - (P. Cao Observations) ✔ distribution of web requests follow a a Zipf- α ∈ , like distribution (fig. 1) 0.64 0.83 ✔ the 10/90 rule does not apply to web accesses (70% of accesses to about 25%-40% of the documents) (fig. 2) ✔ low statistical corellation between the frequency that a document is accessed and its size (fig. 3) ✔ low statistic correlation between a document’s access frequency and its average modifications per request. ✔ request distribution to servers does not follow a Zipf law.; there is no single server contributing to the most popular pages (fig. 5) Page 7 of 24 Web Caching

Implications of Zipf-like behavior ✔ Model: ✗ A cache that receives a stream of requests ✗ N = total number of pages in the universe ( ) P N i = probability that given the arrival of ✗ a page request, it is made for page , ( , ) i i = 1 N ✗ Pages ranked in order of their popularity, page i is the i -th most popular ✗ Zipf-like distribution for the arrivals: Ω i α ( ) ⁄ P N i = ,  1 –  N 1 i α ∑ Ω ⁄   =   ✔ i = 1 ✗ no pages are invalidated in the cache. Page 8 of 24 Web Caching

Implications of Zipf-like behavior infinite cache, finite request stream: Ω ( Ω R ) , ( α ) = log = 1 ( ) H R ) 1 α ⁄ – 1 ( Ω ⁄ ( α ) ) R Ω ( , ( < α < ) = 1 – 0 1 finite cache, infinite request stream: , ( α ) = log C = 1 ( ) H C α C 1 – , < α < = 0 1 page request interarrival time:  k  k     1 1 1 ≈ - - - - - - - - - - - - - - - d k ( ) 1 – - - - - - - - - - - - - - - - - - – 1 – - - - - - - - - - - - -       k log N N log N log N H(R) = hit ratio d(k) = probability distribution that the next request for page iis followed by k-1 requests for pages other than i Page 9 of 24 Web Caching

Cache Replacement Algorithms (fig. 9) ✔ GD-Size: ✗ performs best for small sizes of cache. ✔ Perfect-LFU ✗ performs comparably with GD-Size in hit- ratio and much better in byte hit-ratio for large cache sizes. ✗ drawback: increased complexity ✔ In-Cache-LFU ✗ performs the worst Page 10 of 24 Web Caching

Web document sharing and proxy caching ✔ What is the best performance one could achieve with “perfect” cooperative caching? ✔ For what range of client populations can cooperative caching work effectively? ✔ Does the way in which clients are assigned to caches matter? ✔ What cache hit rates are necessary to achieve worthwhile decreases in document access latency? Page 11 of 24 Web Caching

For what range of client populations can cooperative caching work effectively? 90 80 Request Hit Rate (%) 70 60 50 40 30 Ideal (UW) Cacheable (UW) 20 Ideal (MS) 10 Cacheable (MS) 0 0 5000 10000 15000 20000 25000 30000 Population ✔ for smaller populations, hit rate increases rapidly with population (cooperative caching can be used effectively) ✔ for larger population cooperative caching is unlikely to bring benefits Conclusion: ✔ use cooperative caching to adapt to proxy assignments made for political or geographical reasons. Page 12 of 24 Web Caching

Object Latency vs. Population Mean No Cache Mean Cacheable Mean Ideal Median No Cache Median Cacheable 2000 Median Ideal Object Latency (ms) 1500 1000 500 0 0 5000 10000 15000 20000 Population ✔ Latency stays unchanged when population increases ✔ Caching will have little effect on mean and median latency beyond very small client population (?!) Page 13 of 24 Web Caching

Byte Hit Rate vs. Population 60 50 Byte Hit Rate (%) 40 30 20 Ideal Cacheable 10 0 0 5000 10000 15000 20000 Population ✔ same knee at about 2500 clients ✔ shared documents are smaller Page 14 of 24 Web Caching

Bandwith vs. population Mean Bandwidth (Mbits/s) 5 No Cache Cacheable Ideal 4 3 2 1 0 0 5000 10000 15000 20000 Population ✔ while caching reduces bandwidth consumption there is no benefit to increased client population. Page 15 of 24 Web Caching

Proxies and organizations 100 Ideal Cooperative Ideal Local 90 Cacheable Cooperative 80 Cacheable Local 70 Hit Rate (%) 60 50 40 30 20 10 0 978 735 480 425 392 337 251 246 239 228 222 219 213 200 192 15 Largest Organizations 70 UW Random 60 50 Hit Rate (%) 40 30 20 10 0 978 735 480 425 392 337 251 246 239 228 222 219 213 200 192 15 Largest Organizations ✔ there is some locality in organizational membership, but the impact is not significant Page 16 of 24 Web Caching

Impact of larger population size 100 Cooperative Local 80 Hit Rate (%) 60 40 20 0 UW Ideal UW Cacheable MS Ideal MS Cacheable ✔ unpopular documents are universally unpopular => unlikely that a miss in one large group of population to get a hit in another group. Page 17 of 24 Web Caching

Performance of large scale proxy caching - model ✔ N = clients in population ✔ n = total number of documents ((aprox. 3.2 billions) ✔ pi = fraction of all requests that are for the i-th 1 ≈ α - - - - most popular document ( p i , = 0.8 ) i α ✔ exponential distribution of the requests done λ N λ by client with parameter , where = 590 reqs/day = average client request rate ✔ exponential distribution of time for document µ change, with parameter (unpopular µ u ⁄ document = 1 186 days, popular µ p ⁄ documents = 1 14 days) ✔ p c = 0.6 = probability that the documents are cacheable ✔ E(S) = 7.7KB = avg. document size ✔ E(L) = 1.9 sec = last-byte latency to server Page 18 of 24 Web Caching

Hit rate, latency and bandwidth 100 Cacheable Hit Rate (%) 80 60 40 Slow (14 days, 186 days) Mid Slow (1 day, 186 days) 20 Mid Fast (1 hour, 85 days) Fast (5 mins, 85 days) 0 10^2 10^3 10^4 10^5 10^6 10^7 10^8 Population (log) Hit rate = f(Population) has three areas: ✔ 1. the request rate too low to dominate the rate of change for unpopular documents ✔ 2. marks a significant increase in the hit rate of the unpopular documents ✔ 3. the request rate is high enough to cope with the document modifications ( ) E L ⋅ ⋅ Latency lat req = 1 – H N ( ) + H N lat hit ⋅ Bandwidth B N = H N E S ( ) Page 19 of 24 Web Caching

Document rate of change 100 A Cacheable Hit Rate (%) 80 B 60 40 C mu_p mu_u 20 0 1 2 5 10 30 1 2 6 12 1 2 7 14 30 180 min hour day Change Interval (log) 100 A B Cacheable Hit Rate (%) 80 60 40 C mu_p mu_u 20 0 1 2 5 10 30 1 2 6 12 1 2 7 14 30 180 min hour day Change Interval (log) Proxy cache hit is very sensitive to the change rates of both popular and unpopular documents with a decrease on the time scales at which hit rate is sensitive once the size of the population increases. Page 20 of 24 Web Caching

Web Caching based on: Web Caching , Geoff Huston Web Caching and - PDF document

Web Caching based on: Web Caching , Geoff Huston Web Caching and Zipf-like Distributions: Evidence and Implications , L. Breslau, P. Cao, L. Fan, G. Phillips, S. Shenker On the scale and performance of cooperative Web proxy caching ,

Web Proxy Web Proxy Caching Caching Caching Web Proxy Web Proxy Caching By Miquel Company

Web Caching and Content Delivery Web Caching and Content Delivery Caching for a Better Web

Agenda Caching Caching Gitlab Demo Caching Demos Mirroring Caching Limitations Manual

Cooperative Web Caching Cooperative Web Caching Cooperative Caching Cooperative Caching

Web Caching Web Caching and wireless networks Next generation Wireless Networks Helsinki

1 Web Traffic Characterization Zipf Web Traffic Characterization Zipf [Breslau/Cao99] and

1 Harvest Harvest- -Style ICP Hierarchies Style ICP Hierarchies Issues for Cache Hierarchies

Scaling Your Cache & Caching at Scale Alex Miller @puredanger Mission Why does caching

Temporal Temporal Radiance Caching Radiance Caching Pascal Gautron R&D Engineer Thomson

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

1 Issues for Cache Hierarchies Issues for Cache Hierarchies Hashing: Cache Array Routing

Slide 2 Caching is both the most effective AND the most cost-effective method for schools to

Slide 2 Caching is both the most effective AND the most cost-effective method for schools to

Understanding Optimal Caching and Opportunistic Caching at The Edge of Information Centric

CACHING BEYOND RAM CACHING BEYOND RAM memcached.org/blog @dormando WHY RAM? WHY RAM?

Region Caching: Motivation Region Caching: Motivation High Level Languages influence the

A Socially Aware Caching Mechanism for Encounter Networks Future Internet Architectures: New

COMP 633 - Parallel Computing Lecture 10 September 15, 2020 CC-NUMA (1) CC-NUMA implementation

Measuring the on-lineness of data streams Manfred K. Warmuth Jiazhong Nie University of

Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer

HTTP HTTP: HyperText Transfer Protocol Basis for fetching Web pages request Network CSE 461

1945: Vannevar Bush The Internet End-End As we may think, Atlantic The Web Monthly,

and Transitive Trust Jeff Jarmoc Sr. Security Researcher Dell SecureWorks About this talk

Web Engineering HTTP-message = Request | Response generic-message = start-line *message-header

Web Caching based on: Web Caching , Geoff Huston Web Caching and - PDF document

Web Caching based on: Web Caching , Geoff Huston Web Caching and Zipf-like Distributions: Evidence and Implications , L. Breslau, P. Cao, L. Fan, G. Phillips, S. Shenker On the scale and performance of cooperative Web proxy caching ,

Web Proxy Web Proxy Caching Caching Caching Web Proxy Web Proxy Caching By Miquel Company

Web Caching and Content Delivery Web Caching and Content Delivery Caching for a Better Web

Agenda Caching Caching Gitlab Demo Caching Demos Mirroring Caching Limitations Manual

Cooperative Web Caching Cooperative Web Caching Cooperative Caching Cooperative Caching

Web Caching Web Caching and wireless networks Next generation Wireless Networks Helsinki

1 Web Traffic Characterization Zipf Web Traffic Characterization Zipf [Breslau/Cao99] and

1 Harvest Harvest- -Style ICP Hierarchies Style ICP Hierarchies Issues for Cache Hierarchies

Scaling Your Cache &amp; Caching at Scale Alex Miller @puredanger Mission Why does caching

Temporal Temporal Radiance Caching Radiance Caching Pascal Gautron R&amp;D Engineer Thomson

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

1 Issues for Cache Hierarchies Issues for Cache Hierarchies Hashing: Cache Array Routing

Slide 2 Caching is both the most effective AND the most cost-effective method for schools to

Slide 2 Caching is both the most effective AND the most cost-effective method for schools to

Understanding Optimal Caching and Opportunistic Caching at The Edge of Information Centric

CACHING BEYOND RAM CACHING BEYOND RAM memcached.org/blog @dormando WHY RAM? WHY RAM?

Region Caching: Motivation Region Caching: Motivation High Level Languages influence the

A Socially Aware Caching Mechanism for Encounter Networks Future Internet Architectures: New

COMP 633 - Parallel Computing Lecture 10 September 15, 2020 CC-NUMA (1) CC-NUMA implementation

Measuring the on-lineness of data streams Manfred K. Warmuth Jiazhong Nie University of

Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer

HTTP HTTP: HyperText Transfer Protocol Basis for fetching Web pages request Network CSE 461

1945: Vannevar Bush The Internet End-End As we may think, Atlantic The Web Monthly,

and Transitive Trust Jeff Jarmoc Sr. Security Researcher Dell SecureWorks About this talk

Web Engineering HTTP-message = Request | Response generic-message = start-line *message-header

Scaling Your Cache & Caching at Scale Alex Miller @puredanger Mission Why does caching

Temporal Temporal Radiance Caching Radiance Caching Pascal Gautron R&D Engineer Thomson