Today Memory hierarchy, caches, locality Cache organiza:on - PowerPoint PPT Presentation

University ¡of ¡Washington ¡ Today ¡ ¢ Memory ¡hierarchy, ¡caches, ¡locality ¡ ¢ Cache ¡organiza:on ¡ ¢ Program ¡op:miza:ons ¡that ¡consider ¡caches ¡ 1 ¡

University ¡of ¡Washington ¡ How will execution time grow with SIZE? int array[SIZE]; int A = 0; for (int i = 0 ; i < 200000 ; ++ i) { for (int j = 0 ; j < SIZE ; ++ j) { A += array[j]; TIME } Plot } SIZE 2 ¡

University ¡of ¡Washington ¡ Actual Data 45 40 35 30 25 Series1 20 15 10 5 0 0 2000 4000 6000 8000 10000 3 ¡

University ¡of ¡Washington ¡ Problem: ¡Processor-‑Memory ¡BoDleneck ¡ Processor ¡performance ¡ doubled ¡about ¡ ¡ Bus ¡bandwidth ¡ every ¡18 ¡months ¡ evolved ¡much ¡slower ¡ Main ¡ CPU ¡ Reg ¡ Memory ¡ Core ¡2 ¡Duo: ¡ Core ¡2 ¡Duo: ¡ Can ¡process ¡at ¡least ¡ Bandwidth ¡ 256 ¡Bytes/cycle ¡ 2 ¡Bytes/cycle ¡ Latency ¡ 100 ¡cycles ¡ Buff… ¡ ¡ 4 ¡

University ¡of ¡Washington ¡ Problem: ¡Processor-‑Memory ¡BoDleneck ¡ Processor ¡performance ¡ doubled ¡about ¡ ¡ Bus ¡bandwidth ¡ every ¡18 ¡months ¡ evolved ¡much ¡slower ¡ Main ¡ CPU ¡ Reg ¡ Memory ¡ Core ¡2 ¡Duo: ¡ Core ¡2 ¡Duo: ¡ Can ¡process ¡at ¡least ¡ Bandwidth ¡ 256 ¡Bytes/cycle ¡ 2 ¡Bytes/cycle ¡ Latency ¡ 100 ¡cycles ¡ Solu/on: ¡Caches ¡ 5 ¡

University ¡of ¡Washington ¡ Cache ¡ ¢ English ¡defini:on: ¡a ¡hidden ¡storage ¡space ¡for ¡provisions, ¡ weapons, ¡and/or ¡treasures ¡ ¡ ¢ CSE ¡Defini:on: ¡computer ¡memory ¡with ¡short ¡access ¡:me ¡ used ¡for ¡the ¡storage ¡of ¡frequently ¡or ¡recently ¡used ¡ instruc:ons ¡or ¡data ¡(i-‑cache ¡and ¡d-‑cache) ¡ ¡ more ¡generally, ¡ ¡ used ¡to ¡op:mize ¡data ¡transfers ¡between ¡system ¡elements ¡ with ¡different ¡characteris:cs ¡(network ¡interface ¡cache, ¡I/O ¡ cache, ¡etc.) ¡ 6 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Mechanics ¡ Smaller, ¡faster, ¡more ¡expensive ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ memory ¡caches ¡a ¡ ¡subset ¡of ¡ the ¡blocks ¡ Data ¡is ¡copied ¡in ¡block-‑sized ¡ transfer ¡units ¡ Larger, ¡slower, ¡cheaper ¡memory ¡ Memory ¡ viewed ¡as ¡par::oned ¡into ¡“blocks” ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 7 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Hit ¡ Request: ¡14 ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 8 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Hit ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡14 ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 9 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Hit ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡14 ¡ Block ¡b ¡is ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Hit! ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 10 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Miss ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡12 ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 11 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Miss ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡12 ¡ Block ¡b ¡is ¡not ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Miss! ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 12 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Miss ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡12 ¡ Block ¡b ¡is ¡not ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Miss! ¡ Block ¡b ¡is ¡fetched ¡from ¡ Request: ¡12 ¡ memory ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 13 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Miss ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡12 ¡ Block ¡b ¡is ¡not ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ 12 ¡ Miss! ¡ Block ¡b ¡is ¡fetched ¡from ¡ Request: ¡12 ¡ memory ¡ Block ¡b ¡is ¡stored ¡in ¡cache ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ • Placement ¡policy: ¡ determines ¡where ¡b ¡goes ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ • Replacement ¡policy: ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ determines ¡which ¡block ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ gets ¡evicted ¡(vicAm) ¡ 14 ¡

University ¡of ¡Washington ¡ Cache ¡Performance ¡Metrics ¡ ¢ Miss ¡Rate ¡ § FracAon ¡of ¡memory ¡references ¡not ¡found ¡in ¡cache ¡(misses ¡/ ¡accesses) ¡ = ¡1 ¡– ¡hit ¡rate ¡ § Typical ¡numbers ¡(in ¡percentages): ¡ § 3-‑10% ¡for ¡L1 ¡ § can ¡be ¡quite ¡small ¡(e.g., ¡< ¡1%) ¡for ¡L2, ¡depending ¡on ¡size, ¡etc. ¡ ¢ Hit ¡Time ¡ § Time ¡to ¡deliver ¡a ¡line ¡in ¡the ¡cache ¡to ¡the ¡processor ¡ § includes ¡Ame ¡to ¡determine ¡whether ¡the ¡line ¡is ¡in ¡the ¡cache ¡ § Typical ¡numbers: ¡ § 1-‑2 ¡clock ¡cycle ¡for ¡L1 ¡ § 5-‑20 ¡clock ¡cycles ¡for ¡L2 ¡ ¢ Miss ¡Penalty ¡ § AddiAonal ¡Ame ¡required ¡because ¡of ¡a ¡miss ¡ § typically ¡50-‑200 ¡cycles ¡for ¡main ¡memory ¡( trend: ¡increasing! ) ¡ 15 ¡

University ¡of ¡Washington ¡ Lets ¡think ¡about ¡those ¡numbers ¡ ¢ Huge ¡difference ¡between ¡a ¡hit ¡and ¡a ¡miss ¡ § Could ¡be ¡100x, ¡if ¡just ¡L1 ¡and ¡main ¡memory ¡ ¢ Would ¡you ¡believe ¡99% ¡hits ¡is ¡twice ¡as ¡good ¡as ¡97%? ¡ § Consider: ¡ ¡ cache ¡hit ¡Ame ¡of ¡1 ¡cycle ¡ miss ¡penalty ¡of ¡100 ¡cycles ¡ 16 ¡

University ¡of ¡Washington ¡ Lets ¡think ¡about ¡those ¡numbers ¡ ¢ Huge ¡difference ¡between ¡a ¡hit ¡and ¡a ¡miss ¡ § Could ¡be ¡100x, ¡if ¡just ¡L1 ¡and ¡main ¡memory ¡ ¢ Would ¡you ¡believe ¡99% ¡hits ¡is ¡twice ¡as ¡good ¡as ¡97%? ¡ § Consider: ¡ ¡ cache ¡hit ¡Ame ¡of ¡1 ¡cycle ¡ miss ¡penalty ¡of ¡100 ¡cycles ¡ § Average ¡access ¡Ame: ¡ ¡ ¡97% ¡hits: ¡ ¡1 ¡cycle ¡+ ¡0.03 ¡* ¡100 ¡cycles ¡= ¡ 4 ¡cycles ¡ ¡ ¡99% ¡hits: ¡ ¡1 ¡cycle ¡+ ¡0.01 ¡* ¡100 ¡cycles ¡= ¡ 2 ¡cycles ¡ ¡ ¢ This ¡is ¡why ¡“miss ¡rate” ¡is ¡used ¡instead ¡of ¡“hit ¡rate” ¡ 17 ¡

University ¡of ¡Washington ¡ Types ¡of ¡Cache ¡Misses ¡ ¢ Cold ¡(compulsory) ¡miss ¡ § Occurs ¡on ¡first ¡access ¡to ¡a ¡block ¡ 18 ¡

University ¡of ¡Washington ¡ Types ¡of ¡Cache ¡Misses ¡ ¢ Cold ¡(compulsory) ¡miss ¡ § Occurs ¡on ¡first ¡access ¡to ¡a ¡block ¡ ¢ Conflict ¡miss ¡ § Most ¡hardware ¡caches ¡limit ¡blocks ¡to ¡a ¡small ¡subset ¡(someAmes ¡just ¡one) ¡ of ¡the ¡available ¡cache ¡slots ¡ § if ¡one ¡(e.g., ¡block ¡i ¡must ¡be ¡placed ¡in ¡slot ¡(i ¡mod ¡size)), ¡direct-‑mapped ¡ § if ¡more ¡than ¡one, ¡n-‑way ¡set-‑associaAve ¡(where ¡n ¡is ¡a ¡power ¡of ¡2) ¡ § Conflict ¡misses ¡occur ¡when ¡the ¡cache ¡is ¡large ¡enough, ¡but ¡mulAple ¡data ¡ objects ¡all ¡map ¡to ¡the ¡same ¡slot ¡ § e.g., ¡referencing ¡blocks ¡0, ¡8, ¡0, ¡8, ¡... ¡would ¡miss ¡every ¡Ame= ¡ 19 ¡

Today Memory hierarchy, caches, locality Cache organiza:on - PowerPoint PPT Presentation

University of Washington Today Memory hierarchy, caches, locality Cache organiza:on Program op:miza:ons that consider caches 1 University of Washington

What is the League Today 1 1/23/2017 What is the League Today What is the League Today 2

Social/Network/Analysis mohamed.bouguessa@uqo.ca/ 1 Web/today 2

Lecture 15 Logistics HW4 is due today HW5 posted today HW5 posted today Exam

WIEMANN LAMPHERE ARCHITECTS MONTPELIER TODAY MONTPELIER TODAY PARKING! VEHICLES ARE

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Welcome back. Today. Welcome back. Today. Continue Sampling combinatorial structures. Welcome

1. Abertis today 2. 2016 Financial Year 3. Outlook 4. Conclusions Abertis today 2016

Matt Fisher EUA Coordinator Overview of Parramatta today Overview of Parramatta today Overview

Course Business New dataset on CourseWeb: bpd.csv Midterm project due today Today

Featherweight Scala Week 14 January 31 1 Today Previously: Featherweight Java Today:

Stuff New HW on the web later today No lab today Tests graded by Thurs Last Time

Welcome back. Today. Welcome back. Today. Review: Spectral gap, Edge expansion h ( G ) ,

Sorting 15-121 Fall 2020 Margaret Reid-Miller Today Margaret will have office hours today

Exceptions Announcements Exceptions Today's Topic: Handling Errors 4 Today's Topic: Handling

Today and Tomorrow HEARING LOSS TECHNOLOGY TODAY AND TOMORROW Laura E. Plummer, MA, CRC, ATP

Fr From om Aristoteles to A o AI Today Today Prof. of. Nikol ola K a Kasabov abov Fellow

Machine-checked Interpolation Theorems for Substructural Logics using Display Calculi Jeremy E.

Information Visualization 10-1 Ronald Peikert SciVis 2007 - Information Visualization Overview

Renormalization of Tensor Network States II. RG of Tensor Network States Tao Xiang Institute of

r t

Determining congruence n -permutability is hard ( n 3?) Jonah Horowitz Ryerson University

Chapter 1: Constraints What are they, what do they do and what can I use them for. 1

What do we know about intrinsic metric curvature of affine hypersurfaces? Udo Simon Leuven

Limit Theorems for Products of Large Random Matrices Alexander Tikhomirov Komi Research Center