$$$ $$$ Cache Memory $$$ 2 Schedule This week - PowerPoint PPT Presentation

ì ¡ Computer ¡Systems ¡and ¡Networks ¡ ECPE ¡170 ¡– ¡Jeff ¡Shafer ¡– ¡University ¡of ¡the ¡Pacific ¡ $$$ $$$ Cache ¡Memory ¡ $$$

2 ¡ Schedule ¡ ì This ¡week ¡ ì Chapter ¡6 ¡– ¡Memory ¡systems ¡ ì Next ¡Tuesday ¡ ì Exam ¡2 ¡– ¡Tuesday, ¡Nov ¡1 st ¡ ¡ ì Chapter ¡4 ¡ ì MARIE, ¡etc… ¡ ì Chapter ¡5 ¡ ì InstrucHon ¡sets, ¡memory ¡addressing ¡modes, ¡etc… ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

3 ¡ Review ¡– ¡Quiz ¡3 ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

4 ¡ Review ¡– ¡Quiz ¡4 ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

5 ¡ Objectives ¡ Star@ng ¡Chapter ¡6 ¡today ¡ ì No ¡longer ¡will ¡we ¡treat ¡memory ¡as ¡a ¡big ¡dumb ¡array ¡of ¡ ì bytes! ¡ Hierarchical ¡memory ¡organizaHon ¡ ì How ¡does ¡each ¡level ¡of ¡memory ¡contribute ¡to ¡system ¡ ì performance? ¡ How ¡do ¡we ¡measure ¡performance? ¡ ì New ¡concepts! ¡ ì Cache ¡memory ¡and ¡virtual ¡memory ¡ ì Memory ¡segmentaHon ¡and ¡paging ¡ ì Address ¡translaHon ¡ ì Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

6 ¡ Types ¡of ¡Memory ¡ ì RAM ¡versus ¡ROM? ¡ ì RAM ¡– ¡Random ¡access ¡memory ¡(read ¡& ¡write) ¡ ì ROM ¡– ¡Read-‑only ¡memory ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

7 ¡ Types ¡of ¡Memory ¡ ì DRAM ¡versus ¡SRAM? ¡ ì DRAM ¡– ¡ Dynamic ¡RAM ¡ ì Cheap ¡and ¡simple! ¡ ì Capacitors ¡that ¡slowly ¡leak ¡charge ¡over ¡Hme ¡ ì Refresh ¡every ¡few ¡milliseconds ¡to ¡preserve ¡data ¡ ì SRAM ¡– ¡ Sta@c ¡RAM ¡ ì Similar ¡to ¡D ¡Flip-‑flops ¡ ì No ¡need ¡for ¡refresh ¡ ì Fast ¡/ ¡expensive ¡(use ¡for ¡cache ¡memory, ¡registers, ¡…) ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

8 ¡ Memory ¡Hierarchy ¡ ì Goal ¡as ¡system ¡designers: ¡ ¡ Fast ¡performance ¡ and ¡low ¡cost! ¡ Tradeoff: ¡Faster ¡memory ¡is ¡more ¡expensive ¡than ¡slower ¡ ì memory ¡ ì To ¡provide ¡the ¡best ¡performance ¡at ¡the ¡lowest ¡cost, ¡ memory ¡is ¡organized ¡in ¡a ¡hierarchical ¡fashion ¡ Small, ¡fast ¡storage ¡elements ¡are ¡kept ¡in ¡the ¡CPU ¡ ì Larger, ¡less ¡fast ¡main ¡memory ¡is ¡accessed ¡through ¡the ¡ ì data ¡bus ¡ Largest, ¡slowest, ¡permanent ¡storage ¡(disks, ¡etc…) ¡is ¡even ¡ ì further ¡from ¡the ¡CPU ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

9 ¡ The ¡Memory ¡Hierarchy ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

10 ¡ Memory ¡Hierarchy ¡ This ¡chapter ¡just ¡focuses ¡on ¡the ¡part ¡of ¡the ¡memory ¡hierarchy ¡ ì that ¡involves ¡ registers , ¡ cache , ¡ main ¡memory , ¡and ¡ virtual ¡ memory ¡ What ¡is ¡a ¡register? ¡ ì Storage ¡locaHons ¡available ¡on ¡the ¡processor ¡itself ¡ ì Manually ¡ managed ¡by ¡the ¡assembly ¡programmer ¡or ¡ ì compiler ¡ What ¡is ¡main ¡memory? ¡RAM ¡ ì What ¡is ¡virtual ¡memory? ¡ ì Extends ¡the ¡address ¡space ¡from ¡RAM ¡to ¡the ¡hard ¡drive ¡ ì Provides ¡more ¡space ¡ ì Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

11 ¡ Cache ¡Memory ¡ ì What ¡is ¡a ¡cache? ¡ ì Speed ¡up ¡memory ¡accesses ¡by ¡storing ¡recently ¡used ¡ data ¡closer ¡to ¡the ¡CPU ¡ ì Closer ¡that ¡main ¡memory ¡– ¡on ¡the ¡CPU ¡itself! ¡ ì Although ¡cache ¡is ¡much ¡smaller ¡than ¡main ¡memory, ¡ its ¡access ¡Hme ¡is ¡a ¡fracHon ¡of ¡that ¡of ¡main ¡memory ¡ ì Cache ¡is ¡ automa@cally ¡ managed ¡by ¡the ¡memory ¡ system ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

12 ¡ Memory ¡Hierarchy ¡ ì CPU ¡wishes ¡to ¡access ¡data ¡for ¡an ¡instrucHon ¡ Does ¡the ¡instrucHon ¡say ¡it ¡is ¡in ¡a ¡register ¡or ¡memory? ¡ ì ì If ¡register, ¡go ¡get ¡it! ¡ If ¡in ¡memory, ¡send ¡request ¡to ¡nearest ¡memory ¡(the ¡ ì cache) ¡ If ¡not ¡in ¡cache, ¡send ¡request ¡to ¡main ¡memory ¡ ì If ¡not ¡in ¡main ¡memory, ¡send ¡request ¡to ¡virtual ¡memory ¡ ì (the ¡disk) ¡ ì Once ¡the ¡data ¡is ¡located ¡and ¡delivered ¡to ¡the ¡CPU, ¡it ¡will ¡ also ¡be ¡saved ¡into ¡cache ¡memory ¡for ¡future ¡access ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

13 ¡ (Cache) ¡Hits ¡versus ¡Misses ¡ Hit ¡– ¡When ¡data ¡is ¡found ¡at ¡a ¡given ¡memory ¡level. ¡ ì Miss ¡– ¡When ¡data ¡is ¡not ¡found ¡at ¡a ¡given ¡level ¡ ì Hit ¡rate ¡– ¡Percentage ¡of ¡Hme ¡data ¡is ¡found ¡at ¡a ¡given ¡memory ¡ ì level. ¡ Miss ¡rate ¡ – ¡Percentage ¡of ¡Hme ¡data ¡is ¡not ¡found ¡ ì Miss ¡rate ¡= ¡1 ¡-‑ ¡hit ¡rate ¡ ì Hit ¡@me ¡ – ¡Time ¡required ¡to ¡access ¡data ¡at ¡a ¡given ¡memory ¡ ì level ¡ Miss ¡penalty ¡ – ¡Time ¡required ¡to ¡process ¡a ¡miss ¡ ì Time ¡that ¡it ¡takes ¡to ¡replace ¡a ¡block ¡of ¡memory, ¡plus ¡ ì Time ¡it ¡takes ¡to ¡deliver ¡the ¡data ¡to ¡the ¡processor ¡ ì Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

14 ¡ Cache ¡Locality ¡ ì When ¡data ¡is ¡loaded ¡into ¡a ¡cache, ¡we ¡save ¡more ¡than ¡ just ¡the ¡specific ¡byte(s) ¡requested ¡ Oden, ¡save ¡neighboring ¡64 ¡bytes ¡or ¡more! ¡ ì ì Principle ¡of ¡locality ¡ – ¡Once ¡a ¡byte ¡is ¡accessed, ¡it ¡is ¡likely ¡ that ¡a ¡nearby ¡data ¡element ¡will ¡be ¡needed ¡soon ¡ ì There ¡are ¡three ¡forms ¡of ¡locality: ¡ Temporal ¡locality ¡ – ¡Recently-‑accessed ¡data ¡elements ¡ ì tend ¡to ¡be ¡accessed ¡again ¡ Spa@al ¡locality ¡ -‑ ¡Accesses ¡tend ¡to ¡cluster ¡in ¡memory ¡ ì Sequen@al ¡locality ¡ -‑ ¡InstrucHons ¡tend ¡to ¡be ¡accessed ¡ ì sequenHally ¡ (just ¡a ¡variant ¡of ¡Spa0al ¡locality) ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

15 ¡ ì ¡ Cache ¡Design ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

16 ¡ Cache ¡Memory ¡ ì First, ¡divide ¡main ¡memory ¡and ¡cache ¡memory ¡into ¡ blocks ¡ ¡ Cache ¡block ¡size ¡= ¡main ¡memory ¡block ¡size ¡ ì Example: ¡Core ¡i7: ¡ 64 ¡bytes ¡ per ¡cache ¡block ¡ ì ì If ¡data ¡is ¡loaded ¡into ¡the ¡cache, ¡we ¡load ¡in ¡the ¡enHre ¡ block, ¡even ¡if ¡we ¡only ¡needed ¡a ¡byte ¡of ¡it ¡ Allows ¡us ¡to ¡take ¡advantage ¡of ¡locality ¡ ì ì Main ¡memory ¡is ¡much ¡larger ¡than ¡the ¡cache ¡ Thus, ¡many ¡blocks ¡of ¡main ¡memory ¡must ¡map ¡to ¡a ¡ ì single ¡block ¡of ¡cache ¡ Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

17 ¡ Cache ¡Memory ¡ ì Main ¡memory ¡is ¡usually ¡accessed ¡by ¡address ¡ i.e. ¡“Give ¡me ¡the ¡byte ¡stored ¡at ¡address ¡0x2E3” ¡ ì ì If ¡the ¡data ¡is ¡copied ¡to ¡the ¡cache, ¡it ¡cannot ¡keep ¡the ¡ same ¡address ¡ Remember, ¡the ¡cache ¡is ¡ much ¡smaller ¡than ¡main ¡ ì memory! ¡ ì We ¡need ¡a ¡scheme ¡to ¡translate ¡between ¡a ¡main ¡ memory ¡address ¡and ¡a ¡cache ¡locaHon ¡ Engineers ¡have ¡devised ¡several ¡schemes… ¡ ì Direct ¡map, ¡fully ¡associaHve ¡map, ¡set-‑associaHve ¡map, ¡… ¡ ì Computer ¡Systems ¡and ¡Networks ¡ Fall ¡2011 ¡

$$$ $$$ Cache Memory $$$ 2 Schedule This week - PowerPoint PPT Presentation

Computer Systems and Networks ECPE 170 Jeff Shafer University of the Pacific $$$ $$$ Cache Memory $$$ 2 Schedule This week

What Is Memory Hierarchy A typical memory hierarchy today: Lecture 13: Cache Basics and Cache

Memory Hierarchy: Cache Memory hierarchy Cache basics Locality Cache organization Cache-aware

1 Classifying cache misses Cache Organization Classifying misses by causes (3Cs) Cache size,

General Cache Mechanics CPU Block: unit of data in cache and memory. (a.k.a. line) Memory

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Chapter 4 Cache Memory Contents Computer memory system overview Characteristics of

Cache Systems CPU Main Main CPU Memory Memory 400MHz 10MHz Cache 10MHz Memory Hierarchy

L09: Cache Name: ID: Question: Direct Mapping Cache Hit Rate Consider a 4-block empty Cache,

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Lecture 23: Cache, Memory, Virtual Memory Todays topics: Cache examples, caching

Caches Electronic Computers M Caches 1 Cache LOCALITY PRINCIPLE (SPATIAL AND TEMPORAL)

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

Cache Example Main memory: Byte addressable memory of size 4GB = 2 32 bytes Cache size: 64KB = 2 16

Generations of Cache 1980: no cache in proc; 1989 first Intel proc with a cache on chip.

Distributed Memory and Cache Consistency Distributed Memory and Cache Consistency (some slides

Detecting strategic moves in HearthStone matches BORIS DOUX, CLMENT GAUTRAIS AND BENJAMIN

Scheduling Optim al & Real Tim e using CORA CORA CORA April 2002 June 2005

Observation of Gravitational Waves from a Binary Black Hole Merger In LIGO Hanford and Livingston

Future silicon trackers: 4D tracking, very high fluences, very small pixels Nicol Cartiglia

Blueprint of a Mobile App Indy Khare - Tech Lead & Manager Google Photos Android 2008 - 2009

NODES 2019 Track #1, 4:00PM By Fanghua(Joshua) Yu, Oct. 2019 NODES 2019 Best Practices to Make

OR 62 E xpre ssway Vilas Inte rc hange T AC Me e ting Pre se nte d b y: Ka tie Bro wn Oc to

Preprocessing QBF: Failed Literals and Quantified Blocked Clause Elimination Florian Lonsing

$$$ $$$ Cache Memory $$$ 2 Schedule This week - PowerPoint PPT Presentation

Computer Systems and Networks ECPE 170 Jeff Shafer University of the Pacific $$$ $$$ Cache Memory $$$ 2 Schedule This week

What Is Memory Hierarchy A typical memory hierarchy today: Lecture 13: Cache Basics and Cache

Memory Hierarchy: Cache Memory hierarchy Cache basics Locality Cache organization Cache-aware

1 Classifying cache misses Cache Organization Classifying misses by causes (3Cs) Cache size,

General Cache Mechanics CPU Block: unit of data in cache and memory. (a.k.a. line) Memory

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Chapter 4 Cache Memory Contents Computer memory system overview Characteristics of

Cache Systems CPU Main Main CPU Memory Memory 400MHz 10MHz Cache 10MHz Memory Hierarchy

L09: Cache Name: ID: Question: Direct Mapping Cache Hit Rate Consider a 4-block empty Cache,

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Lecture 23: Cache, Memory, Virtual Memory Todays topics: Cache examples, caching

Caches Electronic Computers M Caches 1 Cache LOCALITY PRINCIPLE (SPATIAL AND TEMPORAL)

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

Cache Example Main memory: Byte addressable memory of size 4GB = 2 32 bytes Cache size: 64KB = 2 16

Generations of Cache 1980: no cache in proc; 1989 first Intel proc with a cache on chip.

Distributed Memory and Cache Consistency Distributed Memory and Cache Consistency (some slides

Detecting strategic moves in HearthStone matches BORIS DOUX, CLMENT GAUTRAIS AND BENJAMIN

Scheduling Optim al &amp; Real Tim e using CORA CORA CORA April 2002 June 2005

Observation of Gravitational Waves from a Binary Black Hole Merger In LIGO Hanford and Livingston

Future silicon trackers: 4D tracking, very high fluences, very small pixels Nicol Cartiglia

Blueprint of a Mobile App Indy Khare - Tech Lead &amp; Manager Google Photos Android 2008 - 2009

NODES 2019 Track #1, 4:00PM By Fanghua(Joshua) Yu, Oct. 2019 NODES 2019 Best Practices to Make

OR 62 E xpre ssway Vilas Inte rc hange T AC Me e ting Pre se nte d b y: Ka tie Bro wn Oc to

Preprocessing QBF: Failed Literals and Quantified Blocked Clause Elimination Florian Lonsing

Scheduling Optim al & Real Tim e using CORA CORA CORA April 2002 June 2005

Blueprint of a Mobile App Indy Khare - Tech Lead & Manager Google Photos Android 2008 - 2009