More $ (caches, yes) Trick or treat! Midterm - PowerPoint PPT Presentation

University ¡of ¡Washington ¡ More ¡$ ¡(caches, ¡yes) ¡ ¢ Trick ¡or ¡treat! ¡ ¢ Midterm ¡ques>ons? ¡ § Note: ¡prac+ce ¡midterms ¡posted ¡ ¢ HW ¡2 ¡due ¡today ¡ ¢ Lab ¡3 ¡will ¡be ¡released ¡soon ¡ § You ¡will ¡implement ¡a ¡buffer ¡overflow ¡a9ack! ¡Huahuahua! ¡ J ¡ ¡ 1 ¡

University ¡of ¡Washington ¡ Deja-vu int array[SIZE]; int A = 0; for (int i = 0 ; i < 200000 ; ++ i) { for (int j = 0 ; j < SIZE ; ++ j) { A += array[j]; } } Runtime Plot SIZE 2 ¡

University ¡of ¡Washington ¡ Not ¡to ¡forget… ¡ CPU A little of super fast memory (cache$) Lots of slower Mem 3 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Mechanics ¡ Smaller, ¡faster, ¡more ¡expensive ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ memory ¡caches ¡a ¡ ¡subset ¡of ¡ the ¡blocks ¡ Data ¡is ¡copied ¡in ¡block-‑sized ¡ transfer ¡units ¡ Larger, ¡slower, ¡cheaper ¡memory ¡ Memory ¡ viewed ¡as ¡par>>oned ¡into ¡“blocks” ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 4 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Hit ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡14 ¡ Block ¡b ¡is ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Hit! ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 5 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Miss ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡12 ¡ Block ¡b ¡is ¡not ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Miss! ¡ Oh ¡no! ¡What ¡now? ¡ Request: ¡12 ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 6 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Miss ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡12 ¡ Block ¡b ¡is ¡not ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ Miss! ¡ Block ¡b ¡is ¡fetched ¡from ¡ Request: ¡12 ¡ memory ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 7 ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Miss ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡12 ¡ Block ¡b ¡is ¡not ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ 12 ¡ Miss! ¡ Block ¡b ¡is ¡fetched ¡from ¡ Request: ¡12 ¡ memory ¡ Block ¡b ¡is ¡stored ¡in ¡cache ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ • Placement ¡policy: ¡ determines ¡where ¡b ¡goes ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ • Replacement ¡policy: ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ determines ¡which ¡block ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ gets ¡evicted ¡(vic+m) ¡ 8 ¡

University ¡of ¡Washington ¡ Cache ¡Performance ¡Metrics ¡ Miss ¡Rate ¡ ¢ § Frac+on ¡of ¡memory ¡references ¡not ¡found ¡in ¡cache ¡(misses ¡/ ¡accesses) ¡ = ¡1 ¡– ¡hit ¡rate ¡ § Typical ¡numbers ¡(in ¡percentages): ¡ CPU 3-‑10% ¡for ¡L1 ¡ § can ¡be ¡quite ¡small ¡(e.g., ¡< ¡1%) ¡for ¡L2, ¡depending ¡on ¡size, ¡etc. ¡ § Hit ¡Time ¡ ¢ § Time ¡to ¡deliver ¡a ¡line ¡in ¡the ¡cache ¡to ¡the ¡processor ¡ $ includes ¡+me ¡to ¡determine ¡whether ¡the ¡line ¡is ¡in ¡the ¡cache ¡ § § Typical ¡numbers: ¡ 1-‑2 ¡clock ¡cycle ¡for ¡L1 ¡ § 5-‑20 ¡clock ¡cycles ¡for ¡L2 ¡ § Miss ¡Penalty ¡ ¢ Memory § Addi+onal ¡+me ¡required ¡because ¡of ¡a ¡miss ¡ typically ¡50-‑200 ¡cycles ¡for ¡main ¡memory ¡( trend: ¡increasing! ) ¡ § 9 ¡

University ¡of ¡Washington ¡ Lets ¡think ¡about ¡those ¡numbers ¡ ¢ Huge ¡difference ¡between ¡a ¡hit ¡and ¡a ¡miss ¡ § Could ¡be ¡100x, ¡if ¡just ¡L1 ¡and ¡main ¡memory ¡ ¢ Would ¡you ¡believe ¡99% ¡hits ¡is ¡twice ¡as ¡good ¡as ¡97%? ¡ § Consider: ¡ ¡ cache ¡hit ¡+me ¡of ¡1 ¡cycle ¡ miss ¡penalty ¡of ¡100 ¡cycles ¡ 10 ¡

University ¡of ¡Washington ¡ Lets ¡think ¡about ¡those ¡numbers ¡ ¢ Huge ¡difference ¡between ¡a ¡hit ¡and ¡a ¡miss ¡ § Could ¡be ¡100x, ¡if ¡just ¡L1 ¡and ¡main ¡memory ¡ ¢ Would ¡you ¡believe ¡99% ¡hits ¡is ¡twice ¡as ¡good ¡as ¡97%? ¡ § Consider: ¡ ¡ cache ¡hit ¡+me ¡of ¡1 ¡cycle ¡ miss ¡penalty ¡of ¡100 ¡cycles ¡ § Average ¡access ¡+me: ¡ ¡ ¡97% ¡hits: ¡ ¡1 ¡cycle ¡+ ¡0.03 ¡* ¡100 ¡cycles ¡= ¡ 4 ¡cycles ¡ ¡ ¡99% ¡hits: ¡ ¡1 ¡cycle ¡+ ¡0.01 ¡* ¡100 ¡cycles ¡= ¡ 2 ¡cycles ¡ ¡ ¢ This ¡is ¡why ¡“miss ¡rate” ¡is ¡used ¡instead ¡of ¡“hit ¡rate” ¡ 11 ¡

University ¡of ¡Washington ¡ Why ¡do ¡caches ¡work? ¡ 12 ¡

University ¡of ¡Washington ¡ Why ¡Caches ¡Work ¡ ¢ Locality: ¡Programs ¡tend ¡to ¡use ¡data ¡and ¡instruc>ons ¡with ¡ addresses ¡near ¡or ¡equal ¡to ¡those ¡they ¡have ¡used ¡recently ¡ ¡ 13 ¡

University ¡of ¡Washington ¡ Why ¡Caches ¡Work ¡ ¢ Locality: ¡Programs ¡tend ¡to ¡use ¡data ¡and ¡instruc>ons ¡with ¡ addresses ¡near ¡or ¡equal ¡to ¡those ¡they ¡have ¡used ¡recently ¡ ¢ Temporal ¡locality: ¡ ¡ ¡ § Recently ¡referenced ¡items ¡are ¡ likely ¡ ¡ block ¡ to ¡be ¡referenced ¡again ¡in ¡the ¡near ¡future ¡ § Why ¡is ¡this ¡important? ¡ ¡ 14 ¡

University ¡of ¡Washington ¡ Why ¡Caches ¡Work ¡ ¢ Locality: ¡Programs ¡tend ¡to ¡use ¡data ¡and ¡instruc>ons ¡with ¡ addresses ¡near ¡or ¡equal ¡to ¡those ¡they ¡have ¡used ¡recently ¡ ¢ Temporal ¡locality: ¡ ¡ ¡ § Recently ¡referenced ¡items ¡are ¡ likely ¡ ¡ block ¡ to ¡be ¡referenced ¡again ¡in ¡the ¡near ¡future ¡ ¢ Spa>al ¡locality? ¡ ¡ ¡ ¡ 15 ¡

University ¡of ¡Washington ¡ Why ¡Caches ¡Work ¡ ¢ Locality: ¡Programs ¡tend ¡to ¡use ¡data ¡and ¡instruc>ons ¡with ¡ addresses ¡near ¡or ¡equal ¡to ¡those ¡they ¡have ¡used ¡recently ¡ ¢ Temporal ¡locality: ¡ ¡ ¡ § Recently ¡referenced ¡items ¡are ¡ likely ¡ ¡ block ¡ to ¡be ¡referenced ¡again ¡in ¡the ¡near ¡future ¡ ¢ Spa>al ¡locality: ¡ ¡ ¡ § Items ¡with ¡nearby ¡addresses ¡ tend ¡ ¡ to ¡be ¡referenced ¡close ¡together ¡in ¡+me ¡ block ¡ § How ¡do ¡caches ¡take ¡advantage ¡of ¡this? ¡ ¡ 16 ¡

University ¡of ¡Washington ¡ Example: ¡Locality? ¡ sum = 0; for (i = 0; i < n; i++) sum += a[i]; return sum; 17 ¡

University ¡of ¡Washington ¡ Example: ¡Locality? ¡ sum = 0; for (i = 0; i < n; i++) sum += a[i]; return sum; ¢ Data: ¡ § Temporal: ¡ sum ¡referenced ¡in ¡each ¡itera+on ¡ § Spa+al: ¡array ¡ a[] ¡ accessed ¡in ¡stride-‑1 ¡pa9ern ¡ 18 ¡

University ¡of ¡Washington ¡ Example: ¡Locality? ¡ sum = 0; for (i = 0; i < n; i++) sum += a[i]; return sum; ¢ Data: ¡ § Temporal: ¡ sum ¡referenced ¡in ¡each ¡itera+on ¡ § Spa+al: ¡array ¡ a[] ¡ accessed ¡in ¡stride-‑1 ¡pa9ern ¡ ¢ Instruc>ons: ¡ § Temporal: ¡cycle ¡through ¡loop ¡repeatedly ¡ § Spa+al: ¡reference ¡instruc+ons ¡in ¡sequence ¡ 19 ¡

University ¡of ¡Washington ¡ Example: ¡Locality? ¡ sum = 0; for (i = 0; i < n; i++) sum += a[i]; return sum; ¢ Data: ¡ § Temporal: ¡ sum ¡referenced ¡in ¡each ¡itera+on ¡ § Spa+al: ¡array ¡ a[] ¡ accessed ¡in ¡stride-‑1 ¡pa9ern ¡ ¢ Instruc>ons: ¡ § Temporal: ¡cycle ¡through ¡loop ¡repeatedly ¡ § Spa+al: ¡reference ¡instruc+ons ¡in ¡sequence ¡ ¢ Being ¡able ¡to ¡assess ¡the ¡locality ¡of ¡code ¡is ¡a ¡crucial ¡skill ¡ for ¡a ¡programmer ¡ ¡ 20 ¡

More $ (caches, yes) Trick or treat! Midterm - PowerPoint PPT Presentation

University of Washington More $ (caches, yes) Trick or treat! Midterm ques>ons? Note: prac+ce midterms posted HW 2 due today Lab

Exam Review 2 1 ROB: head/tail yes R1 B yes none no X5 R3 A none no no --- --- F

YES & YES! YES & YES! David Grimwade Dept. of Medical & Molecular Genetics,

Multicore Workshop Caches Mark Bull David Henty EPCC, University of Edinburgh Overview

Trace Caches and optimizations therein CSE 240C - Rushi Chakrabarti - Winter 2009 Trace Caches

Yes We Can Yes We Can Yes We Can Yes We Can From biomedical informatics to translational

SURVEY AREA WWW-YES-2009-France Water Survey Results 3 June 2009 WWW-YES-2009-France water

Marshalltown Dual Language Program Evaluation Group 1: DLP: Yes and Ever an ELL: Yes

Interference in Judgment Aggregation Dorothea Baumeister, Gbor Erdlyi, Olivia Erdlyi, and

Review: Why We Use Caches Caches Review Mechanism for transparent movement of Proc 1000

Say Goodbye to Off-heap Caches! On-heap Caches Using Memory-Mapped I/O Iacovos G. Kolokasis 1 ,

CSE 351: Week 7 Tom Bergan, TA 1 Today Cache geometries Lab 4 2 Caches they make

CS 136: Advanced Architecture Review of Caches 1 / 30 Introduction Why Caches? Basic goal:

CPUs Chapter 3.5 Caches. Memory management. Caches and CPUs address data cache

ECE232: Hardware Organization and Design Lecture 22: Introduction to Caches Adapted from Computer

What You Must Know about Memory, Caches, and Shared Memory Kenjiro Taura 1 / 67 Contents 1

Caches Electronic Computers M Caches 1 Cache LOCALITY PRINCIPLE (SPATIAL AND TEMPORAL)

SCIENTIFIC WRITING IN LINGUISTICS: WRITING ABSTRACTS Prof. Dr. Shanley Allen University of

More Mechanisms for Generating Optimization Power-Law Distributions Minimal Cost Mandelbrot vs.

Family Reunion Incognito Pit Stops In Life Joseph will God does not pay at experience

Does data security rule out high performance? Adam Huffman 2018-02-04 FOSDEM HPC & Big Data

OSPF Traffic Engineering (TE) Express Path draft-giacalone-ospf-te-express-path-00.txt

DMTCP Transparent Checkpointing for Cluster Computations and the Desktop Jason Ansel 1 Kapil Arya

Repeating Boom and Bust Cycles Characterize Oil Source: Medlock, K.B., Amy Jaffe, The price of

Disclosures Clinical trials research funding support from: Bristol-Meyer Squibb Recent

Sambuz

Useful Links

Newsletter

Mail Us

More $ (caches, yes) Trick or treat! Midterm - PowerPoint PPT Presentation

University of Washington More $ (caches, yes) Trick or treat! Midterm ques>ons? Note: prac+ce midterms posted HW 2 due today Lab

Exam Review 2 1 ROB: head/tail yes R1 B yes none no X5 R3 A none no no --- --- F

YES &amp; YES! YES &amp; YES! David Grimwade Dept. of Medical &amp; Molecular Genetics,

Multicore Workshop Caches Mark Bull David Henty EPCC, University of Edinburgh Overview

Trace Caches and optimizations therein CSE 240C - Rushi Chakrabarti - Winter 2009 Trace Caches

Yes We Can Yes We Can Yes We Can Yes We Can From biomedical informatics to translational

SURVEY AREA WWW-YES-2009-France Water Survey Results 3 June 2009 WWW-YES-2009-France water

Marshalltown Dual Language Program Evaluation Group 1: DLP: Yes and Ever an ELL: Yes

Interference in Judgment Aggregation Dorothea Baumeister, Gbor Erdlyi, Olivia Erdlyi, and

Review: Why We Use Caches Caches Review Mechanism for transparent movement of Proc 1000

Say Goodbye to Off-heap Caches! On-heap Caches Using Memory-Mapped I/O Iacovos G. Kolokasis 1 ,

CSE 351: Week 7 Tom Bergan, TA 1 Today Cache geometries Lab 4 2 Caches they make

CS 136: Advanced Architecture Review of Caches 1 / 30 Introduction Why Caches? Basic goal:

CPUs Chapter 3.5 Caches. Memory management. Caches and CPUs address data cache

ECE232: Hardware Organization and Design Lecture 22: Introduction to Caches Adapted from Computer

What You Must Know about Memory, Caches, and Shared Memory Kenjiro Taura 1 / 67 Contents 1

Caches Electronic Computers M Caches 1 Cache LOCALITY PRINCIPLE (SPATIAL AND TEMPORAL)

SCIENTIFIC WRITING IN LINGUISTICS: WRITING ABSTRACTS Prof. Dr. Shanley Allen University of

More Mechanisms for Generating Optimization Power-Law Distributions Minimal Cost Mandelbrot vs.

Family Reunion Incognito Pit Stops In Life Joseph will God does not pay at experience

Does data security rule out high performance? Adam Huffman 2018-02-04 FOSDEM HPC &amp; Big Data

OSPF Traffic Engineering (TE) Express Path draft-giacalone-ospf-te-express-path-00.txt

DMTCP Transparent Checkpointing for Cluster Computations and the Desktop Jason Ansel 1 Kapil Arya

Repeating Boom and Bust Cycles Characterize Oil Source: Medlock, K.B., Amy Jaffe, The price of

Disclosures Clinical trials research funding support from: Bristol-Meyer Squibb Recent

Sambuz

Useful Links

Newsletter

Mail Us

YES & YES! YES & YES! David Grimwade Dept. of Medical & Molecular Genetics,

Does data security rule out high performance? Adam Huffman 2018-02-04 FOSDEM HPC & Big Data