The Hardware/So<ware Interface CSE351 Winter 2013 Memory - PowerPoint PPT Presentation

University ¡of ¡Washington ¡ The ¡Hardware/So<ware ¡Interface ¡ CSE351 ¡Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Roadmap ¡ Data ¡& ¡addressing ¡ Integers ¡& ¡floats ¡ Machine ¡code ¡& ¡C ¡ C: ¡ Java: ¡ x86 ¡assembly ¡ Car c = new Car(); car *c = malloc(sizeof(car)); programming ¡ c.setMiles(100); c->miles = 100; Procedures ¡& ¡ c.setGals(17); c->gals = 17; stacks ¡ float mpg = float mpg = get_mpg(c); Arrays ¡& ¡structs ¡ c.getMPG(); free(c); Memory ¡& ¡caches ¡ Processes ¡ Assembly ¡ get_mpg: Virtual ¡memory ¡ pushq %rbp language: ¡ movq %rsp, %rbp Memory ¡allocaPon ¡ ... Java ¡vs. ¡C ¡ popq %rbp ret OS: ¡ Machine ¡ 0111010000011000 100011010000010000000010 code: ¡ 1000100111000010 110000011111101000011111 Computer ¡ system: ¡ 2 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Themes ¡of ¡CSE ¡351 ¡  Interfaces ¡and ¡abstracPons ¡  So ¡far: ¡data ¡type ¡abstrac:ons ¡in ¡C; ¡x86 ¡instruc:on ¡set ¡architecture ¡ (interface ¡to ¡hardware) ¡  Today: ¡abstrac:ons ¡of ¡ memory ¡  Soon: ¡process ¡and ¡virtual ¡memory ¡abstrac:ons ¡  RepresentaPon ¡  Integers, ¡floats, ¡addresses, ¡arrays, ¡structs ¡  TranslaPon ¡  Understand ¡the ¡assembly ¡code ¡that ¡will ¡be ¡generated ¡from ¡C ¡code ¡  Control ¡flow ¡  Procedures ¡and ¡stacks; ¡buffer ¡overflows ¡ 3 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Making ¡memory ¡accesses ¡fast! ¡  Cache ¡basics ¡  Principle ¡of ¡locality ¡  Memory ¡hierarchies ¡  Cache ¡organizaPon ¡  Program ¡opPmizaPons ¡that ¡consider ¡caches ¡ 4 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ How ¡does ¡execuPon ¡Pme ¡grow ¡with ¡SIZE? ¡ int array[SIZE]; int A = 0; for (int i = 0 ; i < 200000 ; ++ i) { for (int j = 0 ; j < SIZE ; ++ j) { A += array[j]; } } TIME Plot SIZE 5 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Actual ¡Data ¡ Time ¡ SIZE ¡ 6 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Problem: ¡Processor-‑Memory ¡Bo^leneck ¡ Processor ¡performance ¡ doubled ¡about ¡ ¡ Bus ¡bandwidth ¡ every ¡18 ¡months ¡ evolved ¡much ¡slower ¡ Main ¡ CPU ¡ Reg ¡ Memory ¡ Core ¡2 ¡Duo: ¡ Core ¡2 ¡Duo: ¡ Can ¡process ¡at ¡least ¡ Bandwidth ¡ 256 ¡Bytes/cycle ¡ 2 ¡Bytes/cycle ¡ Latency ¡ 100 ¡cycles ¡ Problem: ¡lots ¡of ¡wai4ng ¡on ¡memory ¡ 7 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Problem: ¡Processor-‑Memory ¡Bo^leneck ¡ Processor ¡performance ¡ doubled ¡about ¡ ¡ Bus ¡bandwidth ¡ every ¡18 ¡months ¡ evolved ¡much ¡slower ¡ Main ¡ CPU ¡ Reg ¡ Cache ¡ Memory ¡ Core ¡2 ¡Duo: ¡ Core ¡2 ¡Duo: ¡ Can ¡process ¡at ¡least ¡ Bandwidth ¡ 256 ¡Bytes/cycle ¡ 2 ¡Bytes/cycle ¡ Latency ¡ 100 ¡cycles ¡ Solu4on: ¡caches ¡ 8 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Cache ¡  English ¡definiPon: ¡a ¡hidden ¡storage ¡space ¡for ¡provisions, ¡ weapons, ¡and/or ¡treasures ¡ ¡  CSE ¡definiPon: ¡computer ¡memory ¡with ¡short ¡access ¡Pme ¡used ¡ for ¡the ¡storage ¡of ¡frequently ¡or ¡recently ¡used ¡instrucPons ¡or ¡ data ¡(i-‑cache ¡and ¡d-‑cache) ¡ ¡ more ¡generally, ¡ ¡ used ¡to ¡opPmize ¡data ¡transfers ¡between ¡system ¡elements ¡ with ¡different ¡characterisPcs ¡(network ¡interface ¡cache, ¡I/O ¡ cache, ¡etc.) ¡ 9 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Mechanics ¡ Smaller, ¡faster, ¡more ¡expensive ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ memory ¡caches ¡a ¡subset ¡of ¡ the ¡blocks ¡ Data ¡is ¡copied ¡in ¡block-‑sized ¡ transfer ¡units ¡ Larger, ¡slower, ¡cheaper ¡memory ¡ Memory ¡ viewed ¡as ¡parPPoned ¡into ¡“blocks” ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 10 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Hit ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡14 ¡ Block ¡b ¡is ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ 14 ¡ Hit! ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ 12 ¡ 13 ¡ 14 ¡ 15 ¡ 11 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ General ¡Cache ¡Concepts: ¡Miss ¡ Data ¡in ¡block ¡b ¡is ¡needed ¡ Request: ¡12 ¡ Block ¡b ¡is ¡not ¡in ¡cache: ¡ Cache ¡ 8 ¡ 9 ¡ 14 ¡ 3 ¡ 12 ¡ Miss! ¡ Block ¡b ¡is ¡fetched ¡from ¡ Request: ¡12 ¡ 12 ¡ memory ¡ Block ¡b ¡is ¡stored ¡in ¡cache ¡ Memory ¡ 0 ¡ 1 ¡ 2 ¡ 3 ¡ • Placement ¡policy: ¡ determines ¡where ¡b ¡goes ¡ 4 ¡ 5 ¡ 6 ¡ 7 ¡ • Replacement ¡policy: ¡ 8 ¡ 9 ¡ 10 ¡ 11 ¡ determines ¡which ¡block ¡ 12 12 ¡ ¡ 13 ¡ 14 ¡ 15 ¡ gets ¡evicted ¡(vic:m) ¡ 12 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Cost ¡of ¡Cache ¡Misses ¡  Huge ¡difference ¡between ¡a ¡hit ¡and ¡a ¡miss ¡  Could ¡be ¡100x, ¡if ¡just ¡L1 ¡and ¡main ¡memory ¡  Would ¡you ¡believe ¡99% ¡hits ¡is ¡twice ¡as ¡good ¡as ¡97%? ¡  Consider: ¡ ¡ ¡Cache ¡hit ¡:me ¡of ¡1 ¡cycle ¡ ¡Miss ¡penalty ¡of ¡100 ¡cycles ¡  Average ¡access ¡:me: ¡  97% ¡hits: ¡ ¡1 ¡cycle ¡+ ¡0.03 ¡* ¡100 ¡cycles ¡= ¡4 ¡cycles ¡  99% ¡hits: ¡ ¡1 ¡cycle ¡+ ¡0.01 ¡* ¡100 ¡cycles ¡= ¡2 ¡cycles ¡  This ¡is ¡why ¡“miss ¡rate” ¡is ¡used ¡instead ¡of ¡“hit ¡rate” ¡ 13 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Why ¡Caches ¡Work ¡  Locality: ¡Programs ¡tend ¡to ¡use ¡data ¡and ¡instrucPons ¡with ¡ addresses ¡near ¡or ¡equal ¡to ¡those ¡they ¡have ¡used ¡recently ¡  Temporal ¡locality: ¡ ¡ ¡  Recently ¡referenced ¡items ¡are ¡likely ¡ ¡ block ¡ to ¡be ¡referenced ¡again ¡in ¡the ¡near ¡future ¡  Why ¡is ¡this ¡important? ¡ ¡  SpaPal ¡locality: ¡ ¡ ¡  Items ¡with ¡nearby ¡addresses ¡tend ¡ ¡ block ¡ to ¡be ¡referenced ¡close ¡together ¡in ¡:me ¡  How ¡do ¡caches ¡take ¡advantage ¡of ¡this? ¡ ¡ 14 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Example: ¡Locality? ¡ sum = 0; for (i = 0; i < n; i++) sum += a[i]; return sum;  Data: ¡  Temporal: ¡ sum ¡referenced ¡in ¡each ¡itera:on ¡  Spa:al: ¡array ¡ a[] ¡ accessed ¡in ¡stride-‑1 ¡pabern ¡  InstrucPons: ¡  Temporal: ¡cycle ¡through ¡loop ¡repeatedly ¡  Spa:al: ¡reference ¡instruc:ons ¡in ¡sequence ¡  Being ¡able ¡to ¡assess ¡the ¡locality ¡of ¡code ¡is ¡a ¡crucial ¡skill ¡ for ¡a ¡programmer ¡ ¡ 15 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

University ¡of ¡Washington ¡ Locality ¡Example ¡#1 ¡ int sum_array_rows(int a[M][N]) { int i, j, sum = 0; a[0][0] ¡ a[0][1] ¡ a[0][2] ¡ a[0][3] ¡ a[1][0] ¡ a[1][1] ¡ a[1][2] ¡ a[1][3] ¡ for (i = 0; i < M; i++) a[2][0] ¡ a[2][1] ¡ a[2][2] ¡ a[2][3] ¡ for (j = 0; j < N; j++) sum += a[i][j]; ¡ ¡1: ¡a[0][0] ¡ ¡ ¡2: ¡a[0][1] ¡ return sum; ¡ ¡3: ¡a[0][2] ¡ } ¡ ¡4: ¡a[0][3] ¡ ¡ ¡5: ¡a[1][0] ¡ ¡ ¡6: ¡a[1][1] ¡ ¡ ¡7: ¡a[1][2] ¡ ¡ ¡8: ¡a[1][3] ¡ ¡ ¡9: ¡a[2][0] ¡ 10: ¡a[2][1] ¡ 11: ¡a[2][2] ¡ 12: ¡a[2][3] ¡ stride-‑1 ¡ 16 ¡ Winter ¡2013 ¡ Memory ¡and ¡Caches ¡I ¡

The Hardware/So<ware Interface CSE351 Winter 2013 Memory - PowerPoint PPT Presentation

University of Washington The Hardware/So<ware Interface CSE351 Winter 2013 Memory and Caches I University of Washington Roadmap Data & addressing

So#ware(Project Lecture'4 Wouter'Swierstra So#ware(project((Lecture(4 1 Last%&me

CS 5150 So(ware Engineering 2. Steps in the so(ware development process William Y. Arms So(ware

So#ware(Project Lecture'3 Wouter'Swierstra So#ware(project((Lecture(3 1 Last%&me

Hardware Observability Framework Hardware Observability Framework Hardware Observability

I/O Bus and Interface Data Bus Addr Bus CPU Control Interface Interface Interface Interface

So%ware Architecture Beyond the Blueprints Aligning So%ware Architecture with the facets of

Toward so)ware engineering in prac0ce Claire Le Goues 15-214 April 27, 2017 1 Learning Goals

CS 5150 So(ware Engineering 18. Reuse and Design Pa9erns William Y. Arms So(ware Reuse It is

CS 5150 So(ware Engineering 12. System Architecture William Y. Arms Design Design in So:ware

Readings Covered Further Readings Ware: Evaluation Appendix Ware, Appendix C: The Perceptual

CS 5150 So(ware Engineering 3. So(ware Development Processes William Y. Arms Sequence of

Readings Covered Further Readings Ware Interaction: Data Manipulation Ware, Chap 10: Interacting

The Hardware/So?ware Interface CSE351 Winter 2013 Excep5onal

The Hardware/So=ware Interface CSE351 Winter 2013 x86 Programming

The Hardware/So=ware Interface CSE351 Winter 2013 Virtual Memory

The Hardware/So<ware Interface CSE351 Winter 2013 Procedures and

Offline First @caolan Unlike the always-wired machines of the past, computers are now truly

Stealing Web Browser Cookies ben-holland.com Whats a cookie? Web 2.0 Cookies provide

jsprobes cross-platform browser instrumentation using JavaScript Brian Burg (@brrian /

TrustJS: Trusted Client-side Execution of JavaScript David Goltzsche 1 , Colin Wulf 1 , Divya

Roadmap Memory & Data Integers/FP Machine code

The power of the network Andrew Betts Principal developer advocate | Fastly Who is this guy?

Presentation is loading The Latency Problem 100 ms Average : 9,3s Loading -1% Revenue The

I Know Where Youve Been: ! Geo-Inference Attacks via the Browser Cache ! Yaoqi Jia Yaoqi Jia

Sambuz

Useful Links

Newsletter

Mail Us

The Hardware/So<ware Interface CSE351 Winter 2013 Memory - PowerPoint PPT Presentation

University of Washington The Hardware/So<ware Interface CSE351 Winter 2013 Memory and Caches I University of Washington Roadmap Data & addressing

So#ware(Project Lecture'4 Wouter'Swierstra So#ware(project((Lecture(4 1 Last%&amp;me

CS 5150 So(ware Engineering 2. Steps in the so(ware development process William Y. Arms So(ware

So#ware(Project Lecture'3 Wouter'Swierstra So#ware(project((Lecture(3 1 Last%&amp;me

Hardware Observability Framework Hardware Observability Framework Hardware Observability

I/O Bus and Interface Data Bus Addr Bus CPU Control Interface Interface Interface Interface

So%ware Architecture Beyond the Blueprints Aligning So%ware Architecture with the facets of

Toward so)ware engineering in prac0ce Claire Le Goues 15-214 April 27, 2017 1 Learning Goals

CS 5150 So(ware Engineering 18. Reuse and Design Pa9erns William Y. Arms So(ware Reuse It is

CS 5150 So(ware Engineering 12. System Architecture William Y. Arms Design Design in So:ware

Readings Covered Further Readings Ware: Evaluation Appendix Ware, Appendix C: The Perceptual

CS 5150 So(ware Engineering 3. So(ware Development Processes William Y. Arms Sequence of

Readings Covered Further Readings Ware Interaction: Data Manipulation Ware, Chap 10: Interacting

The Hardware/So?ware Interface CSE351 Winter 2013 Excep5onal

The Hardware/So=ware Interface CSE351 Winter 2013 x86 Programming

The Hardware/So=ware Interface CSE351 Winter 2013 Virtual Memory

The Hardware/So&lt;ware Interface CSE351 Winter 2013 Procedures and

Offline First @caolan Unlike the always-wired machines of the past, computers are now truly

Stealing Web Browser Cookies ben-holland.com Whats a cookie? Web 2.0 Cookies provide

jsprobes cross-platform browser instrumentation using JavaScript Brian Burg (@brrian /

TrustJS: Trusted Client-side Execution of JavaScript David Goltzsche 1 , Colin Wulf 1 , Divya

Roadmap Memory &amp; Data Integers/FP Machine code

The power of the network Andrew Betts Principal developer advocate | Fastly Who is this guy?

Presentation is loading The Latency Problem 100 ms Average : 9,3s Loading -1% Revenue The

I Know Where Youve Been: ! Geo-Inference Attacks via the Browser Cache ! Yaoqi Jia Yaoqi Jia

Sambuz

Useful Links

Newsletter

Mail Us

So#ware(Project Lecture'4 Wouter'Swierstra So#ware(project((Lecture(4 1 Last%&me

So#ware(Project Lecture'3 Wouter'Swierstra So#ware(project((Lecture(3 1 Last%&me

The Hardware/So<ware Interface CSE351 Winter 2013 Procedures and

Roadmap Memory & Data Integers/FP Machine code