Previous Lecture Slides for Lecture 6 ENCM 501: Principles of - PDF document

/33 ENCM 501 W14 Slides for Lecture 6 Previous Lecture Slides for Lecture 6 ENCM 501: Principles of Computer Architecture Winter 2014 Term ◮ introduction to ISA design ideas ◮ memory-register and load-store architectures Steve Norman, PhD, PEng ◮ a very brief history of RISC versus CISC Electrical & Computer Engineering ◮ aspects of the ISA view of memory—flat address spaces, Schulich School of Engineering University of Calgary alignment rules 28 January, 2014 ENCM 501 W14 Slides for Lecture 6 slide 3/33 ENCM 501 W14 Slides for Lecture 6 slide 4/33 Today’s Lecture Endianness This is not really an aspect of computer design in which there are interesting cost or performance tradeoffs. Rather, it’s an annoying detail that will occasionally bite you if you aren’t ◮ endianness aware of it. ◮ addressing modes Registers inside processor cores do not have endianness. An ◮ examples of tradeoffs in instruction set design N -bit register just has bits N − 1 (MSB), N − 2, . . . , 2, 1, 0 (LSB). Related reading in Hennessy & Patterson: Sections A.3–A.7 Endianness is a property of the interface between the processor core and the memory, and comes from the fact that most ISAs allow memory reads and writes with various sizes, typically 1-byte, 2-byte, 4-byte, and 8-byte. slide 5/33 slide 6/33 ENCM 501 W14 Slides for Lecture 6 ENCM 501 W14 Slides for Lecture 6 Endianness in 64-bit MIPS doublewords Endianness in 32-bit MIPS words The byte offset gives the address of an individual byte relative The byte offset gives the address of an individual byte relative to the address of the entire doubleword. to the address of the entire word. Bit numbering: 63 is MSB, 0 is LSB Bit numbering: 31 is MSB, 0 is LSB 63 56 55 48 47 40 39 32 31 24 23 16 15 8 7 0 31 24 23 16 15 8 7 0 +7 +6 +5 +4 +3 +2 +1 +0 +3 +2 +1 +0 LITTLE-endian byte offsets LITTLE-endian byte offsets Bit numbering: 63 is MSB, 0 is LSB Bit numbering: 31 is MSB, 0 is LSB 63 56 55 48 47 40 39 32 31 24 23 16 15 8 7 0 31 24 23 16 15 8 7 0 +0 +1 +2 +3 +4 +5 +6 +7 +0 +1 +2 +3 BIG-endian byte offsets BIG-endian byte offsets

/33 slide 8/33 ENCM 501 W14 Slides for Lecture 6 ENCM 501 W14 Slides for Lecture 6 Practical code rarely (if ever) writes data as a word and later Example effect of endianness in MIPS32 reads it back as bytes, as was done in the example on the last slide. Why is endianness a practical concern? Here is a practical problem: Assume that R8 contains some valid address that is a ◮ Program P1 on Computer C1 copies an array of integers # LI: pseudoinstruction multiple of four. # for "load immediate" or FP numbers from memory into a file using a function LI R9, 0x12345678 like fwrite in the C library. What goes into R10 , R11 , SW R9, 0(R8) ◮ On disk, the file is just a long sequence of bytes . R12 , R13 , if the processor LB R10, 0(R8) ◮ Program P2 on Computer C2 opens the file and tries to chip is in little-endian LB R11, 1(R8) mode? read the array of numbers from the file into memory using LB R12, 2(R8) a function like fread in the C library. What if the processor chip LB R13, 3(R8) ◮ But C2 does not have the same endianness as C1, so the is in big-endian mode? data does not make sense to P2. The same kind of problem can happen when streaming multi-byte numbers over a network . ENCM 501 W14 Slides for Lecture 6 slide 9/33 ENCM 501 W14 Slides for Lecture 6 slide 10/33 Endianness and real systems Addressing modes Today little-endianness is much more common than Unlike endianness, selection of addressing modes for an ISA big-endianness. Here are some little-endian systems: is a set of design decisions that involve interesting tradeoffs . ◮ anything running on x86 or x86-64; Addressing mode is a slightly misleading term, because it ◮ Apple iOS, Linux (including Android), and Windows refers to the way in which an operand is accessed by an running on ARM. instruction, and that might or might not involve generation Some historically important big-endian machines were: of a memory address . ◮ Macs with 68000- or PowerPC-based processors; Addressing modes for data access are discussed as part of ◮ 68000- and SPARC-based computers from Sun Section A.3 in the textbook. Microsystems. Addressing modes for instruction access—needed, for Many modern ISA families, for example, MIPS and ARM, example, by branches and jumps—are discussed in Section A.6. allow the processor to switch back and forth between little- and big-endian modes. slide 11/33 slide 12/33 ENCM 501 W14 Slides for Lecture 6 ENCM 501 W14 Slides for Lecture 6 Examples of addressing modes for data Addressing modes: Register and Immediate Figure A.6 in the textbook gives examples covering most addressing modes available in ISAs of the present and the Register: Data is coming from or going to a register. All recent past. three operands are accessed in register mode in this MIPS64 instruction: A typical ISA will support some but not all of these DADDU R10, R8, R9 addressing modes. (Historical note: I think the MC68000 series supported all of them and more, which is kind of Immediate: Source data is a constant written into the awesome.) instruction. Here is a MIPS64 example in which two operands This lecture won’t explain every addressing mode in detail, but are register-mode and one is immediate-mode: instead will look at the ones that are most common and DADDIU R16, R16, 8 important. Let’s start with the two modes that don’t involve generation of a memory address . . .

/33 slide 14/33 ENCM 501 W14 Slides for Lecture 6 ENCM 501 W14 Slides for Lecture 6 Encoding of immediate operands in example ISAs The two simplest addressing modes for memory access x86-64: Instruction size is variable, so 1, 2, 4, or 8 bytes are used, as necessary, to describe the constant. Hint for comprehension: Roughly speaking, indirect means “via a pointer”. MIPS32 and MIPS64: Instructions are always 32 bits wide and the field size for immediate operands is always 16 bits Register indirect: Use the bits in a register as a memory wide. The range of constants is − 32768 to +32767 for address. MIPS64 example: instructions that use signed constants and 0 to 65535 for LD R8, (R9) # R8 = doubleword at address in R9 those that use unsigned constants. Displacement: Add a constant to the bits in a register to ARM: 12 bits within the fixed instruction size of 32 bits are generate a memory address. MIPS64 example: used for an immediate operand, in a complicated and # R10 = doubleword at address R10 + 64 bytes interesting way that could totally derail a lecture! (That’s one LD R10, 64(R11) of a few very good reasons why it would not be easy to switch Why is register indirect mode really just a special case of from MIPS to ARM in ENCM 369.) displacement mode? ENCM 501 W14 Slides for Lecture 6 slide 15/33 ENCM 501 W14 Slides for Lecture 6 slide 16/33 Scaled mode: Good for array element access Autoincrement and autodecrement modes (1) Here is some x86-64 assembly language code you will look at Other names for these modes are post-increment and in Assignment 2 . . . pre-decrement . .L16: In either of these modes a load causes two register mov (%rbx,%rax,4), %edx updates—one to a destination register, and another to a addq $1, %rax pointer register. A store also causes two updates—one update addq %rdx, %rbp to a memory location and another to a pointer register. cmpq $500000000, %rax Both are useful for walking through arrays using pointer jne .L16 arithmetic . The mov instruction uses scaled mode: The address used to A store using pre-decrement mode is an efficient way to read memory is push a register value on to a stack . %rbx + 4 × %rax And a load using post-increment mode is an efficient way to %rbx is the address of element 0 of an array of 4-byte pop a register value from a stack . elements, and %rax is an index into that array. slide 17/33 slide 18/33 ENCM 501 W14 Slides for Lecture 6 ENCM 501 W14 Slides for Lecture 6 Autoincrement and autodecrement modes (2) Memory indirect mode Example, using syntax from textbook Figure A.6: MOV R0, @(R1) The address in R1 is used to read a second address from memory. That second address is used to read from memory into R0 . In a These modes closely match some famously tricky C and C ++ typical load/store architecture this would be done with two expressions. instructions: a load followed by another load . Let’s write a couple of C statements that could be each be Another example, using the same syntax: implemented using a single instruction if autoincrement and MOV @(R2), R3 autodecrement modes are available. The address in R2 is used to read a second address from memory. That second address is used to write the data from R3 to memory. In a typical load/store architecture this would be done with two instructions: a load followed by a store . This mode is somewhat obsolete these days, but thinking about it helps to understand pointer-to-pointer types in C and C ++ .

Previous Lecture Slides for Lecture 6 ENCM 501: Principles of - PDF document

slide 2/33 ENCM 501 W14 Slides for Lecture 6 Previous Lecture Slides for Lecture 6 ENCM 501: Principles of Computer Architecture Winter 2014 Term introduction to ISA design ideas memory-register and load-store architectures Steve

MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN

Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides

SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides

Previous Lecture Todays Lecture Slides for Lecture 5 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 30 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 28 Completion of divide-by-3 counter

Previous Lecture Todays Lecture Slides for Lecture 12 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 3 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 2 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 35 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 32 Completion of a timing analysis

Previous Lecture Todays Lecture Slides for Lecture 26 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 33 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 6 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 27 ENEL 353: Digital Circuits Fall

Knape &Vogt Slides Last Updated: 07/02/10 M averick Hardware KV Slides Medium Duty Slides

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks Kai Sheng

Electronic Submission of 806 Reports of Corrections and Removals This tutorial provides

Qualitative Market Research: An International Journal Customer relationship management systems:

Belief models A very general theory of aggregation Seamus Bradley University of Leeds May 14,

Lecture 9: Memory in Python CS 1110 Introduction to Computing Using Python [E. Andersen, A.

Memory Consistency Models Adam Wierman Daniel Neill Adve, Pai, and Ranganathan. Recent advances

Spatially-Coupled Codes: Recent Trends and Applications in Data Storage and Memories Prof. Lara

CPSC 213 Read assembly code Java weak references, reference objects, reference queues

Previous Lecture Slides for Lecture 6 ENCM 501: Principles of - PDF document

slide 2/33 ENCM 501 W14 Slides for Lecture 6 Previous Lecture Slides for Lecture 6 ENCM 501: Principles of Computer Architecture Winter 2014 Term introduction to ISA design ideas memory-register and load-store architectures Steve

MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN

Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides

SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides

Previous Lecture Todays Lecture Slides for Lecture 5 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 30 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 28 Completion of divide-by-3 counter

Previous Lecture Todays Lecture Slides for Lecture 12 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 3 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 2 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 35 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 32 Completion of a timing analysis

Previous Lecture Todays Lecture Slides for Lecture 26 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 33 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 6 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 27 ENEL 353: Digital Circuits Fall

Knape &amp;Vogt Slides Last Updated: 07/02/10 M averick Hardware KV Slides Medium Duty Slides

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks Kai Sheng

Electronic Submission of 806 Reports of Corrections and Removals This tutorial provides

Qualitative Market Research: An International Journal Customer relationship management systems:

Belief models A very general theory of aggregation Seamus Bradley University of Leeds May 14,

Lecture 9: Memory in Python CS 1110 Introduction to Computing Using Python [E. Andersen, A.

Memory Consistency Models Adam Wierman Daniel Neill Adve, Pai, and Ranganathan. Recent advances

Spatially-Coupled Codes: Recent Trends and Applications in Data Storage and Memories Prof. Lara

CPSC 213 Read assembly code Java weak references, reference objects, reference queues

Knape &Vogt Slides Last Updated: 07/02/10 M averick Hardware KV Slides Medium Duty Slides