1 Register-memory Register-register (Load-store) There is no - PDF document

What Is ISA? Instruction set architecture is the structure of a computer that a machine language Lecture 3: Instruction Set programmer (or a compiler) must understand Architecture to write a correct (timing independent) program for that machine. For IBM System/360, 1964 ISA types, register usage, memory addressing, endian and alignment, quantitative evaluation Class ISA types: Stack, Accumulator, and General-purpose register ISA is mature and stable � Why do we study it? 1 2 Stack Accumulator The accumulator provides an Implicit operands on stack implicit input, and is the implicit place to store the Ex. C = A + B result. Push A Ex. C = A + B Push B Load R1, A Add Add R3, R1, B Pop C Store R3, c Good code density; used in Used before 1980 60’s-70’s; now in Java VM 3 4 General-purpose Registers Variants of GRP Architecture General-purpose registers are preferred by Number of operands in ALU instructions: two or compilers three � Reduce memory traffic Add R1, R2, R3 Add R1, R2 � Improve program speed Maximal number of operands in ALU instructions: � Improve code density zero, one, two, or three Usage of general-purpose registers Load R1, A Load R1, A � Holding temporal variables in expression evaluation Load R2, B Add R3, R1, B � Passing parameters Add R3, R1, R2 � Holding variables Three popular combinations GPR and RISC and CISC � RISC ISA is extensively used for desktop, server, and � register-register (load-store): 0 memory, 3 operands embedded: MIPS, PowerPC, UltraSPARC, ARM, MIPS16, � register-memory: 1 memory, 2 operands Thumb � memory-memory: 2 memories, 2 operands; or 3 memories, 3 � CISC: IBM 360/370, an VAX, Intel 80x86 operands 5 6 1

Register-memory Register-register (Load-store) There is no implicit operand Both operands are registers One input operand is Values in memory must be register, and one in memory loaded into a register and stored back Ex. C = A + B Ex. C = A + B Load R1, A Add R3, R1, B Load R1, A Store R3, C Load R2, B Processors include VAX, Add R3, R1, R2 80x86 Store R3, C Processors: MIPS, SPARC 7 8 How Many Registers? ISA and Performance If the number of registers increase: CPU time = #inst × CPI × cycle time RISC with Register-Register instructions � Allocate more variables in registers (fast accesses) � Simple, fix-length instruction encoding � Reducing code spill � Simple code generation � Regularity in CPI � Reducing memory traffic � Higher instruction counts � Lower instruction density � Longer register specifiers (difficult encoding) CISC with Register-memory instructions � Increasing register access time (physical registers) � No extra load in accessing data in memory � Easy encoding � More registers to save in context switch � Operands being not equivalent � Restricted #registers due to encoding memory address � Irregularity in CPI MIPS64: 32 general-purpose registers 9 10 Memory Addressing Little or Big: Where to Start? Instructions see registers, constant values, and memory Byte ordering: Where is the first Addressing mode decides how to specify an object to access Number 0x5678 byte? Object can be memory location, register, or a constant � Memory addressing is complicated � Little-endian Big-endian Big-endian : IBM, Memory addressing involves many factors Memory addressing mode SPARC, Mororola � Object size � byte ordering Little-endian: Intel, � alignment � DEC For a memory location, its effective address is calculated in a Supporting both: 00000003 5 8 certain form of register content, immediate address, and PC, as specified by the addressing mode MIPS, PowerPC 6 7 00000002 7 6 00000001 00000000 8 5 11 12 2

Alignment MIPS Data Addressing Modes Align n-byte objects on n-byte Register boundaries (n = 1, 2, 4, 8) ADD $16, $7, $8 Immediate One align position, n-1 misaligned positions ADDI $17, $7, 100 Misaligned access is undiserable Displacement � Expensive logic, slow references LW $18, 100($9) Aligning in registers may be necessary for bytes and half words Only the three are supported for data addressing 13 14 Storage Used by Compilers Memory Addressing Seen in CISC Register storage Direct (absolute) ADD R1, (1001) � Holding temporal variables in expression Register indirect SUB R2, (R1) evaluation Indexed ADD R1, (R2 + R3) � Passing parameters Scaled SUB R2, 100(R2)[R3] � Holding variables Autoincrement ADD R1, (R2)+ Memory storages consists of Autodecrement SUB R2, -(R1) � Stack: to hold local variables Memory indirect ADD R1, @(R3) � Global data area: to hold statically declared objects (see textbook p98) � Heap: to hold dynamic objects And more … 15 16 Choosing of Memory Addressing Modes How Often Are Those Address Modes? Choosing complex addressing modes � Close to addressing in high-level language � May reduce instruction counts (thus fast) � Increase implementation complexity (may increase cycle time) � Increase CPI RISC ISA comes with simple memory addressing, and CISC ISA with complex ones Usage of address modes, VAX machine, SPEC89 17 18 3

Immediate Size in RISC Usage of Immediate Operands In RISC Alpha, SPEC CINT2000 & CFP2000 Alpha, SPEC CINT2000 & CFP2000 19 20 Displacement Size in RISC Operands size, type and format In MIPS Opcode encodes operand size � Ex. ADD for signed integer, ADDU for unsigned integer, ADD.D for double-precision FP Most common types include � Integer: complement binary numbers � Character: ASCII � Floating point: IEEE standard 754, single-precision or double-precision Decimal format � 4-bits for one decimal digit (0-9), one byte for two decimal digits � Necessary for business applications Fixed Point format in DSP processors: � Representing fractions in (-1, +1) � 11000101 fixed point = -0.1000101 2 Displacement bit size: Alpha ISA, SPEC CPU2000 Integer and FP 21 22 Dynamic Instruction Mix (MIPS) Compiler Effects SPEC2K Int SPEC2K FP Load 26% 15% Store 10% 2% Add 19% 23% Compare 5% 2% Cond br 12% 4% Cond mv 2% 0% Jump 1% 0% LOGIC 18% 4% Architectures change for the needs of compilers FP load 15% • How do compilers use registers? How many? FP store 7% • How do compilers use addressing modes? FP others 19% • Anything that compilers do not like? 23 24 4

1 Register-memory Register-register (Load-store) There is no - PDF document

What Is ISA? Instruction set architecture is the structure of a computer that a machine language Lecture 3: Instruction Set programmer (or a compiler) must understand Architecture to write a correct (timing independent) program for that

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Roadmap Integers & floats Machine code & C C: Java: x86 assembly Car c = new Car();

Memory management Part I Michel Schinz based on Erik Stenmans slides 20070330

Implicit Memory Alloca6on: Garbage Collec6on Garbage collec+on:

T ag- I solated M emory B ringing Fine-grained E nclaves to R ISC- V Samuel Weiser Mario Werner

Checkmate Checkmate: Breaking the Memory Wall with Optimal Tensor Rematerialization Paras Jain

CS356 Unit 10 Memory Allocation & Heap Management 10.2 BASIC OS CONCEPTS & TERMINOLOGY

Improving Virtual Prototyping and Certification with Implicit Finite Element Method at Scale Seid

Mobile Device and Platform Security Part II John Mitchell Two lectures on mobile security

Dynamic Memory Alloca/on: Advanced Concepts 15-213:

How to Get Good Performance by Using OpenMP ! ! Loop optimizations ! ! Measuring OpenMP

CPSC 213 Introduction to Computer Systems Unit 3 Course Review 1 Learning Goals 1 Memory

Building WebGPU with Rust Fosdem, 2th Feb 2020 Dzmitry Malyshau @kvark (Mozilla / Graphics

An implicit graph Suppose we represent each node in graph

Technologies for Dementia Andrea Wilkinson & Mark Chignell TRO 2017 Annual Conference

ECE 3574: Applied Software Design Threads Today we are going to start looking at threads,

The Multicore Challenge performance? sustainability? affordability? SVP SVP SVP SVP SVP

Dynamic Memory Allocation Today Dynamic memory allocation mechanisms & policies

4.1 Polygon Meshes and Implicit Surfaces Hao Li http://cs420.hao-li.com 1 Geometric

MEMORY SYNCHRONIZATION Mahdi Nazm Bojnordi Assistant Professor School of Computing University

Chapter 9 : Linguistics in Prague and Vienna between the wars John A. Goldsmith May 5 , 2016 1

Whats up with the fruit? Visual literacy is the ability

for Adults Beliefs & Prayer SIGN IN AT THE WELCOME TABLE AND ENJOY THE FOOD RCIA Journey of

3. Other Important Terms The terms already defined relate to the study of literature and

1 Register-memory Register-register (Load-store) There is no - PDF document

What Is ISA? Instruction set architecture is the structure of a computer that a machine language Lecture 3: Instruction Set programmer (or a compiler) must understand Architecture to write a correct (timing independent) program for that

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Roadmap Integers &amp; floats Machine code &amp; C C: Java: x86 assembly Car c = new Car();

Memory management Part I Michel Schinz based on Erik Stenmans slides 20070330

Implicit Memory Alloca6on: Garbage Collec6on Garbage collec+on:

T ag- I solated M emory B ringing Fine-grained E nclaves to R ISC- V Samuel Weiser Mario Werner

Checkmate Checkmate: Breaking the Memory Wall with Optimal Tensor Rematerialization Paras Jain

CS356 Unit 10 Memory Allocation &amp; Heap Management 10.2 BASIC OS CONCEPTS &amp; TERMINOLOGY

Improving Virtual Prototyping and Certification with Implicit Finite Element Method at Scale Seid

Mobile Device and Platform Security Part II John Mitchell Two lectures on mobile security

Dynamic Memory Alloca/on: Advanced Concepts 15-213:

How to Get Good Performance by Using OpenMP ! ! Loop optimizations ! ! Measuring OpenMP

CPSC 213 Introduction to Computer Systems Unit 3 Course Review 1 Learning Goals 1 Memory

Building WebGPU with Rust Fosdem, 2th Feb 2020 Dzmitry Malyshau @kvark (Mozilla / Graphics

An implicit graph Suppose we represent each node in graph

Technologies for Dementia Andrea Wilkinson &amp; Mark Chignell TRO 2017 Annual Conference

ECE 3574: Applied Software Design Threads Today we are going to start looking at threads,

The Multicore Challenge performance? sustainability? affordability? SVP SVP SVP SVP SVP

Dynamic Memory Allocation Today Dynamic memory allocation mechanisms &amp; policies

4.1 Polygon Meshes and Implicit Surfaces Hao Li http://cs420.hao-li.com 1 Geometric

MEMORY SYNCHRONIZATION Mahdi Nazm Bojnordi Assistant Professor School of Computing University

Chapter 9 : Linguistics in Prague and Vienna between the wars John A. Goldsmith May 5 , 2016 1

Whats up with the fruit? Visual literacy is the ability

for Adults Beliefs &amp; Prayer SIGN IN AT THE WELCOME TABLE AND ENJOY THE FOOD RCIA Journey of

3. Other Important Terms The terms already defined relate to the study of literature and

Roadmap Integers & floats Machine code & C C: Java: x86 assembly Car c = new Car();

CS356 Unit 10 Memory Allocation & Heap Management 10.2 BASIC OS CONCEPTS & TERMINOLOGY

Technologies for Dementia Andrea Wilkinson & Mark Chignell TRO 2017 Annual Conference

Dynamic Memory Allocation Today Dynamic memory allocation mechanisms & policies

for Adults Beliefs & Prayer SIGN IN AT THE WELCOME TABLE AND ENJOY THE FOOD RCIA Journey of