Instruction Set Architectures Part II: x86, RISC, and CISC - PowerPoint PPT Presentation

Instruction Set Architectures Part II: x86, RISC, and CISC Readings: 2.16-2.18 1

Goals for this Class • Understand how CPUs run programs • How do we express the computation the CPU? • How does the CPU execute it? • How does the CPU support other system components (e.g., the OS)? • What techniques and technologies are involved and how do they work? • Understand why CPU performance varies • How does CPU design impact performance? • What trade-offs are involved in designing a CPU? • How can we meaningfully measure and compare computer performance? • Understand why program performance varies • How do program characteristics affect performance? • How can we improve a programs performance by considering the CPU running it? • How do other system components impact program performance? 2

Goals • Start learning to read x86 assembly • Understand the design trade-offs involved in crafting an ISA • Understand RISC and CISC • Motivations • Origins • Learn something about other current ISAs • Very long instruction word (VLIW) • Arm and Thumb 3

The Stack Frame • Example A function’s “stack frame” holds main: • It’s local variables � addiu � $sp,$sp,-32 • Copies of callee-saved registers (if � sw � � $fp,24($sp) needs to used them) � move � $fp,$sp • Copies of caller-saved registers � sw � � $0,8($fp) (when it makes function calls). � li � � $v0,1 � • The frame pointer ($fp) points to the � sw � � $v0,12($fp) base of the frame stack frame. � li � � $v0,2 � • The frame pointer in action. � sw � � $v0,16($fp) • Adjust the stack pointer to allocate � lw � � $3,12($fp) the frame � lw � � $v0,16($fp) • Save the $fp into the frame (it’s � addu � $v0,$3,$v0 callee-saved) � sw � � $v0,8($fp) • Copy from the $sp to the $fp � lw � � $v0,8($fp) • � move � $sp,$fp Use the $sp as needed for function � lw � � $fp,24($sp) calls. • � addiu � $sp,$sp,32 Refer to local variables relative to � j � � $ra $fp. • Clean up when you’re done. 4

The Stack Frame main: � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) � move � $fp,$sp 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame main: � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) � move � $fp,$sp sp-> 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame main: PC-> � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) � move � $fp,$sp sp-> 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame main: PC-> � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) � move � $fp,$sp 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp sp-> 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame main: � addiu � $sp,$sp,-32 PC-> ... value fp-relative � sw � � $fp,24($sp) � move � $fp,$sp 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp sp-> 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame main: � addiu � $sp,$sp,-32 PC-> ... value fp-relative � sw � � $fp,24($sp) � move � $fp,$sp 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � old fp � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp sp-> 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame main: � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) PC-> � move � $fp,$sp 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � old fp � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp sp-> 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame main: � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) PC-> � move � $fp,$sp 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � old fp � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp sp-> fp-> 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame The stack frame main: � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) PC-> � move � $fp,$sp 0x1020 � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � old fp � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp sp-> fp-> 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame The stack frame main: � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) � move � $fp,$sp 0x1020 PC-> � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � old fp � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp sp-> fp-> 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

The Stack Frame The stack frame main: � addiu � $sp,$sp,-32 ... value fp-relative � sw � � $fp,24($sp) � move � $fp,$sp 0x1020 PC-> � sw � � $0,8($fp) 0x101C +32 � li � � $v0,1 � old fp � sw � � $v0,12($fp) 0x1018 +24 � li � � $v0,2 � 0x1014 +20 � sw � � $v0,16($fp) 0x1010 +16 � lw � � $3,12($fp) � lw � � $v0,16($fp) 0x100C +12 � addu � $v0,$3,$v0 0 0x1008 +8 � sw � � $v0,8($fp) � lw � � $v0,8($fp) 0x1004 +4 � move � $sp,$fp sp-> fp-> 0x1000 +0 � lw � � $fp,24($sp) 0x0FFC � addiu � $sp,$sp,32 � j � � $ra 5

Instruction Set Architectures Part II: x86, RISC, and CISC - PowerPoint PPT Presentation

Instruction Set Architectures Part II: x86, RISC, and CISC Readings: 2.16-2.18 1 Goals for this Class Understand how CPUs run programs How do we express the computation the CPU? How does the CPU execute it? How does the

Instruction Set Architectures Part II: x86, RISC, and CISC Readings: 2.16-2.18 1 Which ISA

CISC / RISC Complex / Reduced Instruction Set Computers CISC / RISC p. 1/12 Instruction

DLX computer Electronic Computers M 1 RISC architectures RISC vs CISC (Reduced Instruction Set

Roadmap 1. Instruction Set Architectures (ISA) What is CISC? What is RISC? Why did RISC prevail

Roadmap 1. Instruction Set Architectures (ISA) What is CISC? What is RISC? Why did RISC prevail

CISC vs. RISC x86 is the epitome of a Complex Instruction x86 or Set Computer Hundreds of

SYSC3601 Microprocessor Systems Unit 13: RISC/CISC Comparison Topics/Reading 1. Review:

x86-32 and x86-64 Assembly (Part 2) (I know Kung-Fu !) Emmanuel Fleury

PROCESSOR DEVELOPMENT THE FREE AND OPEN RISC INSTRUCTION SET ARCHITECTURE Codasip is the

Instruction Set 2 Architecting a vocabulary for the HW INSTRUCTION SET OVERVIEW 3 Instruction

CISC Semiconductor GmbH Dr. Markus PISTAUER CEO m.pistauer@cisc.at Company at a glance

x86 Introduction Philipp Koehn 25 October 2019 Philipp Koehn Computer Systems Fundamentals: x86

The future of operating systems on RISC-V Alex Bradbury asb@lowrisc.org @asbradbury 4th

QEMU Support for the RISC-V Instruction Set Architecture Sagar Karandikar

RISC Processors Chapter 14 S. Dandamudi Outline Introduction Itanium processor

ARM Advanced RISC Machines The ARM Instruction Set The ARM Instruction Set - ARM University

http://fpanalysistools.org/ 1 This work was performed under the auspices of the U.S. Department

Operating System An Introduction Virendra Singh Associate Professor Computer Architecture and

CSE 3320 Operating Systems Introduction Jia Rao Department of Computer Science and Engineering

Lexpresso: a Controlled Natural Language Adam Saulwick Defence Science and Technology

RAW FP John Hughes Mary Sheeran 9,000,000 100,000 50% QuickSort Feldspar Functional

Multiple Issue Previous techniques - Try to achieve an

1 Introduction Co-Occurrences Frequent Item Tree Association rule mining FP Growth Ying

We Well llcom come to TA TACO CO! GCIG/KGOG1027/TGCS2012: Randomized Phase III Clinical