SLIDE 1
Processor Performance and Parallelism
- Y. K. Malaiya
Processor Performance and Parallelism Y. K. Malaiya Processor - - PowerPoint PPT Presentation
Processor Performance and Parallelism Y. K. Malaiya Processor Execution time The time taken by a program to execute is the product of n Number of machine instructions executed n Number of clock cycles per instruction (CPI) n Single clock period
2
3
4
5 Load/store instructions are about 20-30%
6
Demo: Threads in Mac
7
n time
n Time in example
n Non-stop
8
c= 800ps)
c= 200ps)
9
– Ex: AMD Opteron x4 – CPI can be less than 1!.
10
11
12
13
14
n Instruction level parallelism is still SISD n SSE (Streaming SIMD Extensions): vector operations n Intel Xeon e5345: 4 cores n Does not model Instruction level/task level parallelism
15
16
17
18