cse240a graduate computer architecture
play

cse240a: Graduate Computer Architecture Steven Swanson Hung-Wei - PowerPoint PPT Presentation

cse240a: Graduate Computer Architecture Steven Swanson Hung-Wei Tseng 1 Todays Agenda What is architecture? Why is it important? At the highest level, where is architecture today? Where is it going? Whats in this class?


  1. cse240a: Graduate Computer Architecture Steven Swanson Hung-Wei Tseng 1

  2. Today’s Agenda • What is architecture? • Why is it important? • At the highest level, where is architecture today? Where is it going? • What’s in this class? 2

  3. What is architecture? • How do you build a machine that computes? • Quickly, safely, cheaply, efficiently, in technology X, for application Y, etc. Civilization advances by extending the number of important operations which we can perform without thinking about them. -- Alfred North Whitehead

  4. Orientation The internet

  5. Orientation The internet

  6. Orientation System Bus IO (PCI) Memory Memory Power Power Memory Memory Architecture begins about here.

  7. Orientation System Bus IO (PCI) Memory Memory Power Power Memory Memory Architecture begins about here.

  8. You are here

  9. You are here cse240a

  10. The processors go here…

  11. The processors go here…

  12. Abstractions of the Physical World… Physics/Materials Devices Micro-architecture Processors Architectures

  13. Abstractions of the Physical World… cse241a/ Physics/ This Course ECE dept Chemistry/ Material science Physics/Materials Devices Micro-architecture Processors Architectures

  14. …for the Rest of the System JVM Processor Software Compilers Languages Architectures Abstraction Engineers/ Applications

  15. …for the Rest of the System JVM Processor Software Compilers Languages Architectures Abstraction Engineers/ Applications

  16. Why study architecture? • As CEs or CSs you should understand how computers work • Processors are the basis for everything in CS (except theory) • They are where the rubber meets the road. • Performance is important • Faster machines make applications cheaper • Understanding hardware is essential to understanding how systems behave • It’s cool! • Microprocessors are among the most sophisticated devices manufactured by people • How they work (and even that they work) as reliably and as quickly as they do is amazing. • Architecture is undergoing a revolution • The future is uncertain • Opportunities for innovation abound. 11

  17. Performance and You! • Live Demo 12

  18. Processor are Cool! • Chips are made of silicon • Aka “sand” • The most abundant element in the earth’s crust. • Extremely pure (<1 part per billion) • This is the purest stuff people make

  19. Building Chips

  20. Building Chips • Photolithography Silicon Wafer

  21. Building Chips • Photolithography SiO2 Silicon Wafer Silicon Wafer Grow silicon dioxide

  22. Building Chips • Photolithography Resist SiO2 SiO2 Silicon Wafer Silicon Wafer Silicon Wafer Grow silicon dioxide Apply photo resist

  23. Building Chips • Photolithography Mask Mask Resist Resist SiO2 SiO2 SiO2 Silicon Wafer Silicon Wafer Silicon Wafer Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV

  24. Building Chips • Photolithography Mask Mask Resist Resist SiO2 SiO2 SiO2 Silicon Wafer Silicon Wafer Silicon Wafer Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV SiO2 Silicon Wafer Patterned resist

  25. Building Chips • Photolithography Mask Mask Resist Resist SiO2 SiO2 SiO2 Silicon Wafer Silicon Wafer Silicon Wafer Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV SiO2 Silicon Wafer Silicon Wafer Patterned resist Etch SiO2

  26. Building Chips • Photolithography Mask Mask Resist Resist SiO2 SiO2 SiO2 Silicon Wafer Silicon Wafer Silicon Wafer Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV SiO2 Met Silicon Wafer Silicon Wafer Silicon Wafer Patterned resist Deposit metal Etch SiO2

  27. Building Chips • Photolithography Mask Mask Resist Resist SiO2 SiO2 SiO2 Silicon Wafer Silicon Wafer Silicon Wafer Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV SiO2 Met Met Silicon Wafer Silicon Wafer Silicon Wafer Silicon Wafer Etch SiO2 Patterned resist Deposit metal Etch SiO2 (Or not)

  28. Building Blocks: Transistors

  29. Building Blocks: Wires

  30. State of the art CPU • 1-2 Billion xtrs • 45nm features • 3-4Ghz • Several 100 designers • >5 years • $3Billion fab • 70 GFLOPS 18

  31. Current state of architecture

  32. Since 1940

  33. Since 1940 • Plug boards -> Java • Hand assembling -> GCC • No OS -> Windows Vista

  34. Since 1940 • Plug boards -> Java • 50,000 x speedup • Hand assembling -> GCC • >1,000,000,000 x density • No OS -> Windows Vista (Moore’s Law) Flexible performance is a liquid asset

  35. Moore’s Law: Raw transistors

  36. Computer Performance 22

  37. Computer Performance 10000 specINT95 specINT2000 specINT2006 1000 Relative Performance 100 10 1 1990 1992 1994 1996 1998 2000 2002 2004 2006 2008 2010 Year 22

  38. Computer Performance 10000 10000 specINT95 specINT95 specINT2000 specINT2000 specINT2006 specINT2006 47% per year 1000 1000 Relative Performance Relative Performance 100 100 10 10 1 1 1990 1990 1992 1992 1994 1994 1996 1996 1998 1998 2000 2000 2002 2002 2004 2004 2006 2006 2008 2008 2010 2010 Year Year 22

  39. Computer Performance 10000 10000 10000 specINT95 specINT95 specINT95 specINT2000 specINT2000 specINT2000 specINT2006 specINT2006 specINT2006 47% per year 47% per year 39% per year 1000 1000 1000 Relative Performance Relative Performance Relative Performance 100 100 100 10 10 10 1 1 1 1990 1990 1990 1992 1992 1992 1994 1994 1994 1996 1996 1996 1998 1998 1998 2000 2000 2000 2002 2002 2002 2004 2004 2004 2006 2006 2006 2008 2008 2008 2010 2010 2010 Year Year Year 22

  40. Computer Performance 10000 10000 10000 10000 specINT95 specINT95 specINT95 specINT95 specINT2000 specINT2000 specINT2000 specINT2000 specINT2006 specINT2006 specINT2006 specINT2006 47% per year 47% per year 47% per year 39% per year 39% per year 25% per year 1000 1000 1000 1000 Relative Performance Relative Performance Relative Performance Relative Performance 100 100 100 100 10 10 10 10 1 1 1 1 1990 1990 1990 1990 1992 1992 1992 1992 1994 1994 1994 1994 1996 1996 1996 1996 1998 1998 1998 1998 2000 2000 2000 2000 2002 2002 2002 2002 2004 2004 2004 2004 2006 2006 2006 2006 2008 2008 2008 2008 2010 2010 2010 2010 Year Year Year Year 22

  41. The clock speed addiction specINT2000 specINT2006 5000 4000 Clock speed (Mhz) 3000 2000 1000 0 1996 1998 2000 2002 2004 2006 2008 2010 Year • Clock speed is the biggest contributor to power • Chip manufactures (Intel, esp.) pushed clock speeds very hard in the 90s and early 2000s. • Doubling the clock speed increases power by 2-8x • Clock speed scaling is essentially finished. 23

  42. Power 1000 100 2 Watts/cm 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 24

  43. Power 1000 100 2 Watts/cm 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 24

  44. Power 1000 100 2 Watts/cm 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 24

  45. Power 1000 100 2 Watts/cm 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 24

  46. Power 1000 100 2 Watts/cm 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 24

  47. Power 1000 100 2 Watts/cm 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 24

  48. What’s Next: Brainiacs • Hold the clock rate steady. • Be smarter in silicon • More sophisticated processors • More clever algorithms • This continues to deliver about 25% per year. • But for how long? 25

  49. What’s Next: Parallelism • This is all the rage right now • You probably own a multi-processor, they used to be pretty exotic. • They provide some performance, but it’s hard to use. • There aren’t that many threads • Remember, flexible performance is a liquid asset • Remember or look forward to OS 26

  50. Intel P4 Intel Core 2 Duo Intel Nahalem 1 core 2 cores 4 cores SPARC T1 AMD Barcelona Cell BE Intel Prototype 8 cores 4 cores 8 + 1 cores 80 cores 27

  51. Course Staff • Instructor: Steven Swanson • Lectures Tues + Thurs • TA: Hung-Wei Tseng • See the course web page for contact information and office hours. 28

  52. Who am I? • BA/BS at University of Puget Sound • PhD at the University of Washington • Computer architecture • Ubiquitous computing • Thesis: “The WaveScalar Architecture” • At UCSD since 2006 • Heterogeneous architectures • Non-volatile, solid-state memories • Multi-processor memory system optimizations

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend