The Energy Efficiency of CMP vs. SMT for Multimedia Workloads - PDF document

The Energy Efficiency of CMP vs. SMT for Multimedia Workloads ∗ Ruchira Sasanka Sarita V. Adve Yen-Kuang Chen Eric Debes University of Illinois at Urbana-Champaign Architecture Research Labs Department of Computer Science Intel Corporation { sasanka, sadve } @cs.uiuc.edu { yen-kuang.chen, eric.debes } @intel.com UIUC CS Technical Report UIUCDCS-R-2003-2325, March 2003 Intel Technical Report 130581, March 2003 Abstract 1 Introduction This paper compares the energy efficiency of This paper compares the energy efficiency of chip multi- chip multiprocessing (CMP) [10] and simulta- processing (CMP) and simultaneous multithreading (SMT) neous multithreading (SMT) [19] for multime- on modern out-of-order processors for the increasingly im- dia applications on modern out-of-order general- portant multimedia applications. Since performance is an important metric for real-time multimedia applications, we purpose processors (GPPs). Multimedia applications compare configurations at equal performance . We perform are becoming increasingly important for GPPs in a variety this comparison for a large number of performance points of systems including desktops, laptops, tablet PCs, and derived using different processor architectures and frequen- likely future handheld devices. GPPs have begun to support cies/voltages. multithreading for improved throughput, using either CMP or SMT. These techniques are a good match for multimedia We find that for the design space explored, for each work- applications which are inherently multithreaded. However, load, at each performance point, CMP is more energy effi- multimedia applications often run on portable systems cient than SMT. The difference is small for two thread sys- facing strict energy constraints. It is therefore important to tems, but large (18% to 44%) for four thread systems. We study the energy efficiency of general-purpose CMP and also find that the best SMT and the best CMP configuration SMT architectures for multimedia applications. for a given performance target have different architecture SMT allows multiple application threads to be run at the and frequency/voltage. Therefore, their relative energy ef- same time, within the same processor, potentially increasing ficiency depends on a subtle interplay between various fac- utilization of the processor resources. Specifically, current tors such as capacitance, voltage, IPC, frequency, and the wide issue out-of-order processors are often unable to uti- level of clock gating, as well as workload features. We per- lize the full supported fetch/decode/issue width for a single form a detailed analysis considering these factors and de- thread. SMT utilizes these otherwise wasted resources for velop a mathematical model to explain these results. other threads, potentially improving total throughput with Although CMP shows a clear energy advantage for four- little additional hardware. CMP, on the other hand, im- thread (and higher) workloads, it comes at the cost of in- proves throughput by adding additional processors rather creased silicon area. We therefore investigate a hybrid solu- than improving their utilization. tion where a CMP is built out of SMT cores, and find it to be At first glance, SMT may appear to be inherently more an effective compromise. Finally, we find that we can reduce energy efficient than CMP since it potentially uses its re- energy further for CMP with a straightforward application sources more effectively – SMT can get more IPC (instruc- of previously proposed techniques of adaptive architectures tions per cycle) from less hardware. However, in reality, and dynamic voltage/frequency scaling. the comparison is more complex, both in the analysis to un- derstand the experimental results and in the methodology to ∗ This work is supported in part by an equipment donation generate the right results. from AMD Corp., a gift from Intel Corp., and the National Sci- Sources of complexity and our solutions. For real-time ence Foundation under Grant No. EIA-0103645, CCR-0209198, multimedia applications, performance is a key constraint. A CCR-0205638, EIA-0224453, and CCR-0313286. Sarita V. Adve fair comparison of energy must therefore also consider per- was also supported by an Alfred P. Sloan Research Fellowship. formance. As a result, we compare the energy of SMT and Ruchira Sasanka was supported by an Intel graduate fellowship and began this work as a summer intern at Intel. 1

The Energy Efficiency of CMP vs. SMT for Multimedia Workloads - PDF document

The Energy Efficiency of CMP vs. SMT for Multimedia Workloads Ruchira Sasanka Sarita V. Adve Yen-Kuang Chen Eric Debes University of Illinois at Urbana-Champaign Architecture Research Labs Department of Computer Science Intel Corporation

Atelier Num erique OMP Code Optimization: Vectorization Bertrand Putigny July 5, 2016 1 / 27

Pre-2012 CMP 2012 CMP Amendments 2018 CMP Amendments Above: Solar panel carports

Workshop 1 North Central Texas Council of Governments CMP Workshop Overview Overview of

THE CMP INTEGRATES LIFELONG LEARNING WITH ASSESSMENT THE CMP INTEGRATES LIFELONG LEARNING WITH

http://cmp.imag.fr CMP annual users meeting, 4 Feb. 2016, PARIS Pr Process Portf rtfolio lio fr

SMT WORLDWIDE SMT America, Europe and Asia staff has over 20 years experience in the SMT field

POLYMETALLIC PRODUCER AGM PRESENTATION June 30, 2020 TSX: SMT | NYSE AMERICAN: SMTS | BVL: SMT

SMT Solvers: A Disruptive Technology John Rushby Computer Science Laboratory SRI International

Using SMT solvers for binary analysis and exploitation A primer on SMT, SMT solvers, Z3 & angr

El Paso Electric El Paso Electric Energy Efficiency Energy Efficiency Standard Offer Programs -

NHEC Perspectives on Energy NHEC Perspectives on Energy Efficiency and Sustainable Energy

POLYMETALLIC PRODUCER CORPORATE PRESENTATION July 2020 TSX: SMT | NYSE AMERICAN: SMTS | BVL:

SMT in Asia Content Teknek and the SMT industry The market Why cleaning is needed

POLYMETALLIC PRODUCER CORPORATE PRESENTATION February 2020 TSX: SMT | NYSE AMERICAN: SMTS |

DIVERSIFIED PRODUCER CORPORATE PRESENTATION August 2020 TSX: SMT | NYSE AMERICAN: SMTS | BVL:

DIVERSIFED PRODUCER CORPORATE PRESENTATION August 2020 TSX: SMT | NYSE AMERICAN: SMTS | BVL:

An Integrated Tropical Cyclone Information System Bjorn Lambrigtsen , Yi Chao, Svetla

Outreach to Industry 4th Meeting, Trento Thursday 9 - Friday 10 September 2004 Venue: Trento ,

New de Young Museum Design by Herzog & de Meuron Engineering by Fong & Chan Seismic

Web-Based Information Everyday Activity Systems The Web has primarily been people

Retiring Presented by Kevin Wenndt Senior Retirement Benefits Officer IPERS is The largest

CBOE Holdings, Inc. Third Quarter Earnings Conference Call November 3, 2011 p. 1 p. 1 CBOE

KERS Contribution Rates 90% 83.0% 80% 70% 60% 49.0% 49.0% 49.0% 49.0% 49.0% 50% 39.0%

Richard Thripp A Survey of Investing and Retirement Knowledge and Preferences of Florida

The Energy Efficiency of CMP vs. SMT for Multimedia Workloads - PDF document

The Energy Efficiency of CMP vs. SMT for Multimedia Workloads Ruchira Sasanka Sarita V. Adve Yen-Kuang Chen Eric Debes University of Illinois at Urbana-Champaign Architecture Research Labs Department of Computer Science Intel Corporation

Atelier Num erique OMP Code Optimization: Vectorization Bertrand Putigny July 5, 2016 1 / 27

Pre-2012 CMP 2012 CMP Amendments 2018 CMP Amendments Above: Solar panel carports

Workshop 1 North Central Texas Council of Governments CMP Workshop Overview Overview of

THE CMP INTEGRATES LIFELONG LEARNING WITH ASSESSMENT THE CMP INTEGRATES LIFELONG LEARNING WITH

http://cmp.imag.fr CMP annual users meeting, 4 Feb. 2016, PARIS Pr Process Portf rtfolio lio fr

SMT WORLDWIDE SMT America, Europe and Asia staff has over 20 years experience in the SMT field

POLYMETALLIC PRODUCER AGM PRESENTATION June 30, 2020 TSX: SMT | NYSE AMERICAN: SMTS | BVL: SMT

SMT Solvers: A Disruptive Technology John Rushby Computer Science Laboratory SRI International

Using SMT solvers for binary analysis and exploitation A primer on SMT, SMT solvers, Z3 &amp; angr

El Paso Electric El Paso Electric Energy Efficiency Energy Efficiency Standard Offer Programs -

NHEC Perspectives on Energy NHEC Perspectives on Energy Efficiency and Sustainable Energy

POLYMETALLIC PRODUCER CORPORATE PRESENTATION July 2020 TSX: SMT | NYSE AMERICAN: SMTS | BVL:

SMT in Asia Content Teknek and the SMT industry The market Why cleaning is needed

POLYMETALLIC PRODUCER CORPORATE PRESENTATION February 2020 TSX: SMT | NYSE AMERICAN: SMTS |

DIVERSIFIED PRODUCER CORPORATE PRESENTATION August 2020 TSX: SMT | NYSE AMERICAN: SMTS | BVL:

DIVERSIFED PRODUCER CORPORATE PRESENTATION August 2020 TSX: SMT | NYSE AMERICAN: SMTS | BVL:

An Integrated Tropical Cyclone Information System Bjorn Lambrigtsen , Yi Chao, Svetla

Outreach to Industry 4th Meeting, Trento Thursday 9 - Friday 10 September 2004 Venue: Trento ,

New de Young Museum Design by Herzog &amp; de Meuron Engineering by Fong &amp; Chan Seismic

Web-Based Information Everyday Activity Systems The Web has primarily been people

Retiring Presented by Kevin Wenndt Senior Retirement Benefits Officer IPERS is The largest

CBOE Holdings, Inc. Third Quarter Earnings Conference Call November 3, 2011 p. 1 p. 1 CBOE

KERS Contribution Rates 90% 83.0% 80% 70% 60% 49.0% 49.0% 49.0% 49.0% 49.0% 50% 39.0%

Richard Thripp A Survey of Investing and Retirement Knowledge and Preferences of Florida

Using SMT solvers for binary analysis and exploitation A primer on SMT, SMT solvers, Z3 & angr

New de Young Museum Design by Herzog & de Meuron Engineering by Fong & Chan Seismic