Evaluating the Effectiveness of Model Based Power Characterization - PowerPoint PPT Presentation

Evaluating the Effectiveness of Model ‐ Based Power Characterization John McCullough, Yuvraj Agarwal , Jaideep Chandrashekhar (Intel), Sathya Kuppuswamy, Alex C. Snoeren, Rajesh Gupta Computer Science and Engineering, UC San Diego http://variability.org http://synergy.ucsd.edu

Motivation • Computing platforms are ubiquitous – Sensors, mobile devices, PCs to data centers – Significant consumers of energy, slated to grow significantly • Reducing energy consumption – Battery powered devices: goal of all day computing – Mains powered devices: reduce energy costs, carbon footprint

Detailed Power Characterization is Key • Managing energy consumption within platforms – Requires visibility into where energy is being consumed • Granularity of power characterization matters – “Total System Power” or “Individual Subsystem Power” – Depends on level of power optimizations desired • Defining question, from the software stack perspective: – How can power consumption be characterized effectively – What are the limits: accuracy, granularity, complexity? • Power characterization has been well studied – Need to revisit given the characteristics of modern platforms 3

Modern Systems ‐ Larger Dynamic Range Dynamic Total Power Total Power Dynamic Fixed / Base Fixed / Base 0% Utilization 100% 0% Utilization 100% • Prior generation of computing platforms: – Systems with high base power ‐ > small dynamic range – Dynamic component not critical to capture • Modern platforms: – Increasing dynamic fraction – Critical to capture dynamic component for accuracy 4

Power Characterization: Measure or Model • Two options: Directly measured, or indirectly modeled – Modeling preferred because of less hardware complexity • Many different power models have been proposed – Linear regression, learning, stochastic, .. • Question: how good are these models? – Component level as well as system level power predictions

Outline • Describe power measurement infrastructure – Fine grained, per component breakdown • Present different power models – Linear regression (prior work), complex models • Compare models with real measurements – Different workloads (SpecCPU, PARSEC, synthetic) • Results: Power modeling ‐ > high error – Reasons range from complexity, hidden states – Modeling errors will only get worse with variability 6

Power Measurement Infrastructure • Highly instrumented Intel “Calpella” Platform – Nehalem core i7, core i5, 50 sense resistors – High precision NI DAQs, 16bit / 1.25MS/s, 32 ADCs 7

Prior Work in Power Modeling • Total System Power Modeling – [Economou MOBS’06] ‐ Regression model, MANTIS • AMD blade: < 9% error across benchmarks • Itanium server: <21% error – [Riviore HotPower ‘08] – Compare regression models • Core2Duo/XEON, Itanium, Mobile FileServer, AMD Turion • Mean error < 10% across SPEC CPU/JBB benchmarks • Subsystem Models – [Bircher ISPASS ‘07] – linear regression models • P4 XEON system: Error < 9% across all subsystems Prior work: single ‐ threaded workloads, systems with high base power, less complex systems. 8

Power Modeling Methodology Performance Counters, (OS, CPU,..) Training Set Build Power + (Applications) Models Power Measurements Power Testing Set Performance Counters, Power Prediction Model (OS, CPU,..) (Applications) • Counters: CPU + OS/Device counters – For CPU: measure only 4 (programmable) + 2 (fixed) – Remove uncorrelated counters, add based on coefficients • Benchmarks: “training set” and “testing set” – k X 2 ‐ fold cross ‐ validation (do this n = 10 times) – Removes any bias in choosing training and testing set 9

Power Consumption Models • “MANTIS” [Prior Work] – Linear Regression – Uses domain knowledge for counter selection • “Linear ‐ lasso” – Linear Regression – Counters selection: “MANTIS” + Lasso/GLMNET • “nl ‐ poly ‐ lasso” – Non Linear Regression (NLR) – Counters selection: “MANTIS” + Lasso/GLMNET • “nl ‐ poly ‐ exp ‐ lasso” – NLR + Poly term + Exp. Term – Counters selection: “MANTIS” + Lasso/GLMNET “svm_rbf” – Support Vector Machines • – Unlike Lasso, SVM does not force model to be sparse. 10

Benchmarks • “ SpecCPU ” – 22 Benchmarks, single ‐ threaded – More CPU centric • “PARSEC” – emerging multi ‐ core workloads – Include file ‐ dedup, x264 encoding • Specific workloads – specific subsystems – “Bonnie” – I/O heavy benchmark – “Linux Build” – Multi threaded parallel build – StressTestApp, CPULoad, memcached 11

“Calpella” Platform – Power Breakdown • Subsystem level power breakdown – PSU power not shown, GPU constant – Large dynamic range – 23W (Idle) to 57W (stream)! 12

Modeling Total System Power Error bars indicate max-min per-benchmark mean error • Increased Complexity ‐ > Single core to Multi ‐ Core – Modeling error increases significantly – Mean Modeling Error < 10%, worse error > 15% 13

Modeling Subsystem Power – CPU Error bars indicate max-min per-benchmark mean error • Increased Complexity ‐ > Single core to Multi ‐ Core – CPU Power modeling error increases significantly – Multicore ‐ Mean Error ~20%, worst case > 150% – Simplest case: HT and TurboBoost are Disabled 14

CPU Power: Single ‐ > Multicore Single ‐ core: Multi ‐ core: CMP inherently increases prediction complexity 15

Accurate Power Modeling is Challenging • Hidden system states – SSDs: wear leveling, TRIM, delayed writes, erase cycles – Processors: aggressive clock gating, “Turbo Boost” • Increasing system complexity – Too many states: Nehalem CPU has hundreds of counters – Interactions hard to capture: resource contention • E.g. consider SSDs vs traditional HDDs Power Prediction Error on SSD is 2X higher than HDD! 16

Adding Hardware Variability to the Mix P1 P2 P3 P4 • Variability in hardware is increasing – Identical parts, not necessarily identical in power, perf. – Can be due to: manufacturing, environment, aging, … – “Model one, apply to other instances” may not hold • Experiment: Measure CPU power variability – Identical dual ‐ core Core i5 ‐ 540M ‐‐ 540M ‐ 1, 540M ‐ 2 – Same benchmark, different configurations, 5 runs each 17

Variability Leads to Higher Modeling Error Processor Power Variability on 1 benchmark • 12% Variability across 540M ‐ 1 and 540M ‐ 2 – 20% modeling error + 12% variability  34% error! • Part variability slated to increase in the future 18

Summary • Power characterization using modeling – Becoming infeasible for complex modern platforms – Total power: 1% ‐ 5% (single core) to 10% ‐ 15% error (multi ‐ core) – Per ‐ component model predictions even worse: • CPU 20% ‐ 150% error • Memory 2% ‐ 10% error, HDD 3% ‐ 22% error, and SSD 5% ‐ 35% error • Challenge: hidden state and system complexity • Variability in components makes it even worse Need low cost instrumentation solutions for accurate power characterization. 19

Questions? http://synergy.ucsd.edu http://www.variability.org

Total Power: Single ‐ > Multicore Single ‐ core: Multi ‐ core: Increase in error, sensitivity to individual benchmarks 21

Evaluating the Effectiveness of Model Based Power Characterization - PowerPoint PPT Presentation

Evaluating the Effectiveness of Model Based Power Characterization John McCullough, Yuvraj Agarwal , Jaideep Chandrashekhar (Intel), Sathya Kuppuswamy, Alex C. Snoeren, Rajesh Gupta Computer Science and Engineering, UC San Diego

Evaluating Heat Treatment Evaluating Heat Treatment Effectiveness Effectiveness Bh. .

(power x 0) == 1 (power x (+ n 1)) == (* (power x n) x) (power x 0) == 1 (power x (+ (* 2 m)

CSC Effectiveness Review CSC Effectiveness Review Team October 2018 ICANN63 Need for Review of

Defining Teacher Effectiveness Source: A Practical Guide for Evaluating Teaching Effectiveness

WALES SOFT POWER BAROMETER 2018 Measuring soft power beyond the nation-state April 2018 01 WHAT

Evaluating the Effectiveness of SESCDPs 2 Todays Session What is a logic model?

Office of Institutional Effectiveness Overview Institutional Effectiveness Overview The Office of

Educator Effectiveness Grant New Teacher S upport and Development Educator Effectiveness Grant

Determining the Determining the Effectiveness & ROI Effectiveness & ROI of Your GRC

An Insight- -Based Based An Insight Methodology for Evaluating Methodology for Evaluating

Hydro Power Generation e-Power CLA-VAL Europe Product Range e-Power IP e-Power HP e-Power MP

THE POWER OF US THE POWER OF US FIRST NATIONAL WEBINAR September 12, 2017 WEBINAR AGENDA

How does the power industry support How does the power industry support How does the power

Power Converters and Power Quality II CERN Accelerator School on Power Converters Baden, Friday 9

Evaluating the Effectiveness of the ISO 27001:2013 based on the Annex A Bahareh Shojaie Hannes

Presentations Evaluating the Effectiveness of Security Mechanisms at Scale David Nicol

ACRS MEETING WITH THE U.S. NUCLEAR REGULATORY COMMISSION June 7, 2012 Overview Sam Armijo

Field Theory Approach to Equilibrium Critical Phenomena Uwe C. T auber Department of Physics

Response surface models for the Elliott, Rothenberg, Stock DF-GLS unit-root test Christopher F

Strangeness Production in AA Collisions at SIS18 Introduction strangeness in dense baryonic

Pu Wang 1 Pu Wang 2 Pu Wang 3 Pu Wang 4 4 1 2 3 Path: 1,2,3,4 Pu Wang 5 Pu Wang 6

Power and Energy Charles Li and Deepak Pallerla Power: A First-Class Architectural Design

DRAM POWER MANAGEMENT Mahdi Nazm Bojnordi Assistant Professor School of Computing University of

Q1 FY2015 August 7, 2014 Scan QR code to view our corporate video Safe Harbor Statement This

Evaluating the Effectiveness of Model Based Power Characterization - PowerPoint PPT Presentation

Evaluating the Effectiveness of Model Based Power Characterization John McCullough, Yuvraj Agarwal , Jaideep Chandrashekhar (Intel), Sathya Kuppuswamy, Alex C. Snoeren, Rajesh Gupta Computer Science and Engineering, UC San Diego

Evaluating Heat Treatment Evaluating Heat Treatment Effectiveness Effectiveness Bh. .

(power x 0) == 1 (power x (+ n 1)) == (* (power x n) x) (power x 0) == 1 (power x (+ (* 2 m)

CSC Effectiveness Review CSC Effectiveness Review Team October 2018 ICANN63 Need for Review of

Defining Teacher Effectiveness Source: A Practical Guide for Evaluating Teaching Effectiveness

WALES SOFT POWER BAROMETER 2018 Measuring soft power beyond the nation-state April 2018 01 WHAT

Evaluating the Effectiveness of SESCDPs 2 Todays Session What is a logic model?

Office of Institutional Effectiveness Overview Institutional Effectiveness Overview The Office of

Educator Effectiveness Grant New Teacher S upport and Development Educator Effectiveness Grant

Determining the Determining the Effectiveness &amp; ROI Effectiveness &amp; ROI of Your GRC

An Insight- -Based Based An Insight Methodology for Evaluating Methodology for Evaluating

Hydro Power Generation e-Power CLA-VAL Europe Product Range e-Power IP e-Power HP e-Power MP

THE POWER OF US THE POWER OF US FIRST NATIONAL WEBINAR September 12, 2017 WEBINAR AGENDA

How does the power industry support How does the power industry support How does the power

Power Converters and Power Quality II CERN Accelerator School on Power Converters Baden, Friday 9

Evaluating the Effectiveness of the ISO 27001:2013 based on the Annex A Bahareh Shojaie Hannes

Presentations Evaluating the Effectiveness of Security Mechanisms at Scale David Nicol

ACRS MEETING WITH THE U.S. NUCLEAR REGULATORY COMMISSION June 7, 2012 Overview Sam Armijo

Field Theory Approach to Equilibrium Critical Phenomena Uwe C. T auber Department of Physics

Response surface models for the Elliott, Rothenberg, Stock DF-GLS unit-root test Christopher F

Strangeness Production in AA Collisions at SIS18 Introduction strangeness in dense baryonic

Pu Wang 1 Pu Wang 2 Pu Wang 3 Pu Wang 4 4 1 2 3 Path: 1,2,3,4 Pu Wang 5 Pu Wang 6

Power and Energy Charles Li and Deepak Pallerla Power: A First-Class Architectural Design

DRAM POWER MANAGEMENT Mahdi Nazm Bojnordi Assistant Professor School of Computing University of

Q1 FY2015 August 7, 2014 Scan QR code to view our corporate video Safe Harbor Statement This

Determining the Determining the Effectiveness & ROI Effectiveness & ROI of Your GRC