Profitability-Based Power Allocation for Speculative Multithreaded - - PowerPoint PPT Presentation

profitability based power allocation for speculative
SMART_READER_LITE
LIVE PREVIEW

Profitability-Based Power Allocation for Speculative Multithreaded - - PowerPoint PPT Presentation

Profitability-Based Power Allocation for Speculative Multithreaded Systems Polychronis Xekalakis, Nikolas Ioannou Salman Khan and Marcelo Cintra University of Edinburgh Introduction CMPs are here to stay Power and temperature limit


slide-1
SLIDE 1

Profitability-Based Power Allocation for Speculative Multithreaded Systems

Polychronis Xekalakis, Nikolas Ioannou Salman Khan and Marcelo Cintra

University of Edinburgh

slide-2
SLIDE 2

IPDPS 2010 2

Introduction

  • CMPs are here to stay
  • Power and temperature limit performance
  • No speedup for single thread applications

– Use Thread Level Speculation to extract TLP – Energy Inefficient

slide-3
SLIDE 3

IPDPS 2010 3

Introduction

  • CMPs are here to stay
  • Power and temperature limit performance
  • No speedup for single thread applications

– Use Thread Level Speculation to extract TLP – Energy Inefficient

  • Our proposal:

– Steal power from non-profitable threads – Allocate it where it is useful

slide-4
SLIDE 4

IPDPS 2010 4

Contributions

  • Propose power allocation based on thread

profitability

  • Propose a set of novel predictors to classify

threads in profitable and non-profitable ones

  • Our approach outperforms state-of-the-art

TLS systems:

– ED by 21.2% (up to 39.6%) –… while also reducing the temperature

slide-5
SLIDE 5

IPDPS 2010 5

Speculative Multithreading

  • Basic Idea: Use idle cores/contexts to speculate
  • n future application needs

–TLS: speculatively execute parallel threads –HT/RA: speculatively perform future memory operations –MP: speculatively execute along multiple branch targets

  • When speculation fails, power inefficiency

results

slide-6
SLIDE 6

IPDPS 2010 6

Outline

  • Introduction
  • Profitability Based Power Allocation
  • Estimating Profitability
  • Experimental Setup and Results
  • Conclusions
slide-7
SLIDE 7

IPDPS 2010 7

Profitability Based Power Allocation

slide-8
SLIDE 8

IPDPS 2010 8

Profitability Based Power Allocation

slide-9
SLIDE 9

IPDPS 2010 9

Profitability Based Power Allocation

slide-10
SLIDE 10

IPDPS 2010 10

Outline

  • Introduction
  • Profitability Based Power Allocation
  • Estimating Profitability
  • Experimental Setup and Results
  • Conclusions
slide-11
SLIDE 11

Estimating Profitability

  • Benefits for TLS: TLP/ILP

– TLP (Overlapped Execution) – ILP (Prefetching)

11 IPDPS 2010 Thread 1 Thread 2

Speculative Time

Overlapped Execution Thread 1 Thread 2

Speculative Time

Prefetching

slide-12
SLIDE 12

IPDPS 2010 12

Estimating TLP

slide-13
SLIDE 13

IPDPS 2010 13

Estimating ILP

slide-14
SLIDE 14

IPDPS 2010 14

Power Mode Policy

  • For threads that are predicted to squash:

– Place in low power mode on first prediction

– Place in very low power mode on third prediction

  • For threads that are memory bound:

– Place in low power mode

  • If power budget allows, place safe thread in

high power mode

slide-15
SLIDE 15

IPDPS 2010 15

Outline

  • Introduction
  • Profitability Based Power Allocation
  • Estimating Profitability
  • Experimental Setup and Results
  • Conclusions
slide-16
SLIDE 16

IPDPS 2010 16

Evaluation Environment

  • Simulator, Compiler and Benchmarks:

– SESC (http://sesc.sourceforge.net/) – POSH (Liu et al. PPoPP ‘06) – Spec 2000 Int.

  • Architecture:

– Four way CMP, 4-Issue cores – 16KB L1 Data (multi-versioned) and Instruction Caches – 1MB unified L2 Caches – Inst. window/ROB – 80/104 entries

slide-17
SLIDE 17

IPDPS 2010 17

Power Modes Used

Mode Voltage Freq High Power 1000 mV 5.0 GHz Normal Power 950 mV 4.0 GHz Low Power 900 mV 3.0 GHz Very Low Power 700 mV 1.0 GHz

slide-18
SLIDE 18

IPDPS 2010 18

Performance-Power Analysis

Speedup

slide-19
SLIDE 19

IPDPS 2010 19

Performance-Power Analysis

Power

slide-20
SLIDE 20

IPDPS 2010 20

Performance-Power Analysis

Energy Delay

slide-21
SLIDE 21

IPDPS 2010 21

Thermal Analysis

Base TLS Profitability- based Scheme

slide-22
SLIDE 22

IPDPS 2010 22

Outline

  • Introduction
  • Profitability Based Power Allocation
  • Estimating Profitability
  • Experimental Setup and Results
  • Conclusions
slide-23
SLIDE 23

IPDPS 2010 23

Conclusions

  • CMPs are here to stay
  • Power on chip needs to be effectively utilized
  • Allocating power by profitability leads to improvements
  • Squash and memory boundedness predictors can

estimate thread profitability

  • Our approach outperforms state-of-the-art TLS systems:

– ED by 21.2% (up to 39.6%) – … while also reducing the temperature

slide-24
SLIDE 24

Profitability-Based Power Allocation for Speculative Multithreaded Systems

Polychronis Xekalakis, Nikolas Ioannou Salman Khan and Marcelo Cintra

University of Edinburgh