Coarse-Grained Reconfigurable Acceleration Units FRANCESCA PALUMBO - - PowerPoint PPT Presentation

coarse grained reconfigurable acceleration units
SMART_READER_LITE
LIVE PREVIEW

Coarse-Grained Reconfigurable Acceleration Units FRANCESCA PALUMBO - - PowerPoint PPT Presentation

Design for Low-Power IoT Systems: Coarse-Grained Reconfigurable Acceleration Units FRANCESCA PALUMBO UNIVERSIT DEGLI STUDI DI SASSARI Design for Low-Power Internet-of-Things (IoT) Systems ISCAS 2018 Overview Motivations - What we need


slide-1
SLIDE 1

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Design for Low-Power IoT Systems: Coarse-Grained Reconfigurable Acceleration Units

FRANCESCA PALUMBO

UNIVERSITÀ DEGLI STUDI DI SASSARI

slide-2
SLIDE 2

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Overview

  • Motivations
  • What we need adaptation for
  • Triggers and Types
  • Coarse-Grained Reconfigurable Systems
  • Computing Spectrum and Reconfigurable Systems Classification
  • Heterogeneous and Irregular Coarse-Grained Reconfigurable Accelerators
  • Power Management
  • Issues and Strategies
  • Low-Power Coarse-Grained Reconfigurable Accelerators
  • An FFT Example

2

slide-3
SLIDE 3

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Overview

  • Motivations
  • What we need adaptation for
  • Triggers and Types
  • Coarse-Grained Reconfigurable Systems
  • Computing Spectrum and Reconfigurable Systems Classification
  • Heterogeneous and Irregular Coarse-Grained Reconfigurable Accelerators
  • Power Management
  • Issues and Strategies
  • Low-Power Coarse-Grained Reconfigurable Accelerators
  • An FFT Example

3

slide-4
SLIDE 4

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Numbers: opportunity or issue?

4

> 7 billion 20 MWh/year 1,800 kg oil

=

http://www.gartner.com/newsroom/id/3598917

slide-5
SLIDE 5

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Numbers: opportunity or issue?

4

> 7 billion 20 MWh/year 1,800 kg oil

Designed by Freepik

=

> 1 billion smartphones

http://www.gartner.com/newsroom/id/3598917

slide-6
SLIDE 6

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Numbers: opportunity or issue?

4

> 7 billion 20 MWh/year 1,800 kg oil

Designed by Freepik

=

> 1 billion smartphones 8.4 billion connected things in 2017 (+31% wrt 2016) 20.4 billion by 2020

http://www.gartner.com/newsroom/id/3598917

slide-7
SLIDE 7

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Some examples ...

Connectivity and real-time situation awareness are nowadays common in different scenarios.

5

slide-8
SLIDE 8

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

SMART HEALTH: distributed healthcare assistance to improve quality of life and active and healthy ageing, functionalities can be changed according to the daily tasks.

Some examples ...

Connectivity and real-time situation awareness are nowadays common in different scenarios.

5

slide-9
SLIDE 9

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

SMART-SOCIETY: increased building efficiency and comfort, i.e. lightning/air quality management can be adjusted to the room status. SMART HEALTH: distributed healthcare assistance to improve quality of life and active and healthy ageing, functionalities can be changed according to the daily tasks.

Some examples ...

Connectivity and real-time situation awareness are nowadays common in different scenarios.

5

slide-10
SLIDE 10

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

SMART-TRANSPORTATION: autonomous electric vehicle, improved driver assistance and care. Path towards destinations may vary, even diverging from the optimal

  • ne, according to user preferences.

SMART-SOCIETY: increased building efficiency and comfort, i.e. lightning/air quality management can be adjusted to the room status. SMART HEALTH: distributed healthcare assistance to improve quality of life and active and healthy ageing, functionalities can be changed according to the daily tasks.

Some examples ...

Connectivity and real-time situation awareness are nowadays common in different scenarios.

5

slide-11
SLIDE 11

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Reconfiguration: Recipe for Compromises

Modern digital devices (real-time and ad-hoc) are pervasive (98% of computers are embedded) and interconnected.

6

slide-12
SLIDE 12

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Reconfiguration: Recipe for Compromises

Modern digital devices (real-time and ad-hoc) are pervasive (98% of computers are embedded) and interconnected. They may also present sensing and actuating capabilities, leading to the concept of CPS.

6

slide-13
SLIDE 13

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Reconfiguration: Recipe for Compromises

Modern digital devices (real-time and ad-hoc) are pervasive (98% of computers are embedded) and interconnected. They may also present sensing and actuating capabilities, leading to the concept of CPS.

Safety Security Certif. Distrib. HMI Seamless MPSoC Energy Automotive x x x x x x x Aerospace x x x x x x x Healthcare x x x x x x x x Consumer x x x

6

slide-14
SLIDE 14

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Reconfiguration: Recipe for Compromises

Reconfiguration may allow to optimally implement complex/demanding systems, managing numerous/conflicting requirements and a variety of functionalities. Modern digital devices (real-time and ad-hoc) are pervasive (98% of computers are embedded) and interconnected. They may also present sensing and actuating capabilities, leading to the concept of CPS.

Safety Security Certif. Distrib. HMI Seamless MPSoC Energy Automotive x x x x x x x Aerospace x x x x x x x Healthcare x x x x x x x x Consumer x x x

6

slide-15
SLIDE 15

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Triggers for Adaptation

7

slide-16
SLIDE 16

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Triggers for Adaptation

ENVIRONMENTAL AWARENESS: Influence of the environment on the system, i.e. daylight vs. nocturnal, radiation level changes, etc. Sensors are needed to interact with the environment and capture conditions variations.

7

slide-17
SLIDE 17

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Triggers for Adaptation

ENVIRONMENTAL AWARENESS: Influence of the environment on the system, i.e. daylight vs. nocturnal, radiation level changes, etc. Sensors are needed to interact with the environment and capture conditions variations. USER-COMMANDED: System-User interaction, i.e. user preferences, etc. Proper human-machine interfaces are needed to enable interaction and capture commands.

7

slide-18
SLIDE 18

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Triggers for Adaptation

ENVIRONMENTAL AWARENESS: Influence of the environment on the system, i.e. daylight vs. nocturnal, radiation level changes, etc. Sensors are needed to interact with the environment and capture conditions variations. USER-COMMANDED: System-User interaction, i.e. user preferences, etc. Proper human-machine interfaces are needed to enable interaction and capture commands. SELF-AWARENESS: The internal status of the system varies while operating and may lead to reconfiguration needs, i.e. chip temperature variation, low battery. Status monitors are needed to capture the status of the system.

7

slide-19
SLIDE 19

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Types of Adaptation

8

slide-20
SLIDE 20

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Types of Adaptation

FUNCTIONALITY-ORIENTED: To adapt functionality because the CPS mission changes, or the data being processed changes and adaptation is required. It may be parametric (a constant changes) or fully functional (algorithm changes).

A B C

8

slide-21
SLIDE 21

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Types of Adaptation

FUNCTIONALITY-ORIENTED: To adapt functionality because the CPS mission changes, or the data being processed changes and adaptation is required. It may be parametric (a constant changes) or fully functional (algorithm changes). NON-FUNCTIONAL REQUIREMENTS-ORIENTED: Functionality is fixed, but system requires adaptation to accommodate to changing requirements, i.e. execution time or energy consumption.

A B C

8

slide-22
SLIDE 22

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Types of Adaptation

FUNCTIONALITY-ORIENTED: To adapt functionality because the CPS mission changes, or the data being processed changes and adaptation is required. It may be parametric (a constant changes) or fully functional (algorithm changes). NON-FUNCTIONAL REQUIREMENTS-ORIENTED: Functionality is fixed, but system requires adaptation to accommodate to changing requirements, i.e. execution time or energy consumption. REPAIR-ORIENTED: For safety and reliability purposes, adaptation may be used in case of faults. Adaptation may add self-healing or self-repair features. e.g.: HW task migration for permanent faults, or scrubbing (continuous fault verification) and repair.

A B C

8

slide-23
SLIDE 23

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Overview

  • Motivations
  • What we need adaptation for
  • Triggers and Types
  • Coarse-Grained Reconfigurable Systems
  • Computing Spectrum and Reconfigurable Systems Classification
  • Heterogeneous and Irregular Coarse-Grained Reconfigurable Accelerators
  • Power Management
  • Issues and Strategies
  • Low-Power Coarse-Grained Reconfigurable Accelerators
  • An FFT Example

9

slide-24
SLIDE 24

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Computing Spectrum

Flexibility Efficiency

10

slide-25
SLIDE 25

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Computing Spectrum

DSP GPU CPU GP

Flexibility Efficiency

10

slide-26
SLIDE 26

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Computing Spectrum

ASIC DSP GPU CPU GP

Flexibility Efficiency

10

slide-27
SLIDE 27

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Computing Spectrum

ASIC DSP GPU CPU GP

Flexibility Efficiency

CG RECONF FG

Reconfigurable computing provides a trade-off between execution efficiency typical of ASICs and flexibility mainly exhibited by GP devices.

10

slide-28
SLIDE 28

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Computing Spectrum

ASIC DSP GPU CPU GP

Flexibility Efficiency

CG RECONF FG

Fine-Grained (FG) Coarse-Grained (CG) bit-level word-level flexibility ☺  speed  ☺ memory  

Reconfigurable computing provides a trade-off between execution efficiency typical of ASICs and flexibility mainly exhibited by GP devices.

10

slide-29
SLIDE 29

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Virtual vs. Dynamic & Partial

VRC → Virtual Reconfigurable Circuits

  • High reconfiguration speed
  • Lower operation speed (mux and size)
  • Higher Area Overhead
  • Technology independent (ASIC or FPGA)

DPR → Dynamic and Partial Reconfiguration

  • Lower reconfiguration speeds
  • Better operation speed (no mux/less logic)
  • Better Resource Utilization (no dark logic)
  • Higher Flexibility and Scalability
  • Technology dependent (FPGA)

11

slide-30
SLIDE 30

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

C

Dataflow Model of Computation

COMPUTING PARADIGM:

  • directed flow graph of actors (functional units)
  • communication with tokens (packets of data)

exchange through dedicated channels PECULIARITIES:

  • explicit intrinsic application parallelism
  • modularity favours model re-usability/adaptivity

EXTERNAL INTERFACE:

  • I/O ports number
  • I/O ports depth
  • I/O ports tokens burst

A B D

actions state

13

slide-31
SLIDE 31

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

C

Dataflow Model of Computation

COMPUTING PARADIGM:

  • directed flow graph of actors (functional units)
  • communication with tokens (packets of data)

exchange through dedicated channels PECULIARITIES:

  • explicit intrinsic application parallelism
  • modularity favours model re-usability/adaptivity

EXTERNAL INTERFACE:

  • I/O ports number
  • I/O ports depth
  • I/O ports tokens burst

A B D

actions state

13

slide-32
SLIDE 32

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular: approach

coarse grained substrate

C D A B C D A B

1:1

http://sites.unica.it/rpct/

14

slide-33
SLIDE 33

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular: approach

coarse grained substrate

C D A B C D A B

1:1

C D A B E D A α β

http://sites.unica.it/rpct/

14

slide-34
SLIDE 34

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular: approach

coarse grained substrate

C D A B C D A B

1:1

coarse grained reconfigurable substrate

SB

E

SB

C D A B

2:1

C D A B E D A α β

http://sites.unica.it/rpct/

14

slide-35
SLIDE 35

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular: approach

coarse grained substrate

C D A B C D A B

1:1

coarse grained reconfigurable substrate

SB

E

SB

C D A B

2:1

C D A B E D A α β

http://sites.unica.it/rpct/

14

slide-36
SLIDE 36

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular: MDC framework

Multi-Dataflow Composer (MDC) tool: Dataflow 2 HW tool

15

slide-37
SLIDE 37

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular: MDC framework

Multi-Dataflow Composer (MDC) tool: Dataflow 2 HW tool

  • Given N input dataflow specification, it generates the

HDL Coarse-Grain (CG) reconfigurable substrate

HETEROGENOUS FUNCTIONAL UNITS

15

slide-38
SLIDE 38

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular: MDC framework

Multi-Dataflow Composer (MDC) tool: Dataflow 2 HW tool

  • Given N input dataflow specification, it generates the

HDL Coarse-Grain (CG) reconfigurable substrate

  • Handles programmability, by defining switching and

configuration logic

HETEROGENOUS FUNCTIONAL UNITS IRREGULAR INTERCONNECT

15

slide-39
SLIDE 39

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular: MDC framework

Multi-Dataflow Composer (MDC) tool: Dataflow 2 HW tool

  • Given N input dataflow specification, it generates the

HDL Coarse-Grain (CG) reconfigurable substrate

  • Handles programmability, by defining switching and

configuration logic

  • Deploy Xilinx-compliant IP blocks, plus their drivers

HETEROGENOUS FUNCTIONAL UNITS IRREGULAR INTERCONNECT

15

slide-40
SLIDE 40

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

MDC-based Reconfiguration: Adaptation Types

Functional Oriented Non-Functional Oriented

A B C

16

slide-41
SLIDE 41

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

MDC-based Reconfiguration: Adaptation Types

Functional Oriented

A B C in1

  • ut1

B E in2

  • ut2

A SW C in1 Execution Profile

  • ut1

B E SW in2

  • ut2

Non-Functional Oriented

A B C

16

slide-42
SLIDE 42

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

MDC-based Reconfiguration: Adaptation Types

A B C in

  • ut1

A D B in B E C

  • ut2

A SW C in Execution Profile

  • ut

B B SW E D SW

Functional Oriented

A B C in1

  • ut1

B E in2

  • ut2

A SW C in1 Execution Profile

  • ut1

B E SW in2

  • ut2

Non-Functional Oriented

A B C

16

Troughput vs Energy

slide-43
SLIDE 43

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

MDC-based Reconfiguration: Adaptation Types

Execution Profile A SW A SW A in SW

  • ut

A A A in

  • ut1

A A in

  • ut2

A in

  • ut3

A B C in

  • ut1

A D B in B E C

  • ut2

A SW C in Execution Profile

  • ut

B B SW E D SW

Functional Oriented

A B C in1

  • ut1

B E in2

  • ut2

A SW C in1 Execution Profile

  • ut1

B E SW in2

  • ut2

Non-Functional Oriented

A B C

16

Troughput vs Energy QoS Vs. Energy

slide-44
SLIDE 44

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Overview

  • Motivations
  • What we need adaptation for
  • Triggers and Types
  • Coarse-Grained Reconfigurable Systems
  • Computing Spectrum and Reconfigurable Systems Classification
  • Heterogeneous and Irregular Coarse-Grained Reconfigurable Accelerators
  • Power Management
  • Issues and Strategies
  • Low-Power Coarse-Grained Reconfigurable Accelerators
  • An FFT Example

17

slide-45
SLIDE 45

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

The Power Issue

Power consumption = Dynamic power + Static power

Dynamic: activity dependent

  • Short-circuit: when the output line of a transistor is

switching, there is a period of time when both the PMOS and the NMOS transistors are on (I·V·f)

  • Switching power: due to the charging and discharging
  • f the load capacitance when logic transitions occur

(determined by the formula C·V2·f). Static: not activity dependent, but due to leakage currents.

  • Do not depend on switching and operating frequency.
  • As transistors get smaller, channel lengths become shorter and leakage currents increase.

90 nm Inflection Point

18

slide-46
SLIDE 46

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

The Power Issue: Textbook Techniques

  • DESIGN: Multi Vt, Clock gating, Power gating, Multi Vdd, DVFS.
  • PROCESS: Multi Vt, PD SOI, FD SOI, FinFet, Body Bias, Multi oxide devices, Minimize

capacitance by custom design.

  • ARCHITECTURE: power-constrained DSE, hw customization and IP specific techniques,

parallelism, mapping.

Dynamic

Clock gating Variable frequency Voltage islands Variable power supply Multi power supply DVFS

Static

Multi-threshold dev. Power gating Back (substrate) bias Multi-oxide devices SOI CMOS

INTRINSICALLY SYSTEM LEVEL MANAGEABLE AT SYSTEM LEVEL

19

slide-47
SLIDE 47

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-Based Techniques

  • Power consumed by flip-flops.
  • Power consumed by combinatorial logic driven by registers.
  • Power consumed by the clock buffer tree.

20

Reduce frequency whenever you can. Stop the clock when the component is not active.

slide-48
SLIDE 48

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-Based Techniques

  • Power consumed by flip-flops.
  • Power consumed by combinatorial logic driven by registers.
  • Power consumed by the clock buffer tree.

20

Reduce frequency whenever you can. Stop the clock when the component is not active.

Fine-Grained

slide-49
SLIDE 49

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-Based Techniques

  • Power consumed by flip-flops.
  • Power consumed by combinatorial logic driven by registers.
  • Power consumed by the clock buffer tree.

20

Reduce frequency whenever you can. Stop the clock when the component is not active.

Fine-Grained Coarse-Grained

slide-50
SLIDE 50

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-Based Techniques

  • Power consumed by flip-flops.
  • Power consumed by combinatorial logic driven by registers.
  • Power consumed by the clock buffer tree.

20

Reduce frequency whenever you can. Stop the clock when the component is not active.

Fine-Grained Coarse-Grained

50% less dynamic power

slide-51
SLIDE 51

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Power-Based Techniques

  • Power-aware partitioning.

21

Reduce the voltage level of a power island whenever you can. Switch it off when it is not active.

V4 V1 V3 V2 V1 V3 V2 V4

Static Voltage Scaling (SVS )

slide-52
SLIDE 52

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Power-Based Techniques

  • Power-aware partitioning.
  • Switching-off power island brings local leakage to zero.

21

Reduce the voltage level of a power island whenever you can. Switch it off when it is not active.

V4 V4 V1 V3 V2 V1 V3 V2 V4 V1

OFF

V2 V1

OFF

V2 V4

Static Voltage Scaling (SVS ) SVS with Power Shut Off (PSO)

slide-53
SLIDE 53

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

V3

Power-Based Techniques

  • Power-aware partitioning.
  • Switching-off power island brings local leakage to zero.
  • Modes of operation -> power down all the idle chip regions.

21

Reduce the voltage level of a power island whenever you can. Switch it off when it is not active.

V4 V4 V1 V3 V2 V1 V3 V2 V4 V1

OFF

V2 V1

OFF

V2 V4 V4 V1

OFF OFF OFF OFF OFF

Static Voltage Scaling (SVS ) SVS with Power Shut Off (PSO) SVS with PSO and Power Modes

slide-54
SLIDE 54

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

V3 V2

Power-Based Techniques

  • Power-aware partitioning.
  • Switching-off power island brings local leakage to zero.
  • Modes of operation -> power down all the idle chip regions.

21

Reduce the voltage level of a power island whenever you can. Switch it off when it is not active.

V4 V4 V1 V3 V2 V1 V3 V2 V4 V1

OFF

V2 V1

OFF

V2 V4 V4 V1

OFF OFF OFF OFF OFF

Static Voltage Scaling (SVS ) SVS with Power Shut Off (PSO) SVS with PSO and Power Modes

slide-55
SLIDE 55

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-/Power-Based Techniques: Overhead

22

Switch off the clock, applicable to both ASIC and FPGA.

  • @ASIC: simple and gates.
  • @FPGA: dedicated vendor specific cells.
slide-56
SLIDE 56

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-/Power-Based Techniques: Overhead

22

Switch off the clock, applicable to both ASIC and FPGA.

  • @ASIC: simple and gates.
  • @FPGA: dedicated vendor specific cells.

Switch off the power supply, ASIC only (partially FPGA).

slide-57
SLIDE 57

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-/Power-Based Techniques: Overhead

22

Switch off the clock, applicable to both ASIC and FPGA.

  • @ASIC: simple and gates.
  • @FPGA: dedicated vendor specific cells.

Switch off the power supply, ASIC only (partially FPGA).

  • Sleep transistors: to switch on and off power supply.

POWER- DOWN BLOCK

Vdd Power Switch-Off Cell

slide-58
SLIDE 58

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-/Power-Based Techniques: Overhead

22

Switch off the clock, applicable to both ASIC and FPGA.

  • @ASIC: simple and gates.
  • @FPGA: dedicated vendor specific cells.

Switch off the power supply, ASIC only (partially FPGA).

  • Sleep transistors: to switch on and off power supply.
  • Isolation logic: to avoid the transmission of spurious signals from

gated regions to normally-on cells.

POWER- DOWN BLOCK

P_UP

ALWAYS-ON BLOCK

Isolation Cell

POWER- DOWN BLOCK

Vdd Power Switch-Off Cell

slide-59
SLIDE 59

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-/Power-Based Techniques: Overhead

22

Switch off the clock, applicable to both ASIC and FPGA.

  • @ASIC: simple and gates.
  • @FPGA: dedicated vendor specific cells.

Switch off the power supply, ASIC only (partially FPGA).

  • Sleep transistors: to switch on and off power supply.
  • Isolation logic: to avoid the transmission of spurious signals from

gated regions to normally-on cells.

  • Retention logic: to maintain the internal state of gated regions.

MAIN REGISTER SHADOW REGISTER

SAVE RESTORE Retention Register

POWER- DOWN BLOCK

P_UP

ALWAYS-ON BLOCK

Isolation Cell

POWER- DOWN BLOCK

Vdd Power Switch-Off Cell

slide-60
SLIDE 60

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Clock-/Power-Based Techniques: Overhead

23

Switch off the clock, applicable to both ASIC and FPGA.

  • @ASIC: simple and gates.
  • @FPGA: dedicated vendor specific cells.

Switch off the power supply, ASIC only (partially FPGA).

  • Sleep transistors: to switch on and off power supply.
  • Isolation logic: to avoid the transmission of spurious signals from

gated regions to normally-on cells.

  • Retention logic: to maintain the internal state of gated regions.

Vary mode changing Vdd, ASIC only.

  • Level shifters: to pass signals between portions of the design that
  • perate on different voltages.

POWER DOMAIN 1 0.8 V POWER DOMAIN 2 1.2 V

Level Shifters

slide-61
SLIDE 61

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

PSO Example

  • Reconfigurable Filter, the

depth of the filter can vary.

  • 2 different Logic Regions

(LR), one always on and one switchable

24

slide-62
SLIDE 62

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

PSO Example

  • Reconfigurable Filter, the

depth of the filter can vary.

  • 2 different Logic Regions

(LR), one always on and one switchable

  • Switchable LR needs the

insertion of

  • isolation, retention and

power switch cells;

  • one power controller to

handle the control signals [1*shut-o + 1*isol. + 2*reten.]

24

slide-63
SLIDE 63

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: power issue

α C D A B E D A D F β γ

25

slide-64
SLIDE 64

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: power issue

α C D A B E D A D F β γ

25

SB0

E A C B

SB1 SB2

F D Multi-Dataflow Graph

slide-65
SLIDE 65

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: power issue

α C D A B E D A D F β γ

25

Common element

SB0

E A C B

SB1 SB2

F D Multi-Dataflow Graph

slide-66
SLIDE 66

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: power issue

α C D A B E D A D F β γ

SB0

E A C B

SB1 SB2

F D

26

Common element

Multi-Dataflow Graph

α execution: E and F, being not involved, are wasting power!

slide-67
SLIDE 67

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: power issue

α C D A B E D A D F β γ

SB0

E A C B

SB1 SB2

F D

27

Common element

Multi-Dataflow Graph

β execution: B C and D, being not involved, are wasting power!

slide-68
SLIDE 68

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: power issue

α C D A B E D A D F β γ

SB0

E A C B

SB1 SB2

F D

28

Common element

Multi-Dataflow Graph

γ execution: B C and D, being not involved, are wasting power!

slide-69
SLIDE 69

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: MDC approach

E D A D F C D A B

α β γ

29

slide-70
SLIDE 70

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: MDC approach

SB

E A C B

SB SB

F D E D A D F C D A B

α β γ MDC front-end

29

slide-71
SLIDE 71

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: MDC approach

C F D A B E

γ α β

SB

E A C B

SB SB

F D E D A D F C D A B

α β γ MDC front-end

29

LRs Identification

slide-72
SLIDE 72

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Heterogeneous and Irregular CGR: MDC approach

C F D A B E LR 1 2 3 4 5 actors A B,C D E F α 1 1 1 β 1 1 1 γ 1 1

γ α β

SB

E A C B

SB SB

F D E D A D F C D A B

α β γ MDC front-end

29

LRs Identification

slide-73
SLIDE 73

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

AUTOMATED DYNAMIC POWER MANAGEMENT

low power (clock gated) CGR substrate

en generator

C F D A B E

clk

configurator

en1 en2 en3 en4 en5 LR

actors

α β γ 1 A 1 1 2 B,C 1 3 D 1 1 1 4 E 1 5 F 1 1

MDC back-end

30

slide-74
SLIDE 74

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Overview

  • Motivations
  • What we need adaptation for
  • Triggers and Types
  • Coarse-Grained Reconfigurable Systems
  • Computing Spectrum and Reconfigurable Systems Classification
  • Heterogeneous and Irregular Coarse-Grained Reconfigurable Accelerators
  • Power Management
  • Issues and Strategies
  • Low-Power Coarse-Grained Reconfigurable Accelerators
  • An FFT Example

31

slide-75
SLIDE 75

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

CG Reconfigurable FFT Design

32

x0 x4 x2 x6 x1 x5 x3 x7 y0 y1 y2 y3 y4 y5 y6 y7

  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1

radix-2 butterfly stage 1 stage 2 stage 3

slide-76
SLIDE 76

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

CG Reconfigurable FFT Design

33

b_00 b_10 b_20 b_30 b_10 b_11 b_21 b_31 b_02 b_12 b_22 b_32

12 butterflies

b_00 b_10 b_20 b_30

4 butterflies

b_00

1 butterfly

b_00 b_10

2 butterflies

slide-77
SLIDE 77

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Low-Power CG Reconfigurable FFT Design

b_00 b_10 b_20 b_30 b_10 b_11 b_21 b_31 b_02 b_12 b_22 b_32 SWITCHABLE AREA

34

slide-78
SLIDE 78

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Low-Power CG Reconfigurable FFT Design

b_00 b_10 b_20 b_30 b_10 b_11 b_21 b_31 b_02 b_12 b_22 b_32 SWITCHABLE AREA

34

slide-79
SLIDE 79

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Low-Power CG Reconfigurable FFT: 90nm ASIC

FFT: power vs throughput Dynamic trade-off management On ASIC MDC offers automatic implementation of power-gated and clock-gated designs FFT: Area

35

slide-80
SLIDE 80

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Intelligent system DEsign and Application (IDEA) @ UNISS

36

slide-81
SLIDE 81

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

The MDC Group – UNISS + UNICA team

37

UNIVERSITY OF SASSARI UNIVERSITY OF CAGLIARI

slide-82
SLIDE 82

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

The MDC Group – UNISS + UNICA team

37

UNIVERSITY OF SASSARI UNIVERSITY OF CAGLIARI

slide-83
SLIDE 83

Design for Low-Power Internet-of-Things (IoT) Systems – ISCAS 2018

Thanks to …

38

Coordinator: Michal Masin (IBM), michaelm@il.ibm.com Scientific Coordinator: Francesca Palumbo (UniSS), fpalumbo@uniss.it Innovation Manager: Katiuscia Zedda (Abinsula), katiuscia.zedda@abinsula.com Dissemination-Communication Manager: Francesco Regazzoni (USI), francesco.regazzoni@usi.ch www.cerbero-h2020.eu info@cerbero-h2020.eu @CERBERO_h2020

EU Commission for funding the CERBERO (Cross-layer modEl-based fRamework for multi-

  • Bjective dEsign of Reconfigurable systems in unceRtain hybRid envirOnments) project as

part of the H2020 Programme under grant agreement No 732105.