Priority-based Wormhole Networks-on-Chip: challenges and - - PowerPoint PPT Presentation

priority based wormhole networks on chip challenges and
SMART_READER_LITE
LIVE PREVIEW

Priority-based Wormhole Networks-on-Chip: challenges and - - PowerPoint PPT Presentation

Leandro Soares Indrusiak Priority-based Wormhole Networks-on-Chip: challenges and opportunities Leandro Soares Indrusiak Real-Time Systems Group Department of Computer Science University of York United Kingdom RTN 2017 1 Real-Time Systems


slide-1
SLIDE 1

1 Leandro Soares Indrusiak Real-Time Systems Group

Priority-based Wormhole Networks-on-Chip: challenges and opportunities

Leandro Soares Indrusiak

Real-Time Systems Group Department of Computer Science University of York United Kingdom

RTN 2017

slide-2
SLIDE 2

2 Leandro Soares Indrusiak Real-Time Systems Group

Outline

  • Wormhole Networks
  • Networks-on-Chip
  • Real-Time Analysis
  • Resource Management
slide-3
SLIDE 3

3 Leandro Soares Indrusiak Real-Time Systems Group

Motivation

  • Multiprocessor and multicore systems

forced a shift towards communication- centric design

abundant computation resources shared communication media

  • Inter-processor communication

point-to-point bus networks

source: IBM, Intel

slide-4
SLIDE 4

4 Leandro Soares Indrusiak Real-Time Systems Group

Networks – key characteristics

  • topology

mesh, star, torus…

  • routing protocol

deterministic, adaptive…

  • arbitration

round-robin, priority preemptive, priority non- preemptive, TDM…

  • buffering

FIFO, SAFC, SAMQ, DAMQ, hot potato…

  • flow control protocol

handshake, credit-based…

  • switching protocol

circuit, store-and-forward, wormhole

slide-5
SLIDE 5

5 Leandro Soares Indrusiak Real-Time Systems Group

Networks – key characteristics

  • topology

mesh, star, torus…

  • routing protocol

deterministic, adaptive…

  • arbitration

round-robin, priority preemptive, priority non- preemptive, TDM…

  • buffering

FIFO, SAFC, SAMQ, DAMQ, hot potato…

  • flow control protocol

handshake, credit-based…

  • switching protocol

circuit, store-and-forward, wormhole

slide-6
SLIDE 6

6 Leandro Soares Indrusiak Real-Time Systems Group

Circuit switching

  • Packets are forwarded through dedicated paths that are

kept until the transmission is finished

  • Contention can arise when establishing a path

 no further contention once path is established

  • Temporary packet buffering in routers is not required
  • Suitable to long and infrequent messages

 time to establish a path can be high

slide-7
SLIDE 7

7 Leandro Soares Indrusiak Real-Time Systems Group

Circuit Switching

Terminal

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

slide-8
SLIDE 8

8 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-9
SLIDE 9

9 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-10
SLIDE 10

10 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-11
SLIDE 11

11 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-12
SLIDE 12

12 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-13
SLIDE 13

13 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-14
SLIDE 14

14 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-15
SLIDE 15

15 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-16
SLIDE 16

16 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-17
SLIDE 17

17 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-18
SLIDE 18

18 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-19
SLIDE 19

19 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-20
SLIDE 20

20 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

A segment of reserved path is idle for a significant period

  • f time.

Circuit Switching

Terminal Terminal

Packet Header Packet Data

slide-21
SLIDE 21

21 Leandro Soares Indrusiak Real-Time Systems Group

SAF Switching

  • SAF - Store And Forward
  • Routers can only forward a packet once it is completely

received and stored

 packet acquires one link at a time

  • Router input ports must have enough buffering space to

temporarily store a complete packet

slide-22
SLIDE 22

22 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-23
SLIDE 23

23 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-24
SLIDE 24

24 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-25
SLIDE 25

25 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-26
SLIDE 26

26 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-27
SLIDE 27

27 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-28
SLIDE 28

28 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-29
SLIDE 29

29 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-30
SLIDE 30

30 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-31
SLIDE 31

31 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-32
SLIDE 32

32 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-33
SLIDE 33

33 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-34
SLIDE 34

34 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

SAF Switching

slide-35
SLIDE 35

35 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole switching

  • Packet is routed and forwarded as soon the header flit has

arrived

 payload flits follow header

  • Input ports does not need to buffer a complete packet

flits of a packet can be stored across multiple routers

  • Trade-off between buffer overheads and network contention
slide-36
SLIDE 36

36 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole Switching

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

slide-37
SLIDE 37

37 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole Switching

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

slide-38
SLIDE 38

38 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole Switching

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

slide-39
SLIDE 39

39 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole Switching

Switch Switch Switch Switch Switch Switch

Terminal

Packet Header Packet Data

slide-40
SLIDE 40

40 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Wormhole Switching

Terminal

Packet Header Packet Data

slide-41
SLIDE 41

41 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Wormhole Switching

Terminal

Packet Header Packet Data

slide-42
SLIDE 42

42 Leandro Soares Indrusiak Real-Time Systems Group

Switch Switch Switch Switch Switch Switch

Wormhole Switching

Terminal

Packet Header Packet Data

slide-43
SLIDE 43

43 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole Networks-on-Chip

  • Small buffering overheads
  • f wormhole networks is

particularly attractive to a special class of resource- constrained networks: Networks-on-Chip (NoCs)

small buffers mean smaller area and lower energy dissipation

PE PE PE PE PE PE PE PE R PE R R R R R R R R

slide-44
SLIDE 44

44 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole Networks-on-Chip

PE PE PE PE PE PE PE PE R PE R R R R R R R R

Core Router Link

slide-45
SLIDE 45

45 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole Networks-on-Chip

PE PE PE PE PE PE PE PE R PE R R R R R R R R

Link

slide-46
SLIDE 46

46 Leandro Soares Indrusiak Real-Time Systems Group

arbitration routing & transmission control routing & transmission control

data out data in data out data in data out data in data out data in data out data in

PE PE PE PE PE PE PE PE R PE R R R R R R R R

Wormhole Networks-on-Chip

slide-47
SLIDE 47

47 Leandro Soares Indrusiak Real-Time Systems Group

Wormhole Networks-on-Chip

slide-48
SLIDE 48

48 Leandro Soares Indrusiak Real-Time Systems Group

NoC parallelism and scalability

CPU CPU CPU CPU RAM CPU I/O CPU

Multiple connections simultaneously

slide-49
SLIDE 49

49 Leandro Soares Indrusiak Real-Time Systems Group

NoC performance

link contention leads to latency variability

CPU CPU CPU CPU RAM CPU I/O CPU

task contention leads to latency variability

slide-50
SLIDE 50

50 Leandro Soares Indrusiak Real-Time Systems Group

Time predictability in embedded NoCs

  • Ability to guarantee an upper bound
  • n the system’s temporal behaviour

 worst-case response time of each task  worst-case latency of each NoC packet  worst-case end-to-end latencies of communicating task chains

  • Ability to constrain the variability of

the system’s temporal behaviour

 limited best/worst case difference

upper bound

time time frequency frequency

slide-51
SLIDE 51

51 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-52
SLIDE 52

52 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-53
SLIDE 53

53 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-54
SLIDE 54

54 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-55
SLIDE 55

55 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-56
SLIDE 56

56 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

packet is blocked Packet Header Packet Data

PE

slide-57
SLIDE 57

57 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-58
SLIDE 58

58 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-59
SLIDE 59

59 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

new packet released Packet Header Packet Data

PE

slide-60
SLIDE 60

60 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-61
SLIDE 61

61 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-62
SLIDE 62

62 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-63
SLIDE 63

63 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Wormhole Networks-on-Chip

PE

Packet Header Packet Data

PE

slide-64
SLIDE 64

64 Leandro Soares Indrusiak Real-Time Systems Group

Performance guarantees in embedded NoCs

  • As the core counts increase, NoC link

contention tends to me the dominant source of latency variability

  • Current solutions

Full traffic separation (i.e. no link contention)

  • deterministic routing, fully disjoint routes (e.g. Hermes)
  • multiple overlay networks (e.g. Tilera)
  • contention over NIs and memory still possible
  • circuit switching (e.g. PNoC)
  • unpredictable circuit setup time
  • very low utilisation
  • state of the art: mixed criticality, virtual traffic separation
slide-65
SLIDE 65

65 Leandro Soares Indrusiak Real-Time Systems Group

Performance guarantees in embedded NoCs

  • As the core counts increase, NoC link

contention tends to me the dominant source of latency variability

  • Current solutions

Virtual traffic separation

  • time-division multiplexing (TDM)
  • fixed traffic slotting (e.g. Aethereal, AElite)
  • round-robin (RR)
  • rate controlling (e.g. Kalray, Nostrum, IDAMC)
  • fixed-priority (FP)
  • priority-arbitrated virtual channels (e.g. QNoC)
slide-66
SLIDE 66

66 Leandro Soares Indrusiak Real-Time Systems Group

Priority preemptive virtual channels

  • Wormhole NoCs using virtual channels

with priority preemptive arbitration can discriminate packets of different levels of urgency

  • Matches previous work on schedulability

analysis in priority-preemptive wormhole networks

slide-67
SLIDE 67

67 Leandro Soares Indrusiak Real-Time Systems Group

Priority preemptive virtual channels

highest priority with remaining credit data_in credit_out data_out credit_in

routing & transmission control priority ID

highest priority with remaining credit

routing & transmission control

PE PE PE PE PE PE PE PE R PE R R R R R R R R

slide-68
SLIDE 68

68 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-69
SLIDE 69

69 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-70
SLIDE 70

70 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

high priority packet released

slide-71
SLIDE 71

71 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-72
SLIDE 72

72 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-73
SLIDE 73

73 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

first packet is preempted

slide-74
SLIDE 74

74 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-75
SLIDE 75

75 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-76
SLIDE 76

76 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-77
SLIDE 77

77 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-78
SLIDE 78

78 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-79
SLIDE 79

79 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-80
SLIDE 80

80 Leandro Soares Indrusiak Real-Time Systems Group

R R R R R R

Priority preemptive virtual channels

PE

wormhole NoC with priority preemptive virtual channels

Packet Header Packet Data

PE

slide-81
SLIDE 81

81 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

Pros vs Cons

slide-82
SLIDE 82

82 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

  • Cons

 not available as COTS

slide-83
SLIDE 83

83 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

  • Cons

 hardware overhead related to virtual channel buffering and arbitration

Xilinx Artix FPGA

  • B. Sudev, L. S. Indrusiak: Low overhead predictability enhancement in non-preemptive network-on-chip routers using Priority Forwarded Packet
  • Splitting. ReCoSoC 2014.
slide-84
SLIDE 84

84 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

  • Cons

 hardware overhead related to virtual channel buffering and arbitration

simple round-robin, no traffic shaping

Xilinx Artix FPGA

  • B. Sudev, L. S. Indrusiak: Low overhead predictability enhancement in non-preemptive network-on-chip routers using Priority Forwarded Packet
  • Splitting. ReCoSoC 2014.
slide-85
SLIDE 85

85 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

  • Cons

 hardware overhead related to virtual channel buffering and arbitration

priority non- preemptive arbitration [Sudev & Indrusiak, ReCoSoC 2014]

Xilinx Artix FPGA

OPEN PROBLEM ALERT

  • B. Sudev, L. S. Indrusiak: Low overhead predictability enhancement in non-preemptive network-on-chip routers using Priority Forwarded Packet
  • Splitting. ReCoSoC 2014.
slide-86
SLIDE 86

86 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

  • Cons

 hardware overhead related to virtual channel buffering and arbitration

priority preemptive arbitration, 4 VCs with 2 position buffers each

Xilinx Artix FPGA

  • B. Sudev, L. S. Indrusiak: Low overhead predictability enhancement in non-preemptive network-on-chip routers using Priority Forwarded Packet
  • Splitting. ReCoSoC 2014.
slide-87
SLIDE 87

87 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

  • Pros

 notion of priorities is very intuitive and natural  no waste of bandwidth through reservation mechanisms  amenable to tight analysis methods (more on this later)  virtual separation of traffic  accommodates change in traffic properties (periods, packet sizes, jitter)

slide-88
SLIDE 88

88 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

  • Pros

 simple protocols to handle mixed-criticality traffic

  • L. S. Indrusiak, J. Harbin, A. Burns: Average and Worst-Case Latency Improvements in Mixed-Criticality Wormhole Networks-on-Chip. ECRTS 2015.

C R R R R C R R C R R C R C C C C C

mode change notification

C R R R R C R R C R R C R C C C C C

after a mode change, routers arbitrate links in criticality order, and in priority

  • rder within the

same criticality

slide-89
SLIDE 89

89 Leandro Soares Indrusiak Real-Time Systems Group

Priority-preemptive wormhole NoCs

vs

slide-90
SLIDE 90

90 Leandro Soares Indrusiak Real-Time Systems Group

Outline

  • Wormhole Networks
  • Networks-on-Chip
  • Real-Time Analysis
  • Resource Management
slide-91
SLIDE 91

91 Leandro Soares Indrusiak Real-Time Systems Group

Performance evaluation

  • How to estimate performance figures for a particular

application mapped to a Network-on-Chip?

 full system prototyping

  • cores + NoC in FPGA, running OS + application
  • extremely costly setup time, can only explore few design alternatives

 accurate system simulation

  • cycle-accurate model of cores + NoC, running OS + application
  • extremely long simulation time, can only explore few design alternatives

 approximately-timed system simulation

  • approximately-timed model of cores + NoC, executing an abstract model of

the OS + application

 analytical system performance models

  • average or worst-case latency estimation for restricted application styles

(periodic independent tasks, synchronous dataflow, etc.)

slide-92
SLIDE 92

92 Leandro Soares Indrusiak Real-Time Systems Group

Performance evaluation

  • How to estimate performance figures for a particular

application mapped to a Network-on-Chip?

 full system prototyping

  • cores + NoC in FPGA, running OS + application
  • extremely costly setup time, can only explore few design alternatives

 accurate system simulation

  • cycle-accurate model of cores + NoC, running OS + application
  • extremely long simulation time, can only explore few design alternatives

 approximately-timed system simulation

  • approximately-timed model of cores + NoC, executing an abstract model of

the OS + application

 analytical system performance models

  • average or worst-case latency estimation for restricted application styles

(periodic independent tasks, synchronous dataflow, etc.)

slide-93
SLIDE 93

93 Leandro Soares Indrusiak Real-Time Systems Group

Real-Time Analysis

  • First approaches to analyse priority-preemptive

wormhole networks came during the 90s

 Mutka (1994)  Hary and Ozguner (1997)

  • Key idea is to consider the entire path of a packet

as a single shared resource

 worst-case latency bound of a packet flow can be found by analysing the higher priority packet flows that share at least one link of its route

slide-94
SLIDE 94

94 Leandro Soares Indrusiak Real-Time Systems Group

Real-Time Analysis

PE PE PE PE PE PE PE PE R PE R R R R R R R R

t4 t3 t2 t1 pri(t1)>pri(t2)>pri(t3)>pri(t4) t2 t4 t3 t1 interference graph

slide-95
SLIDE 95

95 Leandro Soares Indrusiak Real-Time Systems Group

Real-Time Analysis

  • Kim et al (1998) recognised

that direct interferences are not enough to produce correct upper bounds

  • Indirect interference must

be considered, in order to take into account back-to- back hits caused by upstream indirect interference

PE PE PE PE PE PE PE PE R PE R R R R R R R R

t2 t3 t1 pri(t1)>pri(t2)>pri(t3)

slide-96
SLIDE 96

96 Leandro Soares Indrusiak Real-Time Systems Group

Real-Time Analysis

  • With the introduction of Networks-on-Chip in the

2000s, the approach of Kim et al was revisited by Lu et al (ASP DAC 2005)

aiming to provide upper bounds to sporadic packets over NoCs with priority preemptive virtual channels flawed assumption of a critical instant where all packets start flowing simultaneously

slide-97
SLIDE 97

97 Leandro Soares Indrusiak Real-Time Systems Group

Real-Time Analysis

  • Shi and Burns (NOCS 2008) corrected the flaw
  • n Lu et al and produced a response time

formulation that uses a conservative approach to upstream indirect interference

interference jitter Jj

I = Rj-Lj

OPEN PROBLEM ALERT

slide-98
SLIDE 98

98 Leandro Soares Indrusiak Real-Time Systems Group

Real-Time Analysis

  • Several lines of work were derived from Shi and

Burns 2008

highly cited: 145 (Google Scholar) many works on priority assignment and task mapping a few on analysis improvement, aiming to make it tighter

  • Nikolic et al (arxiv 2016) considered that the interference should not

be calculated based on the full path, but the contention domain

  • Kashif et al (Trans Comp 2015) attempted to analyse packet paths
  • n a link-by-link manner, but assumed infinite buffering (i.e. did not

consider backpressure)

  • Kashif and Patel (RTAS 2016) attempted to consider buffering and

backpressure effects

  • all of them upper-bounded by Shi and Burns 2008
slide-99
SLIDE 99

99 Leandro Soares Indrusiak Real-Time Systems Group

Real-Time Analysis

  • Xiong et al (GLSVLSI 2016) has made two key

contributions

new formulation to the upstream indirect interference problem, aiming to be tighter than Shi and Burns 2008 new formulation to the downstream indirect interference problem, aiming to capture a previously unseen issue, and showing that Shi and Burns 2008 is optimistic and unsafe (and so are all the analyses upper-bounded by it)

slide-100
SLIDE 100

100 Leandro Soares Indrusiak Real-Time Systems Group

Real-Time Analysis

  • Indrusiak et al (arxiv 2016) has shown that

Xiong et al’s formulation to the upstream indirect interference problem was flawed Xiong et al’s formulation to the downstream indirect interference problem was correct, but unnecessarily pessimistic (i.e. it assumed all indirect interference as if it is direct interference) a tighter upper bound that considers the downstream indirect interference problem is possible

  • Xiong et al published a corrected analysis on

IEEE Trans Comp in 2017

OPEN PROBLEM ALERT

slide-101
SLIDE 101

101 Leandro Soares Indrusiak Real-Time Systems Group

Outline

  • Wormhole Networks
  • Networks-on-Chip
  • Real-Time Analysis
  • Resource Management
slide-102
SLIDE 102

102 Leandro Soares Indrusiak Real-Time Systems Group

Resource Management

  • Analytical models can be used to test

whether a particular NoC configuration meets its hard real-time constraints

  • It can be used as a fitness function in a

search-based optimisation

guides the search towards full schedulability much faster than simulation, therefore can cover a wider search space

slide-103
SLIDE 103

103 Leandro Soares Indrusiak Real-Time Systems Group

  • P. Mesidis and L. S. Indrusiak, “Genetic mapping of hard real-time applications onto NoC-based MPSoCs — A first approach,” in Int Workshop on

Reconfigurable Communication-centric Systems-on-Chip (ReCoSoC), 2011.

  • M. N. S. M. Sayuti and L. S. Indrusiak, “Real-time low-power task mapping in Networks-on-Chip,” in IEEE Computer Society Annual Symposium on

VLSI (ISVLSI), 2013.

Optimisation Algorithm

  • Optimisation performed by a population-based

evolutionary algorithm

slide-104
SLIDE 104

104 Leandro Soares Indrusiak Real-Time Systems Group

Optimisation Algorithm

  • A population contains a group of individuals represented

by a chromosome structure

  • Creation of new individuals is facilitated by operators:

 Selection  Crossover  Mutation

slide-105
SLIDE 105

105 Leandro Soares Indrusiak Real-Time Systems Group

Optimisation algorithm

  • Goal: evolve a fully schedulable mapping
  • ver generations

# unschedulable tasks and flows generations

slide-106
SLIDE 106

106 Leandro Soares Indrusiak Real-Time Systems Group

Experiment results

  • Autonomous vehicle application (AVA) benchmark (38

communicating tasks), 4x4 Mesh NoC

  • M. N. S. M. Sayuti and L. S. Indrusiak, “Real-time low-power task mapping in Networks-on-Chip,” in IEEE Computer Society Annual Symposium on

VLSI (ISVLSI), 2013.

slide-107
SLIDE 107

107 Leandro Soares Indrusiak Real-Time Systems Group

Multi-objective optimisation algorithm

  • Additional fitness functions can be added to the

evolutionary algorithm

 example: evolve mappings that are fully schedulable and low power

# unschedulable tasks and flows generations dissipated power generations

slide-108
SLIDE 108

108 Leandro Soares Indrusiak Real-Time Systems Group

  • Autonomous vehicle

application (AVA) benchmark, 4x4 Mesh

  • Comparison of best

solution convergence between single and multiple objectives

Experiment results

  • M. N. S. M. Sayuti and L. S. Indrusiak, “Real-time low-power task mapping in Networks-on-Chip,” in IEEE Computer Society Annual Symposium on

VLSI (ISVLSI), 2013.

slide-109
SLIDE 109

109 Leandro Soares Indrusiak Real-Time Systems Group

  • Synthetic application

(SA) benchmark, 4x4 Mesh

  • Comparison of best

solution convergence between single and multiple objectives

Experiment results

  • M. N. S. M. Sayuti and L. S. Indrusiak, “Real-time low-power task mapping in Networks-on-Chip,” in IEEE Computer Society Annual Symposium on

VLSI (ISVLSI), 2013.

slide-110
SLIDE 110

110 Leandro Soares Indrusiak Real-Time Systems Group

  • Synthetic application

(SA) benchmark, 5x5 Mesh

  • Comparison of best

solution convergence between single and multiple objectives

Experiment results

  • M. N. S. M. Sayuti and L. S. Indrusiak, “Real-time low-power task mapping in Networks-on-Chip,” in IEEE Computer Society Annual Symposium on

VLSI (ISVLSI), 2013.

slide-111
SLIDE 111

111 Leandro Soares Indrusiak Real-Time Systems Group

Conclusions

  • Priority-preemptive wormhole networks

have interesting properties, making them amenable to real-time analysis

  • Analysis is less trivial than originally

thought

advances needed on accounting for indirect interference

  • Analytical models are useful guides to

search heuristics attempting to configure multiple system aspects

task mapping, packet routing, priority assignment, security features, voltage and frequency scaling

OPEN PROBLEM ALERT OPEN PROBLEM ALERT

slide-112
SLIDE 112

112 Leandro Soares Indrusiak Real-Time Systems Group

Priority-based Wormhole Networks-on-Chip: challenges and opportunities

RTN 2017

Leandro Soares Indrusiak

with contributions by: Alan Burns, Rob Davis, Iain Bate, Neil Audsley, James Harbin, Piotr Dziurzanski, Amit Singh, Paris Mesidis, Adrian Racu, Norazizi Sayuti, Zheng Shi, Yunfeng Ma, Rosh Mendis, Bharath Sudev with funding from: EU (DreamCloud, T-CREST), EPSRC (MCC, MCCps)