Computing the throughput of probabilistic and replicated streaming - - PowerPoint PPT Presentation

computing the throughput of probabilistic and replicated
SMART_READER_LITE
LIVE PREVIEW

Computing the throughput of probabilistic and replicated streaming - - PowerPoint PPT Presentation

Intro Framework TEGs Computing Comparison Conclusion Computing the throughput of probabilistic and replicated streaming applications Anne Benoit, Fanny Dufoss e, Matthieu Gallet, Bruno Gaujal and Yves Robert Laboratoire de


slide-1
SLIDE 1

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput of probabilistic and replicated streaming applications

Anne Benoit, Fanny Dufoss´ e, Matthieu Gallet, Bruno Gaujal and Yves Robert

Laboratoire de l’Informatique du Parall´ elisme ´ Ecole Normale Sup´ erieure de Lyon, France

Roma Working Group

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 1/ 39

slide-2
SLIDE 2

Intro Framework TEGs Computing Comparison Conclusion

Outline

1

Introduction

2

Framework

3

Timed Event Graphs

4

Computing the throughput

5

Comparison results

6

Conclusion

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 2/ 39

slide-3
SLIDE 3

Intro Framework TEGs Computing Comparison Conclusion

Outline

1

Introduction

2

Framework

3

Timed Event Graphs

4

Computing the throughput

5

Comparison results

6

Conclusion

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 3/ 39

slide-4
SLIDE 4

Intro Framework TEGs Computing Comparison Conclusion

Problem description

We are given (i) a streaming application, dependence graph = linear chain; (ii) a one-to-many mapping of appliction onto heterogeneous platform; (iii) a set of I.I.D. (Independent and Identically-Distributed) variables to model computation/communication time in the mapping. How can we compute the throughput of the application, i.e., the rate at which data sets can be processed? Two execution models: Strict and Overlap

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 4/ 39

slide-5
SLIDE 5

Intro Framework TEGs Computing Comparison Conclusion

Problem description

We are given (i) a streaming application, dependence graph = linear chain; (ii) a one-to-many mapping of appliction onto heterogeneous platform; (iii) a set of I.I.D. (Independent and Identically-Distributed) variables to model computation/communication time in the mapping. How can we compute the throughput of the application, i.e., the rate at which data sets can be processed? Two execution models: Strict and Overlap

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 4/ 39

slide-6
SLIDE 6

Intro Framework TEGs Computing Comparison Conclusion

Problem description

We are given (i) a streaming application, dependence graph = linear chain; (ii) a one-to-many mapping of appliction onto heterogeneous platform; (iii) a set of I.I.D. (Independent and Identically-Distributed) variables to model computation/communication time in the mapping. How can we compute the throughput of the application, i.e., the rate at which data sets can be processed? Two execution models: Strict and Overlap

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 4/ 39

slide-7
SLIDE 7

Intro Framework TEGs Computing Comparison Conclusion

Motivation

No replication, i.e., one-to-one mapping: throughput dictated by critical hardware resource With replication, deterministic case: surprisingly difficult! (remember previous work, cases with no critical resources) Contributions: (i) general method (exponential cost) to compute throughput with I.I.E. exponential laws; (ii) bounds for arbitrary I.I.E. and N.B.U.E. (New Better than Used in Expectation) variables: between exponential and deterministic values; (iii) the problem of finding the optimal mapping is NP-complete.

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 5/ 39

slide-8
SLIDE 8

Intro Framework TEGs Computing Comparison Conclusion

Motivation

No replication, i.e., one-to-one mapping: throughput dictated by critical hardware resource With replication, deterministic case: surprisingly difficult! (remember previous work, cases with no critical resources) Contributions: (i) general method (exponential cost) to compute throughput with I.I.E. exponential laws; (ii) bounds for arbitrary I.I.E. and N.B.U.E. (New Better than Used in Expectation) variables: between exponential and deterministic values; (iii) the problem of finding the optimal mapping is NP-complete.

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 5/ 39

slide-9
SLIDE 9

Intro Framework TEGs Computing Comparison Conclusion

Outline

1

Introduction

2

Framework

3

Timed Event Graphs

4

Computing the throughput

5

Comparison results

6

Conclusion

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 6/ 39

slide-10
SLIDE 10

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F1 F2 F0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-11
SLIDE 11

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-12
SLIDE 12

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-13
SLIDE 13

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-14
SLIDE 14

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-15
SLIDE 15

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-16
SLIDE 16

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-17
SLIDE 17

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-18
SLIDE 18

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-19
SLIDE 19

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-20
SLIDE 20

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-21
SLIDE 21

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-22
SLIDE 22

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-23
SLIDE 23

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-24
SLIDE 24

Intro Framework TEGs Computing Comparison Conclusion

Application

A linear workflow with many instances T1 T2 T3 T0 F0 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 7/ 39

slide-25
SLIDE 25

Intro Framework TEGs Computing Comparison Conclusion

Platform

A fully connected platform Heterogeneous processors and communication links

P1 P2 P3 P0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 8/ 39

slide-26
SLIDE 26

Intro Framework TEGs Computing Comparison Conclusion

Platform

A fully connected platform Heterogeneous processors and communication links

P0 P1 P2 P3

b1,2 b1,3 b0,3 b2,3

s2 s3

b0,1

s0

b0,2

s1

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 8/ 39

slide-27
SLIDE 27

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P0 P2 P4 P6 P1 R3 = 1 R2 = 3 R1 = 2 P3 R0 = 1 P5 T2 T1 F0 F1 F2 T3 T0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-28
SLIDE 28

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P3 P2 P4 P6 P1 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P5 F2 T1 T0 T3 T2 F0 F1

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-29
SLIDE 29

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P3 P2 P4 P6 P1 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P5 F2 T1 T0 T3 T2 F0 F1

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-30
SLIDE 30

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P3 P2 P4 P6 P1 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P5 F2 T1 T0 T3 T2 F0 F1

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-31
SLIDE 31

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P3 P2 P4 P6 P1 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P5 F2 T1 T0 T3 T2 F0 F1

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-32
SLIDE 32

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P6 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P3 P5 P2 P4 P1 F0 T0 T3 T2 T1 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-33
SLIDE 33

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P6 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P3 P5 P2 P4 P1 F0 T0 T3 T2 T1 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-34
SLIDE 34

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P6 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P3 P5 P2 P4 P1 F0 T0 T3 T2 T1 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-35
SLIDE 35

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P6 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P3 P5 P2 P4 P1 F0 T0 T3 T2 T1 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-36
SLIDE 36

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P6 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P3 P5 P2 P4 P1 F0 T0 T3 T2 T1 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-37
SLIDE 37

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P6 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P3 P5 P2 P4 P1 F0 T0 T3 T2 T1 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-38
SLIDE 38

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

P6 R3 = 1 R2 = 3 R0 = 1 R1 = 2 P0 P3 P5 P2 P4 P1 F0 T0 T3 T2 T1 F1 F2

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-39
SLIDE 39

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task

Input data Path in the system P0 → P1 → P3 → P6 1 P0 → P2 → P4 → P6 2 P0 → P1 → P5 → P6 3 P0 → P2 → P3 → P6 4 P0 → P1 → P4 → P6 5 P0 → P2 → P5 → P6 6 P0 → P1 → P3 → P6 7 P0 → P2 → P4 → P6

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-40
SLIDE 40

Intro Framework TEGs Computing Comparison Conclusion

Mapping

A processor processes at most 1 task A task is mapped on possibly many processors Replication count of Ti: Ri Round-Robin distribution of each task Theorem Assume that stage Ti is mapped onto Ri distinct processors. Then the number of paths is equal to R = lcm (R0, . . . , Rn−1).

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 9/ 39

slide-41
SLIDE 41

Intro Framework TEGs Computing Comparison Conclusion

Communication models

Strict: receptions, computations and transmissions are sequential Overlap:

  • verlap of computations by communications

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 10/ 39

slide-42
SLIDE 42

Intro Framework TEGs Computing Comparison Conclusion

Communication models

Strict: receptions, computations and transmissions are sequential Overlap:

  • verlap of computations by communications

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 10/ 39

slide-43
SLIDE 43

Intro Framework TEGs Computing Comparison Conclusion

Random variables

Xi(n): time required by Pi to process its n-th data set Yi,j(n): time required by Pi to send its n-th file to Pj Deterministic case Exponential variables I.I.D.: Independent and Identically-Distributed variables N.B.U.E.: New Better than Used in Expectation variables

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 11/ 39

slide-44
SLIDE 44

Intro Framework TEGs Computing Comparison Conclusion

Random variables

Xi(n): time required by Pi to process its n-th data set Yi,j(n): time required by Pi to send its n-th file to Pj Deterministic case Exponential variables I.I.D.: Independent and Identically-Distributed variables N.B.U.E.: New Better than Used in Expectation variables

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 11/ 39

slide-45
SLIDE 45

Intro Framework TEGs Computing Comparison Conclusion

Outline

1

Introduction

2

Framework

3

Timed Event Graphs

4

Computing the throughput

5

Comparison results

6

Conclusion

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 12/ 39

slide-46
SLIDE 46

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-47
SLIDE 47

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-48
SLIDE 48

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-49
SLIDE 49

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-50
SLIDE 50

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-51
SLIDE 51

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-52
SLIDE 52

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-53
SLIDE 53

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-54
SLIDE 54

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-55
SLIDE 55

Intro Framework TEGs Computing Comparison Conclusion

Short presentation of Timed Event Graphs (TEGs)

Some transitions Some places Connections between transitions and places. . . and between places and transitions Some tokens allowing transitions to be fired Time between the consumption of the input tokens and the creation of the output tokens

τ2 τ3 τ4 τ1

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 13/ 39

slide-56
SLIDE 56

Intro Framework TEGs Computing Comparison Conclusion

Timed Event Graph model

Transitions: communications and computations Places: dependences between two successive operations Each path followed by the input data must be fully developed in the TEG Exponential size of the TEG

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 14/ 39

slide-57
SLIDE 57

Intro Framework TEGs Computing Comparison Conclusion

Overlap model

A communication cannot begin before the end of the computation

T1 T2 T3 T0 F0 F1 F2

P6 P0 P6 P0 P6 P3 P0 P6 P4 P5 P4 P5 P1 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 15/ 39

slide-58
SLIDE 58

Intro Framework TEGs Computing Comparison Conclusion

Overlap model

A computation cannot begin before the end of the communication

T1 T2 T3 T0 F0 F1 F2

P6 P0 P6 P0 P6 P3 P0 P6 P4 P5 P4 P5 P1 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 15/ 39

slide-59
SLIDE 59

Intro Framework TEGs Computing Comparison Conclusion

Overlap model

Dependences due to the round-robin distribution of computations

T2 T1 T3 T0 F0 F2 F1

P6 P3 P0 P6 P4 P5 P4 P5 P1 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0 P6 P0 P6 P0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 15/ 39

slide-60
SLIDE 60

Intro Framework TEGs Computing Comparison Conclusion

Overlap model

Dependences due to the round-robin distribution of outgoing communications

T3 T0 T1 T2 F1 F2 F0

P0 P6 P4 P5 P4 P5 P1 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0 P6 P0 P6 P0 P6 P3

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 15/ 39

slide-61
SLIDE 61

Intro Framework TEGs Computing Comparison Conclusion

Overlap model

Dependences due to the round-robin distribution of incoming communications

T1 T2 T3 T0 F0 F1 F2

P6 P0 P6 P0 P6 P3 P0 P6 P4 P5 P4 P5 P1 P1 P2 P2 P1 P2 P0 P6 P0 P6 P3 P0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 15/ 39

slide-62
SLIDE 62

Intro Framework TEGs Computing Comparison Conclusion

Overlap model

All dependences!

T2 T0 T1 T3 F1 F2 F0

P3 P4 P5 P4 P5 P6 P1 P0 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0 P6 P0 P6 P0 P6

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 15/ 39

slide-63
SLIDE 63

Intro Framework TEGs Computing Comparison Conclusion

Strict model

Dependences between communications and computations

T1 T2 T3 T0 F0 F1 F2

P6 P0 P6 P0 P6 P3 P0 P6 P4 P5 P4 P5 P1 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 16/ 39

slide-64
SLIDE 64

Intro Framework TEGs Computing Comparison Conclusion

Strict model

Dependences due to the Strict model

T1 T2 T0 T3 F2 F1 F0

P6 P4 P5 P4 P5 P1 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0 P6 P0 P6 P0 P6 P3 P0

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 16/ 39

slide-65
SLIDE 65

Intro Framework TEGs Computing Comparison Conclusion

Strict model

All dependences!

T1 T3 T0 T2 F0 F1 F2

P4 P5 P4 P6 P0 P3 P6 P0 P6 P6 P0 P0 P3 P6 P0 P6 P0 P1 P2 P2 P2 P1 P1 P5

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 16/ 39

slide-66
SLIDE 66

Intro Framework TEGs Computing Comparison Conclusion

Outline

1

Introduction

2

Framework

3

Timed Event Graphs

4

Computing the throughput

5

Comparison results

6

Conclusion

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 17/ 39

slide-67
SLIDE 67

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – deterministic case

Equivalent to find critical cycles C is a cycle of the TEG L(C) is its length (total time of transitions) t(C) is the total number of tokens in places traversed by C A critical cycle achieves the largest ratio maxCcycle

L(C) t(C)

This ratio gives the period P of the system Can be computed in time O(M3R3) (R = lcm (R0, . . . , RM−1))

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 18/ 39

slide-68
SLIDE 68

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – deterministic case

(previous result) Strict model: the TEG has an exponential size! Overlap model: Theorem Consider a pipeline of M stages T0, . . . , TM−1, such that stage Ti is mapped onto Ri distinct processors. Then the average throughput of this system can be computed in time O M−2

i=0

  • (RiRi+1)3

.

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 19/ 39

slide-69
SLIDE 69

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

General case: Theorem Let us consider a system formed by the mapping of an application

  • nto a platform. Then the throughput can be computed in time

O

  • exp(R)3

.

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 20/ 39

slide-70
SLIDE 70

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

General case: model the system by a timed event graph Exponential in the size of the system transform this timed event graph into a Markov chain Exponential in the size of the TEG compute the stationary measure of this Markov chain derive the throughput from the marginals of the stationary measure

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 21/ 39

slide-71
SLIDE 71

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

General case: model the system by a timed event graph Exponential in the size of the system transform this timed event graph into a Markov chain Exponential in the size of the TEG compute the stationary measure of this Markov chain derive the throughput from the marginals of the stationary measure

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 21/ 39

slide-72
SLIDE 72

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

General case: model the system by a timed event graph Exponential in the size of the system transform this timed event graph into a Markov chain Exponential in the size of the TEG compute the stationary measure of this Markov chain derive the throughput from the marginals of the stationary measure

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 21/ 39

slide-73
SLIDE 73

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Transformation into a Markov chain: each marking of the TEG becomes a state

T2 T0 T1 T3 F1 F2 F0

P3 P4 P5 P4 P5 P6 P1 P0 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0 P6 P0 P6 P0 P6

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 22/ 39

slide-74
SLIDE 74

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Transformation into a Markov chain: each marking of the TEG becomes a state

c d f e a b

P6 P4 P12 P8 P7 P10 P9 P11 P2 P3 P1 P5

P3 P4 P5 P6 P2

T3 T2

F2

P2 P2 P3 P3 P3 P2 P4 P4 P5 P6 P5 P6

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 23/ 39

slide-75
SLIDE 75

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Transformation into a Markov chain: list of all possible states

(0, 1, 0, 1, 0, 0, 0, 1, 1, 0, 1, 0) (1, 0, 0, 1, 0, 0, 0, 1, 0, 1, 0, 1) (0, 0, 1, 1, 0, 0, 0, 1, 0, 1, 1, 0) (1, 0, 0, 1, 0, 0, 1, 0, 1, 0, 1, 0) (0, 1, 0, 0, 1, 0, 0, 1, 0, 1, 0, 1) (0, 0, 1, 0, 1, 0, 1, 0, 0, 1, 1, 0) (0, 0, 1, 0, 0, 1, 0, 1, 0, 1, 0, 1) (0, 1, 0, 0, 0, 1, 0, 1, 1, 0, 0, 1) (0, 1, 0, 0, 1, 0, 1, 0, 1, 0, 1, 0) (0, 0, 1, 0, 0, 1, 1, 0, 1, 0, 1, 0) (1, 0, 0, 0, 0, 1, 1, 0, 1, 0, 0, 1) (1, 0, 0, 0, 1, 0, 1, 0, 0, 1, 0, 1)

d

d d b a f f b a c f a b e e e c c

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 24/ 39

slide-76
SLIDE 76

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: Theorem Let us consider a system formed by the mapping of an application

  • nto a platform. Then the throughput can be computed in time

O

  • N exp( max

1≤i≤N(Ri))3

  • .

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 25/ 39

slide-77
SLIDE 77

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: split the timed event graph into columns Ci, with 1 ≤ i ≤ 2N − 1 separately consider each column Ci separately consider each connected component Dj of Ci single component Dj: many copies of the same pattern Pj, of size uj × vj transform Pj into a Markov chain Mj determine a stationary measure of Mj compute the throughput of Pj in isolation combine the inner throughputs of all components to get the global throughput of the system

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 26/ 39

slide-78
SLIDE 78

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: split the timed event graph into columns Ci, with 1 ≤ i ≤ 2N − 1 separately consider each column Ci separately consider each connected component Dj of Ci single component Dj: many copies of the same pattern Pj, of size uj × vj transform Pj into a Markov chain Mj determine a stationary measure of Mj compute the throughput of Pj in isolation combine the inner throughputs of all components to get the global throughput of the system

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 26/ 39

slide-79
SLIDE 79

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: Communication column:

T2 T0 T1 T3 F1 F2 F0

P3 P4 P5 P4 P5 P6 P1 P0 P1 P2 P2 P2 P1 P0 P6 P0 P6 P3 P0 P6 P0 P6 P0 P6

R0 = 5, R1 = 21, R2 = 27, R3 = 11

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 27/ 39

slide-80
SLIDE 80

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: Communication column:

F0 F1 F2

P1 P4 P5 P1 P1 P2 P2 P2 P0 P6 P0 P6 P3 P0 P6 P0 P6 P0 P6 P3 P0 P6 P4 P5

R0 = 5, R1 = 21, R2 = 27, R3 = 11

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 27/ 39

slide-81
SLIDE 81

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: Communication column:

9 columns 7 rows 55 patterns

R0 = 5, R1 = 21, R2 = 27, R3 = 11

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 27/ 39

slide-82
SLIDE 82

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: Communication column:

7 rows 9 columns

R0 = 5, R1 = 21, R2 = 27, R3 = 11

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 27/ 39

slide-83
SLIDE 83

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: split the timed event graph into columns Ci, with 1 ≤ i ≤ 2N − 1 separately consider each column Ci separately consider each connected component Dj of Ci single component Dj: many copies of the same pattern Pj, of size uj × vj transform Pj into a Markov chain Mj determine a stationary measure of Mj compute the throughput of Pj in isolation combine the inner throughputs of all components to get the global throughput of the system

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 28/ 39

slide-84
SLIDE 84

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: Representation of a valid marking on the TEG

i u j Fired k + 1 times v Fired k − 1 times Fired k times

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 29/ 39

slide-85
SLIDE 85

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: Representation of a valid marking with Young diagrams

u − i v − j (v, 0) i (0, 0) j (0, v) (u, v)

⇒ Number of states easily determined Exponential number of states in each connected component

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 30/ 39

slide-86
SLIDE 86

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model, homogeneous communication network: Theorem Let us consider a system formed by the mapping of an application

  • nto a platform, following the Overlap communication model with

a homogeneous communication network. Then the throughput can be computed in polynomial time.

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 31/ 39

slide-87
SLIDE 87

Intro Framework TEGs Computing Comparison Conclusion

Computing the throughput – exponential laws

Overlap model: Reachable states from a given position

(0, v) (u, v) u − i v − j (v, 0) i (0, 0) j

⇒ Same number of incoming and outgoing states + Same firing rate (homogeneous communication network) = Invariant measure is uniform

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 32/ 39

slide-88
SLIDE 88

Intro Framework TEGs Computing Comparison Conclusion

Outline

1

Introduction

2

Framework

3

Timed Event Graphs

4

Computing the throughput

5

Comparison results

6

Conclusion

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 33/ 39

slide-89
SLIDE 89

Intro Framework TEGs Computing Comparison Conclusion

Comparison between two systems

Theorem Consider two systems (X (1), Y (1)) and (X (2), Y (2)). If we have for all n, ∀1 ≤ p ≤ M, X (1)

p (n) ≤st X (2) p (n) and

∀1 ≤ p, q ≤ M, Y (1)

p,q (n) ≤st Y (2) p,q (n), then ρ(1) ≥ ρ(2).

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 34/ 39

slide-90
SLIDE 90

Intro Framework TEGs Computing Comparison Conclusion

Comparison between two systems with I.I.D. laws

Theorem Let us consider two systems with I.I.D. communication and processing times (X (1), Y (1)) and (X (2), Y (2)). If we have for all n, ∀1 ≤ p ≤ M, X (1)

p (n) ≤icx X (2) p (n) and

∀1 ≤ p, q ≤ M, Y (1)

p,q (n) ≤icx Y (2) p,q (n), then ρ(1) ≥ ρ(2).

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 35/ 39

slide-91
SLIDE 91

Intro Framework TEGs Computing Comparison Conclusion

Bounds on the expected throughput

Theorem Let us consider any system (X (1), Y (1)), such that X (1)

p (n) and

Y (1)

p,q (n) are N.B.U.E.. Let us also consider two new systems

(X (2), Y (2)) and (X (3), Y (3)) such that: ∀1 ≤ p ≤ M, X (2)

p (n) has an exponential distribution, and

E[X (2)

p (n)] = E[X (1) p (n)],

∀1 ≤ p, q ≤ M, Y (2)

p,q (n) has an exponential distribution, and

E[Y (2)

p,q (n)] = E[Y (1) p,q (n)],

∀1 ≤ p ≤ M, X (3)

p (n) is deterministic and for all n,

X (3)

p (n) = E[X (1) p (n)],

∀1 ≤ p, q ≤ M, Y (3)

p,q (n) is deterministic and for all n,

Y (3)

p,q (n) = E[Y (1) p,q (n)].

Then we have: ρ(3) ≥ ρ(1) ≥ ρ(2).

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 36/ 39

slide-92
SLIDE 92

Intro Framework TEGs Computing Comparison Conclusion

Numerical experiments

Evolution of the measured throughput with the number of samples

Exponential laws 1.115 1.12 1.125 1.13 1.135 1.14 1.145 1.15 1.155 100 1000 10000 100000 1e+06 Throughput Number of events Constant values 1.11

Distribution Constant Exponential Uniform Uniform Pareto value c mean c c/2 - 3c/2 c/10 - 19c/10 mean c Throughput 2.0299 2.0314 2.0304 2.0305 2.0300

Table: Throughput obtained with several distributions of same mean.

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 37/ 39

slide-93
SLIDE 93

Intro Framework TEGs Computing Comparison Conclusion

Outline

1

Introduction

2

Framework

3

Timed Event Graphs

4

Computing the throughput

5

Comparison results

6

Conclusion

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 38/ 39

slide-94
SLIDE 94

Intro Framework TEGs Computing Comparison Conclusion

Conclusion and future work

Even if the mapping is given, the throughput is hard to determine Expectation of the throughput can be computed in many cases:

General case with exponential laws: exponential time Overlap model with exponential laws: smaller exponential time Overlap model, homogeneous communications: polynomial time General case, N.B.U.E. laws: bounds can be established

Determining the mapping that maximizes the throughput is an NP-complete problem, even in the simpler deterministic case with no communication costs Future work: Design efficient mapping heuristics

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 39/ 39

slide-95
SLIDE 95

Intro Framework TEGs Computing Comparison Conclusion

Conclusion and future work

Even if the mapping is given, the throughput is hard to determine Expectation of the throughput can be computed in many cases:

General case with exponential laws: exponential time Overlap model with exponential laws: smaller exponential time Overlap model, homogeneous communications: polynomial time General case, N.B.U.E. laws: bounds can be established

Determining the mapping that maximizes the throughput is an NP-complete problem, even in the simpler deterministic case with no communication costs Future work: Design efficient mapping heuristics

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 39/ 39

slide-96
SLIDE 96

Intro Framework TEGs Computing Comparison Conclusion

Conclusion and future work

Even if the mapping is given, the throughput is hard to determine Expectation of the throughput can be computed in many cases:

General case with exponential laws: exponential time Overlap model with exponential laws: smaller exponential time Overlap model, homogeneous communications: polynomial time General case, N.B.U.E. laws: bounds can be established

Determining the mapping that maximizes the throughput is an NP-complete problem, even in the simpler deterministic case with no communication costs Future work: Design efficient mapping heuristics

Matthieu Gallet Roma Working Group, April 2010 Computing the throughput, probabilistic and replicated 39/ 39