Synchronous Elastic Systems Mike Kishinevsky and Jordi Cortadella - PowerPoint PPT Presentation

Elastic channel and its protocol not Valid Valid * Stop Retry Idle Sender Receiver Data Valid * not Stop Valid Transfer Stop 60

Elastic channel protocol Sender Receiver * D D * C C C B * A Data Data 0 1 1 0 1 1 1 1 0 1 Valid Valid 0 0 1 0 0 1 1 0 0 0 Stop Stop Transfer Retry Idle 61

Basic VS block En i En i V i-1 V i V i-1 V i VS S i-1 S i S i-1 S i VS block + data-path latch = elastic HALF-buffer (EHB) EHB + EHB = elastic buffer with capacity 2 62

Control specification of the EB 63

Two implementations 64

Elastic buffer keeps data while stop is in flight EBs = FIFOs with two parameters: Forward latency Capacity W2R1 Backward latency for stop propagation assumed (but need not be) equal to fwd latency W1R2 Typical case: (1,2) - 1 cycle forward latency with capacity of 2 W2R2 Replaces “normal” registers Decoupling buffers W1R1 Cannot be done with Single Edge Flops without double pumping W1R1 Can use latches inside Master-Slave as shown before 65

Join + V 1 V VS S 1 VS S V 2 S 2 VS 66

(Lazy) Fork V V 1 S 1 V 2 S S 2 67

Eager Fork S 1 ^ V 1 V V 2 ^ S S 2 68

Eager fork (another implementation) VS VS VS VS VS 69

Variable Latency Units [0 - k] cycles done clear go V/S V/S 70

Coarse grain control 71

Elasticization Synchronous Elastic 72

CLK 73

FORK IF/ID ID/EX EX/MEM MEM/WB F J O PC O R I J K N O I N CLK 74

FORK J F J V V V V V O O O I R I S S S S S N N K CLK 75

0 FORK 0 J F J 1 1 1 1 1 O O O I R I 0 0 0 0 0 N N K CLK 76

Elastic control layer 1 1 1 1 1 Generation of gated clocks 0 0 0 0 0 CLK 77

Equivalence Synchronous: stream of data D: a b c d e d f g h i j … SELF: elastic stream of data D: a * b * * c d e * d f * g h * * i j … V: 1 0 1 1 0 1 1 1 1 1 1 1 1 1 0 1 1 1 … S: 0 0 0 1 0 0 0 0 1 0 0 1 0 0 0 1 0 0 … Transfer sub-stream = original stream Called: transfer equivalence, flow equivalence, or latency equivalence 78

Marked Graph models of elastic systems 79

Modelling elastic control with Petri nets data-token bubble data-token bubble 80

Modelling elastic control with Petri nets bubble data-token 2 data-tokens Hiding internal transitions of elastic buffers 81

Modelling elastic control with Marked Graphs 82

Modelling elastic control with Marked Graphs Forward (Valid or Request) Backward (Stop or Acknowledgement) 83

Elastic control with Timed Marked Graphs. Continuous time = asynchronous d=151ps d=250ps Delays in time units 250 151 84

Elastic control with Timed Marked Graphs. Discrete time = synchronous elastic d=1 d=1 Latencies in clock cycles 1 1 85

Elastic control with Timed Marked Graphs. Discrete time. Multi-cycle operation d=1 d=2 2 1 86

Elastic control with Timed Marked Graphs. Discrete time. Variable latency operation ∈ d=1 d {1,2} e.g. discrete probabilistic distribution: average latency 0.8*1 + 0.2*2 = 1.2 {1,2} 1 87

Modeling forks and joins d=1 1 88

Modelling combinational elastic blocks d=0 d=1 1 0 89

Elastic Marked Graphs An Elastic Marked Graph (EMG) is a Timed MG such that for any arc a there exists a complementary arc a’ satisfying the following condition • a = a ’• and • a ’ = a • Initial number of tokens on a and a’ (M 0 (a)+M 0 (a’)) = capacity of the corresponding elastic buffer Similar forms of “pipelined” Petri Nets and Marked Graphs have been previously used for modeling pipelining in HW and SW (e.g. Patil 1974; Tsirlin, Rosenblum 1982) 90

Reminder: Performance analysis of Marked graphs Th = operations / cycle = number of firings per time unit The throughput is given by the Th=min(Th(A), Th(B), Th(C))=2/5 minimum mean-weight cycle Th(A)=3/7 Th(B)=3/5 A B C Th(C)=2/5 Efficient algorithms: (Karp 1978), (Dasdan,Gupta 1998) 91

Early evaluation Naïve solution: introduce choice places – issue tokens at choice node only into one (some) relevant path – problem: tokens can arrive to merge nodes out-of-order later token can overpass the earlier one Solution: change enabling rule – early evaluation – issue negative tokens to input places without tokens, i.e. keep the same firing rule – Add symmetric sub-channels with negative tokens – Negative tokens kill positive tokens when meet Two related problems: Early evaluation and Exceptions (how to kill a data-token) 92

Examples of early evaluation MULTIPLEXOR a T c if s = T then c := a -- don’t wait for b b F else c := b -- don’t wait for a s MULTIPLIER a c if a = 0 then c := 0 -- don’t wait for b * b 93

Related work Petri nets – Extensions to model OR causality Kishinevsky et al. Change Diagrams [e.g. book of 1994] Yakovlev et al. Causal Nets 1996 Asynchronous systems – Reese et al 2002: Early evaluation – Brej 2003: Early evaluation with anti-tokens – Ampalan & Singh 2006: preemption using anti-tokens 94

Dual Marked Graph Marking: Arcs (places) −> Z (allow negative markings) Some nodes are labeled as early-enabling Enabling rules for a node: – Positive enabling: M(a) > 0 for every input arc – Early enabling (for early enabling nodes): M(a) > 0 for some input arcs – Negative enabling: M(a) < 0 for every output arc Firing rule: the same as in regular MG 95

Dual Marked Graphs Early enabling can be associated with an external guard that depends on data variables (e.g., a select signal of a multiplexor) Actual enabling guards are abstracted away (unless needed) Anti-token generation: When an early enabled node fires, it generates anti-tokens in the predecessor arcs that had no tokens Anti-token propagation counterflow: When negative enabled node fires, it propagates the anti-tokens from the successor to the predecessor arcs 96

Dual Marked Graph model -1 -1 -1 -1 -1 Enabled ! 97

Passive anti-token Passive DMG = version of DMG without negative enabling Negative tokens can only be generated due to early enabling, but cannot propagate Let D be a strongly connected DMG such that all cycles have positive cumulative marking Let D p be a corresponding passive DMG . If environment (consumers) never generate negative tokens, and there are no multi-cycle operations then throughput ( D ) = throughput ( D p ) – If capacity of input places for early enabling transitions is unlimited, then active anti-tokens do not improve performance – Active anti-tokens reduce activity in the data-path (good for power reduction) 98

Properties of DMGs Firing invariant: Let node n be simultaneously positive (early) and negative enabled in marking M. Let M 1 be the result of firing n from M due to positive (early) enabling. Let M 2 be the result of firing n from M due to negative enabling. Then, M 1 = M 2 Token preservation. Let c be a cycle of a strongly connected DMG with initial marking M 0 . For every reachable marking M : M(c) = M 0 (c) Liveness. A strongly connected passive DMG is live iff for every cycle c: M(c) > 0. – For DMGs this is a sufficient condition of liveness – It is also a necessary condition for positive liveness Repetitive behavior. In a SC DMG: a firing sequence s from M leads to the same marking iff every node fires in s the same number of times DMGs have properties similar to regular MGs 99

Implementing early enabling 100

Synchronous Elastic Systems Mike Kishinevsky and Jordi Cortadella - PowerPoint PPT Presentation

Synchronous Elastic Systems Mike Kishinevsky and Jordi Cortadella Intel Universitat Politecnica Strategic CAD Labs de Catalunya Hillsboro, USA Barcelona, Spain DAC Summer School July 26, 2009 Contributors to SELF research

Synchronous Elastic Systems Synchronous Elastic Systems Mike Kishinevsky and Jordi Cortadella

Synchronous Grammars Synchronous grammars are a way of simultaneously generating pairs of

Bibliography on Synchronous Elastic (aka Latency Insensitive) Systems July 20, 2009 Latency

Using Kieker with Elastic APM: An Experience Report Valentin Seifermann Duan Okanovi SSP

HW/SW Codesign w/ FPGAs Data Flow Modeling II ECE 522 Synchronous Data Flow Graphs Synchronous

Monitor your containers with the Elastic Stack Monica Sarbu Monica Sarbu Team lead, Beats team

Outline Synchronous Programming Introduction of Reactive Systems The Data-Flow Language Lustre

Synchronous Computations, Basic techniques (Secs. 6.1-6.2) T-79.4001 Seminar on Theoretical

A Theory of A Theory of Elastic Presentation Space Elastic Presentation Space Sheelagh

A Theory of A Theory of Elastic Presentation Space Elastic Presentation Space Sheelagh

Elastic Search - Aditi Choksi (EW18455) Elastic Search Search engine Distributed

Unconstrained Elastic Matching Unconstrained Elastic Matching and Eigen Eigen- -Deformations

Trapped Modes in Elastic Media for Zero Poisson Coefficient Three-dimensional elastic plate with

Transparent boundary conditions for the elastic Transparent boundary conditions for the elastic

Notes 1D Elastic Continuum From last class: elastic rod Required reading: linear

1 in a Nutshell 2019 Pass the S ALT Workshop Overview 2 Introduction to Elastic S tack

Course Information Homepage: http://www.ccs.neu.edu/home/mirek/classes/ CS 6240: Parallel

MAY COMMUNITY MEETING BOARD UPDATE The TC Business Architecture Forum THE TWIN CITIES BUSINESS

Session State: Beyond Soft State Benjamin C. Ling, Emre Kcman and Armando Fox Computer

Internet Server Clusters Internet Server Clusters Jeff Chase Duke University, Department of

The NS (v2) Sim ulator W orkshop br ought to you by Kevin F all La wrence Berk

CSE Seminar Series Networks and Distributed Systems (+ Data as bonus - mainly Big)

Vulnerable Populations PRESENTED BY ATHENA BREWER POLITICS, PHILOSOPHY, & ECONOMICS MAJOR

18 More Package Management, and some distros CS 2043: Unix Tools and Scripting, Spring 2019

Synchronous Elastic Systems Mike Kishinevsky and Jordi Cortadella - PowerPoint PPT Presentation

Synchronous Elastic Systems Mike Kishinevsky and Jordi Cortadella Intel Universitat Politecnica Strategic CAD Labs de Catalunya Hillsboro, USA Barcelona, Spain DAC Summer School July 26, 2009 Contributors to SELF research

Synchronous Elastic Systems Synchronous Elastic Systems Mike Kishinevsky and Jordi Cortadella

Synchronous Grammars Synchronous grammars are a way of simultaneously generating pairs of

Bibliography on Synchronous Elastic (aka Latency Insensitive) Systems July 20, 2009 Latency

Using Kieker with Elastic APM: An Experience Report Valentin Seifermann Duan Okanovi SSP

HW/SW Codesign w/ FPGAs Data Flow Modeling II ECE 522 Synchronous Data Flow Graphs Synchronous

Monitor your containers with the Elastic Stack Monica Sarbu Monica Sarbu Team lead, Beats team

Outline Synchronous Programming Introduction of Reactive Systems The Data-Flow Language Lustre

Synchronous Computations, Basic techniques (Secs. 6.1-6.2) T-79.4001 Seminar on Theoretical

A Theory of A Theory of Elastic Presentation Space Elastic Presentation Space Sheelagh

A Theory of A Theory of Elastic Presentation Space Elastic Presentation Space Sheelagh

Elastic Search - Aditi Choksi (EW18455) Elastic Search Search engine Distributed

Unconstrained Elastic Matching Unconstrained Elastic Matching and Eigen Eigen- -Deformations

Trapped Modes in Elastic Media for Zero Poisson Coefficient Three-dimensional elastic plate with

Transparent boundary conditions for the elastic Transparent boundary conditions for the elastic

Notes 1D Elastic Continuum From last class: elastic rod Required reading: linear

1 in a Nutshell 2019 Pass the S ALT Workshop Overview 2 Introduction to Elastic S tack

Course Information Homepage: http://www.ccs.neu.edu/home/mirek/classes/ CS 6240: Parallel

MAY COMMUNITY MEETING BOARD UPDATE The TC Business Architecture Forum THE TWIN CITIES BUSINESS

Session State: Beyond Soft State Benjamin C. Ling, Emre Kcman and Armando Fox Computer

Internet Server Clusters Internet Server Clusters Jeff Chase Duke University, Department of

The NS (v2) Sim ulator W orkshop br ought to you by Kevin F all La wrence Berk

CSE Seminar Series Networks and Distributed Systems (+ Data as bonus - mainly Big)

Vulnerable Populations PRESENTED BY ATHENA BREWER POLITICS, PHILOSOPHY, &amp; ECONOMICS MAJOR

18 More Package Management, and some distros CS 2043: Unix Tools and Scripting, Spring 2019

Vulnerable Populations PRESENTED BY ATHENA BREWER POLITICS, PHILOSOPHY, & ECONOMICS MAJOR