High Level Synthesis Design Representation Intermediate - PDF document

High Level Synthesis Design Representation • Intermediate representation essential for efficient processing. – Input HDL behavioral descriptions translated into some canonical intermediate representation. • Language independent • Uniform view across CAD tools and users – Synthesis tools carry out transformations of the intermediate representation. CAD for VLSI 2 1

Scope of High Level Synthesis Verilog / VHDL Description Transformation Control and Data Flow Graph (CDFG) Scheduling Allocation FSM DataPath Controller Structure CAD for VLSI 3 Simple Transformation A = B + C; D = A * E; X = D – A; Stmt 1 Stmt 2 Stmt 3 Read B Read C Read A Read E Read D Read A + * – Write A Write D Write X CAD for VLSI 4 2

Read B Read C + Read E * Data Flow Graph – Write X CAD for VLSI 5 Transformation with Control/Data Flow case (C) 1: begin X = X + 3; A = X + 1; end 2: A = X + 5; default: A = X + Y; endcase CAD for VLSI 6 3

Data flow graph can be drawn similarly, consisting X = X + 3; A = X + 5; A = X + Y; A = X + 1; of “Read” and “Write” boxes, operation nodes, and muliplexers. Control Flow Graph CAD for VLSI 7 Another Example if (X == 0) A = B + C; D = B – C; else D = D – 1; CAD for VLSI 8 4

Read B Read C Read D 1 − − − − − − − − + Read A Read X 0 = 1 0 0 1 MUX Write D Write A CAD for VLSI 9 Compiler Transformations • Set of operations carried out on the intermediate representation. – Constant folding – Redundant operator elimination – Tree height transformation – Control flattening – Logic level transformation – Register-Transfer level transformation CAD for VLSI 10 5

Constant Folding Constant 4 Constant 12 Constant 16 + Write X Write X CAD for VLSI 11 Redundant Operator Elimination C = A * B; D = A * B; Read A Read B Read A Read B Read A Read B * * * Write C Write D Write C Write D CAD for VLSI 12 6

Tree Height Transformation a = b – c + d – e + f + g a a − − − − − − − − − − − − + b b − − − − + c + + d e + e + f g c d f g CAD for VLSI 13 Control Flattening CAD for VLSI 14 7

Logic Level Transformation Read A Read B Read A Read B NOT OR AND Write C OR Write C C = A + A ′ ′ B = A + B ′ ′ CAD for VLSI 15 High Level Synthesis PARTITIONING 8

Why Required? • Used in various steps of high level synthesis: – Scheduling – Allocation – Unit selection • The same techniques for partitioning are also used in physical design automation tools. – To be discussed later. CAD for VLSI 17 Component Partitioning • Given a netlist, create a partition which satisfies some objective function. – Clusters almost of equal sizes. – Minimum interconnection strength between clusters. • An example to illustrate the concept. CAD for VLSI 18 9

Cut 1 = 4 Cut 2 = 4 Size 1 = 15 Size 2 = 16 Size 3 = 17 CAD for VLSI 19 Behavioral Partitioning • With respect to Verilog, can be used when: – Multiple modules are instantiated in a top-level module description. • Each module becomes a partition. – Several concurrent “always” blocks are used. • Each “always” block becomes a partition. CAD for VLSI 20 10

Partitioning Techniques • Broadly two classes of algorithms: 1. Constructive • Random selection • Cluster growth • Hierarchical clustering 2. Iterative-improvement • Min-cut • Simulated annealing CAD for VLSI 21 Random Selection • Randomly select nodes one at a time and place them into clusters of fixed size, until the proper size is reached. • Repeat above procedure until all the nodes have been placed. • Quality/Performance: – Fast and easy to implement. – Generally produces poor results. – Usually used to generate the initial partitions for iterative placement algorithms. CAD for VLSI 22 11

Cluster Growth m : size of each cluster, V : set of nodes n = |V| / m ; for (i=1; i<=n; i++) { seed = vertex in V with maximum degree; V i = {seed}; V = V – {seed}; for (j=1; j<m; j++) { t = vertex in V maximally connected to V i ; V i = V i U {t}; V = V – {t}; } } CAD for VLSI 23 Hierarchical Clustering • Consider a set of objects and group them depending on some measure of closeness. – The two closest objects are clustered first, and considered to be a single object for further partitioning. – The process continues by grouping two individual objects, or an object or cluster with another cluster. – We stop when a single cluster is generated and a hierarchical cluster tree has been formed. • The tree can be cut in any way to get clusters. CAD for VLSI 24 12

Example v 1 v 1 v 241 v 2413 7 5 6 7 5 v 3 v 2 v 24 1 4 4 v 3 1 v 3 9 4 v 5 v 5 4 v 4 v 5 v 5 v 24135 CAD for VLSI 25 v 24135 v 2413 v 241 v 24 v 5 v 1 v 3 v 2 v 4 CAD for VLSI 26 13

Min-Cut Algorithm (Kernighan-Lin) • Basically a bisection algorithm. – The input graph is partitioned into two subsets of equal sizes. • Till the cutsets keep improving: – Vertex pairs which give the largest decrease in cutsize are exchanged. – These vertices are then locked. – If no improvement is possible and some vertices are still unlocked, the vertices which give the smallest increase are exchanged. CAD for VLSI 27 Example 1 8 1 5 3 2 6 2 6 7 3 7 4 8 5 4 Initial Solution Final Solution CAD for VLSI 28 14

Steps of Execution 1 3 2 6 Choose 5 and 3 for exchange 5 7 4 8 CAD for VLSI 29 • Drawbacks of K-L Algorithm – It is not applicable for hyper-graphs. • It considers edges instead of hyper-edges. • It cannot handle arbitrarily weighted graphs. • Partition sizes have to be specified a priori. – Time complexity is high. • O(n 3 ). – It considers balanced partitions only. CAD for VLSI 30 15

Goldberg-Burstein Algorithm • Performance of K-L algorithm depends on the ratio R of edges to vertices. • K-L algorithm yields good bisections if R > 5. • For typical VLSI problems, 1.8 < R < 2.5. • The basic improvement attempted is to increase R. – Find a matching M in graph G. – Each edge in the matching is contracted to increase the density of the graph. – Any bisection algorithm is applied to the modified graph. – Edges are uncontracted within each partition. CAD for VLSI 31 Example of G-B Algorithm After Contracting Matching of Graph CAD for VLSI 32 16

Simulated Annealing • Iterative improvement algorithm. – Simulates the annealing process in metals. – Parameters: • Solution representation • Cost function • Moves • Termination condition • Randomized algorithm – To be discussed later. CAD for VLSI 33 High Level Synthesis SCHEDULING 17

What is Scheduling? • Task of assigning behavioral operators to control steps. – Input: • Control and Data Flow Graph (CDFG) – Output: • Temporal ordering of individual operations (FSM states) • Basic Objective: – Obtain the fastest design within constraints (exploit parallelism). CAD for VLSI 35 Example • Solving 2nd order differential equations (HAL) module HAL (x, dx, u, a, clock, y); input x, dx, u, a, clock; output y; always @(posedge clock) while (x < a) begin x1 = x + dx; u1 = u – (3 * x * u * dx) – (3 * y * dx); y1 = y + (u * dx); x = x1; u = u1; y = y1; end endmodule CAD for VLSI 36 18

CAD for VLSI 37 Scheduling Algorithms • Three popular algorithms: – As Soon As Possible (ASAP) – As Late As Possible (ALAP) – Resource Constrained (List scheduling) CAD for VLSI 38 19

As Soon As Possible (ASAP) • Generated from the DFG by a breadth-first search from the data sources to the sinks. – Starts with the highest nodes (that have no parents) in the DFG, and assigns time steps in increasing order as it proceeds downwards. – Follows the simple rule that a successor node can execute only after its parent has executed. • Fastest schedule possible – Requires least number of control steps. – Does not consider resource constraints. CAD for VLSI 39 ASAP Schedule for HAL v 1 v 2 v 3 v 4 v 10 * * * * + v 5 v 6 v 9 v 11 * * + < v 7 - v 8 - CAD for VLSI 40 20

As Late As Possible (ALAP) • Works very similar to the ALAP algorithm, except that it starts at the bottom of the DFG and proceeds upwards. • Usually gives a bad solution: – Slowest possible schedule (takes the maximum number of control steps). – Also does not necessarily reduce the number of functional units needed. CAD for VLSI 41 ALAP Schedule for HAL v 1 v 2 * * v 3 v 5 * * v 4 v 7 v 6 v 10 - * * + v 8 v 9 v 11 + < - CAD for VLSI 42 21

High Level Synthesis Design Representation Intermediate - PDF document

High Level Synthesis Design Representation Intermediate representation essential for efficient processing. Input HDL behavioral descriptions translated into some canonical intermediate representation. Language independent Uniform

SYNTHESIS OF SUPER SYNTHESIS OF SUPER NANOPOROUS SYNTHESIS OF SUPER SYNTHESIS OF

Total Synthesis of the Polycyclic Total Synthesis of the Polycyclic Total Synthesis of the

Chemical Synthesis Techniques Chemical Synthesis Techniques Chemical Synthesis Techniques

UN High UN High UN High UN High- - - -Level Meeting on TB Level Meeting on TB Level Meeting

High-Level Synthesis Creating Custom Circuits from High-Level Code Hao Zheng Comp Sci & Eng

Towards Layout-Friendly High-Level Synthesis Jason Cong UCLA Bin Liu UCLA Peking University

Post-Synthesis Simulation VITAL Models, SDF Files, Timing Simulation Post-synthesis simulation

Synthesis of Carbon Synthesis of Carbon Nanotubes Nanotubes Polina Shifrina Supervisors: Dr.

Solid Texture Synthesis Solid Texture Synthesis Solid Texture Synthesis from 2D Exemplars from

Synthesis of Ranking Functions and Synthesis of Inductive Invariants and Synthesis of

Text-to-Image Generation Yu Cheng Text-to-Image Synthesis Text-to-Image Synthesis

Text-to-Speech Synthesis Bernd Mbius Language Science and Technology Saarland University

CTP431- Music and Audio Computing Sound Synthesis Graduate School of Culture Technology KAIST

Texture Synthesis Given a texture, create more CS176: Texture Synthesis All examples from Wei

Co-synthesis techniques for embedded systems embedded systems Kelvin Yuk June 5, 2002 EEC282 -

PowerWizard Level 1.0 & Level 2.0 Control Systems Training Systems Comparison Level 2

A high-level implementation of software pipelining in LLVM Roel Jordans 1 , David Moloney 2 1

Systems for Resource Management Corso di Sistemi e Architetture per Big Data A.A. 2019/2020

Multipath TCP Architecture: Towards Consensus Towards Consensus draft-ford-mptcp-architecture-01

Kick-Off: TDAQ Phase-II Upgrade - Overview Outline o High-level design o Effort and cost

Introduction to Coding in Python Fermilab - TARGET 2017 Week 1 Low to High Level Programing

Towards a High-Level Implementation of Flexible Parallelism Primitives for Symbolic Languages

Implementation of Lambda-Free Higher-Order Superposition Petar Vukmirovi Automatic theorem

Higher Order Logic for Syntacticians Carl Pollard Department of Linguistics Ohio State

High Level Synthesis Design Representation Intermediate - PDF document

High Level Synthesis Design Representation Intermediate representation essential for efficient processing. Input HDL behavioral descriptions translated into some canonical intermediate representation. Language independent Uniform

SYNTHESIS OF SUPER SYNTHESIS OF SUPER NANOPOROUS SYNTHESIS OF SUPER SYNTHESIS OF

Total Synthesis of the Polycyclic Total Synthesis of the Polycyclic Total Synthesis of the

Chemical Synthesis Techniques Chemical Synthesis Techniques Chemical Synthesis Techniques

UN High UN High UN High UN High- - - -Level Meeting on TB Level Meeting on TB Level Meeting

High-Level Synthesis Creating Custom Circuits from High-Level Code Hao Zheng Comp Sci &amp; Eng

Towards Layout-Friendly High-Level Synthesis Jason Cong UCLA Bin Liu UCLA Peking University

Post-Synthesis Simulation VITAL Models, SDF Files, Timing Simulation Post-synthesis simulation

Synthesis of Carbon Synthesis of Carbon Nanotubes Nanotubes Polina Shifrina Supervisors: Dr.

Solid Texture Synthesis Solid Texture Synthesis Solid Texture Synthesis from 2D Exemplars from

Synthesis of Ranking Functions and Synthesis of Inductive Invariants and Synthesis of

Text-to-Image Generation Yu Cheng Text-to-Image Synthesis Text-to-Image Synthesis

Text-to-Speech Synthesis Bernd Mbius Language Science and Technology Saarland University

CTP431- Music and Audio Computing Sound Synthesis Graduate School of Culture Technology KAIST

Texture Synthesis Given a texture, create more CS176: Texture Synthesis All examples from Wei

Co-synthesis techniques for embedded systems embedded systems Kelvin Yuk June 5, 2002 EEC282 -

PowerWizard Level 1.0 &amp; Level 2.0 Control Systems Training Systems Comparison Level 2

A high-level implementation of software pipelining in LLVM Roel Jordans 1 , David Moloney 2 1

Systems for Resource Management Corso di Sistemi e Architetture per Big Data A.A. 2019/2020

Multipath TCP Architecture: Towards Consensus Towards Consensus draft-ford-mptcp-architecture-01

Kick-Off: TDAQ Phase-II Upgrade - Overview Outline o High-level design o Effort and cost

Introduction to Coding in Python Fermilab - TARGET 2017 Week 1 Low to High Level Programing

Towards a High-Level Implementation of Flexible Parallelism Primitives for Symbolic Languages

Implementation of Lambda-Free Higher-Order Superposition Petar Vukmirovi Automatic theorem

Higher Order Logic for Syntacticians Carl Pollard Department of Linguistics Ohio State

High-Level Synthesis Creating Custom Circuits from High-Level Code Hao Zheng Comp Sci & Eng

PowerWizard Level 1.0 & Level 2.0 Control Systems Training Systems Comparison Level 2