Partitioning Introduction to Partitioning Mahapatra-Texas - PDF document

Partitioning Introduction to Partitioning Mahapatra-Texas A&M-Spring’02 1 System partitioning System level partitioning problem ⇓ Assignment of operations to hardware or software • Assignment of an operation to HW or SW determines the delay of the operation • Assignment of operation to a processor and to more application-specific HW circuits involve additional delays due to communication overhead. Good partitioning scheme ⇒ Minimize this communication Mahapatra-Texas A&M-Spring’02 2 1

System partitioning contd.. • Increasing operations in software on a single processor ⇒ increases processor utilization • system performance : depends on hw-sw partition on utilization of processor and bandwidth of bus between processor and application specific hardware. • Characteristic of Partitioning scheme : capture and make use of partition’s effect on system performance in making trade-off between hw and sw implementation of an operation. – Devise a “ partition cost function ”. Mahapatra-Texas A&M-Spring’02 3 Partitioning • Cost function: – Directs the partitioning algorithm towards desired solution • optimum solution is minimum cost function • Need to capture: – effects of size of hw/sw parts – effects on timing behavior of these portions on cost function ( contrast: optimized area/pinout) • difficult due to the problem being global in nature • approximation is used to account the effect on total latency Mahapatra-Texas A&M-Spring’02 4 2

Partitioning • Partitioning in software : extensive use of statistical timing properties to drive partitioning algorithm. – Dynamic or runtime, excess time.., flexible • Partitioning in hardware : attempts to divide circuits that implement schedule operations. – Static, less time, non-flexible • An intermediate approach is advised: incrementally computable of cost function f. – partial, deterministic bound on timing properties, Mahapatra-Texas A&M-Spring’02 5 Timing properties in partition cost function Timing properties in partition cost function Statistical Program- level Deterministic Hardware- bounds software None Logic-level Static Partial Dynamic Scheduling flexibility Mahapatra-Texas A&M-Spring’02 6 3

A Partitioning cost function • Consider software model in terms of set of program threads and cost function f. ρ 1 ρ 2 ρ 3 r1 r2 ASIC λ 1 λ 2 λ 3 r3 Bus – where, λ i (per second) is thread latency: execution delay – ρ i (per second) thread reaction rate: invocation rate of the program thread n • processor utilization P is calculated by P = Σ λ i ρ i m i=1 • Bus utilization B (per second) = Σ r j “ m variables to be transferred, j=1 rj = inverse of minimum time interval between consecutive samples for 7 variable r ”. Partition cost function • Software characterization using λ , ρ , P and B: static bound – can be used to select appropriate partition of system functionality between hardware and software. • over estimation of processor and bus bandwidth is possible (since actual distribution of data communication is not captured above) • Include S H ( hardware size ) bottom up. – From the size estimates of the resources implementing operations • Characterize interface using set of communication ports (one per variable) – overhead due to communication between hw and sw is manifested by the utilization of bus bandwidth. Mahapatra-Texas A&M-Spring’02 8 4

Partitioning with cost function • From a given set of sequencing graph models and timing constraints, create two sets of sequencing graph models such that one can be implemented in hw and the other in sw and the following is true: – timing constraints are satisfied for the two sets of graph models – processor utilization, P ≤ 1 – bus utilization, B ≤ B’ – A partition cost function, f = f (S H, , B, P -1 , m) is minimized. Mahapatra-Texas A&M-Spring’02 9 Partitioning using heuristics • Minimum cost function can be achieved by trying very large number of solutions ( exponential relation to number of operations ) ⇒ heuristics are used for good solution that may show minimum cost function for some local properties • Start with constructive initial solution on which iterative procedure can be applied to improve the solution – exchange operations or paths between partitions, apply procedure • A good heuristic is relatively insensitive to initial solution – exchange of large number of operation makes it more insensitive to starting solution Mahapatra-Texas A&M-Spring’02 10 5

Partitioning trend Many applications consist of one or small number of very large processes • Partitioning before synthesis or compilation has advantages – order of magnitude reduction in logic synthesis runtime. – Improved system performance as smaller processes can be synthesized with shorter clock period than one large processor. – Improved satisfaction of I/O and size constraints on a package, reducing inter-package signals (compared to structural partitioning) Mahapatra-Texas A&M-Spring’02 11 Partitioning approaches • Functional • Structural specification specification partitioning synthesis Control specifi cation Datapath unit synthesis partitioning Con trol Data Control Data Control un it unit path path unit data path Mahapatra-Texas A&M-Spring’02 12 6

Functional Partitioning • Divides a system’s functional specification into multiple sub-specification. • Each sub-specification represents the functionality of a system component, such as a custom-hardware or software processor. • Then the components are synthesized down to gates or compiled to machine codes. Mahapatra-Texas A&M-Spring’02 13 Advantages of FP • Power reduction due to mutual exclusive components • smaller board size, lower cost • increase software speed • concurrent synthesis and debugging • less physical design problems Mahapatra-Texas A&M-Spring’02 14 7

Problem description: Model • Input: process x (C program or VHDL process) • A view of the process: set of procedures F = {f1, f2,…fn} with one as main procedure. • Variable: simple processor with read and write being the procedure calls. • Execution of F: procedures executing sequentially, staring with main and that calls other procedures; only one is active at a time Mahapatra-Texas A&M-Spring’02 15 Problem description: Model • Functional partitioning creates a partition P consisting of a set of parts {p 1 , p 2 ,…p m }, such that every procedure f i is assigned to exactly one part p j , i.e. p 1 ∪ p 2 ∪ …p m = F and p i ∩ p j = 0 for all i, j, i ≠ j. • Each p j represents the function to be implemented on a single processor. The processors are mutually exclusive. • Each part p j is converted to a single process before synthesis; this process consists of a loop that detects a request for one of the part’s procedures, receive input parameters, calls the procedure, and sends back output parameters. Mahapatra-Texas A&M-Spring’02 16 8

Model contd... • Function Bus: single bus carries parameter passing between processors • Protocol: putting destination procedure’s address, pulsing address request, putting parameter, pulsing the data request. Synthesis • Process custom processor component C i • For application we target, Ci = non-trivial datapath and a complex controller with hundreds of states. • Procedure on Ci may be implemented either as a control subroutine or datapath component. • Synthesis may implement process’s procedures in parallel if data dependencies are not violated. – While procedures are not mutually exclusive after partitioning, processors are still mutually exclusive. Mahapatra-Texas A&M-Spring’02 17 Five tasks for good partitioning • Model creation – converts input to an internal model (call graph model) • Allocation – Instantiating processors of varying type • Partitioning – Dividing input process among allocated processors • Transformation – modifies the input process into one with different organization but same overall functionality, leading to better partition. • Estimation – provides data used to create values for design metrics. Pre- estimation and online-estimation. Mahapatra-Texas A&M-Spring’02 18 9

Partitioning Methodology Access Graph • Three-step method: Granularity Selection Pre-Estimation Sequence of partitioning steps proposed by Vahid Pre-Clustering Online N-way Estimation Assignment Partitioned Access Graph Mahapatra-Texas A&M-Spring’02 19 Design Space Exploration System Behavior Mohanty, Mahapatra & Pre-Allocation Performance Choi. Proposed Pre- Estimation Allocation allocation… Partitioning Estimation Mahapatra-Texas A&M-Spring’02 20 10

Partitioning Introduction to Partitioning Mahapatra-Texas - PDF document

Partitioning Introduction to Partitioning Mahapatra-Texas A&M-Spring02 1 System partitioning System level partitioning problem Assignment of operations to hardware or software Assignment of an operation to HW or SW determines

Partitioning and Divide-and- Conquer Strategies Partitioning Strategies Partitioning simply

Partitioning under the hood in MySQL 5.5 Mattias Jonsson, Partitioning developer Mikael

1 1 Slide 5 Slide 6 Partitioning and Load Balancing Partitioning Goals Assignment of

Partitioning Problem and Usage Lecture 8 CSCI 4974/6971 26 Sep 2016 1 / 14 Todays Biz 1.

Investigating hypergraph-partitioning-based sparse matrix partitioning methods Bora U car

Background MapReduce Model SCOPE Language and Cosmos system Advanced partitioning

Power grid partitioning Data-Driven Partitioning of Power Networks Via Koopman Mode

Partitioning Tens and Ones Can you put these numbers into tens and ones? 37 = 7 30 3 7

Using Processor Partitioning to Using Processor Partitioning to Evaluate the Performance of MPI,

Optimal Partitioning of Multicast Receivers Min Sik Kim minskim@cs.utexas.edu Co-authors: Simon

Data Life Cycle Management for Oracle @ CERN with partitioning Oracle @ CERN with partitioning,

Some Results on the Online Partitioning of Permutations Benjamin Leroy-Beaulieu 1 Marc Demange 2 1

Territory partitioning is ... art Territory Partitioning for Minimalist Gossiping Robots

KPart: A Hybrid Cache Sharing-Partitioning Technique for Commodity Multicores Nosayba EI-Sayed

Program Partitioning Program Partitioning for Secure E xecution for Secure E xecution Cha rle

BACKEND DESIGN Circuit Partitioning Partitioning System Design Decomposition of a complex

Laboratoire Kastler Brossel Collge de France, ENS, UPMC, CNRS Introduction to Ultracold Atoms

GaN HEMT Reliability J. A. del Alamo and J. Joh Microsystems Technology Laboratories, MIT ESREF

Depth Estimation for Ranking Query Optimization KarlSchnaitter,UCSantaCruz

Creating Shared Value to End Extreme Poverty with Science, Technology, Innovation, and

The Pumping Lemma for Regular Languages The Pumping Lemma forRegular Languages p.1/39

The Pumping Lemma for CFLs Statement Applications 1 Intuition Recall the pumping lemma

Review Languages and Grammars Alphabets, strings, languages Regular Languages CS 301 -

Review Languages and Grammars Alphabets, strings, languages CS 301 - Lecture 6 Regular

Sambuz

Useful Links

Newsletter

Mail Us