Interconnect Andr DeHon <andre@cs.caltech.edu> Thursday, - - PDF document

interconnect
SMART_READER_LITE
LIVE PREVIEW

Interconnect Andr DeHon <andre@cs.caltech.edu> Thursday, - - PDF document

Interconnect Andr DeHon <andre@cs.caltech.edu> Thursday, June 20, 2002 CBSSS 2002: DeHon Physical Entities Idea: Computations take up space Bigger/smaller computations Size resources cost Size distance


slide-1
SLIDE 1

1

CBSSS 2002: DeHon

Interconnect

André DeHon <andre@cs.caltech.edu> Thursday, June 20, 2002

CBSSS 2002: DeHon

Physical Entities

  • Idea: Computations take up space

– Bigger/smaller computations – Sizeresourcescost – Sizedistancedelay

slide-2
SLIDE 2

2

CBSSS 2002: DeHon

Impact

  • Consequence is:

– Properties of the physical world ultimately affect our computations

  • Delay = Distance / Speed
  • Scattering, mean-free-path
  • Thermodynamics (reversibility, kT,…)

CBSSS 2002: DeHon

Interconnect

  • Perhaps nowhere is this more present

than in interconnect

– Speed of light delay – Finite size of devices

  • Ultimate limits (Feynman’s “Bottom”)
  • What we can pattern and control today
  • How well we can localize phenomena (tunneling)

– Area and geometry of wires

slide-3
SLIDE 3

3

CBSSS 2002: DeHon

Today

  • Interconnect
  • Wires and VLSI
  • Dominance of Interconnect
  • Implications for physical computing

systems

CBSSS 2002: DeHon

Physical Interconnect

  • Anything that allows one physical

component of the computer to communicate with another

– Wires that connect transistors or gates – Traces on printed circuit boards that connect components – Cables and backplanes that connect boards – Ethernet and video cables that connect workstations, switches, and IO – Fibers that connect our building routers

slide-4
SLIDE 4

4

CBSSS 2002: DeHon

Interconnect

  • Today, let’s concentrate on

– gates and wires

  • Modern component contains millions of

gates (e.g. 2-input nor gate)

  • Each gate takes up finite space
  • To work together, these gates need to

communicate with each other

– Need wires for interconnect

CBSSS 2002: DeHon

Last Time

  • We saw that

– Modest size programmable gates – Connected by programmable interconnect

  • Are more efficient than

– Tiny programmable gates – Large LUTs

  • Even though the interconnect may take

up most of the area!

slide-5
SLIDE 5

5

CBSSS 2002: DeHon

Small Example

CBSSS 2002: DeHon

Physical Layout

slide-6
SLIDE 6

6

CBSSS 2002: DeHon

More typically, we have a very large number of gates that need to be connected.

Larger Example

DES Circuit

CBSSS 2002: DeHon

Larger Example (DES) Routed

Must find place for all those wires.

slide-7
SLIDE 7

7

CBSSS 2002: DeHon

Closeup (DES Routed)

Wires can take up significant space.

CBSSS 2002: DeHon

Claim

  • For

– Sufficiently large computations – “arbitrary” design (and many particular) – with finite size wires

  • Area associated with interconnect will

dominate that required for gates.

– Natural consequence of physical geometry in two-dimensional space

  • (any finite dimensions)
slide-8
SLIDE 8

8

CBSSS 2002: DeHon

Wires and VLSI

  • Simple VLSI model

– Have a small, finite number of wiring layers

  • E.g.

–one for horizontal wiring –one for vertical wiring

– Assume wires can run over gates

nand2

– Gates have fixed size (Agate) – Wires have finite spacing (Wwire)

CBSSS 2002: DeHon

Visually: Wires and VLSI

  • r2

inv

  • r2

xnor2 nor2 nand2 xor2 inv and2

slide-9
SLIDE 9

9

CBSSS 2002: DeHon

Important Consequence

  • A set of wires

W = 7 Wwire

  • take up space:

W = (N x Wwire) / Nlayers

  • crossing a line

CBSSS 2002: DeHon

Thompson’s Argument

  • The minimum area of a VLSI

component is bounded by the larger of:

– The area to hold all the gates

  • Achip ≥ N × Agate

– The area required by the wiring

  • Achip ≥ Nhorizontal Wwire × Nvertical Wwire
slide-10
SLIDE 10

10

CBSSS 2002: DeHon

How many wires?

  • We can get a lower bound on the total

number of horizontal (vertical) wires by considering the bisection of the computational graph:

– Cut the graph of gates in half – Minimize connections between halves – Count number of connections in cut – Gives a lower bound on number of wires

CBSSS 2002: DeHon

Bisection

Bisection Width

3

slide-11
SLIDE 11

11

CBSSS 2002: DeHon

Next Question

  • In general, if we:

– Cut design in half – Minimizing cut wires

  • How many wires will be in the

bisection?

N/2 N/2 cutsize

CBSSS 2002: DeHon

Arbitrary Graph

  • Graph with N nodes
  • Cut in half

– N/2 gates on each side

  • Worst-case:

– Every gate output on each side – Is used somewhere on other side – Cut contains N wires

slide-12
SLIDE 12

12

CBSSS 2002: DeHon

Arbitrary Graph

  • For a random graph

– Something proportional to this is likely

  • That is:

– Given a random graph with N nodes – The number of wires in the bisection is likely to be: c×N

CBSSS 2002: DeHon

Particular Computational Graphs

  • Some important computations have

exactly this property

– FFT (Fast Fourier Transform) – Sorting

slide-13
SLIDE 13

13

CBSSS 2002: DeHon

FFT

CBSSS 2002: DeHon

FFT

  • Can implement with N/2 nodes

– Group row together

  • Any bisection will cut N/2 wire bundles

– True for any reordering

slide-14
SLIDE 14

14

CBSSS 2002: DeHon

Assembling what we know

  • Achip ≥ N × Agate
  • Achip ≥ Nhorizontal Wwire × Nvertical Wwire
  • Nhorizontal = c × N
  • Nvertical = c × N

– [bound true recursively in graph]

  • Achip ≥ cN Wwire × c N Wwire

CBSSS 2002: DeHon

Assembling …

  • Achip ≥ N × Agate
  • Achip ≥ cN Wwire × cN Wwire
  • Achip ≥ (cN Wwire)2
  • Achip ≥ N2 × c′
slide-15
SLIDE 15

15

CBSSS 2002: DeHon

Result

  • Achip ≥ N × Agate
  • Achip ≥ N2 × c′
  • Wire area grows faster than gate area
  • Wire area grows with the square of gate

area

  • For sufficiently large N,

– Wire area dominates gate area

CBSSS 2002: DeHon

Intuitive Version

  • Consider a region of a chip
  • Gate capacity in the region goes as area

(s2)

  • Wiring capacity into region goes as

perimeter (4s)

  • Perimeter grows more slowly than area

– Wire capacity saturates before gate

slide-16
SLIDE 16

16

CBSSS 2002: DeHon

Result

  • Achip ≥ N2 × c′
  • Wire area grows with the square of gate

area

  • Troubling:

– To double the size of our computation – Must quadruple the size of our chip!

CBSSS 2002: DeHon

Interlude

slide-17
SLIDE 17

17

CBSSS 2002: DeHon

Miles of Wire

  • Consider FPGA

– Programmable Gate Arrays – Today providing ~1 Million gate capacity devices

  • “What we really sell is miles of wiring.”

– Clive McCarthy (Altera) circa 1998

  • 15mm die ×15mm/0.5µm wire spacing
  • (450m/layer) × 5 layers > 2 km

CBSSS 2002: DeHon

So what?

What do we do with this

  • bservation?
slide-18
SLIDE 18

18

CBSSS 2002: DeHon

First Observation

  • Not all designs have this large of a

bisection

  • Architecture is about understanding

structure

  • What is typical?

CBSSS 2002: DeHon

Array Multiplier

Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy bit Mpy Bit Mpy bit

Bisection Width Sqrt(N)

slide-19
SLIDE 19

19

CBSSS 2002: DeHon

Shift Register

reg reg reg reg reg reg reg reg reg reg reg reg reg reg reg reg

Bisection Width 1 Regardless of size

CBSSS 2002: DeHon

Bisection Width

  • Trying to assess wiring or total area

requirements on gates alone is short sighted.

– But most people try to do this…

  • Bisection width is an important, first
  • rder property of a design.
slide-20
SLIDE 20

20

CBSSS 2002: DeHon

Rent’s Rule

  • In the world of circuit design, an

empirical relationship to capture:

IO = c Np

  • 0≤p≤1
  • p – characterizes interconnect richness
  • Typical: 0.5≤p≤0.7
  • “High-Speed” Logic p=0.67

CBSSS 2002: DeHon

Empirical Characterization of Bisection

Log-log plot

IO N

C=7 P=0.68 Fit: IO=cN

p

slide-21
SLIDE 21

21

CBSSS 2002: DeHon

As a function of Bisection

  • Achip ≥ N × Agate
  • Achip ≥ Nhorizontal Wwire × Nvertical Wwire
  • Nhorizontal = Nvertical = IO = cN

p

  • Achip ≥ (cN)2p
  • If p<0.5

Achip ∝ N

  • If p>0.5

Achip ∝ N2p

CBSSS 2002: DeHon

In terms of Rent’s Rule

  • If p<0.5, Achip ∝ N
  • If p>0.5, Achip ∝ N2p
  • Typical designs have p>0.5

→ interconnect dominates

slide-22
SLIDE 22

22

CBSSS 2002: DeHon

Programmable Machine Impact

Design of Multiprocessors, FPGAs…

CBSSS 2002: DeHon

Impact on Programmables?

  • What does this mean for our

programmable devices?

– Devices which may solve any problem? – E.g. multiprocessors, FPGAs

  • Do we design for worst case?

– Put N2 area into interconnect – And guarantee can use all the gates?

  • Or design to use the wires?

– Wasting gates (processors) as necessary?

slide-23
SLIDE 23

23

CBSSS 2002: DeHon

  • Mapping procedure
  • Benchmark set

– MCNC 4-LUT mapped Details: FPGA’99

  • Parameterizable

network

– tree of meshes/fat- tree – bisection bw = CnP bisection bw = CnP

Interconnect: Experiment

bisection bw = CnP bisection bw = CnP

  • VLSI area model

CBSSS 2002: DeHon

Effects of P on Area

0.25 P=0.5 0.37 P=0.67 1.00 P=0.75

1024 LUT Area Comparison

slide-24
SLIDE 24

24

CBSSS 2002: DeHon

Resources × Area Model ⇒ Area Resources × Area Model ⇒ Area Resources × Area Model ⇒ Area Resources × Area Model ⇒ Area

CBSSS 2002: DeHon

Picking Network Design Point

Must provide reasonable level of interconnect; …but don’t guarantee 100% compute utilization.

slide-25
SLIDE 25

25

CBSSS 2002: DeHon

Single Design

  • Previous is for a set of designs
  • What about a single design?

– Do we minimize the area by providing enough wires to use all the gates for that single design?

CBSSS 2002: DeHon

Gate Utilization predict Area?

Single design

slide-26
SLIDE 26

26

CBSSS 2002: DeHon

Consequences

  • Even for a single design

– We do not, necessarily, win by maximizing gate utilization – Are better off focusing on efficiently using the wires

  • Focus on using the most expensive resource!

CBSSS 2002: DeHon

Key Ideas

  • Matter Computes

– our computing machines are built out of physical phenomena – physical effects ultimately determine landscape for computations

  • Interconnect requirements may

dominate all other requirements

– Compute, memory – Direct consequence of physical properties

  • Efficient computations

– May waste gates (compute) to use wires efficiently and minimize total area

slide-27
SLIDE 27

27

CBSSS 2002: DeHon

Admin

  • Project Discussion

– 4:30pm here – Pitch projects, discuss ideas