Outline Introduction to CMOS VLSI Single-bit Addition Design - - PDF document

outline
SMART_READER_LITE
LIVE PREVIEW

Outline Introduction to CMOS VLSI Single-bit Addition Design - - PDF document

Outline Introduction to CMOS VLSI Single-bit Addition Design Carry-Ripple Adder Carry-Skip Adder Carry-Lookahead Adder Lecture 11: Carry-Select Adder Adders Carry-Increment Adder Tree Adder David Harris Harvey Mudd


slide-1
SLIDE 1

1

Introduction to CMOS VLSI Design

Lecture 11: Adders

David Harris

Harvey Mudd College Spring 2004

11: Adders Slide 2 CMOS VLSI Design

Outline

Single-bit Addition Carry-Ripple Adder Carry-Skip Adder Carry-Lookahead Adder Carry-Select Adder Carry-Increment Adder Tree Adder

11: Adders Slide 3 CMOS VLSI Design

Single-Bit Addition

Half Adder Full Adder

1 1 1 1 S Cout B A 1 1 1 1 1 1 1 1 1 1 1 1 S Cout C B A

A B S Cout A B C S Cout

  • ut

S C = =

  • ut

S C = =

11: Adders Slide 4 CMOS VLSI Design

Single-Bit Addition

Half Adder Full Adder

1 1 1 1 1 1 1 S Cout B A 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 S Cout C B A

A B S Cout A B C S Cout

  • ut

S A B C A B = ⊕ = ฀

  • ut

( , , ) S A B C C MAJ A B C = ⊕ ⊕ =

11: Adders Slide 5 CMOS VLSI Design

PGK

For a full adder, define what happens to carries – Generate: Cout = 1 independent of C

  • G =

– Propagate: Cout = C

  • P =

– Kill: Cout = 0 independent of C

  • K =

11: Adders Slide 6 CMOS VLSI Design

PGK

For a full adder, define what happens to carries – Generate: Cout = 1 independent of C

  • G = A • B

– Propagate: Cout = C

  • P = A ⊕ B

– Kill: Cout = 0 independent of C

  • K = ~A • ~B
slide-2
SLIDE 2

2

11: Adders Slide 7 CMOS VLSI Design

Full Adder Design I

Brute force implementation from eqns

  • ut

( , , ) S A B C C MAJ A B C = ⊕ ⊕ =

A B C S Cout MAJ A B C A B B B A C S C C C B B B A A A B C B A C B A A B C Cout C A A B B

11: Adders Slide 8 CMOS VLSI Design

Full Adder Design II

Factor S in terms of Cout S = ABC + (A + B + C)(~Cout) Critical path is usually C to Cout in ripple adder

S S Cout A B C Cout MINORITY 11: Adders Slide 9 CMOS VLSI Design

Layout

Clever layout circumvents usual line of diffusion – Use wide transistors on critical path – Eliminate output inverters

11: Adders Slide 10 CMOS VLSI Design

Full Adder Design III

Complementary Pass Transistor Logic (CPL) – Slightly faster, but more area

A C S S B B C C C B B Cout Cout C C C C B B B B B B B B A A A

11: Adders Slide 11 CMOS VLSI Design

Full Adder Design IV

Dual-rail domino – Very fast, but large and power hungry – Used in very fast multipliers

Cout _h A_h B_h C_h B_h A_h φ Cout _l A_l B_l C_l B_l A_l φ S_h S_l A_h B_h B_h B_l A_l C_l C_h C_h φ 11: Adders Slide 12 CMOS VLSI Design

Carry Propagate Adders

N-bit adder called CPA – Each sum bit depends on all previous carries – How do we compute all these carries quickly?

+ BN...1 AN...1 SN...1 Cin Cout 11111 1111 +0000 0000

A4...1 carries B4...1 S4...1 Cin Cout

00000 1111 +0000 1111

Cin Cout

slide-3
SLIDE 3

3

11: Adders Slide 13 CMOS VLSI Design

Carry-Ripple Adder

Simplest design: cascade full adders – Critical path goes from Cin to Cout – Design full adder to have fast carry delay

Cin Cout B1 A1 B2 A2 B3 A3 B4 A4 S1 S2 S3 S4 C1 C2 C3

11: Adders Slide 14 CMOS VLSI Design

Inversions

Critical path passes through majority gate – Built from minority + inverter – Eliminate inverter and use inverting full adder

Cout Cin B1 A1 B2 A2 B3 A3 B4 A4 S1 S2 S3 S4 C1 C2 C3

11: Adders Slide 15 CMOS VLSI Design

Generate / Propagate

Equations often factored into G and P Generate and propagate for groups spanning i:j Base case Sum:

: : i j i j

G P = =

: : i i i i i i

G G P P ≡ = ≡ =

0:00:00inGC 0:00:00inGC

0:0 0:0

G G P P ≡ = ≡ =

i

S =

11: Adders Slide 16 CMOS VLSI Design

Generate / Propagate

Equations often factored into G and P Generate and propagate for groups spanning i:j Base case Sum:

: : : 1: : : 1: i j i k i k k j i j i k k j

G G P G P P P

− −

= + = ฀ ฀

: : i i i i i i i i i i

G G A B P P A B ≡ = ≡ = ⊕ ฀

0:00:00inGC 0:00:00inGC

0:0 0:0 in

G G C P P ≡ = ≡ =

1:0 i i i

S P G − = ⊕

11: Adders Slide 17 CMOS VLSI Design

PG Logic

S1 B1 A1 P1 G1 G0:0 S2 B2 P2 G2 G1:0 A2 S3 B3 A3 P3 G3 G2:0 S4 B4 P4 G4 G3:0 A4 Cin G0 P0 1: Bitwise PG logic 2: Group PG logic 3: Sum logic C0 C1 C2 C3 Cout C4

11: Adders Slide 18 CMOS VLSI Design

Carry-Ripple Revisited

:0 1:0 i i i i

G G P G − = + ฀

S1 B1 A1 P1 G1 G0:0 S2 B2 P2 G2 G1:0 A2 S3 B3 A3 P3 G3 G2:0 S4 B4 P4 G4 G3:0 A4 Cin G0 P0 C0 C1 C2 C3 Cout C4

slide-4
SLIDE 4

4

11: Adders Slide 19 CMOS VLSI Design

Carry-Ripple PG Diagram

Delay 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 Bit Position

ripple

t =

11: Adders Slide 20 CMOS VLSI Design

Carry-Ripple PG Diagram

Delay 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 Bit Position

ripple xor

( 1)

pg AO

t t N t t = + − +

11: Adders Slide 21 CMOS VLSI Design

PG Diagram Notation

i:j i:j i:k k-1:j i:j i:k k-1:j i:j Gi:k Pk-1:j Gk-1:j Gi:j Pi:j Pi:k Gi:k Gk-1:j Gi:j Gi:j Pi:j Gi:j Pi:j Pi:k Black cell Gray cell Buffer

11: Adders Slide 22 CMOS VLSI Design

Carry-Skip Adder

Carry-ripple is slow through all N stages Carry-skip allows carry to skip over groups of n bits – Decision based on n-bit propagate signal

Cin + S4:1 P4:1 A4:1 B4:1 + S8:5 P8:5 A8:5 B8:5 + S12:9 P12:9 A12:9 B12:9 + S16:13 P16:13 A16:13 B16:13 Cout C4 1 C8 1 C12 1 1

11: Adders Slide 23 CMOS VLSI Design

Carry-Skip PG Diagram

For k n-bit groups (N = nk)

skip

t =

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 16:0

11: Adders Slide 24 CMOS VLSI Design

Carry-Skip PG Diagram

For k n-bit groups (N = nk) ( )

skip xor

2 1 ( 1)

pg AO

t t n k t t = + − + − + ⎡ ⎤ ⎣ ⎦

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 16:0

slide-5
SLIDE 5

5

11: Adders Slide 25 CMOS VLSI Design

Variable Group Size

Delay grows as O(sqrt(N))

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 16:0 11: Adders Slide 26 CMOS VLSI Design

Carry-Lookahead Adder

Carry-lookahead adder computes Gi:0 for many bits in parallel. Uses higher-valency cells with more than two inputs.

Cin + S4:1 G4:1 P4:1 A4:1 B4:1 + S8:5 G8:5 P8:5 A8:5 B8:5 + S12:9 G12:9 P12:9 A12:9 B12:9 + S16:13 G16:13 P16:13 A16:13 B16:13 C4 C8 C12 Cout

11: Adders Slide 27 CMOS VLSI Design

CLA PG Diagram

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 16:0 11: Adders Slide 28 CMOS VLSI Design

Higher-Valency Cells

i:j i:k k-1:l l-1:m m-1:j Gi:k Gk-1:l Gl-1:m Gm-1:j Gi:j Pi:j Pi:k Pk-1:l Pl-1:m Pm-1:j

11: Adders Slide 29 CMOS VLSI Design

Carry-Select Adder

Trick for critical paths dependent on late input X – Precompute two possible outputs for X = 0, 1 – Select proper output when X arrives Carry-select adder precomputes n-bit sums – For both possible carries into n-bit group

Cin + A4:1 B4:1 S4:1 C4 + + 1 A8:5 B8:5 S8:5 C8 + + 1 A12:9 B12:9 S12:9 C12 + + 1 A16:13 B16:13 S16:13 Cout 1 1 1

11: Adders Slide 30 CMOS VLSI Design

Carry-Increment Adder

Factor initial PG and final XOR out of carry-select

5:4 6:4 7:4 9:8 10:8 11:8 13:12 14:12 15:12

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

increment

t =

slide-6
SLIDE 6

6

11: Adders Slide 31 CMOS VLSI Design

Carry-Increment Adder

Factor initial PG and final XOR out of carry-select

5:4 6:4 7:4 9:8 10:8 11:8 13:12 14:12 15:12

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

( )

increment xor

1 ( 1)

pg AO

t t n k t t = + − + − + ⎡ ⎤ ⎣ ⎦

11: Adders Slide 32 CMOS VLSI Design

Variable Group Size

Also buffer noncritical signals

3:2 5:4 6:4 8:7 9:7 12:11 13:11 14:11 15:11 10:7 3:2 5:4 6:4 8:7 9:7 12:11 13:11 14:11 15:11 10:7 6:0 3:0 1:0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

11: Adders Slide 33 CMOS VLSI Design

Tree Adder

If lookahead is good, lookahead across lookahead! – Recursive lookahead gives O(log N) delay Many variations on tree adders

11: Adders Slide 34 CMOS VLSI Design

Brent-Kung

1:0 3:2 5:4 7:6 9:8 11:10 13:12 15:14 3:0 7:4 11:8 15:12 7:0 15:8 11:0 5:0 9:0 13:0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

11: Adders Slide 35 CMOS VLSI Design

Sklansky

1:0 2:0 3:0 3:2 5:4 7:6 9:8 11:10 13:12 15:14 6:4 7:4 10:8 11:8 14:12 15:12 12:8 13:8 14:8 15:8

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

11: Adders Slide 36 CMOS VLSI Design

Kogge-Stone

1:0 2:1 3:2 4:3 5:4 6:5 7:6 8:7 9:8 10:9 11:10 12:11 13:12 14:13 15:14 3:0 4:1 5:2 6:3 7:4 8:5 9:6 10:7 11:8 12:9 13:10 14:11 15:12 4:0 5:0 6:0 7:0 8:1 9:2 10:3 11:4 12:5 13:6 14:7 15:8 2:0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

slide-7
SLIDE 7

7

11: Adders Slide 37 CMOS VLSI Design

Tree Adder Taxonomy

Ideal N-bit tree adder would have – L = log N logic levels – Fanout never exceeding 2 – No more than one wiring track between levels Describe adder with 3-D taxonomy (l, f, t) – Logic levels: L + l – Fanout: 2f + 1 – Wiring tracks: 2t Known tree adders sit on plane defined by l + f + t = L-1

11: Adders Slide 38 CMOS VLSI Design

Tree Adder Taxonomy

f (Fanout) t (Wire Tracks) l (Logic Levels) 0 (2) 1 (3) 2 (5) 3 (9) 0 (4) 1 (5) 2 (6) 3 (8) 2 (4) 1 (2) 0 (1) 3 (7)

11: Adders Slide 39 CMOS VLSI Design

Tree Adder Taxonomy

f (Fanout) t (Wire Tracks) l (Logic Levels) 0 (2) 1 (3) 2 (5) 3 (9) 0 (4) 1 (5) 2 (6) 3 (8) 2 (4) 1 (2) 0 (1) 3 (7) Kogge-Stone Brent-Kung Sklansky

11: Adders Slide 40 CMOS VLSI Design

Han-Carlson

1:0 3:2 5:4 7:6 9:8 11:10 13:12 15:14 3:0 5:2 7:4 9:6 11:8 13:10 15:12 5:0 7:0 9:2 11:4 13:6 15:8

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

11: Adders Slide 41 CMOS VLSI Design

Know les [2, 1, 1, 1]

1:0 2:1 3:2 4:3 5:4 6:5 7:6 8:7 9:8 10:9 11:10 12:11 13:12 14:13 15:14 3:0 4:1 5:2 6:3 7:4 8:5 9:6 10:7 11:8 12:9 13:10 14:11 15:12 4:0 5:0 6:0 7:0 8:1 9:2 10:3 11:4 12:5 13:6 14:7 15:8 2:0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

11: Adders Slide 42 CMOS VLSI Design

Ladner-Fischer

1:0 3:2 5:4 7:6 9:8 11:10 13:12 3:0 7:4 11:8 15:12 5:0 7:0 13:8 15:8 15:14 15:8 13:0 11:0 9:0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

slide-8
SLIDE 8

8

11: Adders Slide 43 CMOS VLSI Design

Taxonomy Revisited

f (Fanout) t (Wire Tracks) l (Logic Levels) 0 (2) 1 (3) 2 (5) 3 (9) 0 (4) 1 (5) 2 (6) 3 (8) 2 (4) 1 (2) 0 (1) 3 (7) Kogge- Stone Sklansky Brent- Kung Han- Carlson Knowles [2,1,1,1] Knowles [4,2,1,1] Ladner- Fischer Han- Carlson Ladner- Fischer New (1,1,1) (c) Kogge-Stone

1:0 2:1 3:2 4:3 5:4 6:5 7:6 8:7 9:8 10:9 11:10 12:11 13:12 14:13 15:14 3:0 4:1 5:2 6:3 7:4 8:5 9:6 10:7 11:8 12:9 13:10 14:11 15:12 4:0 5:0 6:0 7:0 8:1 9:2 10:3 11:4 12:5 13:6 14:7 15:8 2:0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 (e) Knowles [2,1,1,1]

1:0 2:1 3:2 4:3 5:4 6:5 7:6 8:7 9:8 10:9 11:10 12:11 13:12 14:13 15:14 3:0 4:1 5:2 6:3 7:4 8:5 9:6 10:7 11:8 12:9 13:10 14:11 15:12 4:0 5:0 6:0 7:0 8:1 9:2 10:3 11:4 12:5 13:6 14:7 15:8 2:0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:0 12:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

(b) Sklansky

1:0 2:0 3:0 3:2 5:4 7:6 9:8 11:10 13:12 15:14 6:4 7:4 10:8 11:8 14:12 15:12 12:8 13:8 14:8 15:8 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:0 12:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0 1:0 3:2 5:4 7:6 9:8 11:10 13:12 3:0 7:4 11:8 15:12 5:0 7:0 13:8 15:8 15:14 15:8 13:0 11:0 9:0 1 2 3 4 5 6 7 8 9 1 1 1 1 2 1 3 1 4 1 5 15:0 14:0 13:0 12:0 11:0 10:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

(f) Ladner-Fischer (a) Brent-Kung

1:0 3:2 5:4 7:6 9:8 11:10 13:12 15:14 3:0 7:4 11:8 15:12 7:0 15:8 11:0 5:0 9:0 13:0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

1:0 3:2 5:4 7:6 9:8 11:10 13:12 15:14 3:0 5:2 7:4 9:6 11:8 13:10 15:12 5:0 7:0 9:2 11:4 13:6 15:8

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15:014:013:012:011:010:0 9:0 8:0 7:0 6:0 5:0 4:0 3:0 2:0 1:0 0:0

(d) Han-Carlson

11: Adders Slide 44 CMOS VLSI Design

Summary

Nlog2N N/2 2 log2N (0, 0, L-1) Kogge-Stone 0.5 Nlog2N 1 N/2 + 1 log2N (0, L-1, 0) Sklansky 2N 1 2 2log2N – 1 (L-1, 0, 0) Brent-Kung 2N 1 4 N/4 + 2 Carry-Inc. n=4 1.25N 1 2 N/4 + 5 Carry-Skip n=4 N 1 1 N-1 Carry-Ripple Cells Tracks Max Fanout Logic Levels Classification Architecture

Adder architectures offer area / power / delay tradeoffs. Choose the best one for your application.