An Efficient Softcore Multiplier Architecture for Xilinx FPGAs
22nd IEEE Symposium on Computer Arithmetic
Martin Kumm, Shahid Abbas and Peter Zipf
University of Kassel, Germany
An Efficient Softcore Multiplier Architecture for Xilinx FPGAs 22 nd - - PowerPoint PPT Presentation
An Efficient Softcore Multiplier Architecture for Xilinx FPGAs 22 nd IEEE Symposium on Computer Arithmetic Martin Kumm, Shahid Abbas and Peter Zipf University of Kassel, Germany CONTENTS 1. State-of-the-art 2. Proposed multiplier 3.
University of Kassel, Germany
2
3
4
5
1 1 1
Carry Logic
1
LUT LUT LUT LUT
6
1 1 1
Carry Logic
1
LUT LUT LUT LUT
6
7
M
m=0 m even
bm+1 bm bm−1 BEm zm cm sm 1 1 1 1 1 1 1 2 1 1
1 1 1 1
1 1 1
1 1 1 1 1
8
9
LSB MSB
10
LSB MSB
1 1
Carry Logic
1 0 1 0 1 0 1
LUT LUT LUT
0 1 0 1 0 1
LUT
1
11
1 1
Carry Logic
1 0 1 0 1 0 1
LUT LUT LUT
0 1 0 1 0 1
LUT
1
11
12
slices per row
no of rows
13
8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 200 400 600 800 1,000 1,200 1,400 1,600 1,800 2,000 Input word size (N) #Slices 1x4 LUT Multiplier 3x2 LUT Multiplier 3x3 LUT Multiplier Parandeh-Afshar Multiplier Coregen (area) Coregen (speed) proposed
14
8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 20 40 60 80 Input word size (N) Slice reduction (%) 1x4 LUT Multiplier 3x2 LUT Multiplier 3x3 LUT Multiplier Parandeh-Afshar Multiplier Coregen (area) Coregen (speed)
15
8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 100 200 300 400 500 600 700 Input word size (N) Frequency [MHz] 1x4 LUT Multiplier 3x2 LUT Multiplier 3x3 LUT Multiplier Parandeh-Afshar Multiplier Coregen (area) Coregen (speed) proposed
16
8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 200 400 600 800 1,000 1,200 1,400 1,600 1,800 2,000 Input word size (N) #Slices 1x4 LUT Multiplier 3x2 LUT Multiplier 3x3 LUT Multiplier Parandeh-Afshar Multiplier Coregen (area) Coregen (speed) proposed
17
18
8 12 16 20 24 28 32 36 40 44 48 52 56 60 −10 10 20 30 40 50 60 70 80 Input word size (N) Slice reduction (%) 1x4 LUT Multiplier 3x2 LUT Multiplier 3x3 LUT Multiplier Parandeh-Afshar Multiplier Coregen (area) Coregen (speed)
8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 100 200 300 400 500 600 700 Input word size (N) Frequency [MHz] 1x4 LUT Multiplier 3x2 LUT Multiplier 3x3 LUT Multiplier Parandeh-Afshar Multiplier Coregen (area) Coregen (speed) proposed
19
20
21
22
[Parandeh-Afshar 2011]: Parandeh-Afshar & Ienne Measuring and Reducing the Performance Gap between Embedded and Soft Multipliers on FPGAs, FPL 2011 [Brunie 2013]: Brunie, de Dinechin, Istoan, Sergent, Illyes & Popa Arithmetic Core Generation Using Bit Heaps, FPL 2013 [de Dinechin 2012]: de Dinechin & Pasca Designing Custom Arithmetic Data Paths with FloPoCo IEEE Design & Test of Computers 2012
23
BE0=−2b1+b0
BE2=(−2b3+b2+b1)22
M
m=0 m even
25
26
D FF/LAT INIT1 INIT0 SRHI SRLO SR CE CK D6:1 CE Q CK SR Q SRHI SRLO INIT1 INIT0
27