Introduction to Field Programmable Gate Arrays Lecture 2/3 CERN - PowerPoint PPT Presentation

Introduction to Field Programmable Gate Arrays Lecture 2/3 CERN Accelerator School on Digital Signal Processing Sigtuna, Sweden, 31 May – 9 June 2007 Javier Serrano, CERN AB-CO-HT

Outline � Digital Signal Processing using FPGAs � Introduction. Why FPGAs for DSP? � Fixed point and its subtleties. � Doing arithmetic in hardware. � Distributed Arithmetic (DA). � COordinate Rotation DIgital Computer (CORDIC).

Why FPGAs for DSP? (1) Reason 1: FPGAs handle high computational workloads FPGA Conventional DSP Device (Von Neumann architecture) Data In Data In Reg1 Reg2 Reg0 Reg255 Reg C0 C1 C2 C255 .... MAC unit Data Out Data Out 256 Loops needed to process samples All 256 MAC operations in 1 clock cycle

FPGAs are ideal for multi-channel DSP designs 80MHz 20MHz ch1 LPF Samples Samples ch2 LPF LPF ch3 LPF Multi Channel Filter ch4 LPF � Many low sample rate channels can be multiplexed (e.g. TDM) and processed in the FPGA, at a high rate. � Interpolation (using zeros) can also drive sample rates higher.

Why FPGAs for DSP? (2) Reason 2: Tremendous Flexibility A × + Q = (A x B) + (C x D) + (E x F) + (G x H) B C × + + can be implemented in parallel D Q E + × + F G × + H But is this the only way in the FPGA?

Customize Architectures to Suit Your Ideal Algorithms Parallel Semi-Parallel Serial × + + × + + × + D Q + + D Q + × × × + + + × + Speed Optimized for? Cost FPGAs allow Area (cost) / Performance tradeoffs

Why FPGAs for DSP? (3) Reason 3: Integration simplifies PCBs Hundreds of Termination Resistors A/D SDRAM AFE A/D MACs MACs DSP ASSP SSTL3 Procs. C C Quad Translators P P Control DDC DDC Control r r DDC DDC TRx e e w w o o P P FPGA DDC DDC Network DDC DDC Quad SDRAM TRx Card FPGA D/A DUC DUC DUC DUC DSP Card D/A DUC DUC DUC DUC A/D SDRAM ASSP A/D Control PL4 Pow erPC Pow erPC 3.125 Gbps MACs, DUCs, D/A DDCs, Logic Pow erPC Pow erPC D/A SDRAM Control CORBA

Unsigned integers: positive values only

2’s complement

Fixed point binary numbers Example: 3 integer bits and 5 fractional bits

Fixed point truncation vs. rounding Note that in 2’s complement, truncation is biased while rounding isn’t.

The Full Adder (FA)

Add/subtract circuit S = A+B when Control=‘0’ S = A-B when Control=‘1’

Saturation You can’t let the data path become arbitrarily wide. Saturation involves overflow detection and a multiplexer. Useful in accumulators (like the one in the PI controller we use in the lab).

Multiplication: pencil & paper approach

A 4-bit unsigned multiplier using Full Adders and AND gates Of course, you can use embedded multipliers if your chip has them!

Constant coefficient multipliers using ROM For “easy” coefficients, there are smarter ways. E.g. to multiply a number A by 31, left-shift A by 5 places then subtract A.

Division: pencil & paper � Uses add/subtract blocks presented earlier. � MSB produced first: this will usually imply we have to wait for whole operation to finish before feeding result to another block. � Longer combinational delays than in multiplication: an N by N division will always take longer than an N by N multiplication.

Pipelining the division array

Square root � Take a division array, cut it in half (diagonally) and you have square root. Square root is therefore faster than division! � Although with less ripple through, this block suffers from the same problems as the division array. � Alternative approach: first guess with a ROM, then use an iterative algorithm such as Newton-Raphson.

Distributed Arithmetic (DA) 1/2 Digital filtering is about sums of products: − N 1 ∑ = ⋅ y c [ n ] x [ n ] = n 0 c[n] constant (prerequisite to use DA) Let’s assume: x[n] input signal B bits wide ⎛ ⎞ − − N 1 B 1 x b [n] is bit number b ∑ ∑ = ⎜ ⋅ ⋅ ⎟ b y c [ n ] x b n [ ] 2 Then: of x[n] (either 0 or 1) ⎝ ⎠ = = n 0 b 0 ⎛ ⎞ − − B 1 N 1 ∑ ∑ = ⋅ ⋅ ⎜ ⎟ And after some b y c n x n 2 [ ] [ ] b ⎝ ⎠ rearrangement of terms: = = b 0 n 0 This can be implemented with an N-input LUT

Distributed Arithmetic (DA) 2/2 ⎛ ⎞ − − B 1 N 1 ∑ ∑ = ⋅ ⎜ ⋅ ⎟ b y 2 c [ n ] x [ n ] b ⎝ ⎠ = = b 0 n 0 …… x B [0] x 1 [0] x 0 [0] …… x B [1] x 1 [1] x 0 [1] y …….... …….... …….... LUT + Register …… x B [N-1] x 1 [N-1] x 0 [N-1] 2 -1 Generates a result every B clock ticks. Replicating logic one can trade off speed vs. area, to the limit of getting one result per clock tick.

COrdinate Rotation DIgital Computer

Pseudo-rotations

Basic CORDIC iterations

Angle accumulator

The scaling factor

Rotation Mode

Example: calculate sin and cos of 30º

Vectoring Mode Vector magnitude

Circular coordinate system

Other coordinate systems

Generalized CORDIC equations

Summary of CORDIC functions

Precision and convergence

FPGA implementation

Iterative bit-serial design

Acknowledgements � Many thanks to Jeff Weintraub (Xilinx University Program) and Bob Stewart (University of Strathclyde) for many of these slides.

Introduction to Field Programmable Gate Arrays Lecture 2/3 CERN - PowerPoint PPT Presentation

Introduction to Field Programmable Gate Arrays Lecture 2/3 CERN Accelerator School on Digital Signal Processing Sigtuna, Sweden, 31 May 9 June 2007 Javier Serrano, CERN AB-CO-HT Outline Digital Signal Processing using FPGAs

Field Programmable Gate Arrays by Ketil Red Field Programmable Gate Array Integrated

Advanced GATE Embedded Track II, Module 8 Second GATE Training Course May 2010 Advanced GATE

Regulatory Guidance on the Use of Field Programmable Gate of Field Programmable Gate Arrays in

Arrays (2) Higher-Dimensional Arrays Arrays of Character Strings Topics Variables and Arrays

Lesson 6 Combinational Logic Circuits Gate Review AND Gate OR Gate NOT Gate NAND

Data Abstraction Copying Arrays. Sorting Arrays. 2D Arrays. Janyl Jumadinova September 30 and

CHAPTER IV GATE DESIGN R.M. Dansereau; v.1.0 GATE NETWORKS INTRO. TO COMP. ENG. GATE

Gate B Gate B Gate B Gate D Gate D Gate D Gate E Gate E Gate E Ferry Plaza Ferry Plaza

The GATE Embedded API Track II, Module 5 Second GATE Training Course May 2010 The GATE Embedded

GATE APIs Track II, Module 6 Second GATE Training Course May 2010 GATE APIs 1 / 62 Using Java

ROMs, PLAs and FPGAs October 5, 2006 Typeset by Foil T EX Why Programmable Logic?

High-Speed Computing & Co-Processing with FPGAs FPGAs (Field Programmable Gate Arrays) are

USING FIELD PROGRAMMABLE GATE ARRAYS IN A BEOWULF CLUSTER Matthew J. Krzych Naval Undersea

PicoCrack: The Art of Efficient Cracking FPGAs (Field Programmable Gate Arrays) allow custom

The Next Generation of Cryptanalytic Hardware FPGAs (Field Programmable Gate Arrays) allow custom

Unit 18 Field Programmable Gate Arrays (FPGAs) HARDWARE IMPLEMENTATION Implementing Logic

Design Activity Update Schedule Coordination Feb 20: Cost Estimation Begins Feb

Rubbish Services and Operating Budget City of Saginaw 5-6-19 Rubbish Services Curbside

Joe La Mariana, Executive Director South Bayside Waste Management Authority Slides Courtesy of

Dont Trash It! Answers to your questions about home recycling, composting, and beyond Earth

Field Programmable Gate Array - What is it? 2D array of logic blocks surrounded by a

Hardware-Accelerated Flexible Flow Measurement Pavel eleda celeda@liberouter.org Martin

Simon McIntosh-Smith Computer Science University of Bristol 1 IWOCL 2014 ~110 delegates

Challenges of Parallel Processor Design Martti Forsell (VTT Oulu) Ville Lepp anen (University

Introduction to Field Programmable Gate Arrays Lecture 2/3 CERN - PowerPoint PPT Presentation

Introduction to Field Programmable Gate Arrays Lecture 2/3 CERN Accelerator School on Digital Signal Processing Sigtuna, Sweden, 31 May 9 June 2007 Javier Serrano, CERN AB-CO-HT Outline Digital Signal Processing using FPGAs

Field Programmable Gate Arrays by Ketil Red Field Programmable Gate Array Integrated

Advanced GATE Embedded Track II, Module 8 Second GATE Training Course May 2010 Advanced GATE

Regulatory Guidance on the Use of Field Programmable Gate of Field Programmable Gate Arrays in

Arrays (2) Higher-Dimensional Arrays Arrays of Character Strings Topics Variables and Arrays

Lesson 6 Combinational Logic Circuits Gate Review AND Gate OR Gate NOT Gate NAND

Data Abstraction Copying Arrays. Sorting Arrays. 2D Arrays. Janyl Jumadinova September 30 and

CHAPTER IV GATE DESIGN R.M. Dansereau; v.1.0 GATE NETWORKS INTRO. TO COMP. ENG. GATE

Gate B Gate B Gate B Gate D Gate D Gate D Gate E Gate E Gate E Ferry Plaza Ferry Plaza

The GATE Embedded API Track II, Module 5 Second GATE Training Course May 2010 The GATE Embedded

GATE APIs Track II, Module 6 Second GATE Training Course May 2010 GATE APIs 1 / 62 Using Java

ROMs, PLAs and FPGAs October 5, 2006 Typeset by Foil T EX Why Programmable Logic?

High-Speed Computing &amp; Co-Processing with FPGAs FPGAs (Field Programmable Gate Arrays) are

USING FIELD PROGRAMMABLE GATE ARRAYS IN A BEOWULF CLUSTER Matthew J. Krzych Naval Undersea

PicoCrack: The Art of Efficient Cracking FPGAs (Field Programmable Gate Arrays) allow custom

The Next Generation of Cryptanalytic Hardware FPGAs (Field Programmable Gate Arrays) allow custom

Unit 18 Field Programmable Gate Arrays (FPGAs) HARDWARE IMPLEMENTATION Implementing Logic

Design Activity Update Schedule Coordination Feb 20: Cost Estimation Begins Feb

Rubbish Services and Operating Budget City of Saginaw 5-6-19 Rubbish Services Curbside

Joe La Mariana, Executive Director South Bayside Waste Management Authority Slides Courtesy of

Dont Trash It! Answers to your questions about home recycling, composting, and beyond Earth

Field Programmable Gate Array - What is it? 2D array of logic blocks surrounded by a

Hardware-Accelerated Flexible Flow Measurement Pavel eleda celeda@liberouter.org Martin

Simon McIntosh-Smith Computer Science University of Bristol 1 IWOCL 2014 ~110 delegates

Challenges of Parallel Processor Design Martti Forsell (VTT Oulu) Ville Lepp anen (University

High-Speed Computing & Co-Processing with FPGAs FPGAs (Field Programmable Gate Arrays) are