FPGAs! Basic Concepts Building Blocks There are (3) fundamental - PowerPoint PPT Presentation

FPGAs!

Basic Concepts – Building Blocks • There are (3) fundamental building blocks found in digital devices interconnect gates flip flops – Gates – Flip-Flops D Q > – Interconnect (or routing) D Q > D Q > D Q > 2

Digital Logic Landscape The following slides provide a history of the various logic devices Full Design Capacity Custom (gates) Standard Cell Gate Programmable Array Logic FPGA CPLD SPLD Standard Logic Development Time hours days weeks months years 3

Digital Logic History - PLDs interconnect gates flip flops • Developed in the late 70s A very common D Q low cost IC package > • Major player has pins on all 4 sides called a Plastic-Leaded D Q today: Lattice Chip Carrier (PLCC) > • First device that D Q needs software > • 50 – 200 gates D Q > 4

PLD Example 5

Digital Logic History - Gate Array Definition: A pre-built IC consisting of a regular arrangement of gates and interconnect (routing) where the interconnect is modified to achieve a customer’s desired functions. – The customer designs the behaviors/functions – The vendor manipulates/changes the gates interconnect metal interconnect to arrive at the customer’s specified functions (that is, the vendor hooks up the gates) – Sometimes called an Uncommitted Logic Array (ULA). Packaging Enhancement: To increase the number of I/Os (Inputs/Outputs), the pin thickness and spacing (pitch) are dramatically Gate Array in a reduced in this Thin Quad TQFP package FlatPack package (TQFP). 1,000,000+ gates 6

Gate Array • The ultimate building tool set for digital designers • Advantages – Very dense (today over 10,000,000 gates (10 million)) – Fast performance (200 – 500 MHz) – Very low unit cost • Disadvantages – Long turn around time (3 - 6 months) – $50K - $500K NRE • NRE = Non-Recurring Engineering charges, which are one- time “set - up” charges to ready the “fab” to build the custom part (“fab” = the “factory” where the ICs are manufactured; the “fabrication plant”) – Risk of re-spins 7

Digital Logic History - Standard Cell • This device features a series of customized “cells” – Each cell is optimized for its “standard” function • Cells are chosen form a library from the Standard Cell vendor, customized, and connected to the other cells and the routing on the part. • There are no standard layers to the device; each layer is a unique design • Advantages: – More optimized die size compared to GA – Cheaper device price compared to GA – Can add analog functions • Disadvantages: – Extremely high NRE charges (up to $1M) – Requires >250k+ units/year – Much longer development time – Much higher risk (re-spins, etc.) 8

CPLDs, FPGAs Full Design Capacity Custom (gates) Standard Cell Gate Array FPGA Programmable Logic CPLD SPLD Standard Logic Development Time hours days weeks months years 9

Digital Logic History - CPLD Complex Programmable Logic Device macrocells interconnect Definition: A CPLD contains a bunch of PLD blocks whose inputs and outputs are connected together by a global interconnection matrix. CPLD has two levels of programmability: --Each PLD block can be programmed --The interconnection between the PLDs can be programmed. CPLD technology was introduced in the late 80s 32-1024 macrocells 10

CPLDs • Vendors: Altera, Lattice, Cypress, Xilinx • 2 Primary Technologies – EEPROM (old technology) – FLASH (technology used by Xilinx CPLDs) • FPGAs vs. CPLDs – FPGAs have much greater capacity – CPLDs are faster for some small applications – Both are easy to design 11

Digital Logic History - FPGA Field Programmable Gate Array Definition: logic cells interconnect • An array of “ logic cells ” surrounded by substantial routing, both of which are under the user’s control • The CLB (Configurable Logic Block) is/was the fundamental building block of the logic cell, although today’s FPGAs use a very sophisticated collection of gates that goes beyond the original CLB design – The early Xilinx CLBs contained a (4) input look-up table (LUT), a flip-flop, and “carry logic” >10 million gates 12

FPGA Building Blocks 13

An Early Xilinx CLB 14

Digital Logic History FPGA - Field Programmable Gate Array 2 types of FPGAs flip flop LUT • Reprogrammable (SRAM-based) 0 1 1 0 0 – Xilinx, Altera, Lattice, Atmel 1 0 1 1 0 1 1 0 0 1 0 0 0 1 1 1 0 1 0 0 1 1 1 1 1 • One-time Programmable (OTP) SRAM logic cell – Actel, Quicklogic, EZchip gates flip flop OTP logic cell 15

Basic Concepts - Logic Interconnect • Method to hook-up gates inside a single device • Need to have enough routing to connect most gates • Larger gate counts result in lots of routing, bigger die size, increased cost vertical interconnect A B used horizontal interconnect interconnect path gates 16

Basic Concepts - I/Os Inputs and Outputs • All signals on & off O chip must go through I/O buffer an I/O buffer I • User can choose package pin many I/O buffer options silicon die 17

Basic Concepts Propagation Delay (t PD ) Definition: The time required for a signal to travel from A to B, measured in nanoseconds (ns). Gate Delay Interconnect Delay “A” “B” “A” “B” t PD = 3ns t PD = 1ns 18

Basic Concepts Path Delay Definition: The sum of all the gate and net delays from starting to ending point. “C” fanout=2 “B” “A” t PD = 3ns t PD = 1.2ns t PD = 3ns t PD = 1.8ns t PD = 3ns Path Delay “A” to “B” = sum of all gate + net delays 3ns + 1.2ns + 3ns + 1.8ns + 3ns = 12ns 19

Basic Concepts Maximum System Performance (f MAX ) Definition: The fastest speed a circuit containing flip-flops can operate, measured In Megahertz (MHz). Circuit Events per Second: D Q 1 = 1 Hertz (Hz) 1,000 = kilo (kHz) > 1,000,000 = mega (MHz) 1,000,000,000 = giga (GHz) t PD = 2ns t CQ = 2.5ns t PD = 1ns t PD = 0.5ns t PD = 2ns 1 f MAX = longest flip-flop path delay f MAX = 1/(flip-flop delay + gate delays + net delays) = 1/(2.5 + 1 + 2 + 0.5 + 2)ns = 125 MHz 20

Xilinx FPGA Architecture

How are they arranged 18×18 18Kbits Spartan 6 Multiplier Dual Port RAM CLB (Configurable Logic Block) = 4 Slices Slice I3 SET CE I2 O D Q I1 RST I0 I3 SET CE I2 O D Q I1 RST I0 124 multi-standard I/O with JTAG Low Cost Design 22

How they are arranged Kintex-7 FPGA

Typical FPGA Logic Structure • LUT • Flip flop

Typical 4 Input LUT • 4 Inputs • One Output • Any 4 input Logic function can be implemented.

Flip Flop • Input D • Input Clock SET • Input Clock Enable CE • Input Set Q D • Input Reset RST • Output Q

Making the Most of Controls Dedicated Flip-Flop controls make designs smaller and faster. LUT4 SET I3 CE 1 level of logic - fast and small I2 O Q D I1 Up to 4 data inputs plus 3 controls I0 RST t SU 2 levels of logic - significantly slower and twice the size (and cost) LUT4 LUT4 SET I3 I3 CE I2 I2 O O Q net D I1 I1 I0 I0 RST t SU t SU Low Cost Design 27

Workshop - How can this be implemented? This simple code describes a 4-input function followed by a Flip-Flop. What size and performance is this function? process (clk,reset) begin if reset='1' then reset data_out <= '0'; elsif clk'event and clk='1' then enable if enable='1' then if force_high='1' then set data_out <= '1'; else data_out <= a and b and c and d; logic end if; end if; end if; end process; Low Cost Design 28

Making the Most LUTs and FFs Dedicated Flip-Flop controls make designs smaller and faster. LUT4 SET I3 CE 1 level of logic - fast and small I2 O Q D I1 Up to 4 data inputs plus 3 controls I0 RST t SU 2 levels of logic - significantly slower and twice the size (and cost) LUT4 LUT4 SET I3 I3 CE I2 I2 O O Q net D I1 I1 I0 I0 RST t SU t SU Low Cost Design 29

Workshop - How can this be implemented? This simple code describes a 4-input function followed by a Flip-Flop. What size and performance is this function? process (clk,reset) begin if reset='1' then reset data_out <= '0'; elsif clk'event and clk='1' then enable if enable='1' then if force_high='1' then set data_out <= '1'; else data_out <= a and b and c and d; logic end if; end if; end if; end process; Low Cost Design 30

TWICE the Cost and Half the Speed Report Cell Usage : # BELS : 2 TWICE as Big as it # LUT2 : 1 # LUT4 : 1 should be and Slow! # FlipFlops/Latches : 1 # FDCE : 1 enable LUT4 PRE LUT2 force_high I3 CE d I1 b I2 data_out O O Q D c I0 I1 a I0 CLR Solution reset Low Cost Design 31

CLB (Configurable Logic Block) Multiple LUTs and FFs CLB Slice Slice PRE PRE D Q D Q LUT Carry LUT Carry CE CE CLR CLR LUT Carry PRE LUT Carry PRE D Q D Q CE CE CLR CLR 2 Slices in Each CLB • Each Slice has Two LUTs and Two Flipflops

FPGAs! Basic Concepts Building Blocks There are (3) fundamental - PowerPoint PPT Presentation

FPGAs! Basic Concepts Building Blocks There are (3) fundamental building blocks found in digital devices interconnect gates flip flops Gates Flip-Flops D Q > Interconnect (or routing) D Q > D Q > D Q

The BIST History of FPGAs FPGAs The BIST History of The BISTory BISTory of of FPGAs FPGAs

FPGAs 1 CMPE691/491: Advanced FPGA Design FPGAs Large array of configurable logic blocks

Blocks What is syntax (delimiters) Where can blocks be used Scope and blocks Do

Physical Design For FPGAs Rajeev Jayaraman Physical Implementation Tools Xilinx Inc. ISPD-2001

Efficient Multi-Ported Memories for FPGAs Eric LaForest Greg Steffan University of Toronto

FBPQ and building blocks FBPQ and building blocks Mark Drye Director of Asset Management

STARTER PLANT CONCRETE BLOCKS 1 X 8 INCH Quality building blocks are essential in the safe

Analog Integrated Circuits Fundamental Building Blocks Fundamental Building Blocks Basic

Virtex-7 FPGAs Target Software Virtex-7 FPGAs Target Software Defined Radio Applications Defined

Linux and FPGAs Chad D. Kersey chad@cdkersey.com cdkersey@gatech.edu Linux and FPGAs - p. 1/9

Physical optimization for Physical optimization for FPGAs using post- FPGAs using post-

Hybrid Dot-Product Design for FP-Enabled FPGAs Bogdan Pasca Intel ARITH 2019, June 10-12, 2019

High-Speed Computing & Co-Processing with FPGAs FPGAs (Field Programmable Gate Arrays) are

A Network of Time Division Multiplexing for FPGAs Rosemary Francis Motivation FPGAs are

with FP FPGAs: Cas ase Stu tudy on on a a Key-Value Store FPGAs in the Cloud Wider

Gigabit Ethernet Gigabit Ethernet implementation for implementation for FPGAs FPGAs Grzegorz

Nernst Branes from special geometry David Errington March 5, 2015 arXiv:hep-th/1501 . 07863 Paul

KEY PARTNERS PARTNERS OCRT SCHOLARSHIP FUND FRIENDS THANK YOU TO ALL OUR 2018 OCRT SPONSORS

An 8-core, 64-thread, 64-bit, power efficient SPARC SoC (Niagara2) Umesh Nawathe, Jim Ballard,

Ti e E ffi cient Server Audit Problem, Deduplicated Re-execution, and the Web Cheng Tan, Lingfan

Tenth Workshop on Statistical Machine Translation 0 Ond rej Bojar (Charles University in

A Day Has Only 241 Hours Miroslav ediv eumiro 1 / 73 A Day Has Only 241 Hours

Greening the OpenSolaris Kernel OSDevCon 2009, Dresden Eric Saxe <eric.saxe@sun.com>

Introduction Instructor: Nima Honarmand Spring 2015 :: CSE 502 Computer Architecture CSE 502

FPGAs! Basic Concepts Building Blocks There are (3) fundamental - PowerPoint PPT Presentation

FPGAs! Basic Concepts Building Blocks There are (3) fundamental building blocks found in digital devices interconnect gates flip flops Gates Flip-Flops D Q > Interconnect (or routing) D Q > D Q > D Q

The BIST History of FPGAs FPGAs The BIST History of The BISTory BISTory of of FPGAs FPGAs

FPGAs 1 CMPE691/491: Advanced FPGA Design FPGAs Large array of configurable logic blocks

Blocks What is syntax (delimiters) Where can blocks be used Scope and blocks Do

Physical Design For FPGAs Rajeev Jayaraman Physical Implementation Tools Xilinx Inc. ISPD-2001

Efficient Multi-Ported Memories for FPGAs Eric LaForest Greg Steffan University of Toronto

FBPQ and building blocks FBPQ and building blocks Mark Drye Director of Asset Management

STARTER PLANT CONCRETE BLOCKS 1 X 8 INCH Quality building blocks are essential in the safe

Analog Integrated Circuits Fundamental Building Blocks Fundamental Building Blocks Basic

Virtex-7 FPGAs Target Software Virtex-7 FPGAs Target Software Defined Radio Applications Defined

Linux and FPGAs Chad D. Kersey chad@cdkersey.com cdkersey@gatech.edu Linux and FPGAs - p. 1/9

Physical optimization for Physical optimization for FPGAs using post- FPGAs using post-

Hybrid Dot-Product Design for FP-Enabled FPGAs Bogdan Pasca Intel ARITH 2019, June 10-12, 2019

High-Speed Computing &amp; Co-Processing with FPGAs FPGAs (Field Programmable Gate Arrays) are

A Network of Time Division Multiplexing for FPGAs Rosemary Francis Motivation FPGAs are

with FP FPGAs: Cas ase Stu tudy on on a a Key-Value Store FPGAs in the Cloud Wider

Gigabit Ethernet Gigabit Ethernet implementation for implementation for FPGAs FPGAs Grzegorz

Nernst Branes from special geometry David Errington March 5, 2015 arXiv:hep-th/1501 . 07863 Paul

KEY PARTNERS PARTNERS OCRT SCHOLARSHIP FUND FRIENDS THANK YOU TO ALL OUR 2018 OCRT SPONSORS

An 8-core, 64-thread, 64-bit, power efficient SPARC SoC (Niagara2) Umesh Nawathe, Jim Ballard,

Ti e E ffi cient Server Audit Problem, Deduplicated Re-execution, and the Web Cheng Tan, Lingfan

Tenth Workshop on Statistical Machine Translation 0 Ond rej Bojar (Charles University in

A Day Has Only 241 Hours Miroslav ediv eumiro 1 / 73 A Day Has Only 241 Hours

Greening the OpenSolaris Kernel OSDevCon 2009, Dresden Eric Saxe &lt;eric.saxe@sun.com&gt;

Introduction Instructor: Nima Honarmand Spring 2015 :: CSE 502 Computer Architecture CSE 502

High-Speed Computing & Co-Processing with FPGAs FPGAs (Field Programmable Gate Arrays) are

Greening the OpenSolaris Kernel OSDevCon 2009, Dresden Eric Saxe <eric.saxe@sun.com>