Fast FPGA prototyping with Software Development Kit for FPGA - PowerPoint PPT Presentation

Fast FPGA prototyping with Software Development Kit for FPGA (SDK4FPGA) Andrea Suardi cas.ee.ic.ac.uk/projects/SDK4FPGA This research has been supported by EPSRC Impact Acceleration grant number EP/K503733/1

Outline • What is SDK4FPGA ? • Why SDK4FPGA for embedded optimisation? • How does SDK4FPGA work ?   (Case study: Fast Gradient for real-time audio processing) 1. Algorithm coding 2. Verification (off-line simulation) 3. FPGA prototype

What is SDK4FPGA ? Algorithm coded in C/C++ SDK4FPGA FPGA prototype • Open Source framework � • Automated design flow � • Customisable templates and example designs

Why SDK4FPGA for embedded optimisation? Pros: Cons: fast FPGA prototype [< 1 day] algorithm already C/C++ coded and • • verified low power consumption [<1W] • not Matlab to FPGA coding support • low cost [<10$] • think parallel / small memory • applications with fast dynamics   • [~ms- μ s] not automated circuit design • optimisation support small packaging • 1.6 1.5 #A# 1.4 easy algorithm numerical validation   • fixed po int J − 1.3 cl double precision J [floating-point, fixed-point] cl 1.2 #B# #C# 1.1 1 power [ Watt ] J hw no FPGA knowledge required • 10 -4 10 -3 10 -2 10 -1 10 0

Fast Gradient for real-time audio processing   � (CLIP algorithm) Real-time perception-based clipping of audio signals using convex optimisation   B. Defraene, T. van Waterschoot, H.J. Ferreau, M. Diehl, and M. Moonen   IEEE Transactions on, Audio, Speech, and Language Processing Fast Gradient Configuration Method parameters

Fast Gradient for real-time audio processing   (CLIP algorithm)

Fast Gradient for real-time audio processing   (CLIP algorithm) FFT IFFT c k +1 � � c k +1 − x 5 f w

1. Algorithm coding Matlab C/C++ Matlab C/C++ TCL FPGA TCL FPGA HDL HDL 2% 11% 14% 4% 49% 49% 71% conventional hand-coded   nowadays High Level Synthesis   HDL approach approach

1. Algorithm coding radar design 1024 x 64 QRD conventional hand-coded   nowadays High Level Synthesis   floating point HDL approach approach Design language VDHL/Verilog C 1 Design Time (weeks) 12 21 Latency (ms) 37 Memory (RAMB36E1) 273 138 Registers 29826 14263 24257 Logic (LUTs) 28152 www.xilinx.com

1. Algorithm coding • User: • defines input/output data: • scalar IP • vector of any size input data output data • defines data representation: • floating-point single precision • any fixed-point up to 32 bits … algorithm … word length • codes algorithm in C/C++ � • SDK4FPGA : • provides a customised function template • calls Xilinx Vivado HLS to build the circuit

1. Algorithm coding #define NUMBER_ITERATIONS 30 #define INTEGER_LENGTH 4 #define FRACTION_LENGTH 8 � #define N 512 � typedef ap_fixed< INTEGER_LENGTH+FRACTION_LENGTH, INTEGER_LENGTH,AP_TRN, AP_SAT> data_t; � void clip( data_t x[N], data_t w[N], data_t bmin[N], data_t bmax[N], data_t delta[Kmax], data_t lipschitz, data_t y_out[N]) { � //variables data_t Grad[N]; y data_t Grad_lipschitz[N]; r o data_t new_Grad[N]; m data_t y_tilde[N]; e data_t y_new[N]; M data_t y[N]; data_t y_delta[N]; data_t y_delta_delta[N]; data_t c_new[N]; data_t c[N]; � int k,i;

1. Algorithm coding //initialization initialization_loop: for (i=0; i< N; i++) { Grad[i]=0; c[i]=x[i] y[i]=x[i]; } Executed in N steps

// Fast Gradient iterations loop FG_loop:for (int k=0; k< NUMBER_ITERATIONS; k++) � 1. Algorithm coding //Iteration inner_loop_row: for(i = 0; i < N; i++) { //Gradient * Lipschitz Grad_lipschitz[i] = Grad[i] * lipschitz; //unconstrained update y_tilde[i]=c[i]-Grad_lipschitz[i]; //projection if (y_tilde[i]>bmax[i]) y_new[i]=bmax[i]; else if (y_tilde[i]<bmin[i]) y_new[i]=bmin[i]; else y_new[i]=y_tilde[i]; //update c y_delta[i]=y_new[i]-y[i]; y_delta_delta[i]=delta[k] * y_delta[i]; c_new[i]=y_new[i]+y_delta_delta[i]; to_fft[i]=c_new[i]-x[i]; } � // FFT hls::fft(to_fft, fft_out); � //apply weights w_loop: for (i=0; i< N; i++) { to_ifft[i].real()=fft_out[i].real()*w[i]; to_ifft[i].imag()=fft_out[i].imag()*w[i]; } � // IFFT hls::ifft(to_ifft, new_Grad); � //update variables update_loop: for (i=0; i< N; i++) { Grad[i]=new_Grad[i]; c[i]=c_new[i] y[i]=y_new[i]; } }

// Fast Gradient iterations loop FG_loop:for (int k=0; k< NUMBER_ITERATIONS; k++) � 1. Algorithm coding //Iteration inner_loop_row: for(i = 0; i < N; i++) { //Gradient * Lipschitz Grad_lipschitz[i] = Grad[i] * lipschitz; //unconstrained update y_tilde[i]=c[i]-Grad_lipschitz[i]; //projection Pipeline:   if (y_tilde[i]>bmax[i]) y_new[i]=bmax[i]; Executed else if (y_tilde[i]<bmin[i]) y_new[i]=bmin[i]; in N+7 else y_new[i]=y_tilde[i]; steps //update c y_delta[i]=y_new[i]-y[i]; y_delta_delta[i]=delta[k] * y_delta[i]; c_new[i]=y_new[i]+y_delta_delta[i]; to_fft[i]=c_new[i]-x[i]; } � // FFT hls::fft(to_fft, fft_out); � //apply weights builtin w_loop: for (i=0; i< N; i++) { function to_ifft[i].real()=fft_out[i].real()*w[i]; to_ifft[i].imag()=fft_out[i].imag()*w[i]; } � // IFFT hls::ifft(to_ifft, new_Grad); � //update variables update_loop: for (i=0; i< N; i++) { Grad[i]=new_Grad[i]; c[i]=c_new[i] y[i]=y_new[i]; } }

1. Algorithm coding //update output update_output_loop: for (i=0; i< N; i++) { y_out[i]=y[i]; }

2. Verification (off-line simulation) HLS (C model) � � � � virtual … … � � memory generate results stimulus analysis � IP (RTL/C model) • User: • provides stimulus and analyses results from Matlab • defines computing precision � • SDK4FPGA : • handles the simulation interfacing Matlab with Xilinx Vivado HLS • reports circuit latency (delay) and resources (silicon Area)

3. FPGA prototype • User: Shared memory (DDR3) • provides stimulus input/output and analyses data results with a � � � Matlab API � • defines target � UDP/IP � Evaluation Board Ethernet TCP/IP IP � • selects host PC client configuration UDP/IP interface   TCP/IP (UDP/TCP) server FPGA host PC • SDK4FPGA : • builds the FPGA circuit calling Xilinx Vivado • handle communication between host PC and FPGA

cas.ee.ic.ac.uk/projects/SDK4FPGA Andrea Suardi [a.suardi@imperial.ac.uk] Algorithm coded in C/C++ SDK4FPGA FPGA prototype This research has been supported by EPSRC Impact Acceleration grant number EP/K503733/1

Fast FPGA prototyping with Software Development Kit for FPGA - PowerPoint PPT Presentation

Fast FPGA prototyping with Software Development Kit for FPGA (SDK4FPGA) Andrea Suardi cas.ee.ic.ac.uk/projects/SDK4FPGA This research has been supported by EPSRC Impact Acceleration grant number EP/K503733/1 Outline What is SDK4FPGA ?

We put stunning user experiences on the road. 2 Agenda Prototyping

Prototyping Paper Prototyping Digital Prototyping References Jrg Cassens SoSe 2019

Prototyping 11-04-2012 Design & Prototyping benefits (and disadvantages) of

GCC Highlighted Products GSure Gel Extraction kit GSure Soil DNA Isolation kit GSure Sputum DNA

EMC KIT BOXES COVERS 4 EMC KIT BOXES Frequency Range 5 FOCUS ON HCC Kit Box 6

Where to Buy MTP Kit? How to use MTP Kit? Buy Mtp Kit Online Now with fast and free shipping

PROTOTYPING FOR IOT @ERICASTANLEY #OPENIOT #PROTOTYPING PROTOTYPING FOR NOT ABOUT ME

Prototyping. Research through design Gabriela Avram CS4009 Prototyping What is a

Prototyping : alternative Prototyping : alternative systems development systems development

Open Source FPGA Toolchain FPGA LSE Summer Week 2015 iCE40 Flow Conclusion Vincent Gatine

Tips about an FPGA 02/09/2018 J.C. special topic FPGA ( field-programmable gate array ) FPGA :

FPGA What is a FPGA? How FPGAs work How do they work? Manufacturers

WWW.FPGA What is an FPGA? Field Programmable Gate Array Introduction to FPGA designs

MODEL-BASED DESIGN TOOLBOX ENABLING FAST PROTOTYPING AND DESIGN ON-TARGET RAPID PROTOTYPING FOR

Python RAPID PROTOTYPING: SOFTWARE Examples of rapid prototyping in Python: pure software case

This is a Full Kit 2568D/AT12 HP/2x AEB inj 4cyl. with all components. To present you how our Kit

FPGA-Enabled Cloud Miriam Leeser, Mehmet Gungor, Kai Huang, Stratis Ioannidis Dept. of Electrical

How to Generate Worst-Case Crisp Case Scenarios When Testing Component-Wise . . . Applying the

The Two-Loop Soft Function For Fully Differential Continuum Top Quark Pair Production At Future

Disordered systems and random graphs 2 Amin Coja-Oghlan Goethe University based on joint work

Quantum Computing Kitty Yeung, Ph.D. in Applied Physics Creative Technologist + Sr. PM Microsoft

INSTRUCTION WEEK OF MAY 11 TH 2020 MS. KELLYS SIXTH GRADE GLOBAL THINKERS STUDENT OF THE WEEK:

For independent contractors and sole proprietorships that fjle an IRS 1040 Schedule C.

Uncollectible versus Unproductive: Compliance Impact of Working Collection Cases that are

Fast FPGA prototyping with Software Development Kit for FPGA - PowerPoint PPT Presentation

Fast FPGA prototyping with Software Development Kit for FPGA (SDK4FPGA) Andrea Suardi cas.ee.ic.ac.uk/projects/SDK4FPGA This research has been supported by EPSRC Impact Acceleration grant number EP/K503733/1 Outline What is SDK4FPGA ?

We put stunning user experiences on the road. 2 Agenda Prototyping

Prototyping Paper Prototyping Digital Prototyping References Jrg Cassens SoSe 2019

Prototyping 11-04-2012 Design &amp; Prototyping benefits (and disadvantages) of

GCC Highlighted Products GSure Gel Extraction kit GSure Soil DNA Isolation kit GSure Sputum DNA

EMC KIT BOXES COVERS 4 EMC KIT BOXES Frequency Range 5 FOCUS ON HCC Kit Box 6

Where to Buy MTP Kit? How to use MTP Kit? Buy Mtp Kit Online Now with fast and free shipping

PROTOTYPING FOR IOT @ERICASTANLEY #OPENIOT #PROTOTYPING PROTOTYPING FOR NOT ABOUT ME

Prototyping. Research through design Gabriela Avram CS4009 Prototyping What is a

Prototyping : alternative Prototyping : alternative systems development systems development

Open Source FPGA Toolchain FPGA LSE Summer Week 2015 iCE40 Flow Conclusion Vincent Gatine

Tips about an FPGA 02/09/2018 J.C. special topic FPGA ( field-programmable gate array ) FPGA :

FPGA What is a FPGA? How FPGAs work How do they work? Manufacturers

WWW.FPGA What is an FPGA? Field Programmable Gate Array Introduction to FPGA designs

MODEL-BASED DESIGN TOOLBOX ENABLING FAST PROTOTYPING AND DESIGN ON-TARGET RAPID PROTOTYPING FOR

Python RAPID PROTOTYPING: SOFTWARE Examples of rapid prototyping in Python: pure software case

This is a Full Kit 2568D/AT12 HP/2x AEB inj 4cyl. with all components. To present you how our Kit

FPGA-Enabled Cloud Miriam Leeser, Mehmet Gungor, Kai Huang, Stratis Ioannidis Dept. of Electrical

How to Generate Worst-Case Crisp Case Scenarios When Testing Component-Wise . . . Applying the

The Two-Loop Soft Function For Fully Differential Continuum Top Quark Pair Production At Future

Disordered systems and random graphs 2 Amin Coja-Oghlan Goethe University based on joint work

Quantum Computing Kitty Yeung, Ph.D. in Applied Physics Creative Technologist + Sr. PM Microsoft

INSTRUCTION WEEK OF MAY 11 TH 2020 MS. KELLYS SIXTH GRADE GLOBAL THINKERS STUDENT OF THE WEEK:

For independent contractors and sole proprietorships that fjle an IRS 1040 Schedule C.

Uncollectible versus Unproductive: Compliance Impact of Working Collection Cases that are

Prototyping 11-04-2012 Design & Prototyping benefits (and disadvantages) of