Neural Network Overlay Using FPGA DSP Blocks Lenos Ioannou and - PowerPoint PPT Presentation

Nov 10, 2022 •26 likes •96 views

Neural Network Overlay Using FPGA DSP Blocks Lenos Ioannou and Suhaib A. Fahmy School of Engineering, University of Warwick, UK Introduction Long back-end tool compilation hinders rapid deployment of Neural Networks on FPGAs at the edge

Neural Network Overlay Using FPGA DSP Blocks Lenos Ioannou and Suhaib A. Fahmy School of Engineering, University of Warwick, UK
Introduction • Long back-end tool compilation hinders rapid deployment of Neural Networks on FPGAs at the edge • Use of overlays to build abstractions on top of the FPGA • Effectively enabling rapid deployment • Core NN operation, multiply-accumulate, maps well to DSP Blocks • Most FPGA NN implementations operate sub-max frequencies [1] • Can be solved by optimising the overlay around the DSP blocks [3]
Neural Network Test Cases • Trained 3 NNs using Tensorflow [2], each one comprises four layers • Use of ReLU in the intermediate layers • Considering the input bit-widths of the DSP48E2: • 18 bit weights • 27 bit inputs • 48 bit biases
Overlay • Each neuron is mapped to a single DSP block • DSP blocks alternate between two opmodes • Serial data flow • Needs to stall when # neurons > # inputs • Adjustable latency
Implementation Results • Implemented the overlay targeting the Zynq Ultrascale+ ZU7EV • Maintains low resource utilization • Feedforward serial data flow is highly efficient • High operating frequency • Near the DSP blocks’ theoretical maximum
Conclusion • Not offering peak performance in a particular NN implementation • Contribute to the more rapid deployment of NNs on FPGAs at the edge • Prioritise low resource utilization and energy efficiency Future work • Implement a mechanism to handle the data flow and stall accordingly • Expand the overlay for deeper topologies • Integration with a rapid compiler flow
References [1] E. Wu, X. Zhang, D. Berman, and I. Cho, “A high-throughput reconfigurable processing array for neural networks,” in Int. Conference on Field Programmable Logic and Applications (FPL), Sep. 2017. [2] Martin Abadi et al. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. [3] A. K. Jain, D. L. Maskell, and S. A. Fahmy, “Throughput oriented FPGA overlays using DSP blocks,” in 2016 Design, Automation Test in Europe Conference Exhibition (DATE), March 2016, pp. 1628–1633. [4] A. K. Jain, X. Li, P. Singhai, D. L. Maskell, and S. A. Fahmy, “DeCO: A DSP block based FPGA accelerator overlay with low overhead interconnect,” in Proc. Int. Symposium on Field-Programmable Custom Computing Machines (FCCM), 2016, pp. 1–8.

Recommend

6/23/09 J-DSP: An Online DSP Laboratory Overview J-DSP J-DSP Editor Editor J-DSP blocks

6/23/09 J-DSP: An Online DSP Laboratory Overview J-DSP J-DSP Editor Editor J-DSP blocks to generate, process and understand analog signals. Application of J-DSP to courses in Arts, Media and Engineering Department at Arizona State

539 views • 4 slides

Highlights of the work J-DSP J-DSP Editor Editor Online DSP Quiz integrated with J-DSP

Highlights of the work J-DSP J-DSP Editor Editor Online DSP Quiz integrated with J-DSP Java-DSP Current and Future Improvements environment. Quiz statistics to better understand students J-DSP Editor performance. Shalin

156 views • 4 slides

1 Collaborative Project Collaborative EMD Overview J-DSP J-DSP Editor Editor PLANNED IN THIS

J-DSP: A Distance Learning Paradigm J-DSP J-DSP Editor Editor Existing J-DSP Prototype Java-DSP Online Virtual Laboratories Run simulation Evaluate lecture DSP Simulations J-DSP streaming video Streaming Extensions to other

552 views • 7 slides

J-DSP and Sensor Motes for Universally accessible DSP functions J-DSP Embeds Interactive

Overview J-DSP J-DSP Editor Editor A Web-based DSP Simulation Tool J-DSP and Sensor Motes for Universally accessible DSP functions J-DSP Embeds Interactive Simulations in Web pages Education and Research Seamlessly Integrates Animated

297 views • 4 slides

Reverse Engineering DSP Code GameCube DSP Analyzing GCN DSP code Pierre Bourdon Conclusion

Reverse Engineering DSP Code Pierre Bourdon Introduction DSP Reverse Engineering DSP Code GameCube DSP Analyzing GCN DSP code Pierre Bourdon Conclusion delroth@lse.epita.fr http://lse.epita.fr February 12, 2013 Context Reverse

856 views • 18 slides

Contents Slide 1-1 Some DSP Chip History Slide 1-2 Other DSP Manufacturers Slide 1-3 DSP

Contents Slide 1-1 Some DSP Chip History Slide 1-2 Other DSP Manufacturers Slide 1-3 DSP Applications Slide 1-4 TMS320C6713 DSP Starter Kit (DSK) Slide 1-5 TMS320C6713 DSK Features Slide 1-6 TMS320C6713 Architecture Slide 1-7 Main

538 views • 27 slides

CS5412: OVERLAY NETWORKS Lecture IV Ken Birman Overlay Networks 2 We use the term overlay

CS5412 Spring 2012 (Cloud Computing: Birman) 1 CS5412: OVERLAY NETWORKS Lecture IV Ken Birman Overlay Networks 2 We use the term overlay network when one network (or a network-like data structure) is superimposed upon an underlying

794 views • 57 slides

Solano Community College DSP Solano Community College DSP NVDA & JAWS Screen Reader Student

Solano Community College DSP Solano Community College DSP Solano Community College DSP Solano Community College DSP NVDA & JAWS Screen Reader Student Guide NVDA & JAWS Screen Reader Student Guide NVDA & JAWS is screen reading

319 views • 6 slides

Contents Slide 1 Some DSP Chip History Slide 2 Other DSP Manufacturers Slide 3 DSP

Contents Slide 1 Some DSP Chip History Slide 2 Other DSP Manufacturers Slide 3 DSP Applications Slide 4 TMS320C6701 Evaluation Module (EVM) Slide 5 TMS320C6701 EVM Features Slide 6 EVM Stereo Codec Interface Slide 7 TMS320C6701

917 views • 23 slides

A Novel Approach for Cooperative Overlay-Maintenance in Multi-Overlay Environments 1 Wu-Chun

A Novel Approach for Cooperative Overlay-Maintenance in Multi-Overlay Environments 1 Wu-Chun Chung, National Tsing Hua University 2010/11/30 A Novel Approach for Cooperative Overlay-Maintenance in Multi-Overlay Environments Chin-Jung Hsu, CS,

903 views • 35 slides

Blocks What is syntax (delimiters) Where can blocks be used Scope and blocks Do

Blocks What is syntax (delimiters) Where can blocks be used Scope and blocks Do blocks return a value (use in expressions) Entry point(s) to blocks Exit point(s) from blocks Block syntax and use Typically need start/stop

463 views • 5 slides

De DeCO: : A DS DSP Block Based FPGA Accelerator Overlay Wi With Low Overhead Interconnect Ab

De DeCO: : A DS DSP Block Based FPGA Accelerator Overlay Wi With Low Overhead Interconnect Ab Abhishek Kumar Ja Jain, Xiangwei Li, Pranjul Singhai, Douglas L. Maskell School of Computer Science and Engineering Nanyang Technological

564 views • 39 slides

C55 intro Highlights of the new C55x DSP Architecture The C55x DSP core supports new

C55 intro Highlights of the new C55x DSP Architecture The C55x DSP core supports new programming capabilities, while maintaining complete software compatibility with existing C54x code Advanced Automatic Power Management the C55x DSP

414 views • 9 slides

BELTLINE OVERLAY DISTRICT Z-06-121 Beltline Zoning Overlay District Regulations CITY OF ATLANTA

BELTLINE OVERLAY DISTRICT Z-06-121 Beltline Zoning Overlay District Regulations CITY OF ATLANTA Department of Planning & Community Development BeltLine Overlay District What is the legislation? An amendment to the Zoning Ordinance that

433 views • 27 slides

An introduction to FPGA-based acceleration of neural networks Marco Pagani 1 What is an FPGA?

An introduction to FPGA-based acceleration of neural networks Marco Pagani 1 What is an FPGA? Field-programmable gate array (FPGA) are integrated circuits designed to be con fi gured after manufacturing for implementing arbitrary logic

429 views • 15 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

Low Power Design Prof. Dr. J. Henkel CES - Chair for Embedded Systems KIT, Germany III. Battery

Low Power Design Prof. Dr. J. Henkel CES - Chair for Embedded Systems KIT, Germany III. Battery Modeling Prof. Jrg Henkel, Low Power Design, SS2014 ces.itec.kit.edu 2 Course overview: topics Components

981 views • 34 slides

Secure Resource Sharing for Embedded Protected Module Architectures Jo Van Bulck imec-DistriNet,

Secure Resource Sharing for Embedded Protected Module Architectures Jo Van Bulck imec-DistriNet, KU Leuven, Celestijnenlaan 200A, B-3001 Belgium MSc Thesis Presentation BELCLIV-CLUSIB, April 21, 2017 Jo Van Bulck Secure Resource Sharing for

515 views • 29 slides

Designing asynchronous circuits with timing conditions Vision statement for possible CAD

Designing asynchronous circuits with timing conditions Vision statement for possible CAD development under Workcraft Original document: A. Yakovlev, Synthesis from timing diagrams: rough notes and examples, Tech Memo, March 7, 1998; written on

683 views • 19 slides

HW/SW Codesign w/ FPGAs The Nature of HW/SW I ECE 522 Hardware Software Codesign with FPGAs

HW/SW Codesign w/ FPGAs The Nature of HW/SW I ECE 522 Hardware Software Codesign with FPGAs Instructor: Prof. Jim Plusquellic Text: A Practical Introduction to Hardware/Software Codesign, 2nd Edition", Patrick Schaumont, Springer,

649 views • 10 slides

Design for Testability Maurcio F. Aniche M.F.Aniche@tudelft.nl Controllability and

Design for Testability Maurcio F. Aniche M.F.Aniche@tudelft.nl Controllability and Observability Controllability determines the work it takes to set up and run test cases and the extent to which individual functions and features of the

268 views • 16 slides

TDDE45 - Lecture 7: Testability Martin Sjlund Department of Computer and Information Science

TDDE45 - Lecture 7: Testability Martin Sjlund Department of Computer and Information Science Linkping University 2020-10-06 Part I Testing What is testing? This course is not a course primarily in Software Testing (TDDD04 HT1).

310 views • 19 slides

UNIT TESTING 3 / 8 1 / 8 Unit testing involves: Lots of small, independent tests Reporting

UNIT TESTING 3 / 8 1 / 8 Unit testing involves: Lots of small, independent tests Reporting passes, failures, and errors Some optional setup and teardown shared across tests Aggregation (combining tests into test suites) 3 / 8 2 / 8 WHY

89 views • 8 slides

Specification-Based Testing 1 Stuart Anderson Stuart Anderson Specification-Based Testing 1

Specification-Based Testing 1 Stuart Anderson Stuart Anderson Specification-Based Testing 1 2011 c 1 Overview Basic terminology A view of faults through failure Systematic versus randomised testing A systematic approach to

493 views • 32 slides