hardware accelerators
play

Hardware Accelerators Francesca Palumbo 1 , Claudio Rubattu 1,2 , - PowerPoint PPT Presentation

Exploiting Dataflows for Reconfigurable Hardware Accelerators Francesca Palumbo 1 , Claudio Rubattu 1,2 , Carlo Sau 3 , Tiziana Fanni 3 , Luigi Raffo 3 1 University of Sassari, PolComIng Information Engineering Group 2 University of Rennes,


  1. Exploiting Dataflows for Reconfigurable Hardware Accelerators Francesca Palumbo 1 , Claudio Rubattu 1,2 , Carlo Sau 3 , Tiziana Fanni 3 , Luigi Raffo 3 1 University of Sassari, PolComIng – Information Engineering Group 2 University of Rennes, INSA Group 3 University of Cagliari, Diee – Microelectronics and Bioengineering Group Rennes, 12-14 December 2017

  2. Who and Where UNIVERSITY OF SASSARI UNIVERSITY OF CAGLIARI

  3. Who and Where UNIVERSITY OF SASSARI UNIVERSITY OF CAGLIARI

  4. Outline • The origins of our dataflow to hardware studies: the RPCT Project – Context – Target Technologies – Project Development • The MDC tool – Approach – Baseline Functionality and Extensions • Contexts of application – Neural Signal Decoding – HEVC Interpolation Filters • Final Remarks

  5. Outline • The origins of our dataflow to hardware studies: the RPCT Project – Context – Target Technologies – Project Development • The MDC tool – Approach – Baseline Functionality and Extensions • Contexts of application – Neural Signal Decoding – HEVC Interpolation Filters • Final Remarks

  6. Modern Embedded Systems Embedded Systems ( real-time computing systems with a dedicated functionality) are pervasive ( 98% of computers are embedded) and may present sensing and actuating capabilities.

  7. Modern Embedded Systems Embedded Systems ( real-time computing systems with a dedicated functionality) are pervasive ( 98% of computers are embedded) and may present sensing and actuating capabilities. Complex Seamless Security Distrib. MPSoC Energy Safety Certif. functionalities. HMI Automotive x x x x x x x Aerospace x x x x x x x Colliding technical Healthcare x x x x x x x x requirements. Consumer x x x IDC - Design of Future ES

  8. Multimedia Domain HIGH PERFORMANCES real time, portability, long battery life UP-TO-DATE SOLUTIONS last audio/video codecs, file formats... MORE INTEGRATED FEATURES MP3, Camera, Video, GPS... MARKET DEMAND convenient form factor, affordable price, fashion

  9. Target & Technological Challenges • DATAFLOW MODEL OF COMPUTATION – Modularity and parallelism  EASIER INTEGRATION AND FAVOURED RE-USABILITY • COARSE-GRAINED RECONFIGURABILITY – Flexibility and resource sharing  MULTI-APPLICATION PORTABLE DEVICES The RPCT project (2012-2015) has been funded by Sardinian Regional Government (L.R. 7/2007, CRP-18324). http://sites.unica.it/rpct/

  10. Target & Technological Challenges • DATAFLOW MODEL OF COMPUTATION – Modularity and parallelism  EASIER INTEGRATION AND FAVOURED RE-USABILITY • COARSE-GRAINED RECONFIGURABILITY – Flexibility and resource sharing  MULTI-APPLICATION PORTABLE DEVICES Reconfigurable Platform Composer Tool Project Automated are fundamental to guarantee . Dealing with systems, in particular for , state of the art still lacks in providing a broadly accepted solution. The RPCT project (2012-2015) has been funded by Sardinian Regional Government (L.R. 7/2007, CRP-18324). http://sites.unica.it/rpct/

  11. Reasons for Coarser-Grain Flexibility Performance GP RECONF CPU GPU DSP FG CG ASIC

  12. Reasons for Coarser-Grain Flexibility Performance GP RECONF CPU GPU DSP FG CG ASIC • Coarse Grained (CG): – both in ASIC and FPGA Fine Grained Coarse Grained bit-level word-level – 1 clock cycle switching, with ☺  dedicated switching blocks. Flexibility  ☺ • Fine Grained (FG): Speed   – FPGA only Memory – switching requires a new bit- stream

  13. Framework Development 2010 2011 2012 2013 2014 2015 2016 Baseline tool specification: Multi-Dataflow Composer (MDC) tool MPEG-RVC Framework Integration: Orcc + MDC + Xronos + Turnus

  14. Framework Development 2010 2011 2012 2013 2014 2015 2016 Baseline tool specification: Multi-Dataflow Composer (MDC) tool MPEG-RVC Framework Integration: Orcc + MDC + Xronos + Turnus MDC: Structural Profiler MDC: Low-Power Extension MDC: Co-processor Generator

  15. Framework Evaluation 2010 2011 2012 2013 2014 2015 2016 Reconfigurable Image/Video Coding: JPEG e H.264 Adaptive Filtering: HEVC Encoding

  16. Framework Evaluation 2010 2011 2012 2013 2014 2015 2016 Reconfigurable Image/Video Coding: JPEG e H.264 Neural Signal Decoding Adaptive Filtering: HEVC Encoding Cryptograph ic Systems

  17. Outline • The origins of our dataflow to hardware studies: the RPCT Project – Context – Target Technologies – Project Development • The MDC tool – Approach – Baseline Functionality and Extensions • Contexts of application – Neural Signal Decoding – HEVC Interpolation Filters • Final Remarks

  18. Design Suite & Targeted Challenges Multi Dataflow Co-Processor Generator Composer Tool Structural Profiler Dynamic Power Manager MDC design suite http://sites.unica.it/rpct/

  19. Design Suite & Targeted Challenges Functional Complexity Time to Market: Design & Mapping Automation Multi Dataflow Co-Processor Generator Composer Tool Structural Profiler Dynamic Power Manager MDC design suite http://sites.unica.it/rpct/

  20. Design Suite & Targeted Challenges Functional Complexity Time to Market: Design & Mapping Automation Multi Dataflow Co-Processor Generator Composer Tool Constraint Driven Structural Profiler Optimisation Dynamic Power Manager MDC design suite http://sites.unica.it/rpct/

  21. Design Suite & Targeted Challenges Functional Complexity Time to Market: Design & Mapping Automation Multi Dataflow Co-Processor Generator Composer Tool Constraint Driven Structural Profiler Optimisation Dynamic Power Manager MDC design suite Power Efficiency http://sites.unica.it/rpct/

  22. Design Suite & Targeted Challenges Functional Complexity Time to Market: Fast Integration Design & Mapping and Prototyping Automation Multi Dataflow Co-Processor Generator Composer Tool Constraint Driven Structural Profiler Optimisation Dynamic Power Manager MDC design suite Power Efficiency http://sites.unica.it/rpct/

  23. Baseline: Dataflow to HW coarse grained A A substrate C D C D 1:1 B B

  24. Baseline: Dataflow to HW coarse grained A A substrate C D C D 1:1 B B coarse grained A reconfigurable E substrate A C D SB B 2:1 D SB C A E D B

  25. MDC Front-End: Multi-Dataflow Generator α β γ A C D A E D F D B MDC front-end E 0 0 SB 0 1 2 SB A SB 0 α 1 1 0 0 1 SB 1 D β 0 0 0 C 1 2 B γ x x 1 F 1 shared multi-dataflow

  26. Datapath Merging Problem: Graph Model GRAPHS Gᵢ = (Vᵢ, Eᵢ) a₁₁ b₁₁ G₁ a₁₂ c₁₁ a₂₁ b₂₁ G₂ a₂₂ c₂₁ a₂₃

  27. Datapath Merging Problem: Graph Model GRAPHS LABELING πᵢ : V ᵢ  T Gᵢ = (Vᵢ, Eᵢ) a₁₁ b₁₁ π₁ G₁ a₁₁ A a₁₂ c₁₁ a₂₁ b₂₁ π₂ G₂ a₂₁ A a₂₂ c₂₁ a₂₃

  28. Datapath Merging Problem: Graph Model GRAPHS LABELING MAPPING πᵢ : V ᵢ  T μᵢ(v) = u, Gᵢ = (Vᵢ, Eᵢ) e (vᵢ, vᵢ′) ϵ Eᵢ (v ϵ V ᵢ, u ϵ V)   a₁₁ b₁₁ e( μᵢ(vᵢ), μᵢ(vᵢ′)) ϵ E π₁ πᵢ(v) = π (u) G₁ a₁₁ A a₁₂ c₁₁ a₁₁ μ a₂₁ b₂₁  A π₂ G₂ a₂₁ A a₂₁ a₂₂ c₂₁ a₂₃

  29. Datapath Merging Problem: Graph Model GRAPHS LABELING MAPPING πᵢ : V ᵢ  T μᵢ(v) = u, Gᵢ = (Vᵢ, Eᵢ) e (vᵢ, vᵢ′) ϵ Eᵢ (v ϵ V ᵢ, u ϵ V)   a₁₁ b₁₁ e( μᵢ(vᵢ), μᵢ(vᵢ′)) ϵ E π₁ πᵢ(v) = π (u) G₁ a₁₁ A a₁₂ c₁₁ a₁₁ μ a₂₁ b₂₁  A π₂ G₂ a₂₁ A a₂₁ a₂₂ c₂₁ a₂₃ PROBLEM STATEMENT: find a Reconfigurable Graph G (V,E) with the minimum costs ( min|V| and min |E| )

  30. Datapath Merging Problem: Graph Model GRAPHS LABELING MAPPING πᵢ : V ᵢ  T μᵢ(v) = u, Gᵢ = (Vᵢ, Eᵢ) e (vᵢ, vᵢ′) ϵ Eᵢ (v ϵ V ᵢ, u ϵ V)   a₁₁ b₁₁ e( μᵢ(vᵢ), μᵢ(vᵢ′)) ϵ E π₁ πᵢ(v) = π (u) G₁ a₁₁ A a₁₂ c₁₁ a₁₁ μ a₂₁ b₂₁  A π₂ G₂ a₂₁ A a₂₁ a₂₂ c₂₁ a₂₃ PROBLEM STATEMENT: find a Reconfigurable Graph G (V,E) with the minimum costs ( min|V| and min |E| ) feasible solution: ꓯ T ϵ T , Vᵀ={ v : π (v) = T}  |V ᵀ| = max |V ᵢᵀ|, Vᵢᵀ={ v ᵢ : πᵢ(vᵢ) = T}

  31. Datapath Merging Problem: Graph Model GRAPHS LABELING MAPPING πᵢ : V ᵢ  T μᵢ(v) = u, Gᵢ = (Vᵢ, Eᵢ) e (vᵢ, vᵢ′) ϵ Eᵢ (v ϵ V ᵢ, u ϵ V)   a₁₁ b₁₁ e( μᵢ(vᵢ), μᵢ(vᵢ′)) ϵ E π₁ πᵢ(v) = π (u) G₁ a₁₁ A a₁₂ c₁₁ a₁₁ μ a₂₁ b₂₁  A π₂ G₂ a₂₁ A a₂₁ a₂₂ c₂₁ a₂₃ PROBLEM STATEMENT: find a Reconfigurable Graph G (V,E) with the minimum costs ( min|V| and min |E| ) feasible solution: ꓯ T ϵ T , Vᵀ={ v : π (v) = T}  |V ᵀ| = max |V ᵢᵀ|, Vᵢᵀ={ v ᵢ : πᵢ(vᵢ) = T} optimal solution: feasible solution with min|E|

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend