tvm
play

TVM Deep Learning on Bare-Metal Devices Pratyush Patel No OS - PowerPoint PPT Presentation

TVM Deep Learning on Bare-Metal Devices Pratyush Patel No OS stack Extend TVM to support bare-metal devices Optimization High-Level Differentiable IR AutoTVM Tensor Expression IR LLVM, CUDA VTA AutoVTA Hardware FPGA ASIC Fleet


  1. μ TVM Deep Learning on Bare-Metal Devices Pratyush Patel

  2. No OS stack

  3. Extend TVM to support bare-metal devices Optimization High-Level Differentiable IR AutoTVM Tensor Expression IR LLVM, CUDA VTA AutoVTA Hardware FPGA ASIC Fleet

  4. Extend TVM to support bare-metal devices Optimization High-Level Differentiable IR AutoTVM Tensor Expression IR Most bare-metal devices LLVM, CUDA VTA AutoVTA do not support LLVM Hardware FPGA ASIC Fleet

  5. Extend TVM to support bare-metal devices Optimization High-Level Differentiable IR AutoTVM Tensor Expression IR LLVM, CUDA VTA C, C++ AutoVTA Hardware FPGA ASIC Fleet

  6. Extend TVM to support bare-metal devices Optimization High-Level Differentiable IR AutoTVM Tensor Expression IR Upstreamed! LLVM, CUDA VTA C, C++ AutoVTA Hardware FPGA ASIC Fleet

  7. Extend TVM to support bare-metal devices Optimization High-Level Differentiable IR AutoTVM Tensor Expression IR Upstreamed! LLVM, CUDA VTA C, C++ AutoVTA Hardware FPGA ASIC Fleet Many other backends

  8. μ TVM builds upon AutoTVM μ TVM Runtime μ Device C Code Generator API send program run optimize

  9. A closer look at μ TVM μ TVM Runtime telnet JTAG μ Device C Code OpenOCD Generator API run

  10. A closer look at μ TVM μ TVM Runtime telnet JTAG μ Device C Code OpenOCD Generator API run IR -> code infer.c

  11. A closer look at μ TVM μ TVM Runtime telnet JTAG μ Device C Code OpenOCD Generator API run IR -> code vendor 
 gcc infer.c infer.o

  12. A closer look at μ TVM μ TVM Runtime telnet JTAG μ Device C Code OpenOCD Generator API run IR -> code vendor 
 ld linker gcc remap infer.c infer.o infer

  13. A closer look at μ TVM μ TVM Runtime telnet JTAG μ Device C Code OpenOCD Generator API custom loader run IR -> code vendor 
 ld linker gcc remap infer.c infer.o infer

  14. A closer look at μ TVM μ TVM Runtime telnet JTAG μ Device C Code OpenOCD Generator API send program custom loader run IR -> code vendor 
 ld linker gcc remap infer.c infer.o infer

  15. Next steps • Iron out interfaces with actual hardware. • Optimize with AutoTVM. • Support restricted and configurable model sizes. • Enable custom data types such as fixed-point precision formats. Get in touch! Pratyush Patel — patelp1@cs.uw.edu

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend