TVM TVM f for ed or edge c e com omputin ting p g pla latf - PowerPoint PPT Presentation

Dec 25, 2022 •95 likes •181 views

TVM TVM f for ed or edge c e com omputin ting p g pla latf tform orm NTT Software Inno nnovation n Ce Center Ka Kazutaka Mo Morita In Inference in 5G era Edge Devices Offload MEC (Mobile edge computing) server Offload

TVM TVM f for ed or edge c e com omputin ting p g pla latf tform orm NTT Software Inno nnovation n Ce Center Ka Kazutaka Mo Morita
In Inference in 5G era Edge Devices Offload MEC (Mobile edge computing) server Offload Internet Cloud Base station ~10 10 ms ms la latency �
Benefits of offloading inference Be Computing resource Inference with data Edge Cloud High-end server- spec accelerators GPU Big are available data Edge is one of the targets of AI 5G Edge Interaction accelerators Real-time with other inference with big data devices data Device 5G CPU AI chip Device Device AI chip is unavailable for low-end devices �
Ex Exam ample le – Aug Augmented R d Reality Occlusion Point cloud Object segmentation Point inference cloud Cloud data HYPER-REALITY: https://vimeo.com/166807261 Plane detection object will not collide Captured images Object detection inference Inferenc nce with h bi big da data in n the he cloud ud can also provide collider from moving real world objects bouncing object Ma Many Inferenc nce tasks �
Edge computing platform with TVM VM Developer Developing framework for edge computing Distribute runtimes to device, edge, and cloud Device Offload inference if TVM VM necessary, based on SDK device and communication status Edge offload offload data Device Cloud Internet offload data �
What are required for TVM VM? He Heter erogen eneo eous ru runti time with th Dy Dyna namic r run untime Smart NI Sma NIC support offl of floa oading g suppor ort No overhead of PCIe On device communication or host On device memory access On edge Execute on edge via RPC On edge Edge Edge CPU CPU Switch based on device Scheduler and communication status Smart NIC GPU FPGA NIC Auto tuning support would be also nice Device Device �
Th Thank You! Email: kazutaka.morita.fp@hco.ntt.co.jp �

Recommend

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

We are here EaaS 1.0 EaaS 2.0 Pre-Edge CSP Led Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge Service Edge On Premise Edge Edge Edge (EaaS) Edge Client Client Client Client Machine

439 views • 13 slides

Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge

Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the

606 views • 45 slides

TVM at Facebook Lots of contributors at FB and elsewhere TVM at Facebook Why TVM? Examples from

TVM at Facebook Lots of contributors at FB and elsewhere TVM at Facebook Why TVM? Examples from Speech Synthesis Sparsity PyTorch Why TVM for ML Systems? - Performance matters - Flexibility matters - Portability matters ML Systems at

679 views • 29 slides

Quantization for TVM Ziheng Jiang TVM Conference, Dec 12th 2018 Quantization for TVM What is

Quantization for TVM Ziheng Jiang TVM Conference, Dec 12th 2018 Quantization for TVM What is Quantization? source: Han et al Converting weight value to low-bit integer like 8bit precision from float-point without significant accuracy drop.

421 views • 7 slides

VTA: Open & Flexible DL Acceleration Thierry Moreau TVM Conference, Dec 12th 2018 TVM Stack

VTA: Open & Flexible DL Acceleration Thierry Moreau TVM Conference, Dec 12th 2018 TVM Stack High-Level Differentiable IR Tensor Expression IR LLVM CUDA Metal TVM Stack High-Level Differentiable IR Tensor Expression IR LLVM CUDA Metal

1.01k views • 84 slides

December 12, 2018 Luis Ceze Welcome to the 1st TVM and Deep Learning Compilation Conference!

1st TVM and Deep Learning Compilation Conference December 12, 2018 Luis Ceze Welcome to the 1st TVM and Deep Learning Compilation Conference! Welcome to the 1st TVM and Deep Learning Compilation Conference! 180+ ppl! Machine learning is

1.31k views • 115 slides

TVM Deep Learning on Bare-Metal Devices Pratyush Patel No OS stack Extend TVM to support

TVM Deep Learning on Bare-Metal Devices Pratyush Patel No OS stack Extend TVM to support bare-metal devices Optimization High-Level Differentiable IR AutoTVM Tensor Expression IR LLVM, CUDA VTA AutoVTA Hardware FPGA ASIC Fleet

1.46k views • 18 slides

TVM @ FB Andrew Tulloch Research Scientist Background Excited to be here! Lots of FB

TVM @ FB Andrew Tulloch Research Scientist Background Excited to be here! Lots of FB folks in the audience Working in TVM since ~June Focusing on apply TVM to accelerate ML inference on CPUs/GPUs across mobile and server

863 views • 24 slides

Repairing Four-Atom Conjecture Ting-Ting Nan Advisor: Nigel Boston SP Coding and Information

Repairing Four-Atom Conjecture Ting-Ting Nan Advisor: Nigel Boston SP Coding and Information School Ting-Ting Nan Repairing Four-Atom Conjecture Figure: Butterfly network. Ting-Ting Nan Repairing Four-Atom Conjecture Entropy Region Entropy

157 views • 5 slides

Edge-based Segmentation Transform Hough Edge Tracking Linking Edge Detection Canny Edge

Edge-based Marr-Hildreth http://vision.ouc.edu.cn/~zhenghaiyong CVBIOUC ZhaoHaiwei DaiJialun WangRuchen Edge-based Segmentation Transform Hough Edge Tracking Linking Edge Detection Canny Edge Techniques

691 views • 31 slides

Ub Ubiq iquit itou ous Com omputin ing No screens Say your name Prof. Lydia Chilton COMS

Ub Ubiq iquit itou ous Com omputin ing No screens Say your name Prof. Lydia Chilton COMS 4170 26 February 2018 1 1940s 1960s mputers : Tools for Calculation and Co Comp Symbolic Manipulation 2 1945 Co Comp mputer ers : tools

886 views • 54 slides

Effect of Edge Preparation Methods on Effect of Edge Preparation Methods on Edge Retention Rate

ISST- -2007 2007 ISST Effect of Edge Preparation Methods on Effect of Edge Preparation Methods on Edge Retention Rate of Epoxy Coatings Edge Retention Rate of Epoxy Coatings S.S. Seo Seo, M.K. Chung, C.S. Park, C.H. Lee and K.K. Baek ,

658 views • 26 slides

Next Edge Theta Yield Fund Next Edge Capital Corp., January 2016 IMPORTANT NOTES The Next Edge

Next Edge Theta Yield Fund Next Edge Capital Corp., January 2016 IMPORTANT NOTES The Next Edge Theta Yield Fund or the Fund means the Next Edge Theta Yield Fund . Capitalized terms not defined in this presentation are defined

472 views • 30 slides

Next Edge Private Debt Fund Next Edge Capital Corp., June 2018 IMPORTANT NOTES The Next Edge

Next Edge Private Debt Fund Next Edge Capital Corp., June 2018 IMPORTANT NOTES The Next Edge Private Debt Fundor the Fund means the Next Edge Private Debt Fund. Capitalized terms not defjned in this presentation are defjned as

354 views • 24 slides

Mobile Edge Cloud Services in 5G Yanyong Zhang WINLAB, Rutgers University

Mobile Edge Cloud Services in 5G Yanyong Zhang WINLAB, Rutgers University yyzhang@winlab.rutgers.edu Edge clouds, edge applications MOTIVATION Mobile Edge Clouds WINLAB Edge Applications Mobile Edge Cloud Services Emergency Service

237 views • 22 slides

TVM & THE APACHE SOFTWARE FOUNDATION MARKUS WEIMER MEMBER, APACHE SOFTWARE FOUNDATION

TVM & THE APACHE SOFTWARE FOUNDATION MARKUS WEIMER MEMBER, APACHE SOFTWARE FOUNDATION ARCHITECT, MICROSOFT ML PLATFORM TVM & THE APACHE SOFTWARE FOUNDATION Why I am here MARKUS WEIMER MEMBER, APACHE SOFTWARE FOUNDATION

861 views • 43 slides

Vulnerabilities of Voice Assistants at the Edge: From Defeating Hidden Voice Attacks to

DAISY D ata A nalysis and I nformation S ecurit Y Lab Vulnerabilities of Voice Assistants at the Edge: From Defeating Hidden Voice Attacks to Audio-based Adversarial Attacks Yingying (Jennifer) Chen Professor, Electrical and Computer

1.05k views • 46 slides

Circularities and Modularity in the Wild Some F# Perspectives on Software Engineering Don Syme

| Basel Circularities and Modularity in the Wild Some F# Perspectives on Software Engineering Don Syme (Microsoft Research) (fpbridge.co.uk, fsharpforfunandprofit.com) Scott Wlaschin fsharp.org meetup.com/FSharpLondon Type Providers

713 views • 47 slides

SMART CITY / COOL CITY: ATTRACTING AND RETAINING TALENTED AND CREATIVE WORKERS IN HALIFAX JILL L

SMART CITY / COOL CITY: ATTRACTING AND RETAINING TALENTED AND CREATIVE WORKERS IN HALIFAX JILL L GRANT and KARIN KRONSTAL DALHOUSIE UNIVERSITY SCHOOL OF PLANNING 30 April 2009 ISRN 11 th Annual Meeting, Halifax NS Background: Halifax Regional

331 views • 16 slides

Towards Wireless Multi-Gigabit Systems Towards Wireless Multi Gigabit Systems Channel Models,

Technische Universitt Carolo-Wilhelmina zu Braunschweig tubs.CITY Jahrestagung 2009 tubs C Ja estagu g 009 Towards Wireless Multi-Gigabit Systems Towards Wireless Multi Gigabit Systems Channel Models, Regulation and Standardisation

519 views • 38 slides

EDGE: Extreme Scale Fused Seismic Simulations with the Discontinuous Galerkin Method Alexander

EDGE: Extreme Scale Fused Seismic Simulations with the Discontinuous Galerkin Method Alexander Breuer, Alexander Heinecke (Intel), Yifeng Cui What is EDGE? Extreme-scale Discontinuous Galerkin Environment (EDGE): Seismic wave

348 views • 17 slides

L ECTURE 2 Last time Introduction Basic models for sublinear-time computation Simple

Sublinear Algorithms L ECTURE 2 Last time Introduction Basic models for sublinear-time computation Simple examples of sublinear algorithms Today Properties of lists and functions. Testing if a list is sorted/Lipschitz and if a

266 views • 24 slides

Luttinger Liquid at the Edge of Liquid at the Edge of Luttinger a Graphene Graphene Vacuum

Luttinger Liquid at the Edge of Liquid at the Edge of Luttinger a Graphene Graphene Vacuum Vacuum a H.A. Fertig, Indiana University Luis Brey, CSIC, Madrid I. Introduction: Graphene Edge States (Non-Interacting) II. Quantum Hall

738 views • 27 slides

Outcomes I know the difference between combinational and sequential logic and can name

1-6.1 1-6.2 Outcomes I know the difference between combinational and sequential logic and can name examples of each. I understand latency, throughput, and at least 1 technique to Spiral 1 / Unit 6 improve throughput I can identify

346 views • 6 slides

TVM TVM f for ed or edge c e com omputin ting p g pla latf - PowerPoint PPT Presentation

TVM TVM f for ed or edge c e com omputin ting p g pla latf tform orm NTT Software Inno nnovation n Ce Center Ka Kazutaka Mo Morita In Inference in 5G era Edge Devices Offload MEC (Mobile edge computing) server Offload

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge

TVM at Facebook Lots of contributors at FB and elsewhere TVM at Facebook Why TVM? Examples from

Quantization for TVM Ziheng Jiang TVM Conference, Dec 12th 2018 Quantization for TVM What is

VTA: Open & Flexible DL Acceleration Thierry Moreau TVM Conference, Dec 12th 2018 TVM Stack

December 12, 2018 Luis Ceze Welcome to the 1st TVM and Deep Learning Compilation Conference!

TVM Deep Learning on Bare-Metal Devices Pratyush Patel No OS stack Extend TVM to support

TVM @ FB Andrew Tulloch Research Scientist Background Excited to be here! Lots of FB

Repairing Four-Atom Conjecture Ting-Ting Nan Advisor: Nigel Boston SP Coding and Information

Edge-based Segmentation Transform Hough Edge Tracking Linking Edge Detection Canny Edge

Ub Ubiq iquit itou ous Com omputin ing No screens Say your name Prof. Lydia Chilton COMS

Effect of Edge Preparation Methods on Effect of Edge Preparation Methods on Edge Retention Rate

Next Edge Theta Yield Fund Next Edge Capital Corp., January 2016 IMPORTANT NOTES The Next Edge

Next Edge Private Debt Fund Next Edge Capital Corp., June 2018 IMPORTANT NOTES The Next Edge

Mobile Edge Cloud Services in 5G Yanyong Zhang WINLAB, Rutgers University

TVM & THE APACHE SOFTWARE FOUNDATION MARKUS WEIMER MEMBER, APACHE SOFTWARE FOUNDATION

Vulnerabilities of Voice Assistants at the Edge: From Defeating Hidden Voice Attacks to

Circularities and Modularity in the Wild Some F# Perspectives on Software Engineering Don Syme

SMART CITY / COOL CITY: ATTRACTING AND RETAINING TALENTED AND CREATIVE WORKERS IN HALIFAX JILL L

Towards Wireless Multi-Gigabit Systems Towards Wireless Multi Gigabit Systems Channel Models,

EDGE: Extreme Scale Fused Seismic Simulations with the Discontinuous Galerkin Method Alexander

L ECTURE 2 Last time Introduction Basic models for sublinear-time computation Simple

Luttinger Liquid at the Edge of Liquid at the Edge of Luttinger a Graphene Graphene Vacuum

Outcomes I know the difference between combinational and sequential logic and can name

Sambuz

Useful Links

Newsletter

Mail Us

TVM TVM f for ed or edge c e com omputin ting p g pla latf - PowerPoint PPT Presentation

TVM TVM f for ed or edge c e com omputin ting p g pla latf tform orm NTT Software Inno nnovation n Ce Center Ka Kazutaka Mo Morita In Inference in 5G era Edge Devices Offload MEC (Mobile edge computing) server Offload

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge

TVM at Facebook Lots of contributors at FB and elsewhere TVM at Facebook Why TVM? Examples from

Quantization for TVM Ziheng Jiang TVM Conference, Dec 12th 2018 Quantization for TVM What is

VTA: Open &amp; Flexible DL Acceleration Thierry Moreau TVM Conference, Dec 12th 2018 TVM Stack

December 12, 2018 Luis Ceze Welcome to the 1st TVM and Deep Learning Compilation Conference!

TVM Deep Learning on Bare-Metal Devices Pratyush Patel No OS stack Extend TVM to support

TVM @ FB Andrew Tulloch Research Scientist Background Excited to be here! Lots of FB

Repairing Four-Atom Conjecture Ting-Ting Nan Advisor: Nigel Boston SP Coding and Information

Edge-based Segmentation Transform Hough Edge Tracking Linking Edge Detection Canny Edge

Ub Ubiq iquit itou ous Com omputin ing No screens Say your name Prof. Lydia Chilton COMS

Effect of Edge Preparation Methods on Effect of Edge Preparation Methods on Edge Retention Rate

Next Edge Theta Yield Fund Next Edge Capital Corp., January 2016 IMPORTANT NOTES The Next Edge

Next Edge Private Debt Fund Next Edge Capital Corp., June 2018 IMPORTANT NOTES The Next Edge

Mobile Edge Cloud Services in 5G Yanyong Zhang WINLAB, Rutgers University

TVM &amp; THE APACHE SOFTWARE FOUNDATION MARKUS WEIMER MEMBER, APACHE SOFTWARE FOUNDATION

Vulnerabilities of Voice Assistants at the Edge: From Defeating Hidden Voice Attacks to

Circularities and Modularity in the Wild Some F# Perspectives on Software Engineering Don Syme

SMART CITY / COOL CITY: ATTRACTING AND RETAINING TALENTED AND CREATIVE WORKERS IN HALIFAX JILL L

Towards Wireless Multi-Gigabit Systems Towards Wireless Multi Gigabit Systems Channel Models,

EDGE: Extreme Scale Fused Seismic Simulations with the Discontinuous Galerkin Method Alexander

L ECTURE 2 Last time Introduction Basic models for sublinear-time computation Simple

Luttinger Liquid at the Edge of Liquid at the Edge of Luttinger a Graphene Graphene Vacuum

Outcomes I know the difference between combinational and sequential logic and can name

Sambuz

Useful Links

Newsletter

Mail Us

VTA: Open & Flexible DL Acceleration Thierry Moreau TVM Conference, Dec 12th 2018 TVM Stack

TVM & THE APACHE SOFTWARE FOUNDATION MARKUS WEIMER MEMBER, APACHE SOFTWARE FOUNDATION