Optimizing DNN Computation with Relaxed Graph Substitutions Tim - PowerPoint PPT Presentation

Optimizing DNN Computation with Relaxed Graph Substitutions Tim Lazarus 26 November, 2019

Graph Substitutions We can optimise DNNs if we replace subgraphs with equivalent ones that improve overall performance For a particular input I , computation graph G will produce output O , or written as O = G ( I ) We then say that two graphs, G and G 0 are equivalent if they produce the same output for every input. ( ∀ I : G ( I ) = G 0 ( I ))

Relaxed Graph Substitutions This is a local form of optimisation and may not result in optimal results. Previous work with graph substitutions employed a greedy approach . As with most modern optimising compilers, sometimes further optimisations can be gained if we decrease performance in intermediate steps.

Example Figure: Example relaxed graph substitution optimisation

Defining substitutions Essentially a mapping between a source graph and target graph . Source graph defines constraints on a subgraph. Target graph uses those constraints to create the substituted subgraph. We need the substitution to be valid

Example Figure: Example substitution definition

Cost Model We need to estimate the cost of each substitution. Cost model incorporates many metrics. Can also accurately estimate dynamic execution too

Searching the Space Use a priority queue to search most optimal graph first and backtrack if necessary. The space can be huge if we consider all possible substitutions. Use a parameter α that determines the trade-o ff between search time and space explored. (See next slide)

Search Algorithm Algorithm 1: A Backtracking Search Algorithm Input: An initial computation graph G 0 , a cost model Cost( · ) , a list of valid graph substitutions { S 1 , ..., S m } , and a hyper parameter α Output: An optimised computation graph. // Q is a priority queue of graphs sorted by Cost( · ) Q = {G 0 } while Q 6 = {} do G = Q . dequeue() for i = 1 to m do G 0 = S i ( G ) if Cost( G 0 ) < Cost( G opt ) then G opt = G 0 end if Cost( G 0 ) < α ⇥ Cost( G opt ) then Q . enqueue( G 0 ) end end end return G opt

Graph Splitting Split the graph into smaller subgraphs so the search is more manageable. For each node v , we define the Cap( v ) as the number of substitutions that map to an in or out edge of v . We can then minimise the number of substitutions that span across a split as the problem maps to a minimum vertex cut problem. Can perform a local search around splits to find further potential optimisations.

Evaluation Figure: Compared with TensorFlow, TensorRT and TensorFlow XLA

Evaluation Figure: Comparison of di ff erent cost metrics

Evaluation Figure: Evaluation of varying values of α

Criticism Strengths I Well defined problem I System is open-source I Good testing of system I Can be used on top of other optimisations

Criticism Strengths Weaknesses I Well defined problem I Paper lacked implementation detail I System is open-source I Poor analysis of results I Good testing of system I Can be used on top of other optimisations

Extensions Can be used with existing optimisations like TVM or FlexFlow (as we saw last week) There’s a new paper in town...

TASO Extends this paper by automatically generating possible graph substitutions. For a given set of operators, it enumerates all possible subgraphs up to a fixed size. It then finds equivalent subgraphs through formal verification.

Questions?

Optimizing DNN Computation with Relaxed Graph Substitutions Tim - PowerPoint PPT Presentation

Optimizing DNN Computation with Relaxed Graph Substitutions Tim Lazarus 26 November, 2019 Graph Substitutions We can optimise DNNs if we replace subgraphs with equivalent ones that improve overall performance For a particular input I ,

DNN-based Branch-and-bound for the Quadratic Assignment Problem *Koichi Fujii, Naoki Ito, Yuji

Relaxed Separation Logic Tutorial @ POPL14 Viktor Vafeiadis MPI-SWS 20 January 2014

Contents Introduction Pipelined FPGA DNN accelerators Roof-line Model and optimizing

A solution of A solution of the cusp problem the cusp problem in relaxed halos in relaxed

5th STL Workshop, June 2005 Title: Relaxed weak queues: an alternative to run-relaxed heaps

The Dark Side of DNN Pruning Reza Yazdani Marc Riera Jose-Maria Arnau Antonio Gonzlez

Planning and Optimization C2. Delete Relaxation: Finding Relaxed Plans Malte Helmert and Gabriele

Optimizing monitoring networks for Optimizing monitoring networks for Optimizing monitoring

Community Detection by Decomposing a Graph into Relaxed Cliques Fabio Furini, Timo Gschwind,

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training Hongyu Zhu 1,2 ,

Power-Driven DNN Dataflow Optimization on FPGA Qi Sun 1 , Tinghuan Chen 1 , Jin Miao 2 , Bei Yu 1 1

Entropy, continued UNIT 4 Day 7 Demonstration Stretched vs. Relaxed Rubber Bands POLL: iClicker

Outlier Channel Splitting Improving DNN Quantization without Retraining Ritchie Zhao , Yuwei Hu,

2005 UWEB Communications Workshop Presenting a Scientific Talk (and an Introduction to

PPT presentation of the Golden Road 60 PowetPoint PPT presentation of the Golden Road 60

Identity & Style Matt J. Fuller Director of Technology & Communication Overview

Disclaimer This document does not constitute or form part of and should not be construed as a

Digital Divas Dr Kathleen Bennetts Monash University Mrs Jenny Looker Canterbury Girls

0wn Your Inbox BCNET Conference 2018 Being busy is not the same thing as being productive Any

St Hughs Middle School Developing brilliant learners What is Take One? How

(The Essence of Law Firm Management) Spring 2003 Volume 14 Issue Number Peter J. Winders

Optimizing DNN Computation with Relaxed Graph Substitutions Tim - PowerPoint PPT Presentation

Optimizing DNN Computation with Relaxed Graph Substitutions Tim Lazarus 26 November, 2019 Graph Substitutions We can optimise DNNs if we replace subgraphs with equivalent ones that improve overall performance For a particular input I ,

DNN-based Branch-and-bound for the Quadratic Assignment Problem *Koichi Fujii, Naoki Ito, Yuji

Relaxed Separation Logic Tutorial @ POPL14 Viktor Vafeiadis MPI-SWS 20 January 2014

Contents Introduction Pipelined FPGA DNN accelerators Roof-line Model and optimizing

A solution of A solution of the cusp problem the cusp problem in relaxed halos in relaxed

5th STL Workshop, June 2005 Title: Relaxed weak queues: an alternative to run-relaxed heaps

The Dark Side of DNN Pruning Reza Yazdani Marc Riera Jose-Maria Arnau Antonio Gonzlez

Planning and Optimization C2. Delete Relaxation: Finding Relaxed Plans Malte Helmert and Gabriele

Optimizing monitoring networks for Optimizing monitoring networks for Optimizing monitoring

Community Detection by Decomposing a Graph into Relaxed Cliques Fabio Furini, Timo Gschwind,

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training Hongyu Zhu 1,2 ,

Power-Driven DNN Dataflow Optimization on FPGA Qi Sun 1 , Tinghuan Chen 1 , Jin Miao 2 , Bei Yu 1 1

Entropy, continued UNIT 4 Day 7 Demonstration Stretched vs. Relaxed Rubber Bands POLL: iClicker

Outlier Channel Splitting Improving DNN Quantization without Retraining Ritchie Zhao , Yuwei Hu,

2005 UWEB Communications Workshop Presenting a Scientific Talk (and an Introduction to

PPT presentation of the Golden Road 60 PowetPoint PPT presentation of the Golden Road 60

Identity &amp; Style Matt J. Fuller Director of Technology &amp; Communication Overview

Disclaimer This document does not constitute or form part of and should not be construed as a

Digital Divas Dr Kathleen Bennetts Monash University Mrs Jenny Looker Canterbury Girls

0wn Your Inbox BCNET Conference 2018 Being busy is not the same thing as being productive Any

St Hughs Middle School Developing brilliant learners What is Take One? How

(The Essence of Law Firm Management) Spring 2003 Volume 14 Issue Number Peter J. Winders

Identity & Style Matt J. Fuller Director of Technology & Communication Overview