Standardizing Evaluation of Neural Network Pruning Jose Javier - PowerPoint PPT Presentation

Standardizing Evaluation of Neural Network Pruning Jose Javier Gonzalez Davis Blalock John V. Guttag

Overview Sh Shri rinkBench: Open source library to facilitate development and standardized evaluation of neural network pruning methods • Rapid prototyping of NN pruning methods • Makes it easy to use standardized datasets, pretrained models and finetuning setups • Controls for potential confounding factors 1

Neural Network Pruning • Pretrained networks are often quite accurate but large • Pr Pruning : Systematically remove parameters from a network 2

Neural Network Pruning Accuracy of Pruned Networks Goal : Reduce size of network • Go 0.70 as much as possible with minimal drop in accuracy 0.65 Accuracy 0.60 • Often requires finetuning 0.55 afterwards 0.50 0.45 0.40 1 2 4 8 16 Compression Ratio 3

Traditional Pipeline Need a whole pipeline for performing experiments Data Pruning Finetuning Evaluation Algorithm Model 4

Traditional Pipeline But only the pruning algorithm usually changes Data Pruning Finetuning Evaluation Algorithm Model 5

Traditional Pipeline But only the pruning algorithm usually changes Data Duplicate effort & confounding variables Pruning Finetuning Evaluation Algorithm Model 6

ShrinkBench Library to facilitate standardized evaluation of pruning methods shrinkbench Utils Evaluation Model Data Finetuning Pruning Algorithm 7

ShrinkBench • Provides standardized datasets, pretrained models, and evaluation metrics • Simple and generic parameter masking API • Measures nonzero parameters, activations, and FLOPs • Controlled experiments show the need for standardized evaluation 8

T owards Standardization But how do we standardize? Standardized datasets. • Larger datasets (ImageNet) will be more insightful than smaller ones (CIFAR10) Standardized architectures • Crucial to match complexity of the network with complexity of dataset/task Pretrained models • This can be a confounding factor so it’s important to use the same Finetuning setup • We want improvement coming from pruning not just better hyperparameters 9

T owards Standardization But how do we standardize? Standardized datasets. • Widely adopted datasets, representative of real-world tasks Standardized architectures • With reproducibility record, matched in complexity to the chosen dataset Pretrained models • Even for a fixed architecture and dataset, exact weights may affect results Finetuning setup • We want improvement from pruning, not from better hyperparameters 10

T owards Standardization But how do we standardize? Standardized datasets. • Widely adopted datasets, representative of real-world tasks Standardized architectures • With reproducibility record, matched in complexity to the chosen dataset Pretrained models • Even for a fixed architecture and dataset, exact weights may affect results Finetuning setup • We want improvement from pruning, not from better hyperparameters 11

Masking API We can capture an arbitrary removal pattern using binary masks Model (+ Data) Pruning Masks -2.1 4.6 0.8 -0.1 0 1 0 0 0 0 1 0 -2.1 4.6 0.8 -0.1 0 1 0 0 -2.1 4.6 0.8 -0.1 0.2 1.5 -4.9 2.3 0 0 1 0 0 0 1 1 0.2 1.5 -4.9 2.3 0 0 1 0 0.2 1.5 -4.9 2.3 -2.5 2.7 4.2 -1.1 1 1 1 0 1 1 1 1 -2.5 2.7 4.2 -1.1 1 1 1 0 -2.5 2.7 4.2 -1.1 -0.3 5.0 3.1 4.7 0 1 0 1 0 1 0 0 -0.3 5.0 3.1 4.7 0 1 0 1 -0.3 5.0 3.1 4.7 12

Masks → Accuracy Given a pruning method in terms of masks, ShrinkBench finetunes the model and systematically evaluates it Accuracy Curve Pruning Masks 0.70 0 1 0 0 0 0 1 0 0.65 0 1 0 0 0 0 1 0 Accuracy 0.60 0 0 1 1 0 0 1 0 0.55 1 1 1 0 1 1 1 1 0.50 1 1 1 0 0.45 0 1 0 1 0 1 0 0 0.40 0 1 0 1 1 2 4 8 16 Compression Ratio 13

ShrinkBench Results I • ShrinkBench returns both compression & speedup since they interact differently with pruning 14 Model Compression Speedup

ShrinkBench Results II • ShrinkBench evaluates with varying compression and with several (dataset, architecture) combinations 15

ShrinkBench Results II • ShrinkBench evaluates with varying compression and with several (dataset, architecture) combinations 16

ShrinkBench Results III • ShrinkBench controls for confounding factors such as pretrained weights or finetuning hyperparemeters 17

Summary • ShrinkBench – an open source library to facilitate development and standardized evaluation of neural network pruning methods • Our controlled experiments across hundreds of models demonstrate the need for standardized evaluation. https://shrinkbench.github.io 18

Standardizing Evaluation of Neural Network Pruning Jose Javier - PowerPoint PPT Presentation

Standardizing Evaluation of Neural Network Pruning Jose Javier Gonzalez Davis Blalock John V. Guttag Overview Sh Shri rinkBench: Open source library to facilitate development and standardized evaluation of neural network pruning methods

Natural Target Pruning Making Proper Pruning Cuts Natural Target Pruning In this lesson we

BASICS Natural Target Pruning Terminology and Tools Reasons for Pruning Fruit Trees

Pruning for Cropload Management and Productivity 2013 Winter Pruning Workshop Dr. Mercy

What is the State of Neural Network Pruning? Davis Blalock* Jose Javier Gonzalez* Jonathan

Controlling for Context by Standardizing V2A May 20, 2020 2A 1 2A 2 2020 Schield ECOTS

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis Chaoqi Wang , Roger Grosse,

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Algorithms in Nature Pruning in neural networks Neural network development 1. Efficient signal

Berries, Grapes and Kiwi Pruning Blueberries Prune to an open vase shape, leaving 4 to 6

ENVIRONMENT STANDING COMMITTEE 18 September 2017 Street Trees & Pruning Requests Criteria

Identification of Pruning Branches for for Automated Dormant Pruning M Manoj Karkee j K k

Welcome to the DCGO Presentation Basic Pruning Agenda Reasons for Pruning Tools

More on games (Ch. 5.4-5.6) Announcements Writing 2 posted Minimax Pruning in real life:

Properties of - //the leaf node (terminal state) ) 9 ( The - algorithm //the leaf

Random Sampling Revisited: Lattice Enumeration with Discrete Pruning Yoshinori Aono

Alpha- -beta pruning beta pruning Example Alpha Example reduce the branching factor of

Blocking in the 2 k Design Blocking may be required because: we cannot perform all required runs

STAT 113 Sampling, Randomization and Confounding Colin Reimer Dawson Oberlin College August 31

CS 147: Computer Systems Performance Analysis Fractional Factorial Designs 1 / 26 Overview

Inferring Heterogeneous Causal Effects in Presence of Spatial Confounding ICML, 2019 Muhammad

DevelopingaPredic0veModelfor** InternetVideoQuality9of9Experience* Athula*Balachandran

Broadband Internet Performance: A View from the Gateway

Bayesian graphical models for combining multiple data sources, with applications in environmental

Treatment effect estimation with missing attributes Julie Josse Ecole Polytechnique, INRIA

Standardizing Evaluation of Neural Network Pruning Jose Javier - PowerPoint PPT Presentation

Standardizing Evaluation of Neural Network Pruning Jose Javier Gonzalez Davis Blalock John V. Guttag Overview Sh Shri rinkBench: Open source library to facilitate development and standardized evaluation of neural network pruning methods

Natural Target Pruning Making Proper Pruning Cuts Natural Target Pruning In this lesson we

BASICS Natural Target Pruning Terminology and Tools Reasons for Pruning Fruit Trees

Pruning for Cropload Management and Productivity 2013 Winter Pruning Workshop Dr. Mercy

What is the State of Neural Network Pruning? Davis Blalock* Jose Javier Gonzalez* Jonathan

Controlling for Context by Standardizing V2A May 20, 2020 2A 1 2A 2 2020 Schield ECOTS

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis Chaoqi Wang , Roger Grosse,

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Algorithms in Nature Pruning in neural networks Neural network development 1. Efficient signal

Berries, Grapes and Kiwi Pruning Blueberries Prune to an open vase shape, leaving 4 to 6

ENVIRONMENT STANDING COMMITTEE 18 September 2017 Street Trees &amp; Pruning Requests Criteria

Identification of Pruning Branches for for Automated Dormant Pruning M Manoj Karkee j K k

Welcome to the DCGO Presentation Basic Pruning Agenda Reasons for Pruning Tools

More on games (Ch. 5.4-5.6) Announcements Writing 2 posted Minimax Pruning in real life:

Properties of - //the leaf node (terminal state) ) 9 ( The - algorithm //the leaf

Random Sampling Revisited: Lattice Enumeration with Discrete Pruning Yoshinori Aono

Alpha- -beta pruning beta pruning Example Alpha Example reduce the branching factor of

Blocking in the 2 k Design Blocking may be required because: we cannot perform all required runs

STAT 113 Sampling, Randomization and Confounding Colin Reimer Dawson Oberlin College August 31

CS 147: Computer Systems Performance Analysis Fractional Factorial Designs 1 / 26 Overview

Inferring Heterogeneous Causal Effects in Presence of Spatial Confounding ICML, 2019 Muhammad

Developing*a*Predic0ve*Model*for** Internet*Video*Quality9of9Experience* Athula*Balachandran

Broadband Internet Performance: A View from the Gateway

Bayesian graphical models for combining multiple data sources, with applications in environmental

Treatment effect estimation with missing attributes Julie Josse Ecole Polytechnique, INRIA

ENVIRONMENT STANDING COMMITTEE 18 September 2017 Street Trees & Pruning Requests Criteria

DevelopingaPredic0veModelfor** InternetVideoQuality9of9Experience* Athula*Balachandran