When Ensembling Smaller Models is More Effjcient than Single Large - PowerPoint PPT Presentation

Jan 27, 2023 •206 likes •272 views

When Ensembling Smaller Models is More Effjcient than Single Large Models WebVision 2020 Dan Kondratyuk, Mingxing Tan, Matuhew Brown, Boqing Gong {dankondratyuk,tanmingxing,mtbr,bgong}@google.com Model Ensembles Train multiple models and

When Ensembling Smaller Models is More Effjcient than Single Large Models WebVision 2020 Dan Kondratyuk, Mingxing Tan, Matuhew Brown, Boqing Gong {dankondratyuk,tanmingxing,mtbr,bgong}@google.com
Model Ensembles Train multiple models and average their predictions during inference ● E.g., train a neural network architecture with difgerent random initializations ○ Easy method to reduce prediction error ● Introduces heavy effjciency penalties ● Prediction Most commonly reserved for the largest models ○ Can small ensembles be effjcient ? ● Aggregation ... Model Model Model 1 2 N Input Example
Image Classifjcation - Wide ResNet - CIFAR 10 Ensembles can be both ● more accurate and more effjcient Each line represents one ○ model architecture Each point indicates the ○ number of models ensembled As model sizes get larger, ○ the pergormance gap widens Larger ensembles produce ○ diminishing returns and become less effjcient
Image Classifjcation - EffjcientNet - ImageNet This trend appears for ● highly optimized models on larger datasets as well EffjcientNet scales the ○ width, depth, and resolution of each model size
NAS Ensemble - ImageNet Can we use NAS to ● generate diverse ensemble architectures? Can architecture diversity ○ boost the accuracy to FLOPs/latency ratio? Pareto curve shown for model ○ ensembles searched with NAS Surprisingly, a single searched ○ model pergorms nearly the same as a diverse ensemble Latency (ms)
Conclusion Ensembles of smaller models can be more accurate and more effjcient ● than single large models, especially as model size grows One can use ensembles as a more fmexible trade-ofg between a model’s inference ○ speed and accuracy Ensembles can be easily distributed across multiple workers, furuher increasing ○ effjciency A single searched model using NAS can fjnd a well-optimized architecture ● for ensembling However, ensembling diverse architectures from a search on multiple models pergorms ○ nearly the same as ensembling one model architecture from the search

Recommend

When Ensembling Smaller Models is More Efficient than Single Large Models Dan Kondratyuk, Mingxing

When Ensembling Smaller Models is More Efficient than Single Large Models Dan Kondratyuk, Mingxing Tan, Matthew Brown, Boqing Gong Google AI { dankondratyuk,tanmingxing,mtbr,bgong } @google.com Abstract For ensembles with more than two models,

175 views • 4 slides

Maximum Entropy Classifier Ensembling using Ge- netic Algorithm for NER in Bengali Asif Ekbal 1

Outline Background and Motivation Classifier Ensembling Genetic Algorithms Proposed Method of Classifier Ensemble Feature Set Used Experimental Results Conclusions Future Works Maximum Entropy Classifier Ensembling using Ge- netic

749 views • 36 slides

Cross Validation & Ensembling Shan-Hung Wu shwu@cs.nthu.edu.tw Department of Computer

Cross Validation & Ensembling Shan-Hung Wu shwu@cs.nthu.edu.tw Department of Computer Science, National Tsing Hua University, Taiwan Machine Learning Shan-Hung Wu (CS, NTHU) CV & Ensembling Machine Learning 1 / 34 Outline Cross

1.18k views • 94 slides

A Scalable, Portable, and Memory-Effjcient Lock-Free FIFO Queue Ruslan Nikolaev Systems

A Scalable, Portable, and Memory-Effjcient Lock-Free FIFO Queue Ruslan Nikolaev Systems Software Research Group Virginia Tech, USA Motivation Effjcient concurrent FIFO queues are hard Elimination techniques and relaxed FIFO queues are

178 views • 17 slides

IACP Smaller Law Enforcement Agency Technical Assistance Program Smaller Agency Conference Track

IACP Smaller Law Enforcement Agency Technical Assistance Program Smaller Agency Conference Track 2014 Assessing and Improving Analytic Capacities in Smaller Law Enforcement Agencies Moderator: James Chip R. Coldren, Jr. Mark Spawn, New

863 views • 52 slides

Mini Bookfairs in Schools/Universities More than 50 publishers More than 50 publishers More than

Mini Bookfairs in Schools/Universities More than 50 publishers More than 50 publishers More than 50 publishers More than 3000 titles available Prices TAILORED to you Up to 40% Bookfair should be ordered in advance Talk to you Sales

434 views • 6 slides

Learn more Do more Be more Learn more Do more Be more UNITY Learn more Do

Learn more Do more Be more Learn more Do more Be more UNITY Learn more Do more Be more Key Staff linked to Year 7 Mrs Angela Haynes Mrs Louisa Smith Head of Year 7 Year 7 PSM Ms Leyla Mrs Julia Emmel Bilsborough

736 views • 27 slides

Defect Detection Thomas Zimmermann The First Bug September 9, 1947 More Bugs More Bugs More

Defect Detection Thomas Zimmermann The First Bug September 9, 1947 More Bugs More Bugs More Bugs More Bugs More Bugs More Bugs More Bugs More Bugs More Bugs More Bugs More Bugs More Bugs Facts on Debugging Software bugs are

840 views • 63 slides

Why Transformers Work. More info blablabla More info blablabla More info blablabla More

Why Transformers Work. *More info blablabla *More info blablabla *More info blablabla *More info blablabla *More info blablabla *More info blablabla *More info blablabla *More info blablabla *More info blablabla *More info blablabla *More

1.11k views • 77 slides

An affected party is An affected party is MORE than just a MORE than just a 1-mile

An affected party is An affected party is MORE than just a MORE than just a 1-mile radius & 1-mile radius & MORE than 1% of MORE than 1% of 12,000 residents 12,000 residents 1 The Quarry is an IMMINENT THREAT!

730 views • 41 slides

More than Sport Nick Herbert What is More than Sport ? More than Sport is a WMPCC

More than Sport Nick Herbert What is More than Sport ? More than Sport is a WMPCC funded diversionary project that uses sport, physical activity or even volunteering within the local sporting infrastructure as a vehicle for positive

643 views • 19 slides

Self-ensembling for visual domain adaptation Geoff French g.french@uea.ac.uk Colour Lab

Self-ensembling for visual domain adaptation Geoff French g.french@uea.ac.uk Colour Lab (Finlayson Lab) University of East Anglia, Norwich, UK Image montages from http://www.image-net.org Thanks to My supervisory team: Prof. G. Finlayson,

577 views • 55 slides

Explainable Improved Ensembling for Natural Language and Vision Nazneen Rajani University of

Explainable Improved Ensembling for Natural Language and Vision Nazneen Rajani University of Texas at Austin Ph.D. Defense (12 th July, 2018) NLP Vision Discourse Scene Recognition Visual Question Sentiment Analysis Object Tracking

1.24k views • 99 slides

Acoustic Scene Classification by Ensembling Gradient Boosting Machine and Convolutional Neural

Acoustic Scene Classification by Ensembling Gradient Boosting Machine and Convolutional Neural Networks DCASE 2017 Eduardo Fonseca, Rong Gong, Dmitry Bogdanov, Olga Slizovskaia, Emilia Gomez and Xavier Serra Outline Introduction

581 views • 38 slides

Stacking With Auxiliary Features: Improved Ensembling for Natural Language and Vision Nazneen

Stacking With Auxiliary Features: Improved Ensembling for Natural Language and Vision Nazneen Rajani PhD Proposal November 7, 2016 Committee members: Ray Mooney, Katrin Erk, Greg Durrett and Ken Barker Outline Introduction Background

991 views • 77 slides

Clustering - Unimodal, to Cluster Ensembling, to Multi-View Clustering Captain Iain Cruickshank

CASOS Clustering - Unimodal, to Cluster Ensembling, to Multi-View Clustering Captain Iain Cruickshank icruicks@Andrew.cmu.edu Summer Institute 2020 Center for Computational Analysis of Social and Organizational Systems

431 views • 8 slides

15-11-2019 Department of Veterinary and Animal Sciences Linear programming Anders Ringgaard

15-11-2019 Department of Veterinary and Animal Sciences Linear programming Anders Ringgaard Kristensen Department of Veterinary and Animal Sciences Decision making in general When a decision is made concerning a unit, the following

491 views • 9 slides

hagiography (noun) CMU SCS ChristosTheGreekGodofDatabases.com Pinterest meets Causal

Faloutsos/Pavlo CMU - 15-415/615 CMU SCS Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB Applications C. Faloutsos A. Pavlo How to Scale a Database System CMU SCS hagiography (noun) CMU SCS

274 views • 11 slides

Places that Fail and Endogenous Institutions David K. Levine and Salvatore Modica June 2014 1

Places that Fail and Endogenous Institutions David K. Levine and Salvatore Modica June 2014 1 Mechanism Design Versus Institutional Design mechanism design theory in principle can be used to study institutions two main deficiencies

441 views • 20 slides

SAT SAT SAT SAT To Become an Auto Parts Manufacturing Leader in ASEAN with Excellent Quality

SAT SAT SAT SAT To Become an Auto Parts Manufacturing Leader in ASEAN with Excellent Quality To Become an Auto Parts Manufacturing Leader in ASEAN with Excellent Quality March March 2011 March, March, 2011 2011 2011 Agenda I ndustry

525 views • 35 slides

Profiling a warehouse-scale computer Svilen Kanev Harvard University Juan Pablo Darago

Profiling a warehouse-scale computer Svilen Kanev Harvard University Juan Pablo Darago Universidad de Buenos Aires Kim Hazelwood Yahoo Labs Parthasarathy Ranganathan, Tipp Moseley Google Inc. Gu-Yeon Wei, David Brooks Harvard University

245 views • 21 slides

Indexcompressionand efgicientqueryprocessing COMP90042 LECTURE 3, THE UNIVERSITY OF MELBOURNE by

Indexcompressionand efgicientqueryprocessing COMP90042 LECTURE 3, THE UNIVERSITY OF MELBOURNE by Matthias Petri Tue 12/3/2019 Index compression 1/37 Indexcompression Inverted Index - Recap in 4 where 3 sleep 5 house 4 night 52

710 views • 68 slides

Predicate Logic: Peano Arithmetic Alice Gao Lecture 20 CS 245 Logic and Computation Fall 2019

Predicate Logic: Peano Arithmetic Alice Gao Lecture 20 CS 245 Logic and Computation Fall 2019 1 / 22 Outline The Learning Goals Properties of Equality Using Logic to Model Number Theory Revisiting the Learning Goals CS 245 Logic and

357 views • 22 slides

Purpose-Driven Performance 2017 Results and 2018 Guidance Feb. 16, 2018 Cautionary Statements

Purpose-Driven Performance 2017 Results and 2018 Guidance Feb. 16, 2018 Cautionary Statements Use of Non-GAAP Financial Measures In this presentation, Ameren has presented core earnings per share and free cash flow, which are non-GAAP

430 views • 28 slides