AutoML for Object Detection Xiangyu Zhang MEGVII Research 1 - PowerPoint PPT Presentation

AutoML for Object Detection Xiangyu Zhang MEGVII Research

1 AutoML for • Advances in AutoML Object Detection 2 • Search for Detection Systems

Introduction v AutoML o A meta-approach to generate machine learning systems o Automatically search vs. manually design v AutoML for Deep Learning o Neural architecture search (NAS) o Hyper-parameters turning o Loss function o Data augmentation o Activation function o Backpropagation …

Revolution of AutoML v ImageNet 2012 - 27 26.2 o Hand-craft feature vs. deep learning 16.4 v Era of Deep Learning begins! 8.1 7.3 6.6 4.9 3.57 OXFORD ISI AlexNet SPPnet VGG GoogleNet PReLU ResNet 152 Classification Top-5 Error (%)

Revolution of AutoML (cont’d) v ImageNet 2017 - 19.1 o Manual architecture vs. AutoML models 17.3 17.3 17.1 16.1 Era of AutoML? 15.6 ResNeXt-101 SENet NASNet-A PNASNet-5 AmoebaNet-A EfficientNet Classification Top-1 Error (%)

Revolution of AutoML (cont’d) v Literature o 200+ since 2017

Revolution of AutoML (cont’d) v Literature o 200+ since 2017 v Google Trends

Recent Advances in AutoML (1) v Surpassing handcraft models o NASNet v Keynotes o RNN controller + policy gradient o Flexible search space o Proxy task needed Zoph et al. Learning Transferable Architectures for Scalable Image Recognition Zoph et al. Neural Architecture Search with Reinforcement Learning

Recent Advances in AutoML (2) v Search on the target task o MnasNet v Keynotes o Search directly on ImageNet o Platform aware search o Very costly (thousands of TPU-days) Tan et al. MnasNet: Platform-Aware Neural Architecture Search for Mobile

Recent Advances in AutoML (3) v Weight Sharing for Efficient Search & Evaluation o ENAS o One-shot methods v Keynotes o Super network o Finetuning & inference only instead of retraining o Inconsistency in super net evaluation Pham et al. Efficient Neural Architecture Search via Parameter Sharing Bender et al. Understanding and Simplifying One-Shot Architecture Search Guo et al. Single Path One-Shot Neural Architecture Search with Uniform Sampling

Recent Advances in AutoML (4) v Gradient-based methods o DARTS o SNAS, FBNet, ProxylessNAS, … v Keynotes o Joint optimization of architectures and weights o Weight sharing implied o Sometimes less flexible Liu et al. DARTS: Differentiable Architecture Search Xie et al. SNAS: Stochastic Neural Architecture Search Cai et al. ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware Wu et al. FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search

Recent Advances in AutoML (5) v Performance Predictor o Neural Architecture Optimization o ChamNet v Keynotes o Architecture encoding o Performance prediction models o Cold start problem Luo et al. Neural Architecture Optimization Dai et al. ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation

Recent Advances in AutoML (6) v Hardware-aware Search o Search with complexity budget o Quantization friendly o Energy-aware search … v Keynotes o Complexity-aware loss & reward o Multi-target search o Device in the loop Wu et al. Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search V ´ eniat et al. Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks Wang et al. HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Recent Advances in AutoML (7) v AutoML in Model Pruning o NetAdapt o AMC o MetaPruning v Keynotes o Search for the pruned architecture o Hyper-parameters like channels, spatial size, … Yang et al. NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications He et al. AMC: AutoML for Model Compression and Acceleration on Mobile Devices Liu et al. MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

Recent Advances in AutoML (8) v Handcraft + NAS o Human-expert guided search (IRLAS) o Boosting existing handcraft models (EfficientNet, MobileNet v3) v Keynotes o Very competitive performance o Efficient o Search space may be restricted Howard et al. Searching for MobileNetV3 Tan et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks Guo et al. IRLAS: Inverse Reinforcement Learning for Architecture Search

Recent Advances in AutoML (9) v Various Tasks v Not only NAS, search for everything! o o Object Detection Activation function o o Semantic Segmentation Loss function o o Super-resolution Data augmentation o o Face Recognition Backpropagation … … Liu et al. Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation Chu et al. Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search Ramachandra et al. Searching for Activation Functions Alber et al. Backprop Evolution

Recent Advances in AutoML (10) v Rethinking the Effectiveness of NAS o Random search o Random wire network v Keynotes o Reproducibility o Search algorithm or search space? o Baselines Li et al. Random Search and Reproducibility for Neural Architecture Search Xie et al. Exploring Randomly Wired Neural Networks for Image Recognition

Summary: Trends and Challenges v Trends o Efficient & high-performance algorithm o Flexible search space o Device-aware optimization o Multi-task / Multi-target search Efficiency v Challenges o Trade-offs between efficiency, performance and flexibility o Search space matters! o Fair benchmarks Performance Flexibility o Pipeline search

1 AutoML for • Advances in AutoML Object Detection 2 • Search for Detection Systems

AutoML for Object Detection v Components to search o Image preprocessing o Backbone o Feature fusion o Detection head & loss function …

Search for Detection Systems Augmentation Feature Fusion DetNAS Chen et al. DetNAS: Backbone Search for Object Detection

Challenges of Backbone Search v Similar to general NAS, but … o Controller & evaluator loop o Performance evaluation is very slow v Detection backbone evaluation involves a costly pipeline o ImageNet pretraining o Finetuning on the detection dataset (e.g. COCO) o Evaluation on the validation set

Related Work: Single Path One-shot NAS v Decoupled weight training and architecture optimization v Super net training Guo et al. Single Path One-Shot Neural Architecture Search with Uniform Sampling

Pipeline v Single-pass approach o Pretrain and finetune super net only once

Search Space v Single path super net o 20 (small) or 40 (large) choice blocks o 4 candidates for each choice block o Search space size: 4 20 or 4 40

Search Algorithm v Evolutionary search o Sample & reuse the weights from super net o Very efficient

Results v High performance o Significant improvements over commonly used backbones (e.g. ResNet 50) with fewer FLOPs o Best classification backbones may be suboptimal for object detection

Results v Search cost o Super nets greatly speed up search progress!

Search for Detection Systems Backbone Augmentation Feature Fusion NAS-FPN Ghaisi et al. NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection

Feature Fusion Modules v Multi-scale feature fusion o Used in state-of-the-art detectors (e.g. SSD, FPN, SNIP, FCOS, …) v Automatic search vs. manual design

First Glance v Searched architecture o Very different from handcraft structures

Search Space v Stacking repeated FPN blocks v For each FPN block, N different merging cells v For each merging cell, 4-step generations

Search Algorithm v Controller o RNN-based controller o Search with Proximal Policy Optimization (PPO) v Candidate evaluation o Training a light-weight proxy task

Architectures During Search v Many downsamples and upsamples

Results v State-of-the-art speed/AP trade-off

Search for Detection Systems Backbone Augmentation Feature Fusion Auto-Augment for Detection Zoph et al. Learning Data Augmentation Strategies for Object Detection

Data Augmentation for Object Detection v Augmentation pool o Color distortions o Geometric transforms o Random noise (e.g. cutout, drop block, …) o Mix-up … v Search for the best augmentation configurations

Search Space Design v Mainly follows AutoAugment v Randomly sampling from K sub-policies v For each sub-policy, N image transforms v Each image transform selected from 22 operations: o Color operations o Geometric operations o Bounding box operations Cubuk et al. AutoAugment: Learning Augmentation Strategies from Data

Search Space Design (cont’d)

AutoML for Object Detection Xiangyu Zhang MEGVII Research 1 - PowerPoint PPT Presentation

AutoML for Object Detection Xiangyu Zhang MEGVII Research 1 AutoML for Advances in AutoML Object Detection 2 Search for Detection Systems 1 AutoML for Advances in AutoML Object Detection 2 Search for Detection Systems

Automatic Machine Learning (AutoML): A Tutorial Frank Hutter Joaquin Vanschoren University of

AutoML in Full Life Circle of Deep Learning Assembly Line Junjie Yan SenseTime Group Limited

AutoML: Automated Machine Learning Barret Zoph, Quoc Le Thanks: Google Brain team CIFAR-10

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Automated Machine Learning (AutoML) and Pentaho Caio Moreno de Souza Pentaho Senior Consultant,

AutoML for TinyML with Once-for-All Network Song Han Massachusetts Institute of Technology

Neural Architecture Optimization CONTENTS 1.AutoML 2.NAS

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Object Detection Sanja Fidler CSC420: Intro to Image Understanding 1 / 48 Object Detection The

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Object Detection in Recent 3 Years Beyond RetinaNet and Mask R-CNN Gang Yu

From image classification to object detection Image classification Object detection Image source

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

CS6501: Deep Learning for Visual Recognition Object Detection: RCNN, Fast-RCNN, Faster-RCNN

Lecture 11: Object detection Contains slides from S. Lazebnik, R. Girshick, B. Hariharan 1

Representing Movement Primitives as Implicit Dynamical Systems learned from Multiple

Canada Graduate Scholarship Masters (CGSM) 2020-21 Overview of Awards Frederick Banting

D. M. Therrell High School Norms This is a meeting of the GO Team. Only members of the team

GRADUATION CREDIT REQUIREMENTS Minimum Number of Credits for Regents Diploma and Regents Diploma

Presentation skills: Grow vertically Generally, Companies dont ask for presentation. But it

Jersey Advertisement Pitch By Alex, Ethan and Charles AIM OF THE ADVERT. Advertise Jersey to

Discussion Paper on Substation - Switchgear Coordination based on IEC by Hermann Koch Chairman

S7546 Multi-GPU Programming with OpenACC Jeff Larkin, May 9, 2017, GTC17 Multi-GPU