Albert-Ludwigs-Universität Freiburg
NAS-Bench-1Shot1:
Benchmarking and Dissecting One-Shot Neural Architecture Search
DeToL 07.11.2019
Julien Siems, Arbër Zela and Frank Hutter
Under review as a conference paper at ICLR 2020
NAS-Bench-1Shot1: Benchmarking and Dissecting One-Shot Neural - - PowerPoint PPT Presentation
NAS-Bench-1Shot1: Benchmarking and Dissecting One-Shot Neural Architecture Search Albert-Ludwigs-Universitt Freiburg DeToL 07.11.2019 Julien Siems, Arbr Zela and Frank Hutter Under review as a conference paper at ICLR 2020 Motivation
Albert-Ludwigs-Universität Freiburg
DeToL 07.11.2019
Julien Siems, Arbër Zela and Frank Hutter
Under review as a conference paper at ICLR 2020
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 2
shot model to perform the search.
Figure adapted from: Dong, Xuanyi, and Yi Yang. "One-Shot Neural Architecture Search via Self-Evaluated Template Network." arXiv preprint arXiv:1910.05733 (2019).
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 3
shot model to perform the search.
Figure adapted from: Dong, Xuanyi, and Yi Yang. "One-Shot Neural Architecture Search via Self-Evaluated Template Network." arXiv preprint arXiv:1910.05733 (2019).
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 4
shot model to perform the search.
hyperparameters?
expensive.
Figure adapted from: Dong, Xuanyi, and Yi
Architecture Search via Self-Evaluated Template Network." arXiv preprint arXiv:1910.05733 (2019).
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 5
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 6
DARTS Search Phases Architecture Search Architecture Evaluation
Liu et al. 2018
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 7
DARTS Search Phases Architecture Search Architecture Evaluation
Price to pay to check intermediate architectures
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 8
Architecture Evaluation
NASBench-101
space CIFAR-10 [REF]
How can we use NASBench for Architecture Evaluation?
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 9
DARTS Search Space NASBench Search Space
combinations of tensors
node outputs
cell:
Architectures in the DARTS Search Space are usually not part of the NASBench Search Space.
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 10
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 11
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 12
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 13
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 14
This allowed the following analysis:
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 15
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 16
DARTS [Liu et al. 18]
Figure from Xu, Yuhui, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, and Hongkai Xiong. "PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search." (2019).
PC- DARTS [Xu et al. 19] GDAS [Dong et al. 19] Random Search with Weight Sharing [Li et al. 19]
through each cell.
need to be evaluated
each batch and train one-shot model weights.
Discrete optimizers:
More optimizers to be done …
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 17
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 18
Optimizer Comparison Search Space 3 Search Space 1
number of epochs
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 19
Regularized Search (Cutout) – Search Space 3 GDAS PC-DARTS DARTS
architectures.
positive impact
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 20
Regularized Search (Weight Decay) – Search Space 3 DARTS GDAS PC-DARTS
Higher regularization -> less stable search Higher regularization -> less stable search High regularization -> less stable search
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 21
Effect of one-shot learning rate – Search Space 3 DARTS GDAS PC-DARTS
High learning-rate -> less stable search High learning-rate -> better search High learning-rate -> less stable search
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 22
Correlation
DARTS GDAS PC-DARTS Random-WS 1 2 3
11/07/2019 Benchmarking and Dissecting One-Shot Neural Architecture Search 23
Optimize the hyperparameters of one-shot NAS optimizers using BOHB [Falkner et al. 2018]
black-box NAS optimizers
11/07/2019 Benchmarking and Disecting One-shot Neural Architecture Search 24
benchmarks that enable to evaluate the anytime performance of
tuned properly Future work:
[Cai et al. 2019], etc.
tables.