overcoming multi model forgetting
play

Overcoming Multi-Model Forgetting Y. Benyahia, K. Yu, K. - PowerPoint PPT Presentation

Overcoming Multi-Model Forgetting Y. Benyahia, K. Yu, K. Bennani-Smires, M. Jaggi, A. Davison, M. Salzmann, C. Musat 1 The Weight Sharing In One of the first NAS papers using Reinforcement Learning, Zoph et Al. (Google) used more than 800 gpus


  1. Overcoming Multi-Model Forgetting Y. Benyahia, K. Yu, K. Bennani-Smires, M. Jaggi, A. Davison, M. Salzmann, C. Musat 1

  2. The Weight Sharing In One of the first NAS papers using Reinforcement Learning, Zoph et Al. (Google) used more than 800 gpus in parallel for two weeks . Weight Sharing was introduced in NAS to speed up the process Efficient Neural Architecture Search (Pham et al.) 2

  3. Assumptions Our hypothesis : 1. Weight-sharing can negatively affect architectures . 2. If justified, this can lead to a wrong evaluation of candidates in NAS , making the evaluation phase closer to random 3

  4. Multi-Model Forgetting 4

  5. Study of Weight-Sharing Simple scenario of two models sharing parameters: and Assume that we have access to the optimal parameters of the first model Maximizing the posterior distribution , Cross-entropy loss Weight importance L2 regularization 5

  6. Experiments on Two Models - WPL reduces multi-model forgetting - WPL have a minimal effect on the learning of the second model 6

  7. ENAS on PTB 7

  8. Summing up To recap, our main contributions are: 1. Weight Sharing negatively impacts NAS 2. Weight Sharing can cause the search phase in NAS to become closer to random 3. WPL reduces Multi-Model Forgetting Pacific Ballroom #19 (6:30pm - 9pm) 8

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend