Device Placement Optimization with Reinforcement Learning Azalia - PowerPoint PPT Presentation

Jul 18, 2023 •227 likes •450 views

Device Placement Optimization with Reinforcement Learning Azalia Mirhoseini, Hieu Pham, Quoc V. Le, Benoit Steiner, Rasmus Larsen, Yuefeng Zhou, Naveen Kumar, Mohammad Norouzi, Samy Bengio, Jefg Dean 1 / 21 What is device placement Consider a

Device Placement Optimization with Reinforcement Learning Azalia Mirhoseini, Hieu Pham, Quoc V. Le, Benoit Steiner, Rasmus Larsen, Yuefeng Zhou, Naveen Kumar, Mohammad Norouzi, Samy Bengio, Jefg Dean 1 / 21
What is device placement Consider a T ensorFlow computational graph G, which ● consists of M operations {o 1 ,o 2 , …, o M }, and a list of D available devices. A placement P = {p 1 ,p 2 , …, p M } is an assignment of an ● operation o i to a device p i . 2 / 21
Why device placement Trend toward many-device training, bigger models, larger ● batch sizes Growth in size and computational requirements of training ● and inference 3 / 21
Typical approaches Use a heterogeneous distributed environment with a mixture ● of many CPUs and GPUs Often based on greedy heuristics ● Require deep understanding of devices: bandwidth, latency ● behavior Are not fmexible enough and does not generalize well ● 4 / 21
ML for device placement ML is repeatedly replacing rule based heuristics ● RL can be applied to device placement ● – Efgective search across large state and action spaces to fjnd optimal solution – Automatic learning from underlying environment only based on reward function 5 / 21
RL based device placement Output RL Input model Neural Model Assignment of ops in Policy Neural model to devices Available Devs CPU GPU Evaluate runtime 6 / 21
Problem formulation : expected runtime : trainable parameters of policy : runtime : policy : output placements 7 / 21
Training with REINFORCE Learn the network parameters using Adam optimizer based ● on policy gradients computed via the REINFORCE equation: Use K placement samples to estimate policy gradients & use ● a baseline term B to reduce variance: 8 / 21
Model architecture 9 / 21
Challenges Vanishing ● Exploding gradient issue ● Large memory footprints ● 10 / 21
Distributed training 11 / 21
Experiments Recurrent Neural Language Model (RNNLM) ● Neural Machine Translation with attention mechanism(NMT) ● Inception-V3 ● 12 / 21
Learned placement on NMT 13 / 21
NMT end-to-end runtime 14 / 21
Learned placement on Inception-V3 15 / 21
Inception-V3 end-to-end runtime 16 / 21
Profming on NMT 17 / 21
Profming on Inception-V3 18 / 21
Profming on Inception-V3 19 / 21
Running times (in seconds) 20 / 21
Summary Propose a RL model to optimize device placements for ● neural networks Use policy gradient to learn parameters ● Policy fjnds non-trival assignment of operations to devices ● that outperform heuristic approaches Profjling of results show policy learns implicit trade-ofgs ● between computation and communication in hardware 21 / 21

Recommend

Nquire ask anything Anis Abboud, Chris Snyder, Mario Finelli Device 1 Device 2 Device 1

Nquire ask anything Anis Abboud, Chris Snyder, Mario Finelli Device 1 Device 2 Device 1 Device 2 Device 1 Device 2 Device 1 Device 2 Device 1 Device 2 Device 1 Device 2 Device 1 Device 2 Device 1 Device 2 Device 1 Device 2

821 views • 49 slides

Device Placement Optimization with Reinforcement Learning A Hierarchical Model for Device

Device Placement Optimization with Reinforcement Learning A Hierarchical Model for Device Placement A. Mirhoseini, Hieu Pham, A. Goldie et al November 2019 Problem Background Tensorflow allows user to place operators on different devices to

371 views • 15 slides

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

589 views • 27 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory Device Device Memory Computer-Computer Comm CPU CPU CPU CPU Comm Comm Comm Comm Memory Memory Memory Memory Device Device Device Device

629 views • 36 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

371 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.42k views • 88 slides

Device Placement Optimization using Reinforcement Learning By Mirhoseini et al. Shyam Tailor

Device Placement Optimization using Reinforcement Learning By Mirhoseini et al. Shyam Tailor 21/11/18 1 The Problem machine. website. Figure from TensorFlow well. e.g. Scotch [3] do not work too Previous automated approaches

624 views • 19 slides

Device Placement Optimization with Reinforcement Learning Azalia Mirhoseini et al. (Google, ICML

Device Placement Optimization with Reinforcement Learning Azalia Mirhoseini et al. (Google, ICML 17) Presented by: Stella Lau 21 November 2017 Motivation Problem Neural networks are large heterogeneous environment Which operations go

421 views • 18 slides

Using machine learning Learning knot methods in geometric modeling placement SVM knot placement

Machine learning in geometric modeling Georg Umlauf Using machine learning Learning knot methods in geometric modeling placement SVM knot placement for curves SVM knot placement for skinning Georg Umlauf Learning primitive

825 views • 38 slides

Placeto: Efficient Progressive Device Placement Optimization Ravichandra Addanki, Shaileshh Bojja

Placeto: Efficient Progressive Device Placement Optimization Ravichandra Addanki, Shaileshh Bojja Venkatakrishnan, Shreyan Gupta, Hongzi Mao, Mohammad Alizadeh Recall--- What is Device Placement G(V,E): the computational graph of a neural

638 views • 16 slides

Outline Motivation Seeing the Forest and the Why current placement tools are outdated

Outline Motivation Seeing the Forest and the Why current placement tools are outdated Trees: Steiner Wirelength Analysis of placement objectives Optimization in Placement A nave attempt at optimization Our placement

219 views • 8 slides

VLSI Placement Sadiq M. Sait & Habib Youssef December 1995 Placement Placement is the

King Fahd University of Petroleum & Minerals College of Computer Sciences & Engineering Department of Computer Engineering VLSI Placement Sadiq M. Sait & Habib Youssef December 1995 Placement Placement is the process of

1.15k views • 52 slides

Verification of Agents learning through Reinforcement Shashank Pathak 12 Giorgio Metta 12 Luca

Reinforcement Learning Air hockey as case-study Air hockey as RL task Verification Repair Conclusion Verification of Agents learning through Reinforcement Shashank Pathak 12 Giorgio Metta 12 Luca Pulina 3 Armando Tacchella 2 Robotics, Brain

322 views • 16 slides

Reinforcement Learning: Part 2 Chris Watkins Department of Computer Science Royal Holloway,

Reinforcement Learning: Part 2 Chris Watkins Department of Computer Science Royal Holloway, University of London July 27, 2015 1 TD(0) learning Define the temporal difference prediction error t = r t + V ( s t +1 ) V ( s t ) Agent

434 views • 31 slides

Reinforcemen t Learning Read Chapter Exercises

Reinforcemen t Learning Read Chapter Exercises Con trol learning Con trol p olici es that c ho ose optimal actions Q learning

551 views • 20 slides

Reinforcement Learning-Based SLC Cache Technique for Enhancing SSD Write Performance Sangjin Yoo

1 18 Reinforcement Learning-Based SLC Cache Technique for Enhancing SSD Write Performance Sangjin Yoo and Dongkun Shin Sungkyunkwan University, Korea newlandlord@skku.edu, dongkun@skku.edu Sungkyunkwan university Hotstorage20

352 views • 18 slides

SDRL: Interpretable and Data-efficient Deep Liu Reinforcement Learning Introduction Background

SDRL: Symbolic Deep Reinforcement Learning SDRL: Interpretable and Data-efficient Deep Liu Reinforcement Learning Introduction Background Leveraging Symbolic Planning Method Experiment Conclusion Bo Liu and Future Work Auburn

1.01k views • 20 slides

$\ Task Scheduling in High-Performance Computing Thomas McSweeney School of Mathematics The$

\ Task Scheduling in High-Performance Computing Thomas McSweeney School of Mathematics The

\ Task Scheduling in High-Performance Computing Thomas McSweeney School of Mathematics The University of Manchester thomas.mcsweeney@postgrad.manchester.ac.uk Numerical Linear Algebra Group Meeting October 16, 2018 Outline 1 The task

315 views • 28 slides

CS885 Reinforcement Learning Lecture 1a: May 2, 2018 Course Introduction [SutBar] Chapter 1,

CS885 Reinforcement Learning Lecture 1a: May 2, 2018 Course Introduction [SutBar] Chapter 1, [Sze] Chapter 1 University of Waterloo CS885 Spring 2018 Pascal Poupart 1 Outline Introduction to Reinforcement Learning Course website and

424 views • 14 slides

Deep Reinforcement Learning Prof. Kuan-Ting Lai 2020/3/5 Course Requirements Kaggle-style

Course Requirements of Deep Reinforcement Learning Prof. Kuan-Ting Lai 2020/3/5 Course Requirements Kaggle-style homework (60%) TBD VizDoom Microsoft AirSim Final Project (40%) Team members (1 ~ 4) Final report + Demo

210 views • 9 slides