towards a smart data transfer node
play

Towards a Smart Data Transfer Node Zhengchun Liu, Rajkumar - PowerPoint PPT Presentation

The 4th Innovating the Network for Data Intensive Science (INDIS) workshop Towards a Smart Data Transfer Node Zhengchun Liu, Rajkumar Kettimuthu, Ian Foster, Peter H. Beckman Presented by: Zhengchun Liu November 12, 2017, Denver CO Motivation


  1. The 4th Innovating the Network for Data Intensive Science (INDIS) workshop Towards a Smart Data Transfer Node Zhengchun Liu, Rajkumar Kettimuthu, Ian Foster, Peter H. Beckman Presented by: Zhengchun Liu November 12, 2017, Denver CO

  2. Motivation

  3. Motivation Computer systems are getting ever more sophisticated , and human-lead empirical- based approach towards system optimization is not the most e ffi cient way to realize the full potential of these modern and complex high performance computing systems.

  4. Motivation Computer systems are getting ever more sophisticated , and human-lead empirical- based approach towards system optimization is not the most e ffi cient way to realize the full potential of these modern and complex high performance computing systems. ๏ The e ff ectiveness of parameters are not straightforward or intuitive understandable. ๏ The system is dynamic. Fairly impossible to design a one-size-fits-all rule. ๏ Parameter space is very big and very time consuming to explore. ๏ Environment and platform are di ff erent.

  5. Motivation Computer systems are getting ever more sophisticated , and human-lead empirical- based approach towards system optimization is not the most e ffi cient way to realize the full potential of these modern and complex high performance computing systems. ๏ The e ff ectiveness of parameters are not straightforward or intuitive understandable. ๏ The system is dynamic. Fairly impossible to design a one-size-fits-all rule. ๏ Parameter space is very big and very time consuming to explore. ๏ Environment and platform are di ff erent. The data transfer nodes (DTN) are compute systems dedicated for wide area data transfers in distributed science environments.

  6. Motivation Computer systems are getting ever more sophisticated , and human-lead empirical- based approach towards system optimization is not the most e ffi cient way to realize the full potential of these modern and complex high performance computing systems. ๏ The e ff ectiveness of parameters are not straightforward or intuitive understandable. ๏ The system is dynamic. Fairly impossible to design a one-size-fits-all rule. ๏ Parameter space is very big and very time consuming to explore. ๏ Environment and platform are di ff erent. The data transfer nodes (DTN) are compute systems dedicated for wide area data transfers in distributed science environments. Inspired by work from Google Deepmind about using reinforcement learning to play games (e.g., AlphaGo, Atari). We use reinforcement machine learning methods to discover the “just right ” control parameters for data transfer nodes in dynamic environment.

  7. Motivation * Aggregate incoming transfer rate vs. total concurrency (i.e., instantaneous number of GridFTP server instances) at two heavily used endpoints, with Weibull curve fitted. * Z. Liu, P . Balaprakash, R. Kettimuthu, I. Foster, Explaining wide area data transfer performance. HPDC’17

  8. Motivation * Aggregate incoming transfer rate vs. total concurrency (i.e., instantaneous number of GridFTP server instances) at two heavily used endpoints, with Weibull curve fitted. Luckily, the optimal operating point of these two endpoints are almost fixed. However, the optimal operating point of most endpoints are dynamical because of continuously changing external load. * Z. Liu, P . Balaprakash, R. Kettimuthu, I. Foster, Explaining wide area data transfer performance. HPDC’17

  9. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . https://en.wikipedia.org/wiki/Q-learning

  10. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . Agent (Controller) https://en.wikipedia.org/wiki/Q-learning

  11. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . Agent (Controller) Environment / Object https://en.wikipedia.org/wiki/Q-learning

  12. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . Agent (Controller) State S t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  13. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . Agent (Controller) State Action S t A t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  14. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . Agent (Controller) State Reward Action S t R t A t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  15. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . Learn Agent (Controller) State Reward Action S t R t A t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  16. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . S t The state of environment (control object) at any given time t Learn Agent (Controller) State Reward Action S t R t A t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  17. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . S t The state of environment (control object) at any given time t The corresponding optimal action at any given time t A t Learn Agent (Controller) State Reward Action S t R t A t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  18. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . S t The state of environment (control object) at any given time t The corresponding optimal action at any given time t A t Learn R t The actual reward from , i.e., what we want to optimize A t Agent (Controller) State Reward Action S t R t A t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  19. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . S t The state of environment (control object) at any given time t The corresponding optimal action at any given time t A t Learn R t The actual reward from , i.e., what we want to optimize A t Agent (Controller) State Reward Action Q-learning S t R t A t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  20. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . S t The state of environment (control object) at any given time t The corresponding optimal action at any given time t A t Learn R t The actual reward from , i.e., what we want to optimize A t Agent (Controller) State Reward Action Q-learning S t R t A t Environment / Object https://en.wikipedia.org/wiki/Q-learning

  21. Reinforcement Learning [Idea] An agent interacting with an environment, which provides its current state and numeric reward signals after each action the agent takes. [Goal] Learn how to take actions in order to maximize reward . S t The state of environment (control object) at any given time t The corresponding optimal action at any given time t A t Learn R t The actual reward from , i.e., what we want to optimize A t Agent (Controller) State Reward Action Q-learning S t R t A t Environment / Object Policy Gradient https://en.wikipedia.org/wiki/Q-learning

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend