A Hybrid Reinforcement Learning Approach to Autonomic Resource - PowerPoint PPT Presentation

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation (2006) Paper: Gerald Tesauro, Nicholas K. Jongt, Rajarshi Das and Mohamed N. Bennanit Presentation: Nat McAleese (nm583)

What is autonomic computing? “ computing systems that can manage themselves given high-level objectives from administrators ” An IBM marketing buzzword? (probably) [0] Totally inevitable due to system complexity? Elastic scaling? [0] Kephart, Jeffrey O., and David M. Chess. "The vision of autonomic computing." Computer 36.1 (2003): 41-50.

Problem Maximise system-wide utility, which is assumed to be based on system performance relative to SLAs (Service Level Objectives) Considers the sum of utilities from application specific utility functions. For example web services are penalised if they fall below some latency, batch jobs improve linearly with more added resources

The scheme

Critical Detail “Given enough training samples, RL can converge to the correct value function V π associated with any fixed policy π, and that the new policy whose behavior greedily maximizes V π is guaranteed to improve upon the original policy.” [0] [0] G. Tesauro et al.: A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation, ICAC, 2006.

The scheme Each application manager submits a utility function, computed by evaluating the learnt Q(s, a) for all a. The resource arbiter then optimises [0] But it involves solving an NP-hard mixed integer programming model at the resource arbiter. Could it help with adding/removing processes? Is there an (unintentionally) adversarial component? Need more data. [0] Walsh, William E., et al. "Utility functions in autonomic systems." Autonomic Computing, 2004. Proceedings. International Conference on. IEEE, 2004.

The network 3 inputs - current demand λ, n t-1 , n t 12 sigmoid hidden units 1 output. It’s tiny. [0] G. Tesauro et al.: A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation, ICAC, 2006.

Results [0] G. Tesauro et al.: A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation, ICAC, 2006.

(With the benefit of hindsight, paper was 2006) Future work? These approaches could be improved by *loads* of recent developments. I’d start with applying some (or all) of the recently discovered optimisations for reinforcement learning [0]: And recent work to improve the raw performance Of fully connected nets [1] The real problem is (in my opinion) is lack of a standard benchmark. [0] Hessel, Matteo, et al. "Rainbow: Combining Improvements in Deep Reinforcement Learning." arXiv preprint arXiv:1710.02298 (2017). [1] Klambauer, Günter, et al. "Self-Normalizing Neural Networks." arXiv preprint arXiv:1706.02515 (2017).

(With the benefit of hindsight, paper was 2006) Criticism Does the system design make any sense at all? Is it more scalable? Their previous paper does not present a coherent argument for why this scheme is useful with *learned* utility functions, instead arguing that they are intuitive to design. Compatibility with software that’s actually used - OpenStack is becoming increasingly popular, and has been the target of similar work. Standard benchmarks Standard benchmarks.

(With the benefit of hindsight, paper was 2006) Criticism From next week, the papers that lack a standard benchmarking approach: Dutreilh, Xavier, et al. "Using reinforcement learning for autonomic resource allocation in clouds: towards a fully automated workflow." ICAS 2011, The Seventh International Conference on Autonomic and Autonomous Systems. 2011. LaCurts, Katrina, et al. "Cicada: Introducing Predictive Guarantees for Cloud Networks." HotCloud 14 (2014): 14-19. Roy, Nilabja, Abhishek Dubey, and Aniruddha Gokhale. "Efficient autoscaling in the cloud using predictive models for workload forecasting." Cloud Computing (CLOUD), 2011 IEEE International Conference on. IEEE, 2011. Delimitrou, Christina, and Christos Kozyrakis. "Quasar: resource-efficient and QoS-aware cluster management." ACM SIGPLAN Notices. Vol. 49. No. 4. ACM, 2014. The CherryPick guys tried though! - used, SparkKM, SparkReg TPC-DS, TPC-H, TeraSort. Unfortunately these are just batch processing workloads. - Alipourfard, Omid, et al. "CherryPick: Adaptively Unearthing the Best Cloud Configurations for Big Data Analytics." NSDI. 2017.

A Hybrid Reinforcement Learning Approach to Autonomic Resource - PowerPoint PPT Presentation

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation (2006) Paper: Gerald Tesauro, Nicholas K. Jongt, Rajarshi Das and Mohamed N. Bennanit Presentation: Nat McAleese (nm583) What is autonomic computing? computing systems

Autonomic Systems Autonomic Systems Autonomic : adaptive : adaptive Autonomic Self

Autonomic Dysfunction: Autonomic Non-Epileptic Seizures and the Autonomic Epilepsies James J.

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Autonomic Security Compliance Framework Cihan Tunc and Salim Hariri Cloud and Autonomic

AUTONOMIC DISORDERS AND AUTONOMIC TESTING Kamal R. Chmali, MD Associate Professor of Clinical

Autonomic Web-based Simulation Yingping Huang and Gregory Madey Computer Science and Engineering

From autonomic computing to autonomic ICT Fabrice Saffre Pervasive ICT Research Centre Fabrice

Autonomic Addressing draft-behringer-anima-autonomic-addressing-02.txt 94 rd IETF, 2 Nov 2015

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model for t he Dist

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Hybrid Construction Hybrid Construction Hybrid Construction Hybrid Construction 1 VP

Status of This Document This is the outcome document from the third GNSO Council Strategic

Performing Vendor Risk Assessments You can outsource the work, but you cant outsource the

T. R. Dua Deputy Director General, COAI 1 TM Broadband is no longer a luxury. It has

LaGOV LaGOV Version 2.02 Updated: 11/7/2008 8:58 AM Agenda Logistics, Ground Rules &

Presentation at the United Nations Convention on Certain Conventional Weapons Paul Scharre April

SmartSA Vendor Summit Mobility We plan to keep you moving Cooperative Comprehensive Continuous

Kirkland Citywide ITS Study City of Kirkland ITS Development Process 1996-2006 NE 124 th ITS

DR. WERNER HUBER. HEAD OF DRIVER ASSISTANCE AND PERCEPTION. BMW GROUP. ON THE WAY TOWARDS

A Hybrid Reinforcement Learning Approach to Autonomic Resource - PowerPoint PPT Presentation

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation (2006) Paper: Gerald Tesauro, Nicholas K. Jongt, Rajarshi Das and Mohamed N. Bennanit Presentation: Nat McAleese (nm583) What is autonomic computing? computing systems

Autonomic Systems Autonomic Systems Autonomic : adaptive : adaptive Autonomic Self

Autonomic Dysfunction: Autonomic Non-Epileptic Seizures and the Autonomic Epilepsies James J.

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Autonomic Security Compliance Framework Cihan Tunc and Salim Hariri Cloud and Autonomic

AUTONOMIC DISORDERS AND AUTONOMIC TESTING Kamal R. Chmali, MD Associate Professor of Clinical

Autonomic Web-based Simulation Yingping Huang and Gregory Madey Computer Science and Engineering

From autonomic computing to autonomic ICT Fabrice Saffre Pervasive ICT Research Centre Fabrice

Autonomic Addressing draft-behringer-anima-autonomic-addressing-02.txt 94 rd IETF, 2 Nov 2015

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model for t he Dist

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Hybrid Construction Hybrid Construction Hybrid Construction Hybrid Construction 1 VP

Status of This Document This is the outcome document from the third GNSO Council Strategic

Performing Vendor Risk Assessments You can outsource the work, but you cant outsource the

T. R. Dua Deputy Director General, COAI 1 TM Broadband is no longer a luxury. It has

LaGOV LaGOV Version 2.02 Updated: 11/7/2008 8:58 AM Agenda Logistics, Ground Rules &amp;

Presentation at the United Nations Convention on Certain Conventional Weapons Paul Scharre April

SmartSA Vendor Summit Mobility We plan to keep you moving Cooperative Comprehensive Continuous

Kirkland Citywide ITS Study City of Kirkland ITS Development Process 1996-2006 NE 124 th ITS

DR. WERNER HUBER. HEAD OF DRIVER ASSISTANCE AND PERCEPTION. BMW GROUP. ON THE WAY TOWARDS

LaGOV LaGOV Version 2.02 Updated: 11/7/2008 8:58 AM Agenda Logistics, Ground Rules &