A Unified Monitoring Framework for Energy Consumption and Network - PowerPoint PPT Presentation

A Unified Monitoring Framework for Energy Consumption and Network Traffic Florentin Clouet, Simon Delamare, Jean-Patrick Gelas, Laurent Lefèvre, Lucas Nussbaum, Clément Parisot, Laurent Pouilloux, François Rossigneux Grid’5000 1 / 16

Context: Grid’5000 ◮ Versatile testbed for research on HPC, Clouds, Big Data ◮ 10 sites (1 outside France) ◮ 24 clusters, 1000 nodes, 8000 cores ◮ 10-Gbps backbone (RENATER) ◮ Widely used since 2005: � 500+ users per year � 700+ publications since 2009 https://www.grid5000.fr/ 2 / 16

Maximizing support for advanced experiments Application ◮ Complete control of the testbed’s resources, over the whole stack: Programming environment � Bare-metal system image deployment � Customize your kernel, use your own Cloud stack Application runtime � Network isolation using KaVLAN Grid, Cloud or P2P middleware � no perturbation; protect rest of the testbed Operating system ◮ Trustworthiness : automatic inventory and Networking verification of resources (TRIDENTCOM’2014 paper) ◮ Fully programmable through a REST API � Automating experiments � reproducible research ◮ Higher level tools to facilitate HPC, Clouds, Big Data experiments 3 / 16

Maximizing support for advanced experiments Application ◮ Complete control of the testbed’s resources, over the whole stack: Programming environment � Bare-metal system image deployment � Customize your kernel, use your own Cloud stack Application runtime � Network isolation using KaVLAN Grid, Cloud or P2P middleware � no perturbation; protect rest of the testbed Operating system ◮ Trustworthiness : automatic inventory and Networking verification of resources (TRIDENTCOM’2014 paper) ◮ Fully programmable through a REST API � Automating experiments � reproducible research ◮ Higher level tools to facilitate HPC, Clouds, Big Data experiments This paper: observability, monitoring, measurement 3 / 16

COTS observability tools 4 / 16

COTS observability tools But: ◮ Need to be configured by the experimenters ◮ Often intrusive (running on users’ nodes, non-negligible overhead) 4 / 16

Monitoring solutions for system administration ◮ MRTG, Munin, Ganglia, Nagios, etc. ◮ Main focus: monitor long term variations, tendencies ◮ Designed for low resolution (5 mins) � unsuitable for experimenters 5 / 16

This talk: Kwapi ◮ Monitoring and measurement framework for the Grid’5000 testbed ◮ Initially designed as a power consumption measurement framework for OpenStack – then adapted to Grid’5000’s needs and extended ◮ For energy consumption and network traffic ◮ Measurements taken at the infrastructure level (SNMP on network equipment, power distribution units, etc.) ◮ High frequency (aiming at 1 measurement per second) 6 / 16

Architecture 7 / 16

Multi-metrics support: energy and networking ◮ Future work: extension to other metrics (reactive power, network errors, Infiniband, storage systems, server room temperature, etc.) 8 / 16

Multi-metrics support: energy and networking ◮ 18:39:28 – machines are turned off ◮ 18:40:28 – machines are turned on again and generate network traffic as they boot via PXE ◮ 18:49:28 – machines reservation is terminated, causing a reboot to the default system 8 / 16

Data access and storage ◮ Metrics collected by Kwapi are stored: � In RRD files (typical for monitoring systems) � In HDF5 files, for long-term loss-less archival ⋆ One year of Grid’5000 monitoring = 720 GB ◮ Visualization via a web interface (selection by nodes or job numbers) ◮ Data also exported via the Grid’5000 REST API 9 / 16

Development and deployment challenges ◮ SNMP: � GetBulkRequest to fetch all metrics at once � 64 bits counters (32 bits cycle in 4s on a 10 Gbps network) ◮ Configuration generated automatically from Grid’5000 Reference API � Describes each node’s hardware, including where it is connected (network switch port, PDU port) � Format of SNMP’s IF-Descr fields GigabitEthernet1/%LINECARD%/%PORT% TenGigabitEthernet%LINECARD%/%PORT% Unit: %LINECARD% Slot: 0 Port: %PORT% Gigabit - Level � Includes handling of complex cases (2+ NIC, 2 PSU, shared PDU) ◮ Configuration is automatically tested (Stress CPU and network � compare data retrieved from REST API) 10 / 16

Monitoring overhead ◮ Network traffic: all monitoring traffic on a separate network (also used for e.g. remote control of nodes) ◮ Load on network equipment: no visible impact on performance 11 / 16

Some example use cases 12 / 16

Visualizing TCP congestion control ◮ Linux’s implementation of TCP CUBIC includes the Hystart heuristic � Detects congestion by measuring RTT � Broken until Linux 2.6.32 160 140 120 Bandwidth (MB/s) 100 80 60 40 20 disabled enabled 0 00:00 00:05 00:10 00:15 00:20 00:25 00:30 00:35 00:40 Time (s) ◮ Not as accurate as nuttcp or iperf but: � Measurements are completely passive from the experiment POV � No instrumentation required on nodes 13 / 16

8000 Night or weekends 7000 Day and weekdays Global consumption (W) 6000 5000 4000 3000 2000 1000 0 Jan 29 2015 Feb 01 2015 Feb 04 2015 Feb 07 2015 Feb 10 2015 Feb 13 2015 Feb 16 2015 Feb 19 2015 Date Extracting power consumption trends ◮ Grid’5000 distinguishes between two time periods: � daytime – shared usage to prepare experiments � nights and week-ends – large scale experiments ◮ As a result, there are often free resources during the day ◮ Also, nodes are automatically shut down when not used 14 / 16

8000 Night or weekends 7000 Day and weekdays Global consumption (W) 6000 5000 4000 3000 2000 1000 0 Jan 29 2015 Feb 01 2015 Feb 04 2015 Feb 07 2015 Feb 10 2015 Feb 13 2015 Feb 16 2015 Feb 19 2015 Date Extracting power consumption trends ◮ Grid’5000 distinguishes between two time periods: � daytime – shared usage to prepare experiments � nights and week-ends – large scale experiments ◮ As a result, there are often free resources during the day ◮ Also, nodes are automatically shut down when not used ◮ Does this reflect in power consumption as seen by Kwapi? 14 / 16

Extracting power consumption trends ◮ Grid’5000 distinguishes between two time periods: � daytime – shared usage to prepare experiments � nights and week-ends – large scale experiments ◮ As a result, there are often free resources during the day ◮ Also, nodes are automatically shut down when not used ◮ Does this reflect in power consumption as seen by Kwapi? 8000 Night or weekends 7000 Day and weekdays Global consumption (W) 6000 5000 4000 3000 2000 1000 0 Jan 29 2015 Feb 01 2015 Feb 04 2015 Feb 07 2015 Feb 10 2015 Feb 13 2015 Feb 16 2015 Feb 19 2015 Date 14 / 16

Evaluating energy-aware schedulers ◮ DIET: energy-aware distributed computing middleware ◮ Scheduler starts computing nodes based on energy cost ◮ Kwapi provides a feedback loop 15 / 16

Conclusions ◮ Kwapi: the integrated monitoring solution of the Grid’5000 testbed ◮ Already widely used on Grid’5000 ◮ Available as free software ◮ Try it on your testbed, or on Grid’5000 (Open Access program) ◮ Future work: � Additional metrics � Integrate with other monitoring solutions (sFlow/NetFlow, collectd) � OML support: expose measurement points ◮ Demo 16 / 16

A Unified Monitoring Framework for Energy Consumption and Network - PowerPoint PPT Presentation

A Unified Monitoring Framework for Energy Consumption and Network Traffic Florentin Clouet, Simon Delamare, Jean-Patrick Gelas, Laurent Lefvre, Lucas Nussbaum, Clment Parisot, Laurent Pouilloux, Franois Rossigneux Grid5000 1 / 16

Basics of Unified Sports Ways to get involved with Unified Sports in Ohio Ohio 1 What are

SARVAM UCS Unified Communication Server Unified Communication Server for Modern Enterprises

A Unified Framework for the A Unified Framework for the Consumer-Grade Image Pipeline

Unified Straight and Curved Steel Girder Design Specifications Introduction Unified Steel

SPORTS! Unified Basketball Special Olympics U NIFIED B ASKETBALL Unified Basketball helps

UNIFIED MEMORY IN CUDA 6 MARK HARRIS NVIDIA CONFIDENTIAL Unified Memory Dramatically Lower

UNIFIED PAYMENTS AT A GLANCE DEAR MERCHANT, WELCOME TO UNIFIED PAYMENTS! At Unified Payments,

II.V. Consumption text, Ch 17 20 Objectives : Understand factors affecting consumption

Clean Energy Sources Wind Energy Hydro-Energy Bio-Energy Solar-Energy 1 Why Clean Energy

Meaning of Energy Audit: Energy audit is an official scientific study of energy consumption of an

Gavin Dillingham USGBC Energy Summit October 7, 2014 US Energy Consumption - 2002 US Energy

Energy Consumption in Energy Consumption in IP Networks IP Networks Rodney S. Tucker, Jayant

Energy Consumption and Desalination May 07, 2020 Juan Miguel Pinto, Energy Recovery Inc. NASDAQ:

Traditional Framework Downtown vs. Unified Shopping Center Downtown Unified Shopping Center

Update to the 2020 Unified Transportation Program Texas Transportation Commission 2020 Unified

2021 Unified Transportation Program Development Texas Transportation Commission 2021 Unified

Transportation Data Sharing Oregon Metro Perspective Jeff Frkonja, Robb Kirkman, Peter Bosa

with the Periscope Tuning Framework and the MPI Tools Interface Isaas A. Comprs Technical

Agustinus Nalwan Head of AI Carsales.com.au Carsales Car Buying & Selling Private

Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent

project: meshing technical, political, and social forces Jack C. Swearengen

ECE 697J Advanced Topics Advanced Topics ECE 697J in Computer Networks in Computer

Motivation Moores Law continues More transistors & memory controllers on modern

Natural Language Processing (CSE 517): Dependency Structure Noah Smith 2016 c University of

A Unified Monitoring Framework for Energy Consumption and Network - PowerPoint PPT Presentation

A Unified Monitoring Framework for Energy Consumption and Network Traffic Florentin Clouet, Simon Delamare, Jean-Patrick Gelas, Laurent Lefvre, Lucas Nussbaum, Clment Parisot, Laurent Pouilloux, Franois Rossigneux Grid5000 1 / 16

Basics of Unified Sports Ways to get involved with Unified Sports in Ohio Ohio 1 What are

SARVAM UCS Unified Communication Server Unified Communication Server for Modern Enterprises

A Unified Framework for the A Unified Framework for the Consumer-Grade Image Pipeline

Unified Straight and Curved Steel Girder Design Specifications Introduction Unified Steel

SPORTS! Unified Basketball Special Olympics U NIFIED B ASKETBALL Unified Basketball helps

UNIFIED MEMORY IN CUDA 6 MARK HARRIS NVIDIA CONFIDENTIAL Unified Memory Dramatically Lower

UNIFIED PAYMENTS AT A GLANCE DEAR MERCHANT, WELCOME TO UNIFIED PAYMENTS! At Unified Payments,

II.V. Consumption text, Ch 17 20 Objectives : Understand factors affecting consumption

Clean Energy Sources Wind Energy Hydro-Energy Bio-Energy Solar-Energy 1 Why Clean Energy

Meaning of Energy Audit: Energy audit is an official scientific study of energy consumption of an

Gavin Dillingham USGBC Energy Summit October 7, 2014 US Energy Consumption - 2002 US Energy

Energy Consumption in Energy Consumption in IP Networks IP Networks Rodney S. Tucker, Jayant

Energy Consumption and Desalination May 07, 2020 Juan Miguel Pinto, Energy Recovery Inc. NASDAQ:

Traditional Framework Downtown vs. Unified Shopping Center Downtown Unified Shopping Center

Update to the 2020 Unified Transportation Program Texas Transportation Commission 2020 Unified

2021 Unified Transportation Program Development Texas Transportation Commission 2021 Unified

Transportation Data Sharing Oregon Metro Perspective Jeff Frkonja, Robb Kirkman, Peter Bosa

with the Periscope Tuning Framework and the MPI Tools Interface Isaas A. Comprs Technical

Agustinus Nalwan Head of AI Carsales.com.au Carsales Car Buying &amp; Selling Private

Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent

project: meshing technical, political, and social forces Jack C. Swearengen

ECE 697J Advanced Topics Advanced Topics ECE 697J in Computer Networks in Computer

Motivation Moores Law continues More transistors &amp; memory controllers on modern

Natural Language Processing (CSE 517): Dependency Structure Noah Smith 2016 c University of

Agustinus Nalwan Head of AI Carsales.com.au Carsales Car Buying & Selling Private

Motivation Moores Law continues More transistors & memory controllers on modern