ARG Availability and reliability monitoring for e-Infrastructures - - PowerPoint PPT Presentation

arg
SMART_READER_LITE
LIVE PREVIEW

ARG Availability and reliability monitoring for e-Infrastructures - - PowerPoint PPT Presentation

ARG Availability and reliability monitoring for e-Infrastructures C. Kanellopoulos, GRNET K. Kagkelidis, GRNET/AUTH You can not manage, what you do not monitor Usage Responsiveness Availability Reliability Profit Performance Cost


slide-1
SLIDE 1

Availability and reliability monitoring for e-Infrastructures

ARG

  • C. Kanellopoulos, GRNET
  • K. Kagkelidis, GRNET/AUTH
slide-2
SLIDE 2

“You can not manage, what you do not monitor”

2

Status Availability

Reliability

Performance

Usage

Cost

Profit

Responsiveness

slide-3
SLIDE 3
  • In the last decade our notion of the e-Infrastructure has

dramatically changed

  • Software Defined Networks
  • Compute and data infrastructure as software

components

  • The boundaries and shape of the e-Infrastructure are

dynamically changing to match the needs of the individual user

  • Monitoring has to evolve in order to catch-up with the

dynamic nature of the systems

Dynamic e-Infrastructures

3

slide-4
SLIDE 4

A model for monitoring large scale distributed infrastructures

4

Messaging Infrastructure

Ext. Source Ext. Source Ext. Source

Agents Agents

slide-5
SLIDE 5

Agents

5

  • Agents: collect data and publish them to the Messaging

Infrastructure

  • Can be monitoring agents, accounting services etc

Messaging Infrastructure

Ext. Source Ext. Source Ext. Source

Agents Agents

slide-6
SLIDE 6
  • Nagios based monitoring
  • Auto-configuration of monitored

services

  • Nagios schedules and executes

probes against the monitored service end points

  • Metric results are retrieved by each

Nagios instance, which in turn publishes them to the MBN

Monitoring agents

6

IT ES UK FR VO VO VO Messaging Infrastructure

slide-7
SLIDE 7

Message Broker Network

7

  • The distributed message broker network is the neural

network of the system (Publish - subscribe)

  • Supports large number of concurrent publishers and

subscribers

Messaging Infrastructure

Ext. Source Ext. Source Ext. Source

Agents Agents

slide-8
SLIDE 8

Messaging in ARGO

8

Message Broker Network

  • ActiveMQ Message Brokers in Greece, Croatia and

Switzerland (now only in Greece and Croatia)

  • High Availability configuration. Auto-discovery through

the Information System

slide-9
SLIDE 9

External sources

9

Messaging Infrastructure

Ext. Source Ext. Source Ext. Source

Agents Agents

  • External sources: can be used as source for information

in order to provide e-Infrastructure specific details

  • Can be Configuration Databases, Management Systems

etc.

slide-10
SLIDE 10

Subscribers

10

  • Any number of concurrent subscribers.
  • E.g. SLA (A/R) Monitoring, Accounting, Billing,

Operations portals etc.

Messaging Infrastructure

Ext. Source Ext. Source Ext. Source

Agents Agents

slide-11
SLIDE 11

Abstract subscriber

11

Abstract subscriber

Web UI Message Broker Network Sync Components Data Store Stream Consumers Source Source Source

slide-12
SLIDE 12

The ARGO Framework

12

A/R Compute Engine A/R Compute Engine A/R Compute Engine

Web Service + Distributed Data Store IT ES UK FR VO VO VO Message Broker Network Sync Components Hadoop Cluster Stream Consumers Source Source Source Prefiltering Batch Layer View Layer

slide-13
SLIDE 13

ARGO A/R

13

slide-14
SLIDE 14
  • A/R consumer:
  • tap and listen to the Message Broker

Network

  • gather relevant messages
  • able to manage and keep relevant fields on

each message

  • deliver data to the A/R Compute Engine
  • Sync components:
  • topology information
  • established grouping of services
  • various weight information regarding grouping
  • downtime information
  • computation profiles
  • ...also deliver data to the A/R Compute

Engine

A/R Compute Engine

14

slide-15
SLIDE 15
  • A/R Compute Engine:
  • Picks up the data delivered
  • both metrics and supplementary sync

files

  • Stages the data (prefiltering process)
  • Executes Computations
  • Sending them as jobs to a Hadoop

Cluster

  • r Locally if desired
  • Results are stored to MongoDB for delivery

through the REST API

A/R Compute Engine

15

slide-16
SLIDE 16
  • Rest API:
  • Supports a number of requests to view the

result data

  • Retrieves data from the MongoDB
  • Authentication
  • Supports recomputation requests
  • WebUI:
  • Provides a UI dashboard with various views
  • n the a/r report results
  • Export of the results to various file formats
  • Transforms User Interaction to specific API

requests

A/R Compute Engine

16

slide-17
SLIDE 17

17

slide-18
SLIDE 18
  • Messaging Infrastructure
  • Introduce a REST API for publishing and consuming messages
  • Provide fine grained ACL support
  • Messaging Infrastructure as a Service for end users
  • ARGO
  • Standalone version of ARGO
  • Support for realtime monitoring at EGI scale
  • Improve the visualisation support
  • Custom of availability profiles for Cloud and Data Infrastructures
  • Support federated access through eduGAIN

Next steps

18

slide-19
SLIDE 19
  • More information at:
  • https://github.com/argoeu

19

Thank you