Automatic deployment of the Network Weather Service using the - - PowerPoint PPT Presentation

automatic deployment of the network weather service using
SMART_READER_LITE
LIVE PREVIEW

Automatic deployment of the Network Weather Service using the - - PowerPoint PPT Presentation

TOTO Automatic deployment of the Network Weather Service using the Effective Network View Arnaud Legrand and Martin Quinson Laboratoire de lInformatique du Parall elisme Ecole Normale Sup erieure de Lyon IPDPS 2004 Arnaud Legrand


slide-1
SLIDE 1

TOTO

Automatic deployment of the Network Weather Service using the Effective Network View

Arnaud Legrand and Martin Quinson

Laboratoire de l’Informatique du Parall´ elisme ´ Ecole Normale Sup´ erieure de Lyon

IPDPS 2004

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (1/15)

slide-2
SLIDE 2

The Network Weather Service Effective Network View Deploying the NWS using ENV

Outline

The Network Weather Service Overview Functionalities Configuration & Deployment Effective Network View Overview Mapping algorithm Summary Deploying the NWS using ENV Deployment design Example result Applying the deployment

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (2/15)

slide-3
SLIDE 3

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

The Network Weather Service overview

Overview

Goal: (Grid) system availabilities measurement and forecasting

Project from UCSB, used by AppLeS, Globus, NetSolve, Ninf, DIET, . . .

Architecture

Sensor: conducts measurements Memory: stores the results Forecaster: future tendencies (statistically) Name server: directory service (like LDAP)

Sensor Sensor Nameserver Memory Forecaster

Distributed system

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (3/15)

slide-4
SLIDE 4

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

The Network Weather Service overview

Overview

Goal: (Grid) system availabilities measurement and forecasting

Project from UCSB, used by AppLeS, Globus, NetSolve, Ninf, DIET, . . .

Architecture

Sensor: conducts measurements Memory: stores the results Forecaster: future tendencies (statistically) Name server: directory service (like LDAP)

Sensor Sensor Nameserver Memory Forecaster External source Test Test

Steady state: regular tests

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (3/15)

slide-5
SLIDE 5

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

The Network Weather Service overview

Overview

Goal: (Grid) system availabilities measurement and forecasting

Project from UCSB, used by AppLeS, Globus, NetSolve, Ninf, DIET, . . .

Architecture

Sensor: conducts measurements Memory: stores the results Forecaster: future tendencies (statistically) Name server: directory service (like LDAP)

Request Client Sensor Sensor Nameserver Memory Forecaster

Handling of a request

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (3/15)

slide-6
SLIDE 6

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

The Network Weather Service overview

Overview

Goal: (Grid) system availabilities measurement and forecasting

Project from UCSB, used by AppLeS, Globus, NetSolve, Ninf, DIET, . . .

Architecture

Sensor: conducts measurements Memory: stores the results Forecaster: future tendencies (statistically) Name server: directory service (like LDAP)

Client Sensor Sensor Nameserver Memory Forecaster

Handling of a request

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (3/15)

slide-7
SLIDE 7

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

The Network Weather Service overview

Overview

Goal: (Grid) system availabilities measurement and forecasting

Project from UCSB, used by AppLeS, Globus, NetSolve, Ninf, DIET, . . .

Architecture

Sensor: conducts measurements Memory: stores the results Forecaster: future tendencies (statistically) Name server: directory service (like LDAP)

Answer Client Sensor Sensor Nameserver Memory Forecaster

Handling of a request

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (3/15)

slide-8
SLIDE 8

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

NWS measurements and forecasting

Provided metrics

bandwidthTcp, latencyTcp (Default: 64Kb in 16Kb messages; buffer=32Kb), availableCpu (for an incoming process), currentCpu (for existing processes), connectTimeTcp, freeDisk, freeMemory, . . .

Statistical forecasting

Selection of the best statistical method (mean, median, gradian, last value, . . . ) Data = serie: D1, D2, . . . , Dn−1, Dn. We want Dn+1. Methods are applied on D1, D2, . . . , Dn−1. each one predict Dn. Selection of the best on Dn to predict Dn+1.

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (4/15)

slide-9
SLIDE 9

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

NWS configuration & deployment

Deployment requirements

Correction Do not let experiments interfere.

Two test packets on same link ⇒ each report half of bandwidth Clique: set on which tests are done in a mutually exclusive manner

Scalability Keep cliques small for sufficient frequency and reactivity. Completeness Estimate each host pair

⇒ aggregation when lacking direct measurement ⇒ cliques should follow sub-net tilling

Intrusiveness Conduct only needed test One pair is representative on a hub

Configuring NWS is a difficult task

Need to know both the tool and the topology (link with potential collisions).

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (5/15)

slide-10
SLIDE 10

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

NWS configuration & deployment

Deployment requirements

Correction Do not let experiments interfere.

Two test packets on same link ⇒ each report half of bandwidth Clique: set on which tests are done in a mutually exclusive manner

Scalability Keep cliques small for sufficient frequency and reactivity. Completeness Estimate each host pair

⇒ aggregation when lacking direct measurement ⇒ cliques should follow sub-net tilling

Intrusiveness Conduct only needed test One pair is representative on a hub

Configuring NWS is a difficult task

Need to know both the tool and the topology (link with potential collisions).

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (5/15)

slide-11
SLIDE 11

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

NWS configuration & deployment

Deployment requirements

Correction Do not let experiments interfere.

Two test packets on same link ⇒ each report half of bandwidth Clique: set on which tests are done in a mutually exclusive manner

Scalability Keep cliques small for sufficient frequency and reactivity. Completeness Estimate each host pair

⇒ aggregation when lacking direct measurement ⇒ cliques should follow sub-net tilling

Intrusiveness Conduct only needed test One pair is representative on a hub

Configuring NWS is a difficult task

Need to know both the tool and the topology (link with potential collisions).

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (5/15)

slide-12
SLIDE 12

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

NWS configuration & deployment

Deployment requirements

Correction Do not let experiments interfere.

Two test packets on same link ⇒ each report half of bandwidth Clique: set on which tests are done in a mutually exclusive manner

Scalability Keep cliques small for sufficient frequency and reactivity. Completeness Estimate each host pair

⇒ aggregation when lacking direct measurement ⇒ cliques should follow sub-net tilling

Intrusiveness Conduct only needed test One pair is representative on a hub

Configuring NWS is a difficult task

Need to know both the tool and the topology (link with potential collisions).

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (5/15)

slide-13
SLIDE 13

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Functionalities Configuration & Deployment

NWS configuration & deployment

Deployment requirements

Correction Do not let experiments interfere.

Two test packets on same link ⇒ each report half of bandwidth Clique: set on which tests are done in a mutually exclusive manner

Scalability Keep cliques small for sufficient frequency and reactivity. Completeness Estimate each host pair

⇒ aggregation when lacking direct measurement ⇒ cliques should follow sub-net tilling

Intrusiveness Conduct only needed test One pair is representative on a hub

Configuring NWS is a difficult task

Need to know both the tool and the topology (link with potential collisions).

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (5/15)

slide-14
SLIDE 14

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

The Effective Network View mapping solution

Overview

Goal: Mapping the network topology Authors: Gary Shao et Al (UCSD) Motivation: Master/slave scheduling Methodology: Active interference tests

Related work

Method Restricted Focus Routers Notes SNMP authorized path all passive, LAN traceroute ICMP path all level 3 of OSI pathchar root path all link bandwidth, slow Other no path din = dout tree tomography bipartite [Rabbat03] ENV no interference some tree only

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (6/15)

slide-15
SLIDE 15

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Mapping algorithm (1/4)

Naive algorith

For all hosts (a, b, c, d), measure: bw(ab): bandwidth on (ab) bwcd(ab): idem when (cd) is saturated Interference if

bwcd(ab) bw(ab)

≃ 0.5

Naivety is a bad habit

Network stabilization ⇒ 2 tests per minutes ⇒ 50 days for 20 hosts

ENV algorithm

Tree view: Interferences between streams from a master node to any ⇒ from O(n4) to O(n3) Various other optimizations to reduce the number of tests

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (7/15)

slide-16
SLIDE 16

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Mapping algorithm (1/4)

Naive algorith

For all hosts (a, b, c, d), measure: bw(ab): bandwidth on (ab) bwcd(ab): idem when (cd) is saturated Interference if

bwcd(ab) bw(ab)

≃ 0.5

Naivety is a bad habit

Network stabilization ⇒ 2 tests per minutes ⇒ 50 days for 20 hosts

ENV algorithm

Tree view: Interferences between streams from a master node to any ⇒ from O(n4) to O(n3) Various other optimizations to reduce the number of tests

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (7/15)

slide-17
SLIDE 17

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Mapping algorithm (1/4)

Naive algorith

For all hosts (a, b, c, d), measure: bw(ab): bandwidth on (ab) bwcd(ab): idem when (cd) is saturated Interference if

bwcd(ab) bw(ab)

≃ 0.5

Naivety is a bad habit

Network stabilization ⇒ 2 tests per minutes ⇒ 50 days for 20 hosts

ENV algorithm

Tree view: Interferences between streams from a master node to any ⇒ from O(n4) to O(n3) Various other optimizations to reduce the number of tests

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (7/15)

slide-18
SLIDE 18

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Algorithm (2/4): master-independent data collection

Structural topology

Topology first guess:

1

Each node traceroute to an external location

2

Merging results gives a tree

moby the−doors canaria popc myri sci

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (8/15)

slide-19
SLIDE 19

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Algorithm (3/4): master-dependent data collection

Successive refinements of the topology

Host to host bandwidth

split out machines having different bandwidth to the master

Pairwise host bandwidth

measure bandwidth concurrently compare to previous step split cluster if transfers independent

Internal cluster bandwidth Jammed bandwidth

bwbc (Ma) bw(Ma)

≃ 0.5 = ⇒ internal network shared

the-doors any host

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (9/15)

slide-20
SLIDE 20

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Algorithm (3/4): master-dependent data collection

Successive refinements of the topology

Host to host bandwidth

split out machines having different bandwidth to the master

Pairwise host bandwidth

measure bandwidth concurrently compare to previous step split cluster if transfers independent

Internal cluster bandwidth Jammed bandwidth

bwbc (Ma) bw(Ma)

≃ 0.5 = ⇒ internal network shared

the-doors cluster

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (9/15)

slide-21
SLIDE 21

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Algorithm (3/4): master-dependent data collection

Successive refinements of the topology

Host to host bandwidth

split out machines having different bandwidth to the master

Pairwise host bandwidth

measure bandwidth concurrently compare to previous step split cluster if transfers independent

Internal cluster bandwidth Jammed bandwidth

bwbc (Ma) bw(Ma)

≃ 0.5 = ⇒ internal network shared

cluster

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (9/15)

slide-22
SLIDE 22

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Algorithm (3/4): master-dependent data collection

Successive refinements of the topology

Host to host bandwidth

split out machines having different bandwidth to the master

Pairwise host bandwidth

measure bandwidth concurrently compare to previous step split cluster if transfers independent

Internal cluster bandwidth Jammed bandwidth

bwbc (Ma) bw(Ma)

≃ 0.5 = ⇒ internal network shared

the-doors cluster

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (9/15)

slide-23
SLIDE 23

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Algorithm (4/4): Result on the ENS-Lyon network

Physical topology

backbone_router canaria moby the−doors routlhpc giga_router 10 Mbps 100 Mbps myri1 myri2 popc0 sci0 sci1 sci2 sci4 sci5 sci6 myri0 sci3

✁ ✁ ✁ ✁ ✁ ✂ ✂ ✂ ✂ ✂ ✂ ✄ ✄ ✄

popc.private domain ens-lyon.fr domain

Effective topology from the-doors

moby canaria routlhpc the−doors myri0 popc0 sci0 sci3 sci2 sci4 sci5 sci6 sci1 myri1 myri2 Hub 3 Switch Hub 2 Hub 1

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (10/15)

slide-24
SLIDE 24

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Summary about ENV

Tradeoffs

Master/Slave: tree view only (price for efficiency?) Intrusiveness: inject large amount of traffic (price for simplicity?)

Known problems

Asymmetric routes: not taken into account (yet?)

Makes mapping faster, but such inconsistencies are common

Open questions

Reliability and accuracy: Platform evolution (⇒ mapping speed) Experimental thresholds (empirically determined)

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (11/15)

slide-25
SLIDE 25

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Summary about ENV

Tradeoffs

Master/Slave: tree view only (price for efficiency?) Intrusiveness: inject large amount of traffic (price for simplicity?)

Known problems

Asymmetric routes: not taken into account (yet?)

Makes mapping faster, but such inconsistencies are common

Open questions

Reliability and accuracy: Platform evolution (⇒ mapping speed) Experimental thresholds (empirically determined)

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (11/15)

slide-26
SLIDE 26

The Network Weather Service Effective Network View Deploying the NWS using ENV Overview Mapping algorithm Summary

Summary about ENV

Tradeoffs

Master/Slave: tree view only (price for efficiency?) Intrusiveness: inject large amount of traffic (price for simplicity?)

Known problems

Asymmetric routes: not taken into account (yet?)

Makes mapping faster, but such inconsistencies are common

Open questions

Reliability and accuracy: Platform evolution (⇒ mapping speed) Experimental thresholds (empirically determined)

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (11/15)

slide-27
SLIDE 27

The Network Weather Service Effective Network View Deploying the NWS using ENV Deployment design Example result Applying the deployment

Deployment design

Typical Grid testbeds are constellation of trees ⇒ hierarchical monitoring Manually: 2 levels (inter-organization vs intra-organization) Here: N levels (one per group)

Bottom-up algorithm along the tree

shared group (hub): every pair is representative of internal connectivity

⇒ Form a clique with two arbitrarily chosen hosts NWS cannot substitute a pair with the chosen one, must be application level

not shared group (switch): transfers interfere only if same host in both

(AB CD ⇔ {AB} ∩ {CD} = ∅)

⇒ Host-based locking needed (but not supported by NWS) ⇒ Form a clique with all hosts (ensure validity, deteriorate frequency)

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (12/15)

slide-28
SLIDE 28

The Network Weather Service Effective Network View Deploying the NWS using ENV Deployment design Example result Applying the deployment

Example on the ENS-Lyon network

Result of the algorithm in our case

moby canaria routlhpc the−doors myri0 popc0 sci0 sci3 sci2 sci4 sci5 sci6 sci1 myri1 myri2 Hub 3 Switch Hub 2 Hub 1

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (13/15)

slide-29
SLIDE 29

The Network Weather Service Effective Network View Deploying the NWS using ENV Deployment design Example result Applying the deployment

How to apply the configuration once computed?

Previous (NWS) solution

ssh to each host; pass options to daemons on command-line

Our solution

Make a global configuration file; dispatch it using e.g. NFS Manager script on each host to apply it (after ssh) Ease platform startup and shutdown

Future

Watchdog, or real management solution (JINI) would allow error detection and recovery

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (14/15)

slide-30
SLIDE 30

The Network Weather Service Effective Network View Deploying the NWS using ENV

Conclusion

NWS is the de-facto standard for Grid availability monitoring Ensuring correction, scalability, completeness, limiting intrusiveness requires topology knowledge (interferences: potential collisions) ENV provides an interference-focused network mapping Those informations sufficient for an efficient configuration planning

Open questions & future work

About ENV:

Asymmetric routes + tree limitation Automatic threshold discovery

About NWS:

Host-based locking Lookup: aggregation, substitute pairs

Automatic deployment of NWS using ENV:

Quantify quality of configuration (simulator) Platform evolutions Real management solution

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (15/15)

slide-31
SLIDE 31

The Network Weather Service Effective Network View Deploying the NWS using ENV

Conclusion

NWS is the de-facto standard for Grid availability monitoring Ensuring correction, scalability, completeness, limiting intrusiveness requires topology knowledge (interferences: potential collisions) ENV provides an interference-focused network mapping Those informations sufficient for an efficient configuration planning

Open questions & future work

About ENV:

Asymmetric routes + tree limitation Automatic threshold discovery

About NWS:

Host-based locking Lookup: aggregation, substitute pairs

Automatic deployment of NWS using ENV:

Quantify quality of configuration (simulator) Platform evolutions Real management solution

Arnaud Legrand and Martin Quinson Automatic deployment of the NWS using the ENV (15/15)