Resource Use Pattern Analysis for Opportunistic Grids Marcelo - - PowerPoint PPT Presentation

resource use pattern analysis for opportunistic grids
SMART_READER_LITE
LIVE PREVIEW

Resource Use Pattern Analysis for Opportunistic Grids Marcelo - - PowerPoint PPT Presentation

Goals UPA Simulation Implementation Experiments Related Conclusion Resource Use Pattern Analysis for Opportunistic Grids Marcelo Finger Germano C. Bezerra Danilo R. Conde Department of Computer Science (IME-USP) University of S ao


slide-1
SLIDE 1

Goals UPA Simulation Implementation Experiments Related Conclusion

Resource Use Pattern Analysis for Opportunistic Grids

Marcelo Finger Germano C. Bezerra Danilo R. Conde

Department of Computer Science (IME-USP) University of S˜ ao Paulo Supported by CNPq/Brazil project 550895/2007-8.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-2
SLIDE 2

Goals UPA Simulation Implementation Experiments Related Conclusion

Topics

1 Opportunistic Grids and Scheduling 2 The UPA Method 3 Simulation 4 Implementation 5 Experiments 6 Related Work 7 Conclusion

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-3
SLIDE 3

Goals UPA Simulation Implementation Experiments Related Conclusion

Topics

1 Opportunistic Grids and Scheduling 2 The UPA Method 3 Simulation 4 Implementation 5 Experiments 6 Related Work 7 Conclusion

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-4
SLIDE 4

Goals UPA Simulation Implementation Experiments Related Conclusion

Grids and Opportunism

Applications for Grid Computing

computationally intensive distributed heterogeneous environments

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-5
SLIDE 5

Goals UPA Simulation Implementation Experiments Related Conclusion

Grids and Opportunism

Applications for Grid Computing

computationally intensive distributed heterogeneous environments

Opportunistic Grid Computing

Idle time of machines

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-6
SLIDE 6

Goals UPA Simulation Implementation Experiments Related Conclusion

Grids and Opportunism

Applications for Grid Computing

computationally intensive distributed heterogeneous environments

Opportunistic Grid Computing

Idle time of machines High-performance computation

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-7
SLIDE 7

Goals UPA Simulation Implementation Experiments Related Conclusion

Grids and Opportunism

Applications for Grid Computing

computationally intensive distributed heterogeneous environments

Opportunistic Grid Computing

Idle time of machines High-performance computation Resource-owners have to give permission

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-8
SLIDE 8

Goals UPA Simulation Implementation Experiments Related Conclusion

Grids and Opportunism

Applications for Grid Computing

computationally intensive distributed heterogeneous environments

Opportunistic Grid Computing

Idle time of machines High-performance computation Resource-owners have to give permission QoS must remain high

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-9
SLIDE 9

Goals UPA Simulation Implementation Experiments Related Conclusion

Grids and Opportunism

Applications for Grid Computing

computationally intensive distributed heterogeneous environments

Opportunistic Grid Computing

Idle time of machines High-performance computation Resource-owners have to give permission QoS must remain high

InteGrade: opportunistic grid infrastructure

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-10
SLIDE 10

Goals UPA Simulation Implementation Experiments Related Conclusion

Effective Opportunistic Computing

Desirable: prediction of resource availability Prediction available for grid scheduler The better the prediction, the lower impact on QoS

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-11
SLIDE 11

Goals UPA Simulation Implementation Experiments Related Conclusion

Effective Opportunistic Computing

Desirable: prediction of resource availability Prediction available for grid scheduler The better the prediction, the lower impact on QoS Proposed Solution: Resource Use Pattern Analysis

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-12
SLIDE 12

Goals UPA Simulation Implementation Experiments Related Conclusion

UPA: (Resource) Use Pattern Analysis

Consists of: Detecting the local use pattern of each resource at each machine in the grid

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-13
SLIDE 13

Goals UPA Simulation Implementation Experiments Related Conclusion

UPA: (Resource) Use Pattern Analysis

Consists of: Detecting the local use pattern of each resource at each machine in the grid Basic Hypothesis: Each resource has a (temporal) “pattern” of use

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-14
SLIDE 14

Goals UPA Simulation Implementation Experiments Related Conclusion

Goal

To develop a method

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-15
SLIDE 15

Goals UPA Simulation Implementation Experiments Related Conclusion

Goal

To develop a method that automatically performs resource use pattern analysis

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-16
SLIDE 16

Goals UPA Simulation Implementation Experiments Related Conclusion

Goal

To develop a method that automatically performs resource use pattern analysis for machines belonging to opportunistic grids

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-17
SLIDE 17

Goals UPA Simulation Implementation Experiments Related Conclusion

Strategy

Discover the prototypical patterns of use (off-line)

Unsupervised machine learning Clustering analysis

Runtime prediction

Comparing prototypical with “current” pattern of use

Development method:

Simulation: parameter setting Implementation: LUPA module for InteGrade grid

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-18
SLIDE 18

Goals UPA Simulation Implementation Experiments Related Conclusion

Topics

1 Opportunistic Grids and Scheduling 2 The UPA Method 3 Simulation 4 Implementation 5 Experiments 6 Related Work 7 Conclusion

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-19
SLIDE 19

Goals UPA Simulation Implementation Experiments Related Conclusion

Resource Use Objects

A resource use object: Sampled at every 5 min Span of 48h used for prediction Resources: CPU use, available RAM, disk space, swap space, network and disk I/O

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-20
SLIDE 20

Goals UPA Simulation Implementation Experiments Related Conclusion

Resource Use Objects

A resource use object: Sampled at every 5 min Span of 48h used for prediction Resources: CPU use, available RAM, disk space, swap space, network and disk I/O Objects represent availability (not only use)

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-21
SLIDE 21

Goals UPA Simulation Implementation Experiments Related Conclusion

The UPA Method

Resource Use Pattern Analysis

Unsupervised machine learning Obtain fixed number of use classes

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-22
SLIDE 22

Goals UPA Simulation Implementation Experiments Related Conclusion

The UPA Method

Resource Use Pattern Analysis

Unsupervised machine learning Obtain fixed number of use classes

A class represents a frequent use pattern E.g. busy work day, light work day, holiday, etc

Each class represented by a prototypical object

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-23
SLIDE 23

Goals UPA Simulation Implementation Experiments Related Conclusion

The UPA Method

Resource Use Pattern Analysis

Unsupervised machine learning Obtain fixed number of use classes

A class represents a frequent use pattern E.g. busy work day, light work day, holiday, etc

Each class represented by a prototypical object

Two phases:

training/learning phase: off-line execution/prediction phase: runtime

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-24
SLIDE 24

Goals UPA Simulation Implementation Experiments Related Conclusion

Off-line Learning

Inputs a large amount of objects Collected from regular operation Clustering is applied to training data Reliability depends on amount of data

At least 60 objects, or 2 months of data.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-25
SLIDE 25

Goals UPA Simulation Implementation Experiments Related Conclusion

Off-line Learning

Inputs a large amount of objects Collected from regular operation Clustering is applied to training data Reliability depends on amount of data

At least 60 objects, or 2 months of data.

Several parameters have to be set

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-26
SLIDE 26

Goals UPA Simulation Implementation Experiments Related Conclusion

Learning Parameters

Number of clusters: 5, 10

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-27
SLIDE 27

Goals UPA Simulation Implementation Experiments Related Conclusion

Learning Parameters

Number of clusters: 5, 10 Data normalisation: no normalisation, variational

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-28
SLIDE 28

Goals UPA Simulation Implementation Experiments Related Conclusion

Learning Parameters

Number of clusters: 5, 10 Data normalisation: no normalisation, variational Computation of prototypical element: centroid, centre

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-29
SLIDE 29

Goals UPA Simulation Implementation Experiments Related Conclusion

Learning Parameters

Number of clusters: 5, 10 Data normalisation: no normalisation, variational Computation of prototypical element: centroid, centre Similarity measurement: Euclidean distance between points.

Between clusters: single linkage, complete linkage, centroid method, Ward’s method

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-30
SLIDE 30

Goals UPA Simulation Implementation Experiments Related Conclusion

Learning Parameters

Number of clusters: 5, 10 Data normalisation: no normalisation, variational Computation of prototypical element: centroid, centre Similarity measurement: Euclidean distance between points.

Between clusters: single linkage, complete linkage, centroid method, Ward’s method

Clustering algorithms: hierarchical, sequential, k-means, etc.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-31
SLIDE 31

Goals UPA Simulation Implementation Experiments Related Conclusion

Learning Parameters

Number of clusters: 5, 10 Data normalisation: no normalisation, variational Computation of prototypical element: centroid, centre Similarity measurement: Euclidean distance between points.

Between clusters: single linkage, complete linkage, centroid method, Ward’s method

Clustering algorithms: hierarchical, sequential, k-means, etc. Simulation was performed to choose parameters

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-32
SLIDE 32

Goals UPA Simulation Implementation Experiments Related Conclusion

Runtime Prediction

Recent Track Prototypical Element

6 12 18 24 30 36 42 48 25 50 75 100

d1 d2 d3

d3 < d2 < d1 Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-33
SLIDE 33

Goals UPA Simulation Implementation Experiments Related Conclusion

Runtime Prediction

Recent Track Prototypical Element

6 12 18 24 30 36 42 48 25 50 75 100

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-34
SLIDE 34

Goals UPA Simulation Implementation Experiments Related Conclusion

Predictions

Recent 24h record is compared against 48h use classes The closest class is the current use class. Predicted future in current use class: [now–48h] Request at 6am can predict 18 hours Request at 6pm can predict 6 hours (only)

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-35
SLIDE 35

Goals UPA Simulation Implementation Experiments Related Conclusion

Predictions

Recent 24h record is compared against 48h use classes The closest class is the current use class. Predicted future in current use class: [now–48h] Request at 6am can predict 18 hours Request at 6pm can predict 6 hours (only) Options for longer predictions:

Use less than 24 of current record.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-36
SLIDE 36

Goals UPA Simulation Implementation Experiments Related Conclusion

Predictions

Recent 24h record is compared against 48h use classes The closest class is the current use class. Predicted future in current use class: [now–48h] Request at 6am can predict 18 hours Request at 6pm can predict 6 hours (only) Options for longer predictions:

Use less than 24 of current record. Predicts < 48h Chain of predictions (not implemented yet)

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-37
SLIDE 37

Goals UPA Simulation Implementation Experiments Related Conclusion

Topics

1 Opportunistic Grids and Scheduling 2 The UPA Method 3 Simulation 4 Implementation 5 Experiments 6 Related Work 7 Conclusion

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-38
SLIDE 38

Goals UPA Simulation Implementation Experiments Related Conclusion

Trace-driven Simulation

Data collected for CPU and RAM use, 120 days Linux machines with very different types of users:

1

a general purpose machine with more than 30 users;

2

a single user machine;

3

a general purpose machine with 6 users;

4

a multi-user machine employed for testing heavy computational linguistics programs.

Several parameters compared

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-39
SLIDE 39

Goals UPA Simulation Implementation Experiments Related Conclusion

Simulation Results

Correct predictions above 75% in all parameter combinations

Experiments

0.7500 0.8000 0.8500 0.9000 0.9500 1.0000

% Correct

Pure Normalised Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-40
SLIDE 40

Goals UPA Simulation Implementation Experiments Related Conclusion

Parameters Learned

Number of clusters: 5, 10 Data normalisation: no normalisation, variational Computation of prototypical element: centroid, centre Similarity measurement: Distance between clusters: points.

single linkage, complete linkage, centroid method, Ward’s method

Clustering algorithms: hierarchical, sequential, k-means, etc.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-41
SLIDE 41

Goals UPA Simulation Implementation Experiments Related Conclusion

Parameters Learned

Number of clusters: 5, 10 Data normalisation: no normalisation, variational Computation of prototypical element: centroid, centre Similarity measurement: Distance between clusters: points.

single linkage, complete linkage, centroid method, Ward’s method

Clustering algorithms: hierarchical, sequential, k-means, etc.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-42
SLIDE 42

Goals UPA Simulation Implementation Experiments Related Conclusion

Parameters Learned

Number of clusters: 5, 10 Data normalisation: no normalisation, variational Computation of prototypical element: centroid, centre Similarity measurement: Distance between clusters: points.

single linkage, complete linkage, centroid method, Ward’s method

Clustering algorithms: hierarchical, sequential, k-means, etc. Simulation validated UPA hypotheses on the use of past patterns to predict future behaviours

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-43
SLIDE 43

Goals UPA Simulation Implementation Experiments Related Conclusion

Topics

1 Opportunistic Grids and Scheduling 2 The UPA Method 3 Simulation 4 Implementation 5 Experiments 6 Related Work 7 Conclusion

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-44
SLIDE 44

Goals UPA Simulation Implementation Experiments Related Conclusion

Implementation Objectives

Explore ways the scheduling of grid applications using the UPA method

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-45
SLIDE 45

Goals UPA Simulation Implementation Experiments Related Conclusion

Implementation Objectives

Explore ways the scheduling of grid applications using the UPA method Compare UPA scheduling with other methods

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-46
SLIDE 46

Goals UPA Simulation Implementation Experiments Related Conclusion

Implementation Objectives

Explore ways the scheduling of grid applications using the UPA method Compare UPA scheduling with other methods Identify initialisation strategies

because UPA needs a few months of resource use data collection to be reliable

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-47
SLIDE 47

Goals UPA Simulation Implementation Experiments Related Conclusion

Design Decisions

UPA data analysed locally, not centrally

Advantages: privacy, no network traffic due to UPA processing

Local resource Use Pattern Analyser (LUPA) module installed in all grid machines LUPA module is part of InteGrade middleware C++, on several Linuxes

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-48
SLIDE 48

Goals UPA Simulation Implementation Experiments Related Conclusion

LUPA Architecture

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-49
SLIDE 49

Goals UPA Simulation Implementation Experiments Related Conclusion

LUPA Submodules

Data Collection: resource use sampling. Pattern Analyser: off-line clustering. Patterns are recomputed once a day. Predictor: runtime predictions. Interface

double[] getPrediction(resource r, int hours);

returns a vector of values representing the r-use prediction for the next hours, in 5-minute intervals. LUPA is not yet integrated with the scheduler.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-50
SLIDE 50

Goals UPA Simulation Implementation Experiments Related Conclusion

Topics

1 Opportunistic Grids and Scheduling 2 The UPA Method 3 Simulation 4 Implementation 5 Experiments 6 Related Work 7 Conclusion

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-51
SLIDE 51

Goals UPA Simulation Implementation Experiments Related Conclusion

Experiment Goals

Compare UPA scheduling with other methods

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-52
SLIDE 52

Goals UPA Simulation Implementation Experiments Related Conclusion

Experiment Goals

Compare UPA scheduling with other methods Scheduling methods for choosing n machines for h hours:

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-53
SLIDE 53

Goals UPA Simulation Implementation Experiments Related Conclusion

Experiment Goals

Compare UPA scheduling with other methods Scheduling methods for choosing n machines for h hours: RR: round robin. No prediction

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-54
SLIDE 54

Goals UPA Simulation Implementation Experiments Related Conclusion

Experiment Goals

Compare UPA scheduling with other methods Scheduling methods for choosing n machines for h hours: RR: round robin. No prediction last4: Predicts to the future average of last 4h.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-55
SLIDE 55

Goals UPA Simulation Implementation Experiments Related Conclusion

Experiment Goals

Compare UPA scheduling with other methods Scheduling methods for choosing n machines for h hours: RR: round robin. No prediction last4: Predicts to the future average of last 4h. UPA

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-56
SLIDE 56

Goals UPA Simulation Implementation Experiments Related Conclusion

Experiment Goals

Compare UPA scheduling with other methods Scheduling methods for choosing n machines for h hours: RR: round robin. No prediction last4: Predicts to the future average of last 4h. UPA Predicting methods choose n machines with highest prediction

  • f CPU availability for the next h hours

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-57
SLIDE 57

Goals UPA Simulation Implementation Experiments Related Conclusion

Experiment Description

15 machines, logs of 41–120 days Sequence of tests for a fixed set of parameters: n, h Instant tests (1 experiment ≈ 527 tests)

Choose a day with m valid machines 24 tests are executed for 3 methods, each for an hour of the day

performances(t) = 1 − n

i=1 use(mi, t, h)

n

s: scheduling algorithm n: number of chosen machines mi: machine chose to run for h hours starting at t use(mi, t, h): average CPU at mi for [t, t + h]

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-58
SLIDE 58

Goals UPA Simulation Implementation Experiments Related Conclusion

UPA versus last4

3 6 9 12 15 18 21 24 27 30 33 36 39 42 45

Experiment

20 40 60 80 100

% Better Performance

UPA last4 equiv

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-59
SLIDE 59

Goals UPA Simulation Implementation Experiments Related Conclusion

Comparison of Scheduling Methods

UPA last4 Performance Similar to last4, but better Similar to UPA, but worse Impact of ↑ m/n ↑ performance no effect Impact of ↑ h no effect no effect Performance of RR consistently below both last4 and UPA

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-60
SLIDE 60

Goals UPA Simulation Implementation Experiments Related Conclusion

Adaptative UPA Method

Use UPA if more than 21 days of data collection is available; else Use last4 if more than 4 hours of data collection is available; else Predict that resource use at request time persists.

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-61
SLIDE 61

Goals UPA Simulation Implementation Experiments Related Conclusion

Overhead Analysis

Running time for pattern analysis was always below 1s Running time for prediction calls was always below 3ms

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-62
SLIDE 62

Goals UPA Simulation Implementation Experiments Related Conclusion

Overhead Analysis

Running time for pattern analysis was always below 1s Running time for prediction calls was always below 3ms Measurements made on notebook [AMD Turion 64 1.8GHz CPU, 1GB RAM running Kubuntu 7.10 (32 bits) Linux] Running pattern analysis once a day poses negligible overhead

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-63
SLIDE 63

Goals UPA Simulation Implementation Experiments Related Conclusion

Topics

1 Opportunistic Grids and Scheduling 2 The UPA Method 3 Simulation 4 Implementation 5 Experiments 6 Related Work 7 Conclusion

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-64
SLIDE 64

Goals UPA Simulation Implementation Experiments Related Conclusion

Correlated Availability

BOINC: Volunteer computing Correlated Availability in Internet-Distributed Systems (Kondo, Andrzejak & Anderson, 2008) Computes patterns of simultaneous CPU availability via clustering

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-65
SLIDE 65

Goals UPA Simulation Implementation Experiments Related Conclusion

Correlated Availability

BOINC: Volunteer computing Correlated Availability in Internet-Distributed Systems (Kondo, Andrzejak & Anderson, 2008) Computes patterns of simultaneous CPU availability via clustering Monitors 112,268 hosts Global monitoring and pattern discovery Several interesting clusters of machines discovered, with potential applications. Not originally intended for scheduling

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-66
SLIDE 66

Goals UPA Simulation Implementation Experiments Related Conclusion

Preemptive Resume Scheduling

[Roy and Livny, 2003]

Condor:

Combination of dedicated and opportunistic scheduling Opportunistic scheduling based on checkpointing Preemptive resume scheduling No prediction

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-67
SLIDE 67

Goals UPA Simulation Implementation Experiments Related Conclusion

Preemptive Resume Scheduling

[Roy and Livny, 2003]

Condor:

Combination of dedicated and opportunistic scheduling Opportunistic scheduling based on checkpointing Preemptive resume scheduling No prediction No need to predict job duration

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-68
SLIDE 68

Goals UPA Simulation Implementation Experiments Related Conclusion

Predictive Scheduling

[Yang, Schopf and Foster, 2003]

One-step-ahead CPU load prediction

Load Tendency Prediction Increase/decrease adaptation processes

Interval Load Prediction

Average over aggregated CPU load time series

Load Variance Prediction

Standard deviation time series Combined with one-step-ahead prediction

Conservative scheduling: combines interval and variance load prediction

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-69
SLIDE 69

Goals UPA Simulation Implementation Experiments Related Conclusion

Topics

1 Opportunistic Grids and Scheduling 2 The UPA Method 3 Simulation 4 Implementation 5 Experiments 6 Related Work 7 Conclusion

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-70
SLIDE 70

Goals UPA Simulation Implementation Experiments Related Conclusion

Conclusions

Some form of prediction is always preferable to no prediction UPA method compares favourably w.r.t. other methods Small overheads involved Method can be used for practical grid scheduling

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-71
SLIDE 71

Goals UPA Simulation Implementation Experiments Related Conclusion

Conclusions

Some form of prediction is always preferable to no prediction UPA method compares favourably w.r.t. other methods Small overheads involved Method can be used for practical grid scheduling

but needs task duration prediction

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-72
SLIDE 72

Goals UPA Simulation Implementation Experiments Related Conclusion

Ongoing and Future Works

Scheduler using LUPA information Automated learning of task duration prediction Long term predictions Preemptive task migration using UPA Automated “booking” of machine resources for future executions using UPA

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis

slide-73
SLIDE 73

Goals UPA Simulation Implementation Experiments Related Conclusion

www.integrade.org.br

mfinger@ime.usp.br

Marcelo Finger, Germano C. Bezerra, Danilo R. Conde IME-USP Use Pattern Analysis