Use of a Levy Distribution for Modeling Best Case Execution Time Variation
Jonathan Beard, Roger Chamberlain
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
Work also supported by:
1
Use of a Levy Distribution for Modeling Best Case Execution Time - - PowerPoint PPT Presentation
Use of a Levy Distribution for Modeling Best Case Execution Time Variation Jonathan Beard, Roger Chamberlain SBS Stream Based Supercomputing Lab http://sbs.wustl.edu Work also supported by: 1 Outline Motivation Stream Processing
Jonathan Beard, Roger Chamberlain
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
Work also supported by:
1
2
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
3
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
3
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
4
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
5
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
6
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
1 2 3 4
1 2 3 4
More allocation choices, NUMA node A or B to allocate stream.
7
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
1 2 3 4
1 2 3 4
More allocation choices, NUMA node A or B to allocate stream.
1 2 7
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
1 2 3 4
1 2 3 4
More allocation choices, NUMA node A or B to allocate stream.
1 2 7
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
B C Q1 Q2
A
A B C “Stream” is modeled as a Queue
8
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
B C Q1 Q2
A
A B C “Stream” is modeled as a Queue
8
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
Problem: Accurate measurement is very difficult. Is there a way to decide on a model without it.
9
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
10
Expected Observed
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
10
Expected Observed Is there a pattern of minimal variation within the systems we’re running on?
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
11
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
12
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
13
Ask for Time Receive Time Timer Thread Code
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
14
Timer Thread rdtsc clock_gettime
to serialize
issues
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
15
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
16
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
recommended serializers for each processor type
NUMA node as timer
minimize core context swaps
17
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
taxing
external to process
18
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
times, tied to a single core and run thousands of times
each run, environment collected
Number of processes executing on core Number of context swaps (voluntary, involuntary) Many others
19
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
20
Execution Time Error ( obs - mean )
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
21
Normal Distribution
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
22
Gumbel Distribution
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
23
Levy Distribution
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
23
Levy Distribution
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
requires fitting to each dataset to find where to truncate
both the number of processes per core and the expected execution time
approximate solution to truncation parameters without refitting
24
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
25
0.000025 0.00001 0.0000155 0.000015 0.0000145 0.000014
0.000025 0.00001 0.000014 0.0000135 0.000013 0.0000125
0.00006 0.00003 0.00005 0.000045 0.00004 0.000035 0.00003 0.000025 0.00002
0.00005 0.00002 0.00003 0.000025 0.00002 0.000015 0.00001
1 - 5 processes 6 - 10 processes 16 - 20 processes 11 - 15 processes
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
26
B Q1
Question: Can we use an M/M/1 queueing model to estimate the mean queue occupancy of this system?
between expected and realized distribution is associated with higher model accuracy.
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
27
B Q1
queue occupancy
process distribution using the truncated Levy distribution noise model
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
28
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
approximate BCETV
to accept or reject a stochastic queueing model based on distributional assumptions
convolved distribution highly correlates with queue model accuracy
29
SBS
Stream Based Supercomputing Lab
http://sbs.wustl.edu
30