Intro What’s the problem? BaTS Performance Evaluation
Bag-of-Tasks Scheduling under Budget Constraints
Ana-Maria Oprescu, Thilo Kielmann
project co-funded by the EC 7th Framework Programme
Ana-Maria Oprescu, Thilo Kielmann BaTS 1 / 17
Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, - - PowerPoint PPT Presentation
Intro Whats the problem? BaTS Performance Evaluation Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, Thilo Kielmann project co-funded by the EC 7th Framework Programme Ana-Maria Oprescu, Thilo Kielmann BaTS 1 / 17
Intro What’s the problem? BaTS Performance Evaluation
Ana-Maria Oprescu, Thilo Kielmann BaTS 1 / 17
Intro What’s the problem? BaTS Performance Evaluation
◮ find resources (networks of workstations, clusters, grids,...) ◮ sit in a queue ◮ run ◮ generally no accounting Ana-Maria Oprescu, Thilo Kielmann BaTS 2 / 17
Intro What’s the problem? BaTS Performance Evaluation
Ana-Maria Oprescu, Thilo Kielmann BaTS 3 / 17
Intro What’s the problem? BaTS Performance Evaluation
Ana-Maria Oprescu, Thilo Kielmann BaTS 4 / 17
Intro What’s the problem? BaTS Performance Evaluation
◮ different by certain properties,
Ana-Maria Oprescu, Thilo Kielmann BaTS 5 / 17
Intro What’s the problem? BaTS Performance Evaluation
◮ some form of runtime distribution exists ◮ a ”pay-per-hour” economic model for resource utilization ◮ we have all the tasks Ana-Maria Oprescu, Thilo Kielmann BaTS 6 / 17
Intro What’s the problem? BaTS Performance Evaluation Job Profiler Reconfigure Cluster Monitoring
Ana-Maria Oprescu, Thilo Kielmann BaTS 7 / 17
Intro What’s the problem? BaTS Performance Evaluation Job Profiler Reconfigure Cluster Monitoring
◮ initialize the average with
◮ keep collected runtimes of
5 10 15 20 25 30 200 400 600 800 1000
sample size (n) BoT size (N)
◮ update the moving average during BoT execution ◮ Estimate the runtime of running tasks using the average over
Ana-Maria Oprescu, Thilo Kielmann BaTS 8 / 17
Intro What’s the problem? BaTS Performance Evaluation Job Profiler Reconfigure Cluster Monitoring
◮ The BKP can be solved in pseudo-polynomial time, as 0-1 knapsack
Ana-Maria Oprescu, Thilo Kielmann BaTS 9 / 17
Intro What’s the problem? BaTS Performance Evaluation Job Profiler Reconfigure Cluster Monitoring
◮ Individually requested from the cloud provider,
◮ Each machine has a different time left of the current ATU ◮ Runtime granularity ⇒ paid machine time possibly unused
◮ Time on machines we already paid for ◮ Actual speed (tasks/minute) achieved per cluster Ana-Maria Oprescu, Thilo Kielmann BaTS 10 / 17
Intro What’s the problem? BaTS Performance Evaluation
◮ N=1000 tasks ⇒ n=30 (sample set size) ◮ Normal distribution of runtime: avg=15 min, st. dev.=2.23 ◮ Iosup et al. show bags typically have some normal distribution
◮ 2 clouds, 32 machines each ◮ Fast/slow machines emulated by modifying the sleep time ◮ Allocate through local site scheduler (without competing users) ◮ Accountable Time Unit = 1 hour
Ana-Maria Oprescu, Thilo Kielmann BaTS 11 / 17
Intro What’s the problem? BaTS Performance Evaluation
◮ self-scheduler (RR) always using 32+32 machines ◮ BaTS on initial config. 30+30 machines provided with ◮ budget BBaTSRR = cost incurred by running RR (CRR) ◮ budget BBaTSBMin , computed off-line as the cost incurred by
Ana-Maria Oprescu, Thilo Kielmann BaTS 12 / 17
Intro What’s the problem? BaTS Performance Evaluation
Ana-Maria Oprescu, Thilo Kielmann BaTS 13 / 17
Intro What’s the problem? BaTS Performance Evaluation
Ana-Maria Oprescu, Thilo Kielmann BaTS 14 / 17
Intro What’s the problem? BaTS Performance Evaluation
Ana-Maria Oprescu, Thilo Kielmann BaTS 15 / 17
Intro What’s the problem? BaTS Performance Evaluation
◮ Current work: fixing limitation by pre sampling (even smaller) ◮ Early results promising
◮ DAG’s instead of BoT’s (dependencies) ◮ BaTS for MapReduce? Ana-Maria Oprescu, Thilo Kielmann BaTS 16 / 17
Intro What’s the problem? BaTS Performance Evaluation
Ana-Maria Oprescu, Thilo Kielmann BaTS 17 / 17
Ana-Maria Oprescu, Thilo Kielmann BaTS 1 / 2
Thu Oct 21 02:40:06 2010 id user start stop state nhosts hosts 1152334 vpopescu 10/15 14:45 12/24 00:25 r 1 node010 1152611 ppouwels 10/20 20:00 10/21 08:00 r 1 node030 1152607 ppouwels 10/20 20:00 10/21 08:00 r 1 node059 1152608 ppouwels 10/20 20:00 10/21 08:00 r 1 node060 1152633 ppouwels 10/21 00:22 10/21 12:22 r 1 node062 1152606 ppouwels 10/20 20:00 10/21 08:00 r 1 node068 1152634 ppouwels 10/21 01:01 10/21 13:01 r 1 node078 1152604 mcd 10/20 17:01 10/21 23:02 r 1 node076 [amo@fs0 ~]$ finger ppouwels Login: ppouwels Name: Petra Pouwels Directory: /home5/ppouwels Shell: /bin/bash Office: VUMC, PJW.Pouwels@vumc.nl Never logged in. No mail. No Plan. [amo@fs0 ~]$ finger vpopescu Login: vpopescu Name: Veronica Popescu Directory: /home5/vpopescu Shell: /bin/bash Office: VUMC, v.popescu@vumc.nl Never logged in. No mail. No Plan. [amo@fs0 ~]$
Ana-Maria Oprescu, Thilo Kielmann BaTS 2 / 2