Computing Cluster Usage Visualization Tool Compu&ng Cluster - - PowerPoint PPT Presentation

computing cluster usage visualization tool
SMART_READER_LITE
LIVE PREVIEW

Computing Cluster Usage Visualization Tool Compu&ng Cluster - - PowerPoint PPT Presentation

Computing Cluster Usage Visualization Tool Compu&ng Cluster Usage Visualiza&on Tool Sinisa Jovanovic Prometheus - Cluster Computer Cluster utilization December 2012 01.12. 31.12. Compu&ng


slide-1
SLIDE 1

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

Computing Cluster Usage Visualization Tool

slide-2
SLIDE 2

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

Computer Cluster utilization December 2012 01.12. – 31.12. Prometheus - Cluster

slide-3
SLIDE 3

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡ User Jobs submitted Jobs failed CPU time Runtime Memory used I/O cluster utilization alice 955637 41335 471kh 611kh 194.06EBs 10.52PB 12% hades 254821 3753 438kh 457kh 32.91EBs 2.12PB 9% kp2 77743 7357 172kh 174kh 654.27PBs 99.77TB 3% cbm 72549 5363 140kh 141kh 358.47PBs 461.83TB 2% ufk 182980 194 25kh 33kh 16.09EBs 662.18TB 0% bhs 1449 282 32kh 7kh 32.34PBs 7.43TB 0% had1 300 101 2kh 2kh 6.98PBs 8.81GB 0% hpc 8591 21s 143h35m 0.0Bs 321.22MB 0% kp1 27 12 5h25m 7h36m 18.41TBs 92.03GB 0% rz 5583 2579 5kh 8kh 15.73PBs 518.03GB 0% the 20932 2209 88kh 34kh 428.91PBs 839.56TB 0%

Groups

slide-4
SLIDE 4

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

alice ¡

1118766 668714 158213 1938 1600442 317677 27680 1832 540143 155132 233298 101109 526397 403789 24138 1313 821366 118723 7720 7828 247699 45379 129724 532833

slide-5
SLIDE 5

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

hades ¡

246274 5362 3185 151869 22671 80261 20 45169 78819 9972 120860

slide-6
SLIDE 6

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

kp2 ¡

77743 30582 40976 4502 1683 66465 10052 1215

slide-7
SLIDE 7

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

cbm ¡

70258 2291 56034 1898 10200 4417 23464 27462 7084 14539

slide-8
SLIDE 8

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

User Jobs submitted Jobs failed CPU time Runtime Memory used I/O akalweit 896 3 281h37m 293h35m 1.32PBs 5.9TB andrey 16543 901 29kh 29kh 78.11PBs 237.75TB anielski 2090 64h32m 88h41m 121.78TBs 3.51TB bailhach 852 46 116h38m 557h22m 384.34TBs 1.41TB bhess 127385 8015 120kh 183kh 64.48EBs 1.32PB bneuburg 8558 0s 143h28m 0.0Bs 308.36MB cbaumann 5920 103 11kh 12kh 46.86PBs 260.82TB cbehnke 12893 1 89h19m 725h35m 78.02TBs 6.58TB cdritsa 6994 1999 5kh 5kh 16.01EBs 32.29TB cfranco 24 13 832h47m 865h31m 959.98TBs 33.62TB christis 44 41 3h35m 5h17m 4.81TBs 253.82GB dlohner 212 2 180h6m 184h3m 490.2TBs 95.03GB doenigus 421 171h17m 180h49m 488.65TBs 6.9TB ebelolap 6480 11 17kh 17kh 48.3PBs 29.71TB ekrebs 6831 20 149h24m 501h41m 83.98TBs 204.14GB fbock 20941 1355 20kh 20kh 63.45PBs 343.72TB feldkamp 27 12 5h25m 7h36m 18.41TBs 92.03GB fischer 1154 405 46kh 3kh 270.1PBs 178.03TB fpetrov 1242 229 21kh 6kh 15.14PBs 7.28TB freidt 370 96 52h42m 59h18m 181.3TBs 2.13TB frettig 2377 97 731h46m 761h50m 2.88PBs 17.15TB

Users A - F

Jobs failed: Indicates the problem which

  • ccurred in case a job could not

be started on the execution host CPUA time: The cpu time usage Memory used: The used memory from all Jobs I/O: The amount of data transferred in input/output operations. Runtime: The complete running time from all Jobs

slide-9
SLIDE 9

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

User Jobs submitted Jobs failed CPU time Runtime Memory used I/O gromliuk 300 101 2kh 2kh 6.98PBs 8.81GB hadesdst 93867 226 399kh 400kh 16.81EBs 513.15TB harabasz 96219 61 9kh 10kh 21.49PBs 231.14TB iarsene 25485 284 578h58m 615h2m 2.56PBs 28.96TB jacek 745 350 35h34m 38h29m 89.96TBs 2.29TB jklein 1550 958h38m 1kh 2.82PBs 29.06TB jknedlik 4315 2557 4kh 7kh 13.15PBs 16.26GB jonderw 110368 3247 2kh 4kh 48.01EBs 93.79TB jthaeder 1602 316h39m 317h51m 93.56TBs 320.63GB jtscheus 13127 1827 20kh 20kh 53.47PBs 7.95TB kgill 6242 4kh 7kh 15.35PBs 244.28TB kisselan 1754 957 21kh 21kh 73.63PBs 1.27TB koenig 127 74h50m 77h11m 150.83TBs 133.13GB kryshen 284 13 1kh 2kh 6.26PBs 2.48TB kvantour 1246 114 2kh 2kh 5.36PBs 5.07GB lenhardt 310 11 7h0m 12h48m 35.24TBs 43.59GB lhuther 1077 795 1kh 1kh 701.99TBs 3.49TB lsilva 648 38 1kh 1kh 2.57PBs 192.01TB

Users G - L

Jobs failed: Indicates the problem which

  • ccurred in case a job could not

be started on the execution host CPUA time: The cpu time usage Memory used: The used memory from all Jobs I/O: The amount of data transferred in input/output operations. Runtime: The complete running time from all Jobs

slide-10
SLIDE 10

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

User Jobs submitted Jobs failed CPU time Runtime Memory used I/O marsland

21650 133 698h28m 1kh 2.95PBs 72.45TB

mheide

47069 343 14kh 26kh 50.24PBs 3.43PB

mkrzewic

20756 952 51kh 64kh 475.38PBs 218.7TB

mlorenz

3983 563 1kh 5kh 4.5PBs 158.73TB

mmarquar

36 9 1h49m 2h36m 4.67TBs 112.5GB

mvoelkl

138957 11451 35kh 60kh 48.13EBs 645.38TB

mwilde

232307 2510 77kh 81kh 16.26EBs 1.3PB

nicassio

6568 166 21kh 22kh 16.1EBs 351.97TB

  • busch

699 216h12m 227h39m 974.4TBs 2.91TB

  • ezdemir

493 47 27h48m 29h45m 86.26TBs 1.2TB

  • lgapech

137 1 79h21m 81h40m 142.19TBs 323.71GB

  • svoboda

2350 7 4kh 4kh 26.85PBs 367.25GB

pachmay

69179 643 32kh 39kh 127.73PBs 698.7TB

pgoergen

180 38 6kh 858h22m 12.46PBs 102.87GB

Users M - P

Jobs failed: Indicates the problem which

  • ccurred in case a job could not

be started on the execution host CPUA time: The cpu time usage Memory used: The used memory from all Jobs I/O: The amount of data transferred in input/output operations. Runtime: The complete running time from all Jobs

slide-11
SLIDE 11

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

User Jobs submitted Jobs failed CPU time Runtime Memory used I/O sellheim 12415 796 1kh 4kh 4.06PBs 72.73TB sheckel 23 4h28m 4h56m 8.65TBs 181.23GB skirsch 162 1 239h29m 294h12m 1.49PBs 1.55TB slebedev 8820 4426 49kh 49kh 119.76PBs 73.95TB slindal 14 14 0s 0s 0.0Bs 0.0B sschuchm 182980 194 25kh 33kh 16.09EBs 662.18TB sudol 969 18h34m 49h35m 22.44TBs 427.47GB sweber 2963 429 3kh 3kh 10.05PBs 50.85TB tbroeker 88 33 6h54m 8h3m 23.83TBs 314.39GB tneff 17455 895 39kh 27kh 152.76PBs 658.04TB trascanu 8 45m43s 53m55s 1.03TBs 46.31GB tscheib 11122 28 9kh 13kh 11.8PBs 683.37TB turany 1268 22 1kh 1kh 2.58PBs 501.77GB uhlig 75989 6400 150kh 152kh 580.65PBs 98.5TB uwesterh 97 32h39m 39h24m 131.13TBs 1.18TB vassilie 40422 12 41kh 42kh 106.03PBs 117.93TB vpenso 33 21s 7m2s 0.0Bs 12.86MB wiechula 99873 9127 53kh 64kh 264.26PBs 1.7PB zlecz 27 15 3kh 221h46m 4.74PBs 51.87GB

Users S - Z

Jobs failed: Indicates the problem which

  • ccurred in case a job could not

be started on the execution host CPUA time: The cpu time usage Memory used: The used memory from all Jobs I/O: The amount of data transferred in input/output operations. Runtime: The complete running time from all Jobs

slide-12
SLIDE 12

Compu&ng ¡Cluster ¡Usage ¡Visualiza&on ¡Tool ¡ Sinisa ¡Jovanovic ¡ ¡

26% ~ 29% utilization All groups: Total Runtime: 1467kh Total CPU-Time: 1373kh

Summary