IT & PH DEPT.
Process Monitoring Of Nightly Builds PH/SFT & IT/CF
Summer Student Willem Van Lint
Process Monitoring Of Nightly Builds PH/SFT & IT/CF Summer - - PowerPoint PPT Presentation
IT & PH DEPT. Process Monitoring Of Nightly Builds PH/SFT & IT/CF Summer Student Willem Van Lint Overview IT & PH DEPT. Nightly builds & problem statement Lemon monitoring framework Design of the process
Summer Student Willem Van Lint
Server Linux Client Get work unit DoBuild.py runs Compile ROOT Install ... Test ... Compile ... Install … Test ... ... Windows Client ... MySQL Mac Client ... Stores Web interface
PID TTY STAT TIME COMMAND 19485 ? Ss 0:00 /bin/sh 19486 ? S 0:00 \_ /bin/sh /afs/cern.ch/sw/lcg/app/nightlies/scripts/launch_client.sh lxbuild147 8002 19594 ? S 0:00 | \_ python /afs/cern.ch/sw/lcg/app/nightlies/scripts/client.py --machine lxbuild147 19600 ? Z 0:00 | \_ [uptime] <defunct> 15940 ? S 0:00 | \_ python /afs/cern.ch/sw/lcg/app/nightlies/scripts/doBuild.py --slots dev1 15947 ? S 0:00 | | \_ /bin/sh -c source{SITEROOT}/sw/contrib/ 21661 ? S 0:00 | | \_ cmt pkg_make 4 21683 ? S 0:00 | | \_ sh -c mkdir -p logs; 21690 ? S 0:00 | | \_ sh -x /build/nightlies/dev1/Fri/LCGCMT/LCGCMT_59 21695 ? R 5781:25 | | | \_ make -k -j4 21691 ? S 0:00 | | \_ tee -a logs/ROOT_x86_64-slc5-gcc43-dbg_make.log 8628 ? Z 0:00 | \_ [python] <defunct> 19487 ? S 0:00 \_ tee /afs/cern.ch/sw/lcg/app/nightlies/nightlies-logs/crncli64148.txt
Web browser
Lemon CLI
User
Oracle Database
Repository backend SQL
Nodes
Monitoring Agent
Sensor Sensor Sensor
RRDT
/ PHP apache
HTTP
Lemon-host-check
Applicati
TCP/UDP
Nodes
Monitoring Agent
Sensor Sensor Sensor
30010 MetricName exception.hangingcpu MetricClass alarm.exception Timing 20 5 Parameters Correlation (33:2 > 1000) && (33:1 > 0) Actuator /usr/bin/lemon-actuator-kill cputime $act_value_02 MaxRuns 3 900 Timeout 100
Monitoring Agent Sensor Sensor wrapper Metric module Actuator Exception Metric module
Machi ne Metric nr Time PID Total cpu time Project name lxbuild 148 5380 12825 52613 2912 9 Project0 lxbuild 148 5380 12825 52613 3813 5 Project1
Machine Metric nr Time PID Path Amount written Project lxbuild148 5381 Mon Aug 23 04:03:54 2010 16231 .../x86_64- slc5- gcc43-
12028 GAUDI lxbuild148 5381 Mon Aug 23 04:03:54 2010 17053 …/x86_64- slc5-icc11- dbg- tests.log COOL
Monitoring Agent Sensor Sensor wrapper client (repeater) Metric module Sensor wrapper server RPC Linux machine Windows/Mac machine(s) Sensor wrapper client (repeater) ...