Bright Cluster Manager
Advanced HPC cluster management made easy
Martijn de Vries
CTO Bright Computing
Bright Cluster Manager Advanced HPC cluster management made easy - - PowerPoint PPT Presentation
Bright Cluster Manager Advanced HPC cluster management made easy Martijn de Vries CTO Bright Computing About Bright Computing Bright Computing 1. Develops and supports Bright Cluster Manager for HPC systems and server farms 2. Incorporated
Martijn de Vries
CTO Bright Computing
2
3
I ndustry Governm ent Academ ia
4
5
6
CMDaemon
7
Cluster Management Shell Cluster Management GUI SSL / SOAP / X509 / IPtables Cluster Management Daemon Disk Ethernet Interconnect IPMI / iLO PDU CPU GPU Memory PBS Pro Torque Maui/MOAB Grid Engine SLURM LSF* Monitoring Automation Health Checks Management Compilers Libraries Debuggers Profilers Provisioning SLES / RHEL / CentOS / SL / Oracle EL SLES / RHEL / CentOS / SL / Oracle EL ScaleMP vSMP
8
9
12
17
20
CMDaemon
BMC BMC BMC
25
26
27
31
– Halt workload manager few (milli)seconds before job is executed – Check health of each reserved node – If unhealthy, take off line, inform system administrator – Hand job back to workload manager
– Run health check when node is not used – Run health check through queuing system
– Most thorough health check – Requires reboot
36
37
38
40
41
42