Data & Storage Services
CERN IT Department CH-1211 Genève 23 Switzerland
www.cern.ch/it
DSS
TSM Monitoring @ CERN
Daniele Francesco Kruse CERN IT/DSS
Presented by Giuseppe Lo Presti
20th HEPiX - Vancouver - October 2011
DSS Data & Storage Services TSM Monitoring @ CERN Daniele - - PowerPoint PPT Presentation
DSS Data & Storage Services TSM Monitoring @ CERN Daniele Francesco Kruse CERN IT/DSS Presented by Giuseppe Lo Presti CERN IT Department CH-1211 Genve 23 Switzerland 20th HEPiX - Vancouver - October 2011 www.cern.ch/i t Data &
CERN IT Department CH-1211 Genève 23 Switzerland
www.cern.ch/it
Presented by Giuseppe Lo Presti
20th HEPiX - Vancouver - October 2011
Data & Storage Services
2 20th HEPiX - Vancouver - October 2011
Data & Storage Services
1. Physics data (using CASTOR for this) 2. User PCs (already backing up home AFS/DFS directories)
3 20th HEPiX - Vancouver - October 2011
Data & Storage Services
PB of archived data
million files
20th HEPiX - Vancouver - October 2011 4
Data & Storage Services
17 TSM Servers in production
80 TB of disk storage
5 20th HEPiX - Vancouver - October 2011
Data & Storage Services
TSM monitoring tool developed in-house
about potential issues
6 20th HEPiX - Vancouver - October 2011
Data & Storage Services TSM Management Station
7 20th HEPiX - Vancouver - October 2011
Data & Storage Services
TSMMS daily report example: TSMMS also sends an email for each error in each TSM server
8 20th HEPiX - Vancouver - October 2011
Data & Storage Services
division) and generates graphs and stats for each group
whenever they miss their periodic backup
history performance and stats, associated schedules, etc.
9 20th HEPiX - Vancouver - October 2011
Data & Storage Services
(heavily depending on the TSM 5 database schema)
design and architecture
10 20th HEPiX - Vancouver - October 2011
Data & Storage Services
the alerting tasks will be moved to Splunk
admin with proper information and alerts
11 20th HEPiX - Vancouver - October 2011
Data & Storage Services Splunk
12 20th HEPiX - Vancouver - October 2011
Data & Storage Services
TSM Admin Add nodes to TSM Spot issues and solve them Check DB space and Tape pools Handle user support tickets
Need to find a suitable server ... Need to have a clear view of DB and pools ... Check quickly for any anomaly in the system Scope reduced: Splunk does the rest!
13 20th HEPiX - Vancouver - October 2011
Data & Storage Services
Model Layer TSMMS DB TSM Server 1 TSM Server 2 TSM Server 3 TSM Server 4 TSM Server N Controller Layer (Display Logic) View Layer (HTML and Javascript Templates)
14 20th HEPiX - Vancouver - October 2011
Data & Storage Services
15 20th HEPiX - Vancouver - October 2011
Data & Storage Services
16 20th HEPiX - Vancouver - October 2011