ORNL is managed by UT-Battelle for the US Department of Energy
Compar
- mparativ
ive e I/O O Wor
- rkload
kload Char haract acter eriz ization ion of
- f
Two
- Leader
Leadership hip Clas lass Stor
- rage
Compar omparativ ive e I/O O Wor orkload kload Char haract - - PowerPoint PPT Presentation
Compar omparativ ive e I/O O Wor orkload kload Char haract acter eriz ization ion of of Two o Leader Leadership hip Clas lass Stor orage ge Clus luster ers Presented by Sarp Oral Raghul Gunasekaran, Sarp Oral, Jason Hill,
ORNL is managed by UT-Battelle for the US Department of Energy
2
3
4
Enterprise Storage controllers and large racks of disks are connected via InfiniBand. 36 DataDirect SFA12K-40 controller pairs with 2 Tbyte NL- SAS drives and 8 InifiniBand FDR connections per pair Storage Nodes run parallel file system software and manage incoming FS traffic. 288 Dell servers with 64 GB of RAM each SION II Network provides connectivity between OLCF resources and primarily carries storage traffic. 1600 ports, 56 Gbit/sec InfiniBand switch complex Lustre Router Nodes run parallel file system client software and forward I/O operations from HPC clients. 432 XK7 XIO nodes configured as Lustre routers on Titan Titan XK7 Other OLCF resources XK7 Gemini 3D Torus 9.6 Gbytes/sec per direction InfiniBand 56 Gbit/sec Serial ATA 6 Gbit/sec
Figure reference: S. Oral, et al. OLCF’s 1 TB/s, next-generation lustre file system. In the proceedings of the Cray User Group Conference, 2013
5
ICL
(to OSS)
6
7
8
MySQL database
ddntool,
management server DDN SFA12KX DDN SFA12KX DDN SFA12KX
9
20 40 60 80 100 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36
Percentage (%) DDN Couplets(1-36)
Write 20 40 60 80 100 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47
Percentage (%) DDN Controllers(1-48)
Write
10
20 40 60 80 100 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47
% of Peak Bandwidth DDN Controllers(1-48)
Max Read Max Write 20 40 60 80 100 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36
% of Peak Bandwidth DDN Couplets(1-36)
Max Read Max Write
11
Cumulative Distribution Function (CDF) Storage system usage over a month
40 50 60 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
Percentage (%) Time(days of a month)
% of 32 PB
0.2 0.4 0.6 0.8 1 5 10 50 100 150 200 250
Distribution P(x) Bandwidth (GB/s)
Aggregate bandwidth
12
Probability Distribution Function (PDF) Spider 1 Spider 2
0.1 0.2 0.3 0.4 0.5 0.6 0.7
4k 8k 16k 32k 64k 128k 512k 1M 2M 4M
Distribution P(x) Request Size
Read Write
0.1 0.2 0.3 0.4 0.5 0.6 0.7
4k 8k 16k 32k 64k 128k 512k 1M 2M 4M
Distribution P(x) Request Size
Read Write
13
Cumulative Distribution Function (CDF) Probability Distribution Function (PDF)
0.7 0.75 0.8 0.85 0.9 0.95 1 16 32 64 128 256 512 1000
Distribution P(x) Request Latency(ms)
Read Write
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 16 32 64 128 256 512 1000
Distribution P(x) Request Latency(ms)
Read Write
14
15