2010 Blue Waters Performance Modeling Workshop – Opening and Introduction
Torsten Hoefler
1
2010 Blue Waters Performance Modeling Workshop Opening and - - PowerPoint PPT Presentation
2010 Blue Waters Performance Modeling Workshop Opening and Introduction Torsten Hoefler With slides from: William Kramer, Marc Snir, William Gropp, IBM, and the Blue Waters team 1 Introduction and Overview My slides contain only public
1
2
3
4
5 On-line Storage Near-line Storage
L-Link Cables Super Node
(32 Nodes / 4 CEC)
P7 Chip (8 cores) SMP node (32 cores) Drawer (256 cores) SuperNode (1024 cores) Building Block Blue Waters System NPCF
6
7
Quad-chip MCM
8
MC1 MC 0 8c uP MC1 MC0 8c uP MC0 MC1 8c uP MC0 MC1 8c uP
A B X W B A W X B A Y X A B X Y C Z C Z W Z Z W C Y C Y
A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp
MC 0 MC 1 MC 0 MC 1 MC 0 MC 1 MC 0 MC 1
9
10
DIMM 15 DIMM 8 DIMM 9 DIMM 14 DIMM 4 DIMM 12 DIMM 13 DIMM 5 DIMM 11 DIMM 3 DIMM 2 DIMM 10 DIMM 0 DIMM 7 DIMM 6 DIMM 1
MC1 Mem Mem Mem Mem MC 0 Mem Mem 8c uP MC1 Mem Mem Mem Mem MC0 Mem Mem Mem Mem 8c uP 7 Inter-Hub Board Level L-Buses 3.0Gb/s @ 8B+8B, 90% sus. peak
D0-D15 Lr0-Lr23
320 GB/s 240 GB/s
28x XMIT/RCV pairs @ 10 Gb/s 832 624 5+5GB/s (6x=5+1)
Hub Chip Module
22+22GB/s 164 Ll0 22+22GB/s 164 Ll1 22+22GB/s 164 Ll2 22+22GB/s 164 Ll3 22+22GB/s 164 Ll4 22+22GB/s 164 Ll5 22+22GB/s 164 Ll6
12x 12x 12x 12x 12x 12x 12x 12x 12x 12x 12x 12x
10+10GB/s (12x=10+2)
12x 12x 12x 12x 12x 12x 12x 12x 12x 12x 12x 12x 12x 12x 12x 12x
7+7GB/s 72 EG2
PCIe 61x
7+7GB/s 72 EG1
PCIe 16x
7+7GB/s 72 EG2
PCIe 8x MC0 Mem Mem Mem Mem MC1 Mem Mem Mem Mem 8c uP MC0 Mem Mem Mem Mem MC1 Mem Mem Mem Mem 8c uP Mem Mem
P7-0 P7-1 P7-3 P7-2
A B X W B A W X B A Y X A B X Y C Z C Z W Z Z W C Y C Y Z W Y X
A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp A Clk Grp B Clk Grp C Clk Grp D Clk Grp
MC 0 MC 1 MC 0 MC 1 MC 0 MC 1 MC 0 MC 1
DIMM 1
Mem Mem
DIMM 0
Mem Mem
DIMM 5
Mem Mem
DIMM 4
Mem Mem
DIMM 7
Mem Mem
DIMM 6
Mem Mem
DIMM 10
Mem Mem
DIMM 11
Mem Mem
DIMM 2
Mem Mem
DIMM 3
Mem Mem
DIMM 14
Mem Mem
DIMM 15
Mem Mem
DIMM 8
Mem Mem
DIMM 9
Mem Mem
DIMM 12
Mem Mem
DIMM 13
Mem Mem
EI-3 PHYs
Torrent
Diff PHYs
L local HUB To HUB Copper Board Wiring
L remote 4 Drawer Interconnect to Create a Supernode Optical LR0 Bus Optical
6x 6x
LR23 Bus Optical
6x 6x
LL0 Bus Copper
8B 8B 8B 8B
LL1 Bus Copper
8B 8B
LL2 Bus Copper
8B 8B
LL4 Bus Copper
8B 8B
LL5 Bus Copper
8B 8B
LL6 Bus Copper
8B 8B
LL3 Bus Copper
Diff PHYs
PX0 Bus
16x 16x
PCI-E IO PHY
Hot Plug Ctl
PX1 Bus
16x 16x
PCI-E IO PHY
Hot Plug Ctl
PX2 Bus
8x 8x
PCI-E IO PHY
Hot Plug Ctl FSI FSP1-A FSI FSP1-B I2C TPMD-A, TMPD-B SVIC MDC-A SVIC MDC-B I2C SEEPROM 1 I2C SEEPROM 2
24 L remote Buses
HUB to QCM Connections Address/Data
D Bus Interconnect of Supernodes Optical
D0 Bus Optical
12x 12x
D15 Bus Optical
12x 12x
16 D Buses 28 I2C I2C_0 + Int I2C_27 + Int
I2C To Optical Modules
TOD Sync 8B Z-Bus 8B Z-Bus TOD Sync 8B Y-Bus 8B Y-Bus TOD Sync 8B X-Bus 8B X-Bus TOD Sync 8B W-Bus 8B W-Bus
11
12
DCA-0 Connector (Top DCA) DCA-1 Connector (Bottom DCA) HUB 7 HUB 6 HUB 4 HUB 3 HUB 5 HUB 1 HUB HUB 2
P C I e 9 P C I e 10 P C I e 11 P C I e 12 P C I e 13 P C I e 14 P C I e 15 P C I e 16 P C I e 17
P1-C17-C1
P C I e 1 P C I e 2 P C I e 3 P C I e 4 P C I e 5 P C I e 6 P C I e 7 P C I e 8
Optical Fan-out from HUB Modules 2,304 Fiber 'L-Link'
64/40 Optical 'D-Link' 64/40 Optical 'D-Link'
P7-0 P7-2 P7-3 P7-1 QCM 0 U-P1-M1 P7-0 P7-2 P7-3 P7-1 QCM 1 U-P1-M2 P7-0 P7-2 P7-3 P7-1 QCM 2 U-P1-M3 P7-0 P7-2 P7-3 P7-1 QCM 3 U-P1-M4 P7-0 P7-2 P7-3 P7-1 QCM 4 U-P1-M5 P7-0 P7-2 P7-3 P7-1 QCM 5 U-P1-M6 P7-0 P7-2 P7-3 P7-1 QCM 6 U-P1-M7 P7-0 P7-2 P7-3 P7-1 QCM 7 U-P1-M8
P1-C16-C1 P1-C15-C1 P1-C14-C1 P1-C13-C1 P1-C12-C1 P1-C11-C1 P1-C10-C1 P1-C9-C1 P1-C8-C1 P1-C7-C1 P1-C6-C1 P1-C5-C1 P1-C4-C1 P1-C3-C1 P1-C2-C1 P1-C1-C1
N0-DIMM15 N0-DIMM14 N0-DIMM13 N0-DIMM12 N0-DIMM11 N0-DIMM10 N0-DIMM09 N0-DIMM08 N0-DIMM07 N0-DIMM06 N0-DIMM05 N0-DIMM04 N0-DIMM03 N0-DIMM02 N0-DIMM01 N0-DIMM00 N1-DIMM15 N1-DIMM14 N1-DIMM13 N1-DIMM12 N1-DIMM11 N1-DIMM10 N1-DIMM09 N1-DIMM08 N1-DIMM07 N1-DIMM06 N1-DIMM05 N1-DIMM04 N1-DIMM03 N1-DIMM02 N1-DIMM01 N1-DIMM00 N2-DIMM15 N2-DIMM14 N2-DIMM13 N2-DIMM12 N2-DIMM11 N2-DIMM10 N2-DIMM09 N2-DIMM08 N2-DIMM07 N2-DIMM06 N2-DIMM05 N2-DIMM04 N2-DIMM03 N2-DIMM02 N2-DIMM01 N2-DIMM00 N3-DIMM15 N3-DIMM14 N3-DIMM13 N3-DIMM12 N3-DIMM11 N3-DIMM10 N3-DIMM09 N3-DIMM08 N3-DIMM07 N3-DIMM06 N3-DIMM05 N3-DIMM04 N3-DIMM03 N3-DIMM02 N3-DIMM01 N3-DIMM00 N4-DIMM15 N4-DIMM14 N4-DIMM13 N4-DIMM12 N4-DIMM11 N4-DIMM10 N4-DIMM09 N4-DIMM08 N4-DIMM07 N4-DIMM06 N4-DIMM05 N4-DIMM04 N4-DIMM03 N4-DIMM02 N4-DIMM01 N4-DIMM00 N5-DIMM15 N5-DIMM14 N5-DIMM13 N5-DIMM12 N5-DIMM11 N5-DIMM10 N5-DIMM09 N5-DIMM08 N5-DIMM07 N5-DIMM06 N5-DIMM05 N5-DIMM04 N5-DIMM03 N5-DIMM02 N5-DIMM01 N5-DIMM00 N6-DIMM15 N6-DIMM14 N6-DIMM13 N6-DIMM12 N6-DIMM11 N6-DIMM10 N6-DIMM09 N6-DIMM08 N6-DIMM07 N6-DIMM06 N6-DIMM05 N6-DIMM04 N6-DIMM03 N6-DIMM02 N6-DIMM01 N6-DIMM00 N7-DIMM15 N7-DIMM14 N7-DIMM13 N7-DIMM12 N7-DIMM11 N7-DIMM10 N7-DIMM09 N7-DIMM08 N7-DIMM07 N7-DIMM06 N7-DIMM05 N7-DIMM04 N7-DIMM03 N7-DIMM02 N7-DIMM01 N7-DIMM00
13
4.6 TB/s Bisection BW BW of 1150 10G-E ports
DCA-0 Connector (Top DCA) DCA-1 Connector (Bottom DCA)
2nd Level Interconnect (1,024 cores)
HUB 7
61x96mmHUB 6
61x96mmHUB 4
61x96mmHUB 3
61x96mmHUB 5
61x96mmHUB 1
61x96mmHUB
61x96mmHUB 2
61x96mmP C I e 9 P C I e 10 P C I e 11 P C I e 12 P C I e 13 P C I e 14 P C I e 15 P C I e 16 P C I e 17 P C I e 1 P C I e 2 P C I e 3 P C I e 4 P C I e 5 P C I e 6 P C I e 7 P C I e 8
Optical Fan-out from HUB Modules 2,304 Fiber 'L-Link' 64/40 Optical 'D-Link'
FSP/CLK-A64/40 Optical 'D-Link'
FSP/CLK-B P7-0 P7-2 P7-3 P7-1 QCM 0 P7-0 P7-2 P7-3 P7-1 QCM 1 P7-0 P7-2 P7-3 P7-1 QCM 2 P7-0 P7-2 P7-3 P7-1 QCM 3 P7-0 P7-2 P7-3 P7-1 QCM 4 P7-0 P7-2 P7-3 P7-1 QCM 5 P7-0 P7-2 P7-3 P7-1 QCM 6 P7-0 P7-2 P7-3 P7-1 QCM 7DCA-0 Connector (Top DCA) DCA-1 Connector (Bottom DCA)
2nd Level Interconnect (1,024 cores)
HUB 7
61x96mmHUB 6
61x96mmHUB 4
61x96mmHUB 3
61x96mmHUB 5
61x96mmHUB 1
61x96mmHUB
61x96mmHUB 2
61x96mmP C I e 9 P C I e 10 P C I e 11 P C I e 12 P C I e 13 P C I e 14 P C I e 15 P C I e 16 P C I e 17 P C I e 1 P C I e 2 P C I e 3 P C I e 4 P C I e 5 P C I e 6 P C I e 7 P C I e 8
Optical Fan-out from HUB Modules 2,304 Fiber 'L-Link' 64/40 Optical 'D-Link'
FSP/CLK-A64/40 Optical 'D-Link'
FSP/CLK-B P7-0 P7-2 P7-3 P7-1 QCM 0 P7-0 P7-2 P7-3 P7-1 QCM 1 P7-0 P7-2 P7-3 P7-1 QCM 2 P7-0 P7-2 P7-3 P7-1 QCM 3 P7-0 P7-2 P7-3 P7-1 QCM 4 P7-0 P7-2 P7-3 P7-1 QCM 5 P7-0 P7-2 P7-3 P7-1 QCM 6 P7-0 P7-2 P7-3 P7-1 QCM 7DCA-0 Connector (Top DCA) DCA-1 Connector (Bottom DCA)
2nd Level Interconnect (1,024 cores)
HUB 7
61x96mmHUB 6
61x96mmHUB 4
61x96mmHUB 3
61x96mmHUB 5
61x96mmHUB 1
61x96mmHUB
61x96mmHUB 2
61x96mmP C I e 9 P C I e 10 P C I e 11 P C I e 12 P C I e 13 P C I e 14 P C I e 15 P C I e 16 P C I e 17 P C I e 1 P C I e 2 P C I e 3 P C I e 4 P C I e 5 P C I e 6 P C I e 7 P C I e 8
Optical Fan-out from HUB Modules 2,304 Fiber 'L-Link' 64/40 Optical 'D-Link'
FSP/CLK-A64/40 Optical 'D-Link'
FSP/CLK-B P7-0 P7-2 P7-3 P7-1 QCM 0 P7-0 P7-2 P7-3 P7-1 QCM 1 P7-0 P7-2 P7-3 P7-1 QCM 2 P7-0 P7-2 P7-3 P7-1 QCM 3 P7-0 P7-2 P7-3 P7-1 QCM 4 P7-0 P7-2 P7-3 P7-1 QCM 5 P7-0 P7-2 P7-3 P7-1 QCM 6 P7-0 P7-2 P7-3 P7-1 QCM 7DCA-0 Connector (Top DCA) DCA-1 Connector (Bottom DCA)
2nd Level Interconnect (1,024 cores)
HUB 7
61x96mmHUB 6
61x96mmHUB 4
61x96mmHUB 3
61x96mmHUB 5
61x96mmHUB 1
61x96mmHUB
61x96mmHUB 2
61x96mmP C I e 9 P C I e 10 P C I e 11 P C I e 12 P C I e 13 P C I e 14 P C I e 15 P C I e 16 P C I e 17 P C I e 1 P C I e 2 P C I e 3 P C I e 4 P C I e 5 P C I e 6 P C I e 7 P C I e 8
Optical Fan-out from HUB Modules 2,304 Fiber 'L-Link' 64/40 Optical 'D-Link'
FSP/CLK-A64/40 Optical 'D-Link'
FSP/CLK-B P7-0 P7-2 P7-3 P7-1 QCM 0 P7-0 P7-2 P7-3 P7-1 QCM 1 P7-0 P7-2 P7-3 P7-1 QCM 2 P7-0 P7-2 P7-3 P7-1 QCM 3 P7-0 P7-2 P7-3 P7-1 QCM 4 P7-0 P7-2 P7-3 P7-1 QCM 5 P7-0 P7-2 P7-3 P7-1 QCM 6 P7-0 P7-2 P7-3 P7-1 QCM 714
L-Link Cables Super Node
(32 Nodes / 4 CEC)
15
16
17
18
19
20