TSUBAME---A Year Lat er Sat oshi Mat suoka, Prof essor/ Dr.Sci. - PowerPoint PPT Presentation

1 TSUBAME---A Year Lat er Sat oshi Mat suoka, Prof essor/ Dr.Sci. Global Scient if ic I nf ormat ion and Comput ing Cent er Tokyo I nst . Technology & NAREGI Proj ect Nat ional I nst . I nf ormat ics EuroPVM/ MPI , Paris, France, Oct . 2, 2007

2 Topics for Today • Intro • Upgrades and other New stuff • New Programs • The Top 500 and Acceleration • Towards TSUBAME 2.0

3 The TSUBAME Production “Supercomputing Grid Cluster” Spring 2006-2010 Voltaire ISR9288 Infiniband 10Gbps Sun Galaxy 4 (Opteron Dual x2 (DDR next ver.) core 8-socket) “Fastest ~1310+50 Ports 10480core/655Nodes Supercomputer in ~13.5Terabits/s (3Tbits bisection) 21.4Terabytes Asia, 29 th 10Gbps+External 50.4TeraFlops Top500@48.88TF Network OS Linux (SuSE 9, 10) Unified IB NAREGI Grid MW network NEC SX-8i (for porting) 500GB 48disks 500GB 48disks 500GB 48disks Storage 1.5PB 1.0 Petabyte (Sun “Thumper”) ClearSpeed CSX600 0.1Petabyte (NEC iStore) SIMD accelerator Lustre FS, NFS, CIF, WebDAV (over IP) 360 boards, 50GB/s aggregate I/O BW 35TeraFlops(Current)) 70GB/s

4 Titech TSUBAME Titech TSUBAME ~76 76 racks racks ~ 350m2 floor area 350m2 floor area 1.2 MW (peak) 1.2 MW (peak)

5 Local Infiniband Switch Local Infiniband Switch (288 ports) (288 ports) Currently Currently Node Rear Node Rear 2GB/s / node 2GB/s / node Easily scalable to Easily scalable to 8GB/s / node 8GB/s / node Cooling Towers (~32 32 units) units) Cooling Towers (~ ~500 TB out of 1.1PB ~500 TB out of 1.1PB

6 TSUBAME assembled like iPod… NEC: Main Integrator, Storage, Operations SUN: Galaxy Compute Nodes, Storage, Solaris AMD: Opteron CPU Voltaire: Infiniband Network ClearSpeed: CSX600 Accel. CFS: Parallel FSCFS Novell: Suse 9/10 NAREGI: Grid MW Titech GSIC: us UK Germany AMD:Fab36 USA Japan Israel

7 Nodes arrives in mass Nodes arrives in mass The racks were ready The racks were ready

8 Design Principles of TSUBAME(1) • Capability and Capacity : have the cake and eat it, too! – High-performance, low power x86 multi-core CPU • High INT-FP, high cost performance, Highly reliable • Latest process technology – high performance and low power • Best applications & software availability: OS (Linux/Solaris/Windows), languages/compilers/tools, libraries, Grid tools, all ISV Applications – FAT Node Architecture (later) • Multicore SMP – most flexible parallel programming • High memory capacity per node (32/64/128(new)GB) • Large total memory – 21.4 Terabytes • Low node count – improved fault tolerance, easen network design – High Bandwidth Infiniband Network, IP-based (over RDMA) • (Restricted) two-staged fat tree • High bandwidth (10-20Gbps/link), multi-lane, low latency (< 10microsec), reliable/redundant (dual-lane) • Very large switch (288 ports) => low switch count, low latency • Resilient to all types of communications; nearest neighbor, scatter/gather collectives, embedding multi-dimensional networks • IP-based for flexibility, robustness, synergy with Grid & Internet

9 Design Principles of TSUBAME(2) • PetaByte large-scale, high-perfomance, reliable storage – All Disk Storage Architecture (no tapes), 1.1Petabyte • Ultra reliable SAN/NFS storage for /home (NEC iStore), 100GB • Fast NAS/Lustre PFS for /work (Sun Thumper), 1PB – Low cost / high performance SATA2 (500GB/unit) – High Density packaging (Sun Thumper), 24TeraBytes/4U – Reliability thru RAID6, disk rotation, SAN redundancy (iStore) • Overall HW data loss: once / 1000 years – High bandwidth NAS I/O: ~50GBytes/s Livermore Benchmark – Unified Storage and Cluster interconnect : low cost, high bandwidth, unified storage view from all nodes w/o special I/O nodes or SW • Hybrid Architecture: General-Purpose Scalar + SIMD Vector Acceleration w/ ClearSpeed CSX600 – 35 Teraflops peak @ 90 KW (~ 1 rack of TSUBAME) – General purpose programmable SIMD Vector architecture

10 TSUBAME Architecture = Commodity PC Cluster + Traditional FAT node Supercomputer + The Internet & Grid + (Modern) Commodity SIMD-Vector Acceleration + iPod (HW integration & enabling services)

11 TSUBAME Physical Installation • 3 rooms (600m 2 ), 350m 2 Titech Grid Cluster service area TSUBAME • 76 racks incl. network & storage, 46.3 tons 2 nd Floor A – 10 storage racks • 32 AC units, 12.2 tons • Total 58.5 tons (excl. rooftop AC heat exchangers) • Max 1.2 MWatts • ~3 weeks construction time TSUBAME 2 nd Floor B TSUBAME & Storage TSUBAME 1 st Floor

12 TSUBAME Network: (Restricted) Fat Tree, IB-RDMA & TCP-IP Ext ernal Et her Bisect ion BW = 2.88Tbps x 2 I B 4x 10Gbps Single mode f iber f or x 24 cross-f loor connect ions Volt air I SR9288 I B 4x I B 4x 10Gbps 10Gbps x 2 X4500 x 42nodes (42 port s) X4600 x 120nodes (240 port s) per swit ch => 42port s 420Gbps => 600 + 55 nodes, 1310 port s, 13.5Tbps

13 The Benefits of Being “Fat Node” • Many HPC Apps f avor large SMPs • Flexble programming models---MPI , OpenMP, J ava, ... • Lower node count – higher reliabilit y/ manageabilit y • Full I nt erconnect possible --- Less cabling & smaller swit ches, mult i- link parallelism, no “mesh” t opologies CPUs/ Node Peak/ Node Memory/ Node 8, 32 48GF~217.6GF 16~128GB I BM eServer (SDSC Dat aSt ar) 8, 16 60.8GF~135GF 32~64GB Hit achi SR11000 (U-Tokyo, Hokkaido-U) Fuj it su PrimePower 64~128 532.48GF~799GF 512GB (Kyot o-U, Nagoya-U) The Eart h Simulat or 16 128GF 16GB TSUBAME 16 76. 8GF+ 96GF 32~128(new)GB (Tokyo Tech) I BM BG/ L 2 5.6 GF 0.5~1GB Typical PC Clust er 2~4 10~40GF 1~8GB

14 TSUBAME Cooling Density Challenge • Room 2F-B – 480 nodes, 1330W/node max, 42 racks – Rack area = 2.5m x 33.2m = 83m 2 = 922ft 2 • Rack spaces only---Excludes CRC units – Max Power = x4600 nodes 1330W x 480 nodes + IB switch 3000W x 4 = 650KW – Power density ~= 700W/ft 2 (!) • Well beyond state-of-art datacenters (500W/ft 2 ) – Entire floor area ~= 14m x 14m ~= 200m 2 = 2200 ft 2 – But if we assume 70% cooling power as in the Earth Simulator then total is 1.1MW – still ~500W/ft 2

15 TSUBAME Physical Installation 700W/ft 2 on hatched area 500W/ft 2 TSUBAME for the whole room 2 nd Floor B High density cooling & power reduction TSUBAME

16 Cooling and Cabling 700W/ft 2 --- hot/cold row separation and rapid airflow--- Isolation Low Ceiling:3m Pressurized cool air plate prevents Increase effective smaller air volume Venturi air volume, evens flow effect Isolated 46U 46U hot 46U Cold Rack row Rack row Rack CRC 11 11 11 CRC Unit Sunfire X4600 X4600 Unit 25-27 x4600 Units Units degrees Units Narrow Aisles Narrow Aisles 45cm raised f loor, cabling only --- no f loor cooling no turbulant airf low causing hotspots

17 Narrow Cold Row Aisle- - - no f loor cooling, just cables underneath Duct openings on the ceiling, and the transparent isolation plates to prevent hot- cold mixture Very narrow hot row aisle- - - Hot air f rom the nodes on the right is immediately absorbed and cooled by the CRC units on the lef t Pressurized cold air blowing down f rom the ceiling duct - - - very strong wind

18 TSUBAME as No.1 in Japan circa 2006 >> All University National Centers >85 TeraFlops 1.1Petabyte Total 45 TeraFlops, 4 year procurement cycle 350 Terabytes (circa Has beaten the Earth Simulator in both peak and Top500 2006) Has beaten all the other Univ. centers combined

19 “ Everybody ’ s Supercomputer ” Service Oriented Idealism of Grid: Isolated High-End Seamless integration of supercomputer resource with end- Massive Usage Env. Gap user and enterprise environment •Different usage env. from Hmm, it’s like my personal machine •No HP sharing with client’s PC •Special HW/SW, lack of ISV support •Lack of common development env. (e.g. Visual Studio) “Everybody’s Supercomputer” Might as •Simple batch based, well use no interactive usage, my Laptop Seamless,Ubiquitous access and usage good UI =>Breakthrough Science through Commoditization of Supercomputing and Grid Technologies

20 HPC Services in Educational Activities to over 10,000 users • High-End educat ion using supercomput ers in undergrad labs – High end simulat ions t o supplement “physical” lab courses • Seamless int egrat ion of lab resources t o SCs w/ grid t echnologies • Port al-based applicat ion usage Grid Portal based WebMO My desktop scaled Computational Chemistry Web Portal for a variety of Apps (Gaussian,NWChem,GAMESS, MOPAC, Molpro) to 1000 CPUs! ☺ (Prof. Takeshi Nishikawa @ GSIC) 1.SSO 2.Job Mgmt TSUBAME 3.Edit Molecules 4.Set Conditions WinCCS

21 TSUBAME General Purpose DataCenter Hosting As a core of IT Consolidation All University Members == Users • Campus-wide AAA Sytem (April 2006) – 50TB (for email), 9 Galaxy1 nodes • Campus-wide Storage Service (NEST) – 10s GBs per everyone on campus PC mountable, but accessible directly from TSUBAME – Research Repository • CAI, On-line Courses (OCW = Open CourseWare) • Administrative Hosting (VEST) I can backup ALL my data ☺

22 Tsubame Status How it’s flying about… (And doing some research too)

TSUBAME---A Year Lat er Sat oshi Mat suoka, Prof essor/ Dr.Sci. - PowerPoint PPT Presentation

1 TSUBAME---A Year Lat er Sat oshi Mat suoka, Prof essor/ Dr.Sci. Global Scient if ic I nf ormat ion and Comput ing Cent er Tokyo I nst . Technology & NAREGI Proj ect Nat ional I nst . I nf ormat ics EuroPVM/ MPI , Paris, France, Oct .

TSUBAME-KFC : Ultra Green Supercomputing Testbed Toshio Endo Akira Nukada, Satoshi Matsuoka

High- -Level Analysis of LAT Data: Use Level Analysis of LAT Data: Use High Cases Cases S.

The Shmitah Cycle Common Holy Year 1 Year 2 Year 1 Year 2 Year 3 Year 4 Year 5 Year 6

Galactic Sources as Seen by Fermi-LAT (KIPAC/SLAC) on behalf of the Fermi-LAT

Steep Spectrum Radio Steep Spectrum Radio Sources with the Fermi-LAT Sources with the Fermi-LAT

AGNs with the Fermi with the Fermi- -LAT: LAT: AGNs What we have seen What we have seen

A Simulation of Global Atmosphere Model NICAM on TSUBAME 2.5 Using OpenACC Hisashi YASHIRO

Jieun Kim Hi-Sun Kim University of Chicago 1 st 2 nd 3 rd 4 th 5 th st nd rd th th year

Move2Perform Allison Behnke MS LAT ATC Jason Viel MS LAT ATC What is Move2Perform

Technical Baseline Management Technical Baseline Management September 30, 2003 Pat Hascall LAT

Fermi- Fermi -LAT Study of LAT Study of Cosmic Cosmic- -rays/Diffuse Gamma rays/Diffuse

Recent Highlights from AGN Observations with Fermi-LAT Observations with Fermi-LAT

Fermi Fermi- -LAT Study of LAT Study of Diffuse rays and CRs Diffuse -rays and CRs

Fermi /LAT /LAT and the Origin of and the Origin of Fermi Cosmic Rays Cosmic Rays Fermi

Results from the Results from the Fermi-LAT Mission: Fermi-LAT Mission: Cosmic Rays and Cosmic

A Gamma A Gamma-ray Source ray Source Detected by the Fermi Detected by the Fermi- LAT at the

Contents 01 EasyIP Overview 02 EasyIP Family 03 EasyIP 4.0 Solution 01. EasyIP Overview

Solving the Quasigroup Solving the Quasigroup ! Given a partial assignment of colors, can the

Are You Giving Teachers Blisters? Finding the Right Fit for an EdTech Startup There's an

CAT Game and JCAT Platform Jinzhong Niu Computer Science The Graduate Center, CUNY March 17,

Compiler construction in4303 answ ers Koen Langendoen Delft University of Technology The

Network Security Architecture 1 Additional Reading Firewalls and Internet Security:

Tools & Techniques Triage Using 99 Business Analyst Techniques to better understand

Thinking Outside the Box: Innovative Pathways to Refugee Employment What makes us unique? Hire

Sambuz

Useful Links

Newsletter

Mail Us

TSUBAME---A Year Lat er Sat oshi Mat suoka, Prof essor/ Dr.Sci. - PowerPoint PPT Presentation

1 TSUBAME---A Year Lat er Sat oshi Mat suoka, Prof essor/ Dr.Sci. Global Scient if ic I nf ormat ion and Comput ing Cent er Tokyo I nst . Technology & NAREGI Proj ect Nat ional I nst . I nf ormat ics EuroPVM/ MPI , Paris, France, Oct .

TSUBAME-KFC : Ultra Green Supercomputing Testbed Toshio Endo Akira Nukada, Satoshi Matsuoka

High- -Level Analysis of LAT Data: Use Level Analysis of LAT Data: Use High Cases Cases S.

The Shmitah Cycle Common Holy Year 1 Year 2 Year 1 Year 2 Year 3 Year 4 Year 5 Year 6

Galactic Sources as Seen by Fermi-LAT (KIPAC/SLAC) on behalf of the Fermi-LAT

Steep Spectrum Radio Steep Spectrum Radio Sources with the Fermi-LAT Sources with the Fermi-LAT

AGNs with the Fermi with the Fermi- -LAT: LAT: AGNs What we have seen What we have seen

A Simulation of Global Atmosphere Model NICAM on TSUBAME 2.5 Using OpenACC Hisashi YASHIRO

Jieun Kim Hi-Sun Kim University of Chicago 1 st 2 nd 3 rd 4 th 5 th st nd rd th th year

Move2Perform Allison Behnke MS LAT ATC Jason Viel MS LAT ATC What is Move2Perform

Technical Baseline Management Technical Baseline Management September 30, 2003 Pat Hascall LAT

Fermi- Fermi -LAT Study of LAT Study of Cosmic Cosmic- -rays/Diffuse Gamma rays/Diffuse

Recent Highlights from AGN Observations with Fermi-LAT Observations with Fermi-LAT

Fermi Fermi- -LAT Study of LAT Study of Diffuse rays and CRs Diffuse -rays and CRs

Fermi /LAT /LAT and the Origin of and the Origin of Fermi Cosmic Rays Cosmic Rays Fermi

Results from the Results from the Fermi-LAT Mission: Fermi-LAT Mission: Cosmic Rays and Cosmic

A Gamma A Gamma-ray Source ray Source Detected by the Fermi Detected by the Fermi- LAT at the

Contents 01 EasyIP Overview 02 EasyIP Family 03 EasyIP 4.0 Solution 01. EasyIP Overview

Solving the Quasigroup Solving the Quasigroup ! Given a partial assignment of colors, can the

Are You Giving Teachers Blisters? Finding the Right Fit for an EdTech Startup There's an

CAT Game and JCAT Platform Jinzhong Niu Computer Science The Graduate Center, CUNY March 17,

Compiler construction in4303 answ ers Koen Langendoen Delft University of Technology The

Network Security Architecture 1 Additional Reading Firewalls and Internet Security:

Tools &amp; Techniques Triage Using 99 Business Analyst Techniques to better understand

Thinking Outside the Box: Innovative Pathways to Refugee Employment What makes us unique? Hire

Sambuz

Useful Links

Newsletter

Mail Us

Tools & Techniques Triage Using 99 Business Analyst Techniques to better understand