Introduction to HPC, Leon Kos, UL PRACE Autumn School 2013 - - PowerPoint PPT Presentation

Introduction to HPC, Leon Kos, UL PRACE Autumn School 2013 - Industry oriented HPC simulations, University of Ljubljana, Slovenia

25 members of PRACE • Germany: GCS - GAUSS Centre for Supercomputing e.V • Austria: JKU - Johannes Kepler University of Linz • Belgium: DGO6-SPW – Service Public de Wallonie • Bulgaria: NCSA - Executive agency • Cyprus: CaSToRC –The Cyprus Institute • Czech Republic: VŠB - Technical University of Ostrava • Denmark: DCSC - Danish Center for Scientific Computing • Finland: CSC - IT Center for Science Ltd. • France: GENCI - Grand Equipement National de Calcul Intensif • Greece: GRNET - Greek Research and Technology Network S.A. • Hungary: NIIFI - National Information Infrastructure Development Institute • Ireland: ICHEC - Irish Centre for High-End Computing • Israel: IUCC - Inter-University Computation Center • Italy: CINECA - Consorzio Interuniversitario • Norway: SIGMA – UNINETT Sigma AS – • The Netherlands: SURFSARA: SARA Computing and Networking Services • Poland: PSNC – Instytut Chemii Bioorganicznej Pan • Portugal: FCTUC – Faculdade Ciencias e Tecnologia da Universidade de Coimbra • Serbia: IPB - Institute of Physics Belgrade • Slovenia: ULFME - University of Ljubljana, Faculty of Mechanical Engineering • Spain: BSC – Barcelona Supercomputing Center – Centro Nacional de Supercomputación • Sweden: SNIC – Vetenskapsrådet – Swedish Research Council • Switzerland: ETH – Eidgenössische Technische Hochschule Zürich • Turkey: UYBHM – Ulusal Yuksek Basarimli Hesaplama Merkezi, • UK: EPSRC – The Engineering and Physical Sciences Research Council 2

Why supercomputing? • Weather, Climatology, Earth Science – degree of warming, scenarios for our future climate. – understand and predict ocean properties and variations – weather and flood events • Astrophysics, Elementary particle physics, Plasma physics – systems, structures which span a large range of different length and time scales – quantum field theories like QCD, ITER • Material Science, Chemistry, Nanoscience – understanding complex materials, complex chemistry, nanoscience – the determination of electronic and transport properties • Life Science – system biology, chromatin dynamics, large scale protein dynamics, protein association and aggregation, supramolecular systems, medicine • Engineering – complex helicopter simulation, biomedical flows, gas turbines and internal combustion engines, forest fires, green aircraft, – virtual power plant 3 3

Supercomputing drives science with simulations Environment Ageing Society Materials/ Inf. Tech Energy Weather/ Climatology Medicine Spintronics Plasma Physics Pollution / Ozone Hole Biology Nano-science Fuel Cells 4 4

Computing tshares in the TOP500 list 5

Large HPC systems around the world 6

FZJ 2010 1st PRACE System - JUGENE • BG/P by Gauss Center for Supercomputing at Juelich 294,912 CPU cores, 144 TB memory 1 PFlop/s peak performance 825.5 TFlop/s Linpack 600 I/O nodes (10GigE) > 60 GB/s I/O 2.2 MW power consumption 35% for PRACE 7

GENCI 2011 2nd PRACE system – CURIE • Bull, 1.6PF, 92160 cores, 4GB/core • Phase 1, December 2010, 105 TF – 360 four Intel Nehalem-EX 8-core nodes, 2.26 GHz CPUs (11,520 cores), QDR Infiniband fat-tree – 800 TB, >30GB/sec, local Lustre file system • Phase 1.5 Q2 2011 – Conversion to 90 16-socket, 128 core, 512 GB nodes • Phase 2, Q4 2011, 1.5 TF – Intel Sandy-Bridge – 10PB, 230GB/sec file system 8

HLRS 2011 3rd PRACE System – HERMIT • Cray XE6 (Multi-year contract for $60+M) – Phase 0 – 2010 10TF, 84 dual socket 8-core AMD Magny-Cours CPUs, 1344 cores in total, 2 GHz, 2GB/core, Gemini interconnect – Phase 1 Step 1 – Q3 2011 AMD Interlagos, 16 cores,1 PF 2 – 4 GB/core 2.7 PB file system, 150 GB/s I/O – Phase 2 – 2013 Cascade, first order for Cray, 4- 5 PF 9

LRZ 2011/12 4th PRACE system • IBM iDataPlex (€83M including operational costs) – >14,000 Intel Sandy-Bridge CPUs, 3 PF (~110,000 cores), 384 TB of memory – 10PB GPFS file system with 200GB/sec I/O, 2PB 10GB/sec NAS LRZ <13MW – Innovative hot water cooling (60C inlet, 65C outlet) leading to 40 percent less LRZ <13MW energy consumption compared to air-cooled machine. 10

BSC and CINECA • 2012/2013 5th and 6th PRACE Systems CINECA 2.5 PF BSC <20MW Computing Facility 10 MW 2013 11

Supercomputing at UL FME --HPCFS for ? • Some examples of previous projects

What HPCFS is used for? • Complex enginering research problems demands parallel processing • Education of new generation of students on II cycle ob Bologna process • Cooperation with other GRID and HPC centres

Long term goals – Extension of computing capabilities • In -house development of custom codes • Installation of commercial and open -source codes • ANSYS Multiphysics, OpenFOAM,.. • Cooperation in EU projects • Advantage is if having HPC and knowledge about it • Introducing (young) researchers – Center for modelling, simulations and optimization in cooperation on severale levels at university and intra universities • Promotion of FS/UL, science, research and increased awareness

Software at HPCFS • Linux (CentOS 6.4) • Remote desktop NX • Development environment and LSF batch scheduler • Compilers C++, Fortran (Python, R, ...) • Parallel programming with MPI, OpenMP • Open -source and commercial packages for simulations (ANSYS) • Servers for support of the researsch and development

Hardware of the cluster PRELOG at ULFME • 64 computing nodes – 768 cores X5670 – 1536 threads • 3 TB RAM • Login node • Infiniband network • QDR x4 „fat tree“ • File servers – NFS 25TB – LUSTRE 12TB+22TB • Virtualization servers • 1Gbit Connection to ARNES

Introduction to parallel computing • Usually is the program written for serial execution on one processor • We divide the problem into series of commands that can be executed in paralllel • Only one command at a time can be executed on one CPU 17

Parallel programming models • Threading • OpenMP – automatic parallelization • Distributed memory model = Message Passing Interface (MPI) – manual parallelization needed • Hybrid model OpenMP/MPI 18

Embarrasingly simple parallel processing • Parallel processing of the same subproblems on multiple prooocessors • No communication is needed between processes 19

Logical view of a computing node • Need to know computer architecture • Interconnect bus for sharing memory between processors (NUMA interconnect) 20

Nodes interconnect • Distributed computing • Many nodes exchange messages on – high speed, – low latency interconnect such as Infiniband 21

Development of parallel codes • Good understanding of the problem being solved in parallel • How much of the problem can be run in parallel • Bottleneck analysys and profiling gives good picture on scalability of the problem • We optimize and parallelize parts that consume most of the computing time • Problem needs to be disected into parts functionally and logically 22

Interprocess communications • Having little an infrequent communication between processes is the best • Determining the largest block of code that can run in parallel a nd still provides scalability • Basic properties – response time – transfer speed - bandwidth – interconnect capabilities 23

Parallel portion of the code determines code scalability • Amdahlov la w Speedup = 1/(1-p) 24

Questions and practicals on the HPCFS cluster • Demonstration of the work on the cluster by repeating • Access with NX client • Learning basic Linux commands • LSF scheduler commands • Modules • Development with OpenMP and OpenMPI parallel paradigms • Excercises and extensions of basic ideas • Instructions available at http://hpc.fs.uni-lj.si/ 25

Introduction to HPC, Leon Kos, UL PRACE Autumn School 2013 - - PowerPoint PPT Presentation

Introduction to HPC, Leon Kos, UL PRACE Autumn School 2013 - Industry oriented HPC simulations, University of Ljubljana, Slovenia 25 members of PRACE Germany: GCS - GAUSS Centre for Supercomputing e.V Austria: JKU - Johannes Kepler

HPC @ SAO S.G. Korzennik - SAO HPC Analyst hpc@cfa February 2013 SGK ( hpc@cfa ) HPC @ SAO

The Leon Project Sonora, Mexico Investors Presentation July, 2010 Leon Project Location

Uni.lu HPC School 2020 PS6: HPC Containers: Singularity Uni.lu High Performance Computing (HPC)

The HPC Skill Tree A Brief Overview Kai Himstedt On Behalf of the HPC-CF Board BoF:

Introduction to 3D Scientific Visualization Leon Kos, University of Ljubljana, Slovenia

for KOS Mapping Xia Lin & Jae-Wook Ahn Drexel University Dagobert Soergel The University at

KOS FS INTERNATIONAL SRL YOU CAN BUILD IT THIS WAY! www.kosfsbbs.com ABOUT US KOS FS

The diameter of permutation groups kos Seress May 2012 Cayley graphs The diameter of

Whats new in HPC? Gregory Bauer To keep up-to-date on HPC HPC Guru -

UL HPC School 2017[bis] PS1: Getting Started on the UL HPC platform UL High Performance

UL HPC School 2017 PS5: Advanced Scheduling with SLURM and OAR on UL HPC clusters UL High

UL HPC School 2017 PS1: Getting Started on the UL HPC platform UL High Performance Computing

Building Homebridge with the Yocto Project Leon Anavi Konsulko Group leon.anavi@konsulko.com

Making Your Own Open Source Raspberry Pi HAT Leon Anavi Konsulko Group leon.anavi@konsulko.com

HPC platforms @ UL Overview (as of 2013) and Usage http://hpc.uni.lu S. Varrette, PhD.

CONTAINERS DEMOCRATIZE HPC CJ Newburn, Principal Architect for HPC, NVIDIA GTC19 S9525 -

CONVEXIFICATION AND GLOBAL OPTIMIZATION Nick Sahinidis University of Illinois at

Aveek Das Objective of Linear Regression is to minimize the mean square error For the

Decoding Chromatin States with Epigenome Data 02-715 Advanced Topics in

The PLA Army at 90 Dennis J. Blasko Content Ch-ch-Changes Above the Neck

Computational Systems Biology Deep Learning in the Life Sciences 6.802 6.874 20.390 20.490

Synchronism vs asynchronism in Boolean automata networks Sylvain Sen MOVE seminar 18th

M ODELLING OF B IOCHEMICAL N ETWORKS WITH T IME P ETRI N ETS Monika Heiner Brandenburg University

Stefano Ceri Politecnico di Milano 1 The Big Approach in the pharma sector Bayer, From

Introduction to HPC, Leon Kos, UL PRACE Autumn School 2013 - - PowerPoint PPT Presentation

Introduction to HPC, Leon Kos, UL PRACE Autumn School 2013 - Industry oriented HPC simulations, University of Ljubljana, Slovenia 25 members of PRACE Germany: GCS - GAUSS Centre for Supercomputing e.V Austria: JKU - Johannes Kepler

HPC @ SAO S.G. Korzennik - SAO HPC Analyst hpc@cfa February 2013 SGK ( hpc@cfa ) HPC @ SAO

The Leon Project Sonora, Mexico Investors Presentation July, 2010 Leon Project Location

Uni.lu HPC School 2020 PS6: HPC Containers: Singularity Uni.lu High Performance Computing (HPC)

The HPC Skill Tree A Brief Overview Kai Himstedt On Behalf of the HPC-CF Board BoF:

Introduction to 3D Scientific Visualization Leon Kos, University of Ljubljana, Slovenia

for KOS Mapping Xia Lin &amp; Jae-Wook Ahn Drexel University Dagobert Soergel The University at

KOS FS INTERNATIONAL SRL YOU CAN BUILD IT THIS WAY! www.kosfsbbs.com ABOUT US KOS FS

The diameter of permutation groups kos Seress May 2012 Cayley graphs The diameter of

Whats new in HPC? Gregory Bauer To keep up-to-date on HPC HPC Guru -

UL HPC School 2017[bis] PS1: Getting Started on the UL HPC platform UL High Performance

UL HPC School 2017 PS5: Advanced Scheduling with SLURM and OAR on UL HPC clusters UL High

UL HPC School 2017 PS1: Getting Started on the UL HPC platform UL High Performance Computing

Building Homebridge with the Yocto Project Leon Anavi Konsulko Group leon.anavi@konsulko.com

Making Your Own Open Source Raspberry Pi HAT Leon Anavi Konsulko Group leon.anavi@konsulko.com

HPC platforms @ UL Overview (as of 2013) and Usage http://hpc.uni.lu S. Varrette, PhD.

CONTAINERS DEMOCRATIZE HPC CJ Newburn, Principal Architect for HPC, NVIDIA GTC19 S9525 -

CONVEXIFICATION AND GLOBAL OPTIMIZATION Nick Sahinidis University of Illinois at

Aveek Das Objective of Linear Regression is to minimize the mean square error For the

Decoding Chromatin States with Epigenome Data 02-715 Advanced Topics in

The PLA Army at 90 Dennis J. Blasko Content Ch-ch-Changes Above the Neck

Computational Systems Biology Deep Learning in the Life Sciences 6.802 6.874 20.390 20.490

Synchronism vs asynchronism in Boolean automata networks Sylvain Sen MOVE seminar 18th

M ODELLING OF B IOCHEMICAL N ETWORKS WITH T IME P ETRI N ETS Monika Heiner Brandenburg University

Stefano Ceri Politecnico di Milano 1 The Big Approach in the pharma sector Bayer, From

for KOS Mapping Xia Lin & Jae-Wook Ahn Drexel University Dagobert Soergel The University at