The Numerical Reproducibility Fair Trade: Facing the - PowerPoint PPT Presentation

The ¡Numerical ¡Reproducibility ¡Fair ¡Trade: ¡ ¡ Facing ¡the ¡Concurrency ¡Challenges ¡ ¡ at ¡the ¡Extreme ¡Scale ¡ Michela ¡Taufer ¡ University ¡of ¡Delaware ¡ ¡ Michela ¡Becchi ¡ University ¡of ¡Missouri ¡ ¡

From ¡MulD-‑core ¡to ¡Many ¡Cores ¡ Intel’s Xeon Phi – 50 x86 core NVIDIA K20 - 2,496 CUDA cores From: ¡hGp://www.altera.com/technology/system-‑design/ From ¡hGp://wccIech.com/nvidia-‑tesla-‑ arDcles/2012/mulDcore-‑many-‑core.html ¡ k20-‑gk110-‑specificaDons-‑unveiled/ ¡ 1 ¡

SoIware ¡ExecuDon ¡on ¡Many ¡Cores ¡ 2 ¡ From http://www.pgroup.com/lit/articles/insider/v2n4a1.htm

Plea ¡for ¡Joint ¡Effort ¡ • Homogenous ¡architectures ¡with ¡large ¡number ¡of ¡cores ¡ § Intel ¡Xeon ¡Phi ¡with ¡50 ¡x86 ¡cores ¡ § NVIDIA ¡K20/K40 ¡with ¡2,496 ¡CUDA ¡cores ¡ • Dynamic ¡mapping ¡of ¡tasks ¡to ¡cores ¡make ¡the ¡predicDon ¡of ¡ which ¡thread ¡is ¡assigned ¡to ¡which ¡core ¡not ¡feasible ¡ • While ¡computer ¡architects ¡focus ¡on ¡power, ¡bandwidth, ¡ energy ¡impacts, ¡…. ¡ The ¡computer ¡science ¡and ¡mathemaDcal ¡communiDes ¡must ¡join ¡ effort ¡to ¡study ¡and ¡miDgate ¡the ¡impact ¡of ¡rounding ¡errors ¡on ¡ result ¡reproducibility ¡when ¡concurrent ¡execuDons ¡burst ¡and ¡ workflow ¡determinism ¡vanishes ¡in ¡massively ¡parallel ¡systems ¡ ¡ 3 ¡

A ¡Case ¡Study: ¡Molecular ¡Dynamics ¡SimulaDons ¡ • MD ¡simulaDons ¡are ¡iteraDve ¡execuDons ¡of ¡M D ¡steps ¡ § Given ¡iniDal ¡atomic ¡posiDons ¡and ¡velociDes, ¡obtain ¡those ¡at ¡ later ¡Dmes ¡by ¡ integra(ng ¡ordinary ¡differen(al ¡equa(ons ¡ • IteraDve ¡steps ¡compute ¡forces ¡on ¡each ¡parDcle ¡due ¡to: ¡ § Bonded ¡interacDons ¡ ¡ § Non-‑bonded ¡interacDons ¡ ¡ • Different ¡flavors ¡of ¡MD ¡simulaDons: ¡ § Constant ¡energy ¡(NVE) ¡dynamics ¡is ¡the ¡original ¡method ¡of ¡ molecular ¡dynamics ¡and ¡is ¡performed ¡in ¡a ¡closed ¡environment ¡ with ¡a ¡constant ¡number ¡of ¡atoms, ¡volume, ¡and ¡energy ¡ § Constant ¡temperature ¡(NVT) ¡dynamics ¡ ¡use ¡velocity ¡ reassignment ¡to ¡changes ¡system ¡energy ¡states ¡at ¡specified ¡ intervals ¡and ¡is ¡performed ¡ ¡with ¡a ¡constant ¡number ¡of ¡atoms, ¡ volume, ¡and ¡temperature ¡ 4 ¡

MD ¡on ¡Many ¡Core ¡Plaforms ¡ MD ¡simulaDon ¡step: ¡ • Each ¡GPU-‑thread ¡computes ¡forces ¡ on ¡single ¡atoms ¡ § E.g., ¡bond, ¡angle, ¡dihedrals ¡ and, ¡nonbond ¡forces ¡ • Forces ¡are ¡added ¡to ¡compute ¡ acceleraDon ¡ • AcceleraDon ¡is ¡used ¡to ¡update ¡ Force ¡-‑> ¡AcceleraDon ¡-‑> ¡Velocity ¡ velociDes ¡ ¡ • VelociDes ¡are ¡used ¡to ¡update ¡the ¡ ¡ ¡-‑> ¡ ¡PosiDon ¡ posiDons ¡ 5 ¡

Moving ¡MD ¡onto ¡Accelerators ¡ • Enhancing ¡performance ¡of ¡MD ¡simulaDons ¡allows ¡simulaDons ¡ of ¡larger ¡Dme ¡scales ¡and ¡length ¡scales ¡ • GPU ¡compuDng ¡enables ¡large-‑scale ¡MD ¡simulaDon ¡ § SimulaDons ¡exhibit ¡speed-‑up ¡factors ¡of ¡X10-‑X30 ¡ ¡ MD ¡simulaDon ¡of ¡NaI ¡soluDon ¡system ¡ • Constant ¡energy ¡MD ¡simula(on ¡ ¡ containing ¡988 ¡waters, ¡18 ¡Na+, ¡and ¡ 18 ¡I−: ¡GPU ¡is ¡X15 ¡faster ¡than ¡CPU ¡ -‑-‑-‑-‑-‑ ¡Single ¡precision ¡ 6 ¡

Moving ¡MD ¡onto ¡Accelerators ¡ • Enhancing ¡performance ¡of ¡MD ¡simulaDons ¡allows ¡simulaDons ¡ of ¡larger ¡Dme ¡scales ¡and ¡length ¡scales ¡ • GPU ¡compuDng ¡enables ¡large-‑scale ¡MD ¡simulaDon ¡ § SimulaDons ¡exhibit ¡speed-‑up ¡factors ¡of ¡X10-‑X30 ¡ ¡ MD ¡simulaDon ¡of ¡NaI ¡soluDon ¡system ¡ • Constant ¡energy ¡MD ¡simula(on ¡ ¡ containing ¡988 ¡waters, ¡18 ¡Na+, ¡and ¡ 18 ¡I−: ¡GPU ¡is ¡X15 ¡faster ¡than ¡CPU ¡ -‑-‑-‑-‑-‑ ¡Single ¡precision ¡ 7 ¡

Moving ¡MD ¡onto ¡Accelerators ¡ • Enhancing ¡performance ¡of ¡MD ¡simulaDons ¡allows ¡simulaDons ¡ of ¡larger ¡Dme ¡scales ¡and ¡length ¡scales ¡ • GPU ¡compuDng ¡enables ¡large-‑scale ¡MD ¡simulaDon ¡ § SimulaDons ¡exhibit ¡speed-‑up ¡factors ¡of ¡X10-‑X30 ¡ ¡ MD ¡simulaDon ¡of ¡NaI ¡soluDon ¡system ¡ • GPU single precision GPU single precision containing ¡988 ¡waters, ¡18 ¡Na+, ¡and ¡ GPU double precision 18 ¡I−: ¡GPU ¡is ¡X15 ¡faster ¡than ¡CPU ¡ -‑-‑-‑-‑-‑ ¡Single ¡precision ¡ 8 ¡

Moving ¡MD ¡onto ¡Accelerators ¡ • Enhancing ¡performance ¡of ¡MD ¡simulaDons ¡allows ¡simulaDons ¡ of ¡larger ¡Dme ¡scales ¡and ¡length ¡scales ¡ • GPU ¡compuDng ¡enables ¡large-‑scale ¡MD ¡simulaDon ¡ § SimulaDons ¡exhibit ¡speed-‑up ¡factors ¡of ¡X10-‑X30 ¡ ¡ MD ¡simulaDon ¡of ¡NaI ¡soluDon ¡system ¡ • GPU double precision containing ¡988 ¡waters, ¡18 ¡Na+, ¡and ¡ 18 ¡I−: ¡GPU ¡is ¡X15 ¡faster ¡than ¡CPU ¡ 9 ¡

Code ¡Accuracy: ¡ ¡ Energy ¡FluctuaDons ¡ • A ¡plot ¡of ¡the ¡ energy ¡ fluctua(ons ¡versus ¡(me ¡step ¡ size ¡should ¡follow ¡an ¡ approximately ¡logarithmic ¡ trend ¡1 ¡ • Energy ¡fluctuaDons ¡are ¡ proporDonal ¡to ¡Dme ¡step ¡size ¡ for ¡large ¡Dme ¡step ¡size ¡ § Larger ¡than ¡0.5 ¡fs ¡ • A ¡different ¡behavior ¡for ¡step ¡ size ¡less ¡than ¡0.5 ¡fs ¡is ¡consistent ¡ with ¡results ¡previously ¡ presented ¡and ¡discussed ¡in ¡ other ¡work ¡2 ¡ 1 Allen and Tildesley, Oxford: Clarendon Press, (1987) 2 Bauer et al., J. Comput. Chem. 32(3): 375 – 385, 2011

Composite ¡Precision ¡ArithmeDc ¡ • Numerical ¡reproducibility ¡CANNOT ¡be ¡guaranteed ¡in ¡large-‑ scale ¡simulaDons ¡on ¡ANY ¡massively ¡parallel ¡plaform ¡ 1, ¡2 ¡ • High ¡precision ¡calculaDons ¡can ¡be ¡achieved ¡using ¡arbitrary ¡ precision ¡libraries ¡ 1 ¡ § One ¡criDcal ¡aspect ¡of ¡many ¡libraries ¡is ¡their ¡complexity ¡ • Composite ¡precision ¡arithmeDc ¡provides ¡a ¡simpler ¡soluDon ¡ § Composite ¡precision ¡represents ¡values ¡(results) ¡as ¡the ¡sum ¡of ¡floaDng-‑ point ¡numbers ¡of ¡arbitrarily ¡varying ¡orders ¡of ¡magnitude ¡ 3, ¡4 ¡ • Our ¡goal: ¡use ¡composite ¡precision ¡to ¡improve ¡numerical ¡ reproducibility ¡of ¡simulaDons ¡on ¡many ¡core ¡systems ¡ ¡ 1 Bailey (2005) ¡ 2 He and Ding (2000) 11 ¡ 3 Thall (2006) 4 Taufer et al. (2010)

Composite ¡Precision: ¡Data ¡Structure ¡ • In ¡2010, ¡double ¡operaDons ¡>> ¡float ¡operaDons ¡on ¡GPUs ¡ • Our ¡iniDal ¡representaDon ¡of ¡composite ¡precision ¡used ¡two ¡ float ¡numbers ¡(float2) ¡to ¡represent ¡a ¡floaDng ¡point ¡value ¡ struct float2{ float val; // Value float err; // Error ¡ } x 2 ; float2 x2 = x 2 .val + x 2 .err • Each ¡arithmeDc ¡operaDon ¡takes ¡float2s ¡as ¡parameters ¡and ¡ returns ¡float2s ¡ § Error ¡carried ¡through ¡each ¡operaDon ¡ ¡ § OperaDons ¡rely ¡on ¡self-‑compensaDon ¡of ¡rounding ¡errors ¡ 12 ¡

Composite ¡Precision: ¡AddiDon ¡ Implementa(on ¡ Pseudo-‑code ¡ float2 x 2, y 2, z 2 � float2 x 2, y 2, z 2 � float t � � Z 2 .val = x 2 .val + y 2 .val � z 2 = x 2 + y 2 � t = z 2 .val - x 2 .val � Z 2 .err = x 2 .val - (z 2 .val – t) + � (y 2 .val – t) + x 2 .err + y 2 .err � • MathemaDcally ¡z 2 .err ¡should ¡be ¡0 ¡ § But ¡errors ¡introduced ¡by ¡floaDng-‑point ¡operaDons ¡usually ¡result ¡in ¡ z 2 .err ¡being ¡non-‑zero ¡ • SubtracDon ¡is ¡the ¡same ¡as ¡addiDon, ¡but ¡y 2 .val ¡= ¡–y 2 .val ¡ ¡and ¡ y 2 .err ¡ ¡ ¡= ¡ ¡-‑y 2 .err ¡ ¡ 13 ¡

The Numerical Reproducibility Fair Trade: Facing the - PowerPoint PPT Presentation

The Numerical Reproducibility Fair Trade: Facing the Concurrency Challenges at the Extreme Scale Michela Taufer University of Delaware Michela Becchi

The Numerical Reproducibility Fair Trade: Facing the Concurrency Challenges at the Extreme Scale

Numerical reproducibility of high-performance computations using floating-point or interval

Computational Reproducibility in Production Physics Applications Numerical Reproducibility at

Fair Testing - O Wings We are learning to carry out a fair test. What is a fair test? Fair

Fun & Fact Filled Events! By Atiya Raja and Kelsey Bumsted Event Ideas - Fair Trade Wine

WFTO FAIR TRADE SUMMIT NEW IDEAS WFTO FAIR TRADE SUMMIT 17/09/2019 2 A UN UNIQUE QUE

THE COLLEGE FAIR What is a college fair? When should I attend a fair? Why should I go

SC SCIENCE FAIR IENCE FAIR Calallen Independent School District SCI SCIENCE ENCE FAIR FAIR

Who grows the food that we eat every day? FAIR TRADEBANANAS VS NON FAIR TRADE BANANAS NON F

Computational Reproducibility Daniel S. Katz Jennifer Freeman Smith Computational

Rigor, Reproducibility, and Transparency David T. Redden, PhD Co-Director, CCTS BERD Chair,

Worksheets Percy Liang UCI Reproducibility Symposium September 22, 2020 The current research

Reproducibility & Generalizability @ Twitter Strengthening Reproducibility in Network Science

Everware - lowering reproducibility barriers Andrey Ustyuzhanin Yandex School of Data Analysis

Facing Up to Faults Facing Up to Faults Facing Up to Faults (v.2.0.1) (v.2.0.1) (v.2.0.1)

PubPol 201 Module 3: International Trade Policy Class 1 Introduction to Trade and Trade Policy

BPQ DTNRG @ IETF-81 Bundle Protocol Query Extension Block draft-farrell-dtnrg-bpq-00 Stephen

Concurrency Mo+va+on Opera+ng systems (and applica+on programs)

Enterprise Mashups why do I care? Ross Mason, MuleSoft About Me Agenda What? Why? How?

To ESB or not to ESB Ross Mason MuleSoft About Me Agenda When to When to When to When to

On the intersection of Information Centric Networking and Delay Tolerant Networking (Lessons

Mathematical Logics Description Logic and Databases Fausto Giunchiglia and Mattia Fumagallli

How to write research papers? Ping HU, Kuniaki Saito A papers impact on your career Our image

Social network analytics Bart Baesens Professor Data Science at KU Leuven DataCamp Fraud

The Numerical Reproducibility Fair Trade: Facing the - PowerPoint PPT Presentation

The Numerical Reproducibility Fair Trade: Facing the Concurrency Challenges at the Extreme Scale Michela Taufer University of Delaware Michela Becchi

The Numerical Reproducibility Fair Trade: Facing the Concurrency Challenges at the Extreme Scale

Numerical reproducibility of high-performance computations using floating-point or interval

Computational Reproducibility in Production Physics Applications Numerical Reproducibility at

Fair Testing - O Wings We are learning to carry out a fair test. What is a fair test? Fair

Fun &amp; Fact Filled Events! By Atiya Raja and Kelsey Bumsted Event Ideas - Fair Trade Wine

WFTO FAIR TRADE SUMMIT NEW IDEAS WFTO FAIR TRADE SUMMIT 17/09/2019 2 A UN UNIQUE QUE

THE COLLEGE FAIR What is a college fair? When should I attend a fair? Why should I go

SC SCIENCE FAIR IENCE FAIR Calallen Independent School District SCI SCIENCE ENCE FAIR FAIR

Who grows the food that we eat every day? FAIR TRADEBANANAS VS NON FAIR TRADE BANANAS NON F

Computational Reproducibility Daniel S. Katz Jennifer Freeman Smith Computational

Rigor, Reproducibility, and Transparency David T. Redden, PhD Co-Director, CCTS BERD Chair,

Worksheets Percy Liang UCI Reproducibility Symposium September 22, 2020 The current research

Reproducibility &amp; Generalizability @ Twitter Strengthening Reproducibility in Network Science

Everware - lowering reproducibility barriers Andrey Ustyuzhanin Yandex School of Data Analysis

Facing Up to Faults Facing Up to Faults Facing Up to Faults (v.2.0.1) (v.2.0.1) (v.2.0.1)

PubPol 201 Module 3: International Trade Policy Class 1 Introduction to Trade and Trade Policy

BPQ DTNRG @ IETF-81 Bundle Protocol Query Extension Block draft-farrell-dtnrg-bpq-00 Stephen

Concurrency Mo+va+on Opera+ng systems (and applica+on programs)

Enterprise Mashups why do I care? Ross Mason, MuleSoft About Me Agenda What? Why? How?

To ESB or not to ESB Ross Mason MuleSoft About Me Agenda When to When to When to When to

On the intersection of Information Centric Networking and Delay Tolerant Networking (Lessons

Mathematical Logics Description Logic and Databases Fausto Giunchiglia and Mattia Fumagallli

How to write research papers? Ping HU, Kuniaki Saito A papers impact on your career Our image

Social network analytics Bart Baesens Professor Data Science at KU Leuven DataCamp Fraud

Fun & Fact Filled Events! By Atiya Raja and Kelsey Bumsted Event Ideas - Fair Trade Wine

Reproducibility & Generalizability @ Twitter Strengthening Reproducibility in Network Science