From math to peta-app: Challenges to practical computation with - PowerPoint PPT Presentation

From math to peta-app: Challenges to practical computation with tensor-based algorithms Robert J. Harrison harrisonrj@ornl.gov robert.harrison@utk.edu

Outline • Why tensors? – Many-body physics – Low-rank approximation for operators & functions • Math issues – Operators: accurate representations without an underlying integral form – Functions: efficient and accurate computation with local/global low-rank approximations • C/S issues – Generating efficient code for modern computers – High-level composition

The Electronic Schr ö dinger Equation • A 2nd-order, linear, partial differential equation in 3N dimensions (N electrons)  H  r = E  r  H   r ,t = i d  r ,t   dt Z  H =− 1 1 2 ∑ 2 − ∑  ∑ ∣ r  − r i ∣ ∑  ∇ i ∣ r i − r j ∣ i  j i i

NWChem Overview  Simulation capability for molecular science  Broad range of molecules, including biomolecules, nanoparticles and heavy elements  Electronic structure of molecules (non-relativistic, relativistic, ECPs, first and second derivatives)  Solid state capability (DFT plane-wave, CPMD)  Molecular dynamics, molecular mechanics  Emphasis on modularity and portability.  Freely distributed  Performance characteristics – designed for MPP  Portable – runs on a wide range of computers

Synthesis of High Performance Algorithms for Electronic Structure Calculations http://www.cis.ohio-state.edu/~gb/TCE • Collaboration between DOE/SciDAC, NSF/ITR and ORNL/LDRD • Objective: develop a high level programming tool that translates many-body quantum theory into efficient massively parallel codes. This is anticipated to revolutionize the rate of progress in this field by eliminating man-years of programming effort. • NSF Project: • Sadayappan (PI), Baumgartner, Cociorva, Pitzer (OSU) Bernholdt, Harrison (unfunded) (ORNL) Ramanujam (LSU) Nooijen (Waterloo) • DOE SciDAC: Harrison (PI), Hirata (PNNL) • DOE ORNL/LDRD: Bernholdt (PI, 2002-3) • Other SciDAC projects adopting this tool: Piecuch, Gordon • Also being applied to nuclear physics (Bernholdt and Dean)

CCSD Doubles Equation hbar[a,b,i,j] == sum[f[b,c]*t[i,j,a,c],{c}] -sum[f[k,c]*t[k,b]*t[i,j,a,c],{k,c}] +sum[f[a,c]*t[i,j,c,b],{c}] -sum[f[k,c]*t[k,a]*t[i,j,c,b],{k,c}] -sum[f[k,j]*t[i,k,a,b],{k}] -sum[f[k,c]*t[j,c]*t[i,k,a,b],{k,c}] -sum[f[k,i]*t[j,k,b,a],{k}] -sum[f[k,c]*t[i,c]*t[j,k,b,a],{k,c}] +sum[t[i,c]*t[j,d]*v[a,b,c,d],{c,d}] +sum[t[i,j,c,d]*v[a,b,c,d],{c,d}] +sum[t[j,c]*v[a,b,i,c],{c}] -sum[t[k,b]*v[a,k,i,j],{k}] +sum[t[i,c]*v[b,a,j,c],{c}] -sum[t[k,a]*v[b,k,j,i],{k}] -sum[t[k,d]*t[i,j,c,b]*v[k,a,c,d],{k,c,d}] -sum[t[i,c]*t[j,k,b,d]*v[k,a,c,d], {k,c,d}] -sum[t[j,c]*t[k,b]*v[k,a,c,i],{k,c}] +2*sum[t[j,k,b,c]*v[k,a,c,i],{k,c}] -sum[t[j,k,c,b]*v[k,a,c,i],{k,c}] -sum[t[i,c]*t[j,d]*t[k,b]*v[k,a,d,c],{k,c,d}] +2*sum[t[k,d]*t[i,j,c,b]*v[k,a,d,c],{k,c,d}] -sum[t[k,b]*t[i,j,c,d]*v[k,a,d,c],{k,c,d}] -sum[t[j,d]*t[i,k,c,b]*v[k,a,d,c],{k,c,d}] +2*sum[t[i,c]*t[j,k,b,d]*v[k,a,d,c],{k,c,d}] -sum[t[i,c]*t[j,k,d,b]*v[k,a,d,c],{k,c,d}] -sum[t[j,k,b,c]*v[k,a,i,c],{k,c}] -sum[t[i,c]*t[k,b]*v[k,a,j,c],{k,c}] -sum[t[i,k,c,b]*v[k,a,j,c],{k,c}] -sum[t[i,c]*t[j,d]*t[k,a]*v[k,b,c,d],{k,c,d}] -sum[t[k,d]*t[i,j,a,c]*v[k,b,c,d],{k,c,d}] -sum[t[k,a]*t[i,j,c,d]*v[k,b,c,d],{k,c,d}] +2*sum[t[j,d]*t[i,k,a,c]*v[k,b,c,d],{k,c,d}] -sum[t[j,d]*t[i,k,c,a]*v[k,b,c,d],{k,c,d}] -sum[t[i,c]*t[j,k,d,a]*v[k,b,c,d],{k,c,d}] -sum[t[i,c]*t[k,a]*v[k,b,c,j],{k,c}] +2*sum[t[i,k,a,c]*v[k,b,c,j],{k,c}] -sum[t[i,k,c,a]*v[k,b,c,j],{k,c}] +2*sum[t[k,d]*t[i,j,a,c]*v[k,b,d,c],{k,c,d}] -sum[t[j,d]*t[i,k,a,c]*v[k,b,d,c],{k,c,d}] -sum[t[j,c]*t[k,a]*v[k,b,i,c],{k,c}] -sum[t[j,k,c,a]*v[k,b,i,c],{k,c}] -sum[t[i,k,a,c]*v[k,b,j,c],{k,c}] +sum[t[i,c]*t[j,d]*t[k,a]*t[l,b]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[k,b]*t[l,d]*t[i,j,a,c]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[k,a]*t[l,d]*t[i,j,c,b]*v[k,l,c,d],{k,l,c,d}] +sum[t[k,a]*t[l,b]*t[i,j,c,d]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[j,c]*t[l,d]*t[i,k,a,b]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[j,d]*t[l,b]*t[i,k,a,c]*v[k,l,c,d],{k,l,c,d}] +sum[t[j,d]*t[l,b]*t[i,k,c,a]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[i,c]*t[l,d]*t[j,k,b,a]*v[k,l,c,d],{k,l,c,d}] +sum[t[i,c]*t[l,a]*t[j,k,b,d]*v[k,l,c,d],{k,l,c,d}] +sum[t[i,c]*t[l,b]*t[j,k,d,a]*v[k,l,c,d],{k,l,c,d}] +sum[t[i,k,c,d]*t[j,l,b,a]*v[k,l,c,d],{k,l,c,d}] +4*sum[t[i,k,a,c]*t[j,l,b,d]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[i,k,c,a]*t[j,l,b,d]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[i,k,a,b]*t[j,l,c,d]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[i,k,a,c]*t[j,l,d,b]*v[k,l,c,d],{k,l,c,d}] +sum[t[i,k,c,a]*t[j,l,d,b]*v[k,l,c,d], {k,l,c,d}] +sum[t[i,c]*t[j,d]*t[k,l,a,b]*v[k,l,c,d],{k,l,c,d}] +sum[t[i,j,c,d]*t[k,l,a,b]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[i,j,c,b]*t[k,l,a,d]*v[k,l,c,d],{k,l,c,d}] -2*sum[t[i,j,a,c]*t[k,l,b,d]*v[k,l,c,d],{k,l,c,d}] +sum[t[j,c]*t[k,b]*t[l,a]*v[k,l,c,i], {k,l,c}] +sum[t[l,c]*t[j,k,b,a]*v[k,l,c,i],{k,l,c}] -2*sum[t[l,a]*t[j,k,b,c]*v[k,l,c,i],{k,l,c}] +sum[t[l,a]*t[j,k,c,b]*v[k,l,c,i],{k,l,c}] -2*sum[t[k,c]*t[j,l,b,a]*v[k,l,c,i],{k,l,c}] +sum[t[k,a]*t[j,l,b,c]*v[k,l,c,i],{k,l,c}] +sum[t[k,b]*t[j,l,c,a]*v[k,l,c,i],{k,l,c}] +sum[t[j,c]*t[l,k,a,b]*v[k,l,c,i],{k,l,c}] +sum[t[i,c]*t[k,a]*t[l,b]*v[k,l,c,j],{k,l,c}] +sum[t[l,c]*t[i,k,a,b]*v[k,l,c,j],{k,l,c}] -2*sum[t[l,b]*t[i,k,a,c]*v[k,l,c,j],{k,l,c}] +sum[t[l,b]*t[i,k,c,a]*v[k,l,c,j],{k,l,c}] +sum[t[i,c]*t[k,l,a,b]*v[k,l,c,j],{k,l,c}] +sum[t[j,c]*t[l,d]*t[i,k,a,b]*v[k,l,d,c],{k,l,c,d}] +sum[t[j,d]*t[l,b]*t[i,k,a,c]*v[k,l,d,c],{k,l,c,d}] +sum[t[j,d]*t[l,a]*t[i,k,c,b]*v[k,l,d,c],{k,l,c,d}] -2*sum[t[i,k,c,d]*t[j,l,b,a]*v[k,l,d,c],{k,l,c,d}] -2*sum[t[i,k,a,c]*t[j,l,b,d]*v[k,l,d,c],{k,l,c,d}] +sum[t[i,k,c,a]*t[j,l,b,d]*v[k,l,d,c],{k,l,c,d}] +sum[t[i,k,a,b]*t[j,l,c,d]*v[k,l,d,c], {k,l,c,d}] +sum[t[i,k,c,b]*t[j,l,d,a]*v[k,l,d,c],{k,l,c,d}] +sum[t[i,k,a,c]*t[j,l,d,b]*v[k,l,d,c],{k,l,c,d}] +sum[t[k,a]*t[l,b]*v[k,l,i,j], {k,l}] +sum[t[k,l,a,b]*v[k,l,i,j],{k,l}] +sum[t[k,b]*t[l,d]*t[i,j,a,c]*v[l,k,c,d],{k,l,c,d}] +sum[t[k,a]*t[l,d]*t[i,j,c,b]*v[l,k,c,d], {k,l,c,d}] +sum[t[i,c]*t[l,d]*t[j,k,b,a]*v[l,k,c,d],{k,l,c,d}] -2*sum[t[i,c]*t[l,a]*t[j,k,b,d]*v[l,k,c,d],{k,l,c,d}] +sum[t[i,c]*t[l,a]*t[j,k,d,b]*v[l,k,c,d],{k,l,c,d}] +sum[t[i,j,c,b]*t[k,l,a,d]*v[l,k,c,d],{k,l,c,d}] +sum[t[i,j,a,c]*t[k,l,b,d]*v[l,k,c,d], {k,l,c,d}] -2*sum[t[l,c]*t[i,k,a,b]*v[l,k,c,j],{k,l,c}] +sum[t[l,b]*t[i,k,a,c]*v[l,k,c,j],{k,l,c}] +sum[t[l,a]*t[i,k,c,b]*v[l,k,c,j],{k,l,c}] ab = 〈 T 2 ∣ 0 〉 i j ∣ e +v[a,b,i,j] a b T 2  − T 1 − T 1     i j H e

Tensor Contraction Engine (TCE) • High-level domain-specific language for a class of problems in quantum chemistry/physics based on contraction of large multi-dimensional tensors • Specialized optimizing compiler – Produces F77+GA code, linked to runtime libs range V = 3000; S abij = ∑ A acik B befl C dfjk D cdel range O = 100; cefkl index a,b,c,d,e,f : V; index i,j,k,l : O; mlimit = 100GB; procedure P(in A[V,V,O,O], in B[V,V,V,O], in C[V,V,O,O], in D[V,V,V,O], out S[V,V,O,O])= begin S[a,b,i,j] == sum[ A[a,c,i,k] * B[b,e,f,l] * C[d,f,j,k] * D[c,d,e,l], {c,e,f,k,l}]; end

CCSD(T) run on Cray XT5 : 18 water FP performance at 90K cores: 358 TFlops (H 2 O) 18 54 atoms 918 basis functions Cc-pvtz(-f) basis E. Apra, ORNL

CCSD(T) run on Cray XT5 : 20 water Floating-Point performance at 92K cores: 475 TFlops Efficiency > 50% (H 2 O) 20 60 atoms 1020 basis functions Cc-pvtz(-f) basis E. Apra, ORNL

Multiresolution Adaptive Numerical Scientific Simulation Ariana Beste 1 , George I. Fann 1 , Robert J. Harrison 1,2 , Rebecca Hartman-Baker 1 , Judy Hill 1 1 Oak Ridge National Laboratory 2 University of Tennessee, Knoxville In collaboration with Gregory Beylkin 4 , Fernando Perez 4 , Lucas Monzon 4 , Martin Mohlenkamp 5 and others 4 University of Colorado 5 Ohio University harrisonrj@ornl.gov CSGF June 2008 11

Funding • DOE Office of Science, SciDAC, divisions of Advanced Scientific Computing Research and Basic Energy Science, under contract DE-AC05-00OR22725 with Oak Ridge National Laboratory, in part using the National Center for Computational Sciences. • NSF CHE 0625598: Cyber-infrastructure and Research Facilities: Chemical Computations on Future High-end Computers • NSF CNS-0509410: CAS-AES: An integrated framework for compile-time/run-time support for multi-scale applications on high-end systems • DARPA HPCS2: HPCS programming language evaluation CSGF June 2008 12

MADNESS objectives • Scaling to 1+M processors ASAP • High-level composition (matlab-like) for accurate and efficient solvers – Targeting graduate students in physical science • Correct scaling of cost with system size – Fast algorithms with guaranteed precision • Broad relevance – Applications now in chemistry, atomic physics, fusion, material science, nuclear physics http://code.google.com/p/m-a-d-n-e-s-s CSGF June 2008 13

High-level composition • Close to the physics E =〈 ∣ − 1 1 2  V ∣ 〉 ∫  2  x  2  y  dx dy 2 ∇ ∣ x − y ∣ operatorT op = CoulombOperator(k, rlo, thresh); functionT rho = psi*psi; double twoe = inner(apply(op,rho),rho); double pe = 2.0*inner(Vnuc*psi,psi); double ke = 0.0; for (int axis=0; axis<3; axis++) { functionT dpsi = diff(psi,axis); ke += inner(dpsi,dpsi); } double energy = ke + pe + twoe; CSGF June 2008 14

CSGF June 2008 15

From math to peta-app: Challenges to practical computation with - PowerPoint PPT Presentation

From math to peta-app: Challenges to practical computation with tensor-based algorithms Robert J. Harrison harrisonrj@ornl.gov robert.harrison@utk.edu Outline Why tensors? Many-body physics Low-rank approximation for operators

App App App App App App App App App App App App App App App App App App App App App App

Horizontal Vertically integrated Open interfaces Closed, proprietary Rapid innovation Slow

PEGASUS: A peta-scale graph mining system - Implementation and observations U. Kang, C. E.

Sefos A self-aware factored operating system A Traditional OS App 1 App 2 App 3 System call

MADNESS From Math to Peta-App Robert J. Harrison harrisonrj@ornl.gov robert.harrison@utk.edu 2

The Power of One Poorva Joshipura, Vice President International Affairs, PETA Foundation Board

www.localgloballearning.edu.au @LocGloLearning Kelsey Halbert Peta Salter Michael Singh

Challenges and Solutions for Peta- and Exa-Sacle Programming Tasuku Hiraishi Academic Center for

GUST e-Foundry MATH FONTS Latin Modern Math, ver. 1.959 T EX Gyre Bonum Math, ver. 1.005 T EX

Math 211 Math 211 Lecture #1 August 29, 2000 2 Welcome to Math 211 Welcome to Math 211 Math

WI-FI SMART APPLICATION Download the app Use this QR code tho download Wifi Smart app for OS

MOBILE APP PDF Client Presentation FREE APP FOR ALL The ultimate Mobile App for Fans, Drivers

MOBILE APP TUTORIAL TPL Trakker Mobile App How to Download the Mobile App? Step 1: Tap on

Todays Agenda: Discuss College Application Process: Common App, Coalition App, School App

Communis Smart App is an app that is both revolutionary and convenient to use. It is an app that

Google App Engine Guido van Rossum Stanford EE380 Colloquium, Nov 5, 2008 Google App Engine

1 How long will it take? Scheduling Compare with other tasks you have done Determine list of

iRODS&ARCS ShundeZhang shunde.zhang@arcs.org.au Whoarewe?

Midterm 2 Example Problems Exam on March 21 from 2-3:20 pm in CSC 12 Reference to the textbook,

S-38.180 Exercise 2: Integrated Services Mika Ilvesm aki Helsinki University of Technology

Challenges and Solutions for Peta- and Exa-Sacle Programming: Systems Perspective Richard

Design Patterns for Metamodel Design Domain-Specific Modeling Workshop Portland, Oregon October

r r r sstrtr str

Flow Shop and Job Shop Marco Chiarandini Department of Mathematics & Computer Science

From math to peta-app: Challenges to practical computation with - PowerPoint PPT Presentation

From math to peta-app: Challenges to practical computation with tensor-based algorithms Robert J. Harrison harrisonrj@ornl.gov robert.harrison@utk.edu Outline Why tensors? Many-body physics Low-rank approximation for operators

App App App App App App App App App App App App App App App App App App App App App App

Horizontal Vertically integrated Open interfaces Closed, proprietary Rapid innovation Slow

PEGASUS: A peta-scale graph mining system - Implementation and observations U. Kang, C. E.

Sefos A self-aware factored operating system A Traditional OS App 1 App 2 App 3 System call

MADNESS From Math to Peta-App Robert J. Harrison harrisonrj@ornl.gov robert.harrison@utk.edu 2

The Power of One Poorva Joshipura, Vice President International Affairs, PETA Foundation Board

www.localgloballearning.edu.au @LocGloLearning Kelsey Halbert Peta Salter Michael Singh

Challenges and Solutions for Peta- and Exa-Sacle Programming Tasuku Hiraishi Academic Center for

GUST e-Foundry MATH FONTS Latin Modern Math, ver. 1.959 T EX Gyre Bonum Math, ver. 1.005 T EX

Math 211 Math 211 Lecture #1 August 29, 2000 2 Welcome to Math 211 Welcome to Math 211 Math

WI-FI SMART APPLICATION Download the app Use this QR code tho download Wifi Smart app for OS

MOBILE APP PDF Client Presentation FREE APP FOR ALL The ultimate Mobile App for Fans, Drivers

MOBILE APP TUTORIAL TPL Trakker Mobile App How to Download the Mobile App? Step 1: Tap on

Todays Agenda: Discuss College Application Process: Common App, Coalition App, School App

Communis Smart App is an app that is both revolutionary and convenient to use. It is an app that

Google App Engine Guido van Rossum Stanford EE380 Colloquium, Nov 5, 2008 Google App Engine

1 How long will it take? Scheduling Compare with other tasks you have done Determine list of

iRODS&amp;ARCS ShundeZhang shunde.zhang@arcs.org.au Whoarewe?

Midterm 2 Example Problems Exam on March 21 from 2-3:20 pm in CSC 12 Reference to the textbook,

S-38.180 Exercise 2: Integrated Services Mika Ilvesm aki Helsinki University of Technology

Challenges and Solutions for Peta- and Exa-Sacle Programming: Systems Perspective Richard

Design Patterns for Metamodel Design Domain-Specific Modeling Workshop Portland, Oregon October

r r r sstrtr str

Flow Shop and Job Shop Marco Chiarandini Department of Mathematics &amp; Computer Science

iRODS&ARCS ShundeZhang shunde.zhang@arcs.org.au Whoarewe?

Flow Shop and Job Shop Marco Chiarandini Department of Mathematics & Computer Science