 
              Experimental Mathematics and High-Performance Computing David H Bailey Lawrence Berkeley National Lab “All truths are easy to understand once they are discovered; the point is to discover them.” – Galileo Galilei
The NERSC Computer Center at the Berkeley Laboratory � Seaborg: 6656-CPU IBM P3 system, 10 Tflop/s peak, 7.8 Tbye memory. � Bassi: 976-CPU IBM P5 system, 6.7 Tflop/s peak, 3.5 Tbyte memory. � Franklin: 9672 dual-core Opteron CPUs, 100 Tflop/s peak is now being installed.
Example of NERC Computations: Astrophysics � Multi-physics and multi-scale phenomena. � Large dynamic range in time and length. � Requires adaptive mesh refinement. � Dense linear algebra. � FFTs and spherical harmonic transforms. Supernova simulation: � Future 3-D model calculations will require 1,000,000 CPU-hours per run, on 100 Tflop/s peak system. Analysis of cosmic microwave background data: Graphic: T. Mezzacappa, J. Blondin, K.-L. Ma, et al (ORNL) 3x10 21 flops, 16 Tbyte mem � WMAP (now) 2x10 24 flops, 1.6 Pbyte mem � PLANCK (2007) 1x10 27 flops, 1 Ebyte mem � CMBpol (2015)
Characteristics of Modern High- Performance Scientific Computing � The ultimate objective is to advance the applied discipline: � Physics, chemistry, astronomy, biology, climate, engineering, biotech. � Advanced numerical algorithms and computing techniques: � FFTs, dense linear algebra, sparse linear algebra, iterative solvers, multigrid, highly parallel processing, dynamic data structures, etc. � State-of-the-art calculations require highly parallel computers: � Enormous computational requirements are common. � 1000+ CPUs are used in many calculations. � A pragmatic attitude prevails: “If it works, use it.” � Some combinatorial optimization algorithms are observed to work significantly better in practice than theory might suggest. � Gaussian elimination with partial pivoting is not guaranteed to work in all cases, yet it works fine in real applications. � The QR algorithm was used for many years before it was found to cycle in a simple 4x4 case. A proof of convergence is still elusive.
What Is Experimental Mathematics? “Experimental mathematics” is a term for the emerging discipline where state-of-the-art computing technology is aggressively applied to problems in mathematical research: � Actively exploring mathematical questions. � Computing explicit numerical examples. � Performing large symbolic manipulations. � Testing (and often falsifying) conjectures. � Investigating possible paths for formal proof. Hamming: “The purpose of computing is insight, not numbers.”
Books on Experimental Mathematics Mathematics by Experiment: Plausible Reasoning in the 21st Century Experiments in Mathematics: Computational Paths to Discovery Authors: Jonathan Borwein, DHB and (for vol. 2) Roland Girgensohn. Both books are now available on CD-ROM in a hyperlinked, searchable PDF format. Also, a FREE condensed version is available at: http://www.experimentalmath.info Coming soon (Mar 2007): Experimental Mathematics in Action . Authors: David Bailey, Jon Borwein, Neil Calder, Roland Girgensohn, Russell Luke and Victor Moll.
Experimental Mathematics as High-Performance Computing � The ultimate objective is to advance the applied discipline: � Here the “applied discipline” is pure mathematics! � Advanced numerical algorithms and computing techniques: � PSLQ, high-precision arithmetic, symbolic computing, FFTs, numerical analysis, evaluation of integrals and series, etc. � State-of-the-art calculations require highly parallel computers: � High-precision arithmetic greatly magnifies run times. � 1000+ CPUs have been used in several calculations. � A pragmatic attitude prevails: “If it works, use it.” � We do not know ahead of time what terms to use in an integer relation search – guessing which terms to try is still a black art. � Whereas the standard PSLQ algorithm is guaranteed to find relations, no proof is known for multi-pair PSLQ. � We do not fully understand why tanh-sinh quadrature works so well, especially in 2-D, 3-D, etc.
Examples of Large Experimental Math Computations Identification of the 4th bifurcation point of the logistic iteration: � Integer relation of size 121; 10,000 digit arithmetic. � Required 67 min on 48 CPUs = 54 CPU-hours. Finding a relation derived from roots of Lehmer’s polynomial: � Integer relation of size 125; 50,000 digit arithmetic. � Required 16 hours on 64 CPUs = 1024 CPU-hours. Numerical verification of a mathematical physics integral: � 1-D quadrature calculation; 20,000-digit arithmetic. � Required 45 min on 1024 CPUs = 768 CPU-hours. Numerical evaluation of an Ising theory integral: � 3-D quadrature of a very complicated function; 500-digit arithmetic. � Required 18.2 hours on 256 CPUs = 4659 CPU-hours. Authors: David Broadhurst, Jonathan Borwein, Richard Crandall, Roland Girgensohn and DHB
Computational Methods Used in Experimental Math � High-precision computation. � PSLQ (integer relation detection). � Symbolic computing tools. � Function evaluations: sin, exp, log, erf, gamma, zeta, polylog. � Fast Fourier transforms (FFTs). � Dense and sparse linear algebra. � Evaluation of definite integrals. � Evaluation of infinite series sums. � Error bounds on computed results. � Highly parallel computing. � Computer graphics. Note that except for the first three, these are all staples of modern applied mathematics and numerical analysis.
LBNL’s High-Precision Software (ARPREC and QD) � Low-level routines written in C++. � C++ and F-90 translation modules permit use with existing programs with only minor code changes. � Double-double (32 digits), quad-double, (64 digits) and arbitrary precision (>64 digits) available. � Special routines for extra-high precision (>1000 dig). � High-precision integer, real and complex datatypes. � Includes many common functions: sqrt, cos, exp, gamma, etc. � PSLQ, root finding, numerical integration. � An interactive “Experimental Mathematician’s Toolkit” is also available. Available at: http://www.experimentalmath.info This software is being used by physicists, climate modelers, chemists and engineers, in addition to mathematicians. Authors: Xiaoye Li, Yozo Hida, Brandon Thompson and DHB
ARPREC vs GMP ARPREC advantages: � Comparatively simple install procedure. � Simple arrays facilitate parallel implementations. � High-level Fortran-90/95 interface (not available for GMP). � High-level C++ interface (ARPREC’s is nicer than GMP’s). � FFT-based arithmetic for very high precision (> 1000 digits). GMP/MPFR advantages: � Better performance, especially for over 1000 digit precision and for transcendental functions. � Support of a large community. What is needed: Combine the high-level ARPREC Fortran and C++ interfaces with the GMP low-level routines. Issue: How can this be done and still facilitate parallel applications?
The PSLQ Integer Relation Algorithm Let (x n ) be a vector of real numbers. An integer relation algorithm finds integers (a n ) such that At the present time, the PSLQ algorithm of mathematician-sculptor Helaman Ferguson is the best-known integer relation algorithm. PSLQ was named one of ten “algorithms of the century” by Computing in Science and Engineering . High-precision arithmetic software is required: at least d x n digits, where d is the size (in digits) of the largest of the integers a k . Authors: Helaman Ferguson, Stephen Arno and DHB
The BBP Formula for Pi In 1996, a computer program running the PSLQ algorithm discovered this formula for pi: This formula permits one to directly calculate binary or hexadecimal (base-16) digits of pi beginning at an arbitrary starting position n, without needing to calculate any of the first n-1 digits, by means of a very simple algorithm that requires almost no memory. This formula is now used in the G95 compiler. Authors: Peter Borwein, Simon Plouffe and DHB
Some Other Similar BBP-Type Identities Authors: Peter Borwein, Simon Plouffe, David Broadhurst, Richard Crandall and DHB
Is There a Base-10 Formula for Pi? Note that there is both a base-2 and a base-3 BBP-type formula for π 2 . Base-2 and base-3 formulas are also known for a handful of other constants. Question: Is there any nonbinary (base-n, where n ≠ 2 b ) BBP-type formula for π ? Answer: No. This is ruled out in a 2004 paper. This does not rule out some completely different scheme for finding individual non-binary digits of π . Authors: Jon Borwein, David Borwein and Will Galway
Normality A real number x is said to be b-normal (or normal base b) if every m-long string of base-b digits appears, in the limit, with frequency b -m . Whereas it can be shown that almost all real numbers are b-normal (for any b), there are only a handful of explicit examples. It is not known whether any of the following are b-normal (for any b):
A Connection Between BBP Formulas and Normality Consider the sequence defined by x 0 = 0, and where { } denotes fractional part as before. Result: log(2) is 2-normal if and only if this sequence is equidistributed in the unit interval. In a similar vein, consider the sequence x 0 = 0, and Result: p is 16-normal if and only if this sequence is equidistributed in the unit interval. Authors: Richard Crandall and DHB
Recommend
More recommend