LinBox Lab University of Delaware D. Saunders, Z. Wan, D. Roche, C. - PowerPoint PPT Presentation

LinBox Lab – University of Delaware D. Saunders, Z. Wan, D. Roche, C. Devore (A. Duran, E. Schrag, R. Seagraves, B. Hovinen, ...). Thanks to the National Science Foundation 1

Tools for exact linear algebra http://linalg.org/ Mirror sites are maintained at linalg.org (North America) and linalg.net (Europe). Local links: org , net . Project LinBox: Exact computational linear algebra LinBox is a C++ template library for exact, high-performance linear algebra computation with sparse and structured matrices over the integers and over Overview finite fields. News People No stable releases available at this Download time Documentation Developer resources Current development version: 0.1.3 Links Support Comments? Bug reports? Please contact us at linbox@yahoogroups.com We offer related packages: (1) A gap share package for Simplicial GAP homology package Homology computation and for Smith Maple-LinBox package normal forms, (2) A package for access to linbox computation from Maple. We offer a server which provides linear algebra computations including Online computing servers the Smith normal form of a matrix. A second server computes the full homology of simplicial complexes. Use our compute cycles gratis. Comments? Bug reports? Please contact us at linbox@yahoogroups.com Page prepared by the LinBox team < linbox@yahoogroups.com > This page’s URL: 2 http://www.linalg.org/ (US), http://www.linalg.net/ (Europe) Page major version change: 4 August 2002 Page last updated: 7 March 2003 This material is based upon work supported by the National Science Foundation under grants 9726763, 9712362, 0098284, and 0112807. Any opinions, findings and conclusions or recomendations expressed in this material are those of the author(s) and do not necessarily reflect the

Problems solved by LinBox • Do exact rank, Smith form, determinant, system solve, min- poly, charpoly of integer matrices (via modular computation plus Chinese Remainder Algorithm or Hensel lifting). • Particularly, use rank and Smith form of { 0 , 1 } or { 0 , 1 , − 1 } matrices for Homology and other incidence matrix situations. – Homology of simplicial complexes. – multivariate polynomial equation system solving. • Problems may be huge (100,000 equations, millions of nonzero entries.) 3

Picture of Trefethen and TF class matrices Very sparse matrices, about 2 log n non-zero entries per row in Trefethen matrices. 4

Methods • Blackbox (BB) methods are excellent for large sparse matrices over finite fields. Wiedemann, Kaltofen-Saunders, Dumas- Saunders-Villard... • Sparse elimination (such as SuperLU of Demmel, et al) is excellent on matrices which are small, or slow to fill in. Duran adapted it to work over finite fields. • Other eliminations are fast by using floating point BLAS. 5

Example 1. Engineered algorithm for rank 1.1 1 0.9 Relative efficiencies best/t(A3) 0.8 best/(2t(A4)) 0.7 best/t(COLAMD) 0.6 best/t(BB) 0.5 0.4 0.3 0.2 0.1 0 Tref500 TF12 Rand600 IG5_10 Saylr3 Tref1000 TF13 F855 Rnd3_15 Rnd3_45 Rnd3_30 TF14 tols4000 Tref5000 Rnd6_30 Rnd6_45 TF15 Tref10000 IG5_15 Matrices ordered by size • Blackbox method • Generalized SuperLU • racing - guaranteed 1/2 efficiency of best of BB, GSLU • hybrid - elim until BB estimate is faster 6

TF family 50.000 45.000 BB 40.000 GSLU 35.000 30.000 speedup 25.000 20.000 15.000 10.000 5.000 0.000 107 236 552 1302 3160 7742 19321 matrix order The crossover is near order 1000 7

(slide from Williamsburg report) Conclusions An adaptive hybrid of elimination and blackbox methods is advis- able and effective for exact linear algebra over finite fields (and over the integers). A left looking elimination such as SuperLU lends itself to early determination of excess fill-in and switch to an indirect (blackbox) method. High performance exact linear algebra is implemented in LinBox, available at linalg.org. 8

Example 3: The Generic Design methodology Speedup of ZeroOne over SparseMatrix for 32 bit prime zeroone rep. speedup over sparse rep. 2.1 2 1.9 1.8 1.7 1.6 1.5 1.4 1.3 1.2 1.1 1 bcsstk29 bcsstk30 bcsstk31 bcsstk32 bcsstk33 matrix name ZeroOne takes 2/3 as long as SparseMatrix for matrix-vector products. 9

Example 2. Rank of matrices of rational functions with rational number coefficients. 2 x 2 +7  33 x 5 + x + 2  x x 100 − 3 23 x − 5   3 x 2 +4 94 x 4 + x 3 + 10 x     x 100 − 5 23 x − 5   5 x 2 +1 3 x 7 + x 2 − x  x  x 100 − 8 23 x − 5 ...evaluated at a random point (in this example x = 1).   1 / 2 36 − 1 / 2 − 1 / 4 7 / 18 105     3 − 1 / 7 1 / 3 ...mod a random prime (in this example p = 11).   6 3 5 8 1 6     3 3 4 10

• This is a very fast heuristic when p is a wordsize prime and the evaluation point is random from a sufficiently large set. • It becomes a slower Monte Carlo algorithm with a proven upper bound on the probability of error, if sufficiently many primes and points are used. • It becomes a very sloowww deterministic algorithm, if a really large number of points and primes are used (as calculated using formulas for bounds on determinants). • This work won Carl Devore and me the Computer Algebra Nederland Foundation Prize - 1000 Euros. 11

Example 4: Quickly and exactly solve a challenge problem In 2002, Prof. L. N. Trefethen posted “The SIAM 100-Dollar, 100-Digit Challenge”. ∗ Here is problem 7 (of 10): Let A be the 20 , 000 × 20 , 000 matrix whose entries are zero everywhere except for the primes 2 , 3 , 5 , 7 , ..., 224737 along the main diagonal and the number 1 in all the positions a ij with | i − j | = 1 , 2 , 4 , 8 , ..., 16384. What is the (1 , 1) entry of A − 1 ? ∗ http://web.comlab.ox.ac.uk/oucl/work/nick.trefethen/hundred.html. 12

The 20000 by 20000 matrix has over half a million nonzero entries. The exact answer is a fraction whose numerator and de- nominator each has 97,389 decimal digits. Our solutions of two years ago: • Parallel solution by LinBoxer Jean-Guillaume Dumas (Greno- ble, France): Solve mod 32 bit primes (use 12 thousand of them because of the size of the answer). Use Chinese Remainder Algorithm to combine the results. He ran 182 processors for four days using LinBox software (80 of them were the NSFRI cluster, the rest were PC’s in France). This method runs in O ∼ ( n 4 ) time. 13

• A couple of months later, Zhendong Wan (Newark, Delaware) Recomputed the result on strauss using Dixon lifting. Strauss was called ‘spare’ then - it was in a test period before going public. Its huge memory was necessary. The method needed 8GB. This method runs in O ∼ ( n 3 ) time. Zhendong’s solution two years later: • The exact answer can now be computed in 25 minutes on a cheap PC running Linux on a 1.9GHZ Pentium processor with 1GB memory (or in 12 minutes on a 3.2GHZ Intel Xeon processor). Only a few MB of memory is required. The method is a mixture of numeric approximation and symbolic exact computation. It runs in O ∼ ( n 2 ) time.

Methods Complexity Memory Run time O ∼ ( n 4 ) Quotient of two determinants a few MB Four days in parallel Wiedemann’s algorithm using 182 processors, Chinese remainder theorem 96 Intel 735 MHZ PIII, 6 1G 20 4 × 250MHZ sun ultra-45 O ∼ ( n 3 ) Solve Ax = e 1 = (1 , 0 , · , 0) 3.2 GB 12.5 days sequentially in by plain Dixon lifting a Sun Sun-Fire with for the dense case 750 MHZ Ultrasparcs and Rational reconstruction 8GB for each processors O ∼ ( n 2 ) Solve Ax = e 1 = (1 , 0 , · , 0) a few MB 25 minutes in a pc with by our methods above 1.9GHZ Intel P processor, Rational reconstruction and 1 GB memory The original work earned Zhendong a nice writeup in Trefethen’s report on the contest. The new fast method earned him a place the website of a followup book about the contest. http://www-m3. ma.tum.de/m3/bornemann/challengebook/Updates/index.html 14

Future work for the LinBox team • Theory: For the run time, best asymptotic lower bounds (problem complexity) � = best asymptotic upper bounds (algorithm complexity). – Design fast algorithms for general case. – Design fast algorithms for special matrix classes. – Prove any non-trivial lower bound. • Practice: Best practical algorithm is determined problem size and shape, by hardware properties, by the available tools. – Implement and test the best algorithms. – Improve the library design for genericity and performance. – Engineer the hybrid algorithms . – Continue to provide the best performing integer matrix computation package in the world. • Application: 15

– Homology - what is the geometry of huge, high dimensional, combi- natorial objects? – Graphics and medical imaging - quickly get the right shape. – Cryptology - for instance, the RSA challenge problems.

LinBox Lab University of Delaware D. Saunders, Z. Wan, D. Roche, C. - PowerPoint PPT Presentation

LinBox Lab University of Delaware D. Saunders, Z. Wan, D. Roche, C. Devore (A. Duran, E. Schrag, R. Seagraves, B. Hovinen, ...). Thanks to the National Science Foundation 1 Tools for exact linear algebra http://linalg.org/ Mirror sites are

Mega to Micro: Marine Debris Initiatives in Delaware Nicole Rodi Kari St. Laurent, Ph.D Delaware

Delaware Wetland Protection Vision and Strategic Plan September 25, 2013 ELI Delaware Wetland

Report of Precipitation and Long-Range Forecasts for Delaware David R. Legates Delaware State

Jie Fu (U. Pennsylvania) Jeffrey Heinz (Delaware) Adam Jardine (Delaware) Herbert G. Tanner

Premcor Delaware City Refinery Premcor Delaware City Refinery FCCU 20 ppm NO x Project FCCU 20

BEHAVIORAL HEALTH CONSORTIUM INTRODUCTION BEHAVIORAL HEALTH IN DELAWARE DELAWARE HAS ABOUT

Grant Workshop Large-Scale Grant Program 1 Overview Delaware Sustainable Energy Utility (DESEU)

Delaware Afterschool Network 1 Delaware Afterschool Network This is Afterschool

May 2019 Cheryl Couvillion Delaware Lottery SPORTS LOTTERY-HISTORY PASPA 1992 - Delaware is

Delaware River Basin Commission Updating TMDLs for PCBs for the Delaware Estuary Thomas J. Fikslin,

Delaware Basin Stream Management Program A partnership between NYCDEP and Delaware County Soil

University of Delaware University of Delaware Center for the Arts Center for the Arts Newark,

HCC@UF Lab Resources Overview (and Tour) Lisa Anthony, PhD January 12, 2017 HCC@UF Lab

Lab 7 Lab 6 Review Review for Lab 7 March 5, 2019 Sprenkle - CSCI111 1 Lab 7: Pair

Delawareans Without Health Insurance 2007 presented to Delaware Health Care Commision by

State of Delaware OFFICE OF GOVERNOR CARNEY FINANCIAL OVERVIEW FOR FISCAL YEAR 2021 January 30,

Selected Topics of Theoretical Computer Science (456-335/1) Petr Jan car Dept of Computer

H OW TO STUDY ARITHMETICAL FUNCTIONS ? O VERVIEW E RD OS AND TE R IELE M AIN RESULTS F UTURE

Shors Algorithm Ben Prather UIUC Algorithms Interest Group, Sep 30, 2016 History

Mathematical Background Chester Rebeiro March 7, 2017 Modular Arithmetic Division Theorem

Linear congruences: ax b (mod n ) for x Z a x = b in Z n (in particular x {

An Algebraic Approach to the Design of Block Ciphers Jos Valena scar Pereira Tiago

( t,w ) Threshold schemes " A master key ! (e.g. for a Certificate Authority) is very very

Implementation of RSA 2048 on GPUs Marcelo E. Kaihara EPFL LACAL Nov. 4, 2010 Motivation

LinBox Lab University of Delaware D. Saunders, Z. Wan, D. Roche, C. - PowerPoint PPT Presentation

LinBox Lab University of Delaware D. Saunders, Z. Wan, D. Roche, C. Devore (A. Duran, E. Schrag, R. Seagraves, B. Hovinen, ...). Thanks to the National Science Foundation 1 Tools for exact linear algebra http://linalg.org/ Mirror sites are

Mega to Micro: Marine Debris Initiatives in Delaware Nicole Rodi Kari St. Laurent, Ph.D Delaware

Delaware Wetland Protection Vision and Strategic Plan September 25, 2013 ELI Delaware Wetland

Report of Precipitation and Long-Range Forecasts for Delaware David R. Legates Delaware State

Jie Fu (U. Pennsylvania) Jeffrey Heinz (Delaware) Adam Jardine (Delaware) Herbert G. Tanner

Premcor Delaware City Refinery Premcor Delaware City Refinery FCCU 20 ppm NO x Project FCCU 20

BEHAVIORAL HEALTH CONSORTIUM INTRODUCTION BEHAVIORAL HEALTH IN DELAWARE DELAWARE HAS ABOUT

Grant Workshop Large-Scale Grant Program 1 Overview Delaware Sustainable Energy Utility (DESEU)

Delaware Afterschool Network 1 Delaware Afterschool Network This is Afterschool

May 2019 Cheryl Couvillion Delaware Lottery SPORTS LOTTERY-HISTORY PASPA 1992 - Delaware is

Delaware River Basin Commission Updating TMDLs for PCBs for the Delaware Estuary Thomas J. Fikslin,

Delaware Basin Stream Management Program A partnership between NYCDEP and Delaware County Soil

University of Delaware University of Delaware Center for the Arts Center for the Arts Newark,

HCC@UF Lab Resources Overview (and Tour) Lisa Anthony, PhD January 12, 2017 HCC@UF Lab

Lab 7 Lab 6 Review Review for Lab 7 March 5, 2019 Sprenkle - CSCI111 1 Lab 7: Pair

Delawareans Without Health Insurance 2007 presented to Delaware Health Care Commision by

State of Delaware OFFICE OF GOVERNOR CARNEY FINANCIAL OVERVIEW FOR FISCAL YEAR 2021 January 30,

Selected Topics of Theoretical Computer Science (456-335/1) Petr Jan car Dept of Computer

H OW TO STUDY ARITHMETICAL FUNCTIONS ? O VERVIEW E RD OS AND TE R IELE M AIN RESULTS F UTURE

Shors Algorithm Ben Prather UIUC Algorithms Interest Group, Sep 30, 2016 History

Mathematical Background Chester Rebeiro March 7, 2017 Modular Arithmetic Division Theorem

Linear congruences: ax b (mod n ) for x Z a x = b in Z n (in particular x {

An Algebraic Approach to the Design of Block Ciphers Jos Valena scar Pereira Tiago

( t,w ) Threshold schemes &quot; A master key ! (e.g. for a Certificate Authority) is very very

Implementation of RSA 2048 on GPUs Marcelo E. Kaihara EPFL LACAL Nov. 4, 2010 Motivation

( t,w ) Threshold schemes " A master key ! (e.g. for a Certificate Authority) is very very