Sparse Linear Algebra for the Language: SparseM ( Details Appeared - PowerPoint PPT Presentation

Sparse Linear Algebra for the Language: SparseM ( Details Appeared in the Journal of Statistical Software, Vol 8, Issue 6 ) Roger Koenker University of Illinois at Urbana-Champaign and Pin Ng Northern Arizona University

Outlines Motivations Design Philosophy Incomplete List of Existing Algorithms for Sparse Matrices Functionality Coercion -- Storage Modes Visualization Indexing and Binding Linear Algebra Linear Equation Solving Least Squares Problems Some Potential Refinements Performance Conclusion

Motivations Many Applications in Statistics Involve Large Sparse Matrices Parametric linear regression involving longitudinal data with fixed effects Indicator variables consist of a few ones and a large number of zero elements Nonparametric regression Design matrices in smoothing splines often contain less than 1% of nonzero entries

700 600 A Typical Design Matrix of the 500 Penalized Triogram in Koenker and Mizera 400 row (2003) 300 200 100 50 100 150 200 column

Motivations (Continued) Significant Performance Improvements Can Be Achieved by Exploiting the Sparse Structure of the Matrices Memory Utilization Computational Speed

Design Philosophy Make the Usage as Transparent as Possible by Adhering to the Method- Dispatch Convention of R Behave Like the Dense Matrix Counterparts whenever Possible Adhere to the GPL License

Incomplete List of Existing Sparse Matrix Utilities Sparse Basic Linear Algebra Subroutines (Duff, Heroux and Pozo, 2002) Handle only sparse matrix times dense matrix multiplication at Level 3 Limited matrix utilities Sparskit (Saad, 1994) Wide collection of matrix utilities, e.g. masking, sorting, permuting, extracting and filtering Building blocks for SparseM

Algorithms for Solving Sparse Systems of Linear Equations Iterative Methods (Sparskit) Conjugate gradient method (CGM) CGM on normal residual equation Bi-conjugate gradient method (BCG) BCG with partial pivoting Transpose-free quasi-minimum residual method Much Slower than Direct Methods for Our Applications Reaffirmed by Haunschmid and Ueberhuber (2000), “Iterative Solvers for Sparse Systems”

Algorithm for Solving Sparse Systems of Linear Equations (Continued) Direct Methods Watson Sparse Matrix Package (WSMP), Gupta (2000) Proprietary license MUltifrontal Massively Parallel Solver (MUMPS Version 4.2), Amestoy, Duff, L’Excellent and Koster (2002) LU or Cholesky factorization Written in Fortran 90

Algorithm for Solving Sparse Systems of Linear Equations (Continued) UMFPACK Version 4.0, Davis (2002) LU factorization General sparse solver not written specifically for symmetric positive definite system Ng and Peyton (1993) Left-Looking Block Sparse Cholesky Algorithm Specialized for symmetric positive definite matrices Building block for PCx and SparseM

Functionality -- Coercion Four Storage Modes and Four Sparse Classes CSR (Compressed sparse row), “matrix.csr” ra : real array of nnz non-zero elements ja : integer array of nnz column indices for ra ia : integer array of (n+1 ) pointers to the beginning of each row in ra and ja CSC (Compressed sparse column), “matrix.csc” SSR (Symmetric sparse row), “matrix.ssr” Only the lower triangular non-zero entries are stored SSC (Symmetric sparse column), “matrix.ssc”

Functionality -- Coercion (Continued) Usage: as.matrix.csr(x, …) as.matrix.csc(x, …) as.matrix.ssr(x, …) as.matrix.ssc(x, …) as.matrix(x) is.matrix.csr(x, …), is.matrix.csc(x, …), is.matrix.ssr(x, …), is.matrix.ssc(x, …) nrow(x), ncol(z), dim(x), class(x)

Functionality -- Visualization Allow Exploration of the Sparse Structure Work on Matrices of Class “matrix.csr” Usage: image(x, col=c(“white”,”gray”), xlab=“column”, ylab=“row”, …)

Functionality -- Indexing and Binding Operate on Matrices of Class “matrix.csr” Work Just Like Their Dense Matrix Counterparts but Return Matrices of Class “matrix.csr” Usage: x[i,j] x[i,j] <- value cbind[…] rbind[…]

Functionality -- Linear Algebra Operations on “matrix.csr” Class Yield Objects in “matrix.csr” Class with a Few Exceptions Group Arith : “+”, “-”, “*”, “/”, “^”, “%%”, “%/%”, Group Compare : “==”, “!=”, “>”, “>=”, “<”, “<=” Work and behave just like their dense matrix counterparts

Functionality -- Linear Algebra (Continued) “t” and “diff” Produce “matrix.csr” Class Usage: t(x), diff(x, lag=1, difference=1) “%*%” Produces “matrix.csr” Class if Both x and y Are “matrix.csr” Classes; Produces Dense “matrix” Class if y Is a Dense Vector Usage: x%*%y “diag” Returns a Dense Vector with Appropriate Zeros Reintroduced “diag<-” Returns a “matrix.csr” Class with the Diagonal Entries Replaced

Functionality -- Linear Equation Solving “chol” Performs Cholesky Decomposition Usage: chol(x, pivot = FALSE, nsubmax, nnzlmax, tmpmax, ...) “backsolve” Performs a Triangular Back-Fitting Usage: backsolve(r, x, k, upper.tri, transpose)

Functionality -- Linear Equation Solving (Continued) “solve” Combines “chol” and “backsolve”, and Will Compute the Inverse of a Matrix if the Right- Hand-Side Is Missing Usage: solve(a, b, ...) For Systems of Linear Equations that Only Vary on the Right-hand-side, the Result from “chol” Can Be Reused

Functionality -- Least Squares Problem “slm” Emulates “lm” to Fit Linear Model “slm” Processes a Formula Object the Same Way as “lm”, Calls “slm.fit” for the Actual Fitting Using “chol” and “backsolve” Usage: slm(formula, data, weights, na.action, method = "csr", contrasts = NULL, ...)

Functionality -- Least Squares Problem (Continued) Methods “summary”, “print”, “fitted”, “residuals”, and “coef” Are Available for Class “slm” Objects

Potential Refinements Replace model.matrix(Terms, m, contrasts) in “slm” with a Specialized Version for “matrix.csr” Class Add “crossprod”, “row”, “col”, “eigen”, “det”, “svd” and the Set of Functions for “qr” Decomposition

Performance “solve” Applied to Koenker and Mizera’s (2003) Penalized Triograms − n n 2 ∑ ∑ ( ) ρ − α + λ α ' min z d τ i i j = = i 1 i 1 Test Function ( ) = + = L z f x y , u i 1, , n i 0 i i i ( ) ( ) ( ) − 2 + − 2 40exp 8 x .5 y .5 ( ) = ( ) ( ) ( ) ( ) f x y , ( ) ( ) ( ) ( ) 0 − 2 + − 2 + − 2 + − 2 exp 8 x .2 y .7 exp x .7 y .2

1e+04 rqfn: log(Time) = -11.49 + 2.53 log(n) srqfn: log(Time) = -11.16 + 1.51 log(n) 426 1e+02 Median Execution Time (in Seconds) Time of 50 Replications 1e+00 0.5 rqfn: dense matrix 0.36 version srqfn: sparse matrix version 1e-02 0.01 100 200 500 1000 n

rqfn: log(Storage) = 3.6 + 2 log(n) Storage (in Bytes) srqfn: log(Storage) = 7.5 + 1 log(n) Requirement 1e+09 Storage in Bytes Quadratic Growth 1e+07 Linear Growth 1e+05 50 100 200 500 1000 5000 n

Conclusions SparseM Provides Some Basic R Functionality for Linear Algebra with Sparse Matrices Significant Performance Improvements in Memory Utilization and Computational Speed Are Achieved by Exploiting the Sparseness of the Matrices Involved

Sparse Linear Algebra for the Language: SparseM ( Details Appeared - PowerPoint PPT Presentation

Sparse Linear Algebra for the Language: SparseM ( Details Appeared in the Journal of Statistical Software, Vol 8, Issue 6 ) Roger Koenker University of Illinois at Urbana-Champaign and Pin Ng Northern Arizona University Outlines Motivations

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Sparse Matrices Example Of Sparse Matrices diagonal tridiagonal sparse many elements are

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

Feeding of the Thousands Leveraging the GPU's Computing Power for Sparse Linear Algebra

PV Math Department MCL Vision Credit Options Credit General General/Post- College Honors

Linear algebra explained in four pages Excerpt from the N O BULLSHIT GUIDE TO LINEAR ALGEBRA by

Matrices Basic Linear Algebra Overview Lecture will cover why matrices and linear algebra

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

Expressive Linear Algebra in Haskell Henning Thielemann 2019-08-21 Expressive Linear Algebra in

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems Section 4.1 Direct Methods

Sparse Matrices sparse many elements are zero dense few elements are zero Example Of

Graphical Linear Algebra a specification language for linear algebra Pawel Sobocinski based on

Chapter 4: Vectors, Matrices, and Linear Algebra Scott Owen & Greg Corrado Linear Algebra is

Quaternion Algebras Properties and Applications Rob Eimerl 1 Department of Mathematics

Polynomials and Grbner Bases Alice Feldmann 16th December

Music transcription via convex optimization Song Mei ICME, Stanford June 3, 2015 Song Mei

Introduction to Complexity and Computability NTIN090 Petr Kuera 2019/20 1/167 Introduction

A QUIVER PRESENTATION FOR SOLOMONS DESCENT ALGEBRA. G OTZ PFEIFFER Dedicated to the memory

2019 HALF-YEAR RESULTS HIGH LEVEL OF PRODUCTION ACTIVITY, REAL BREAKTHROUGH IN THE US: SOLID

Investor Presentation In Apr April 2019 Mak ake Digi Digital Real eal | | Ex Execute

A Semantic Infosphere October 2003 Steve Uczekaj, Casey Fung, Fred Dickey, Michael Wilke Sonia

Sparse Linear Algebra for the Language: SparseM ( Details Appeared - PowerPoint PPT Presentation

Sparse Linear Algebra for the Language: SparseM ( Details Appeared in the Journal of Statistical Software, Vol 8, Issue 6 ) Roger Koenker University of Illinois at Urbana-Champaign and Pin Ng Northern Arizona University Outlines Motivations

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Sparse Matrices Example Of Sparse Matrices diagonal tridiagonal sparse many elements are

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

Feeding of the Thousands Leveraging the GPU's Computing Power for Sparse Linear Algebra

PV Math Department MCL Vision Credit Options Credit General General/Post- College Honors

Linear algebra explained in four pages Excerpt from the N O BULLSHIT GUIDE TO LINEAR ALGEBRA by

Matrices Basic Linear Algebra Overview Lecture will cover why matrices and linear algebra

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

Expressive Linear Algebra in Haskell Henning Thielemann 2019-08-21 Expressive Linear Algebra in

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems Section 4.1 Direct Methods

Sparse Matrices sparse many elements are zero dense few elements are zero Example Of

Graphical Linear Algebra a specification language for linear algebra Pawel Sobocinski based on

Chapter 4: Vectors, Matrices, and Linear Algebra Scott Owen &amp; Greg Corrado Linear Algebra is

Quaternion Algebras Properties and Applications Rob Eimerl 1 Department of Mathematics

Polynomials and Grbner Bases Alice Feldmann 16th December

Music transcription via convex optimization Song Mei ICME, Stanford June 3, 2015 Song Mei

Introduction to Complexity and Computability NTIN090 Petr Kuera 2019/20 1/167 Introduction

A QUIVER PRESENTATION FOR SOLOMONS DESCENT ALGEBRA. G OTZ PFEIFFER Dedicated to the memory

2019 HALF-YEAR RESULTS HIGH LEVEL OF PRODUCTION ACTIVITY, REAL BREAKTHROUGH IN THE US: SOLID

Investor Presentation In Apr April 2019 Mak ake Digi Digital Real eal | | Ex Execute

A Semantic Infosphere October 2003 Steve Uczekaj, Casey Fung, Fred Dickey, Michael Wilke Sonia

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Chapter 4: Vectors, Matrices, and Linear Algebra Scott Owen & Greg Corrado Linear Algebra is