Numerical Linear Algebra Software (based on slides written by - PowerPoint PPT Presentation

Numerical Linear Algebra Software (based on slides written by Michael Grant) • BLAS, ATLAS • LAPACK • sparse matrices Prof. S. Boyd, EE364b, Stanford University

Numerical linear algebra in optimization most memory usage and computation time in optimization methods is spent on numerical linear algebra, e.g. , • constructing sets of linear equations ( e.g. , Newton or KKT systems) – matrix-matrix products, matrix-vector products, . . . • and solving them – factoring, forward and backward substitution, . . . . . . so knowing about numerical linear algebra is a good thing Prof. S. Boyd, EE364b, Stanford University 1

Why not just use Matlab? • Matlab (Octave, . . . ) is OK for prototyping an algorithm • but you’ll need to use a real language ( e.g. , C, C++, Python) when – your problem is very large, or has special structure – speed is critical ( e.g. , real-time) – your algorithm is embedded in a larger system or tool – you want to avoid proprietary software • in any case, the numerical linear algebra in Matlab is done using standard free libraries Prof. S. Boyd, EE364b, Stanford University 2

How to write numerical linear algebra software DON’T! whenever possible, rely on existing, mature software libraries • you can focus on the higher-level algorithm • your code will be more portable, less buggy, and will run faster—sometimes much faster Prof. S. Boyd, EE364b, Stanford University 3

Netlib the grandfather of all numerical linear algebra web sites http://www.netlib.org • maintained by University of Tennessee, Oak Ridge National Laboratory, and colleagues worldwide • most of the code is public domain or freely licensed • much written in FORTRAN 77 (gasp!) Prof. S. Boyd, EE364b, Stanford University 4

Basic Linear Algebra Subroutines (BLAS) written by people who had the foresight to understand the future benefits of a standard suite of “kernel” routines for linear algebra. created and organized in three levels : • Level 1 , 1973-1977: O ( n ) vector operations: addition, scaling, dot products, norms • Level 2 , 1984-1986: O ( n 2 ) matrix-vector operations: matrix-vector products, triangular matrix-vector solves, rank-1 and symmetric rank-2 updates • Level 3 , 1987-1990: O ( n 3 ) matrix-matrix operations: matrix-matrix products, triangular matrix solves, low-rank updates Prof. S. Boyd, EE364b, Stanford University 5

BLAS operations Level 1 addition/scaling αx , αx + y x T y , dot products, norms � x � 2 , � x � 1 αA T x + βy Level 2 matrix/vector products αAx + βy , A + αxy T , A + αxx T rank 1 updates A + αxy T + αyx T rank 2 updates αT − 1 x , αT − T x triangular solves αAB T + βC Level 3 matrix/matrix products αAB + βC , αA T B T + βC αA T B + βC , αAA T + βC , αA T A + βC rank- k updates αA T B + αB T A + βC rank- 2 k updates αT − T C αT − 1 C , triangular solves Prof. S. Boyd, EE364b, Stanford University 6

Level 1 BLAS naming convention BLAS routines have a Fortran-inspired naming convention: cblas_ X XXXX prefix data type operation data types: single precision real double precision real s d single precision complex double precision complex c z operations: r ← x T y y ← αx + y axpy dot √ x T x r ← � x � 1 = � r ← � x � 2 = i | x i | nrm2 asum example: double precision real dot product cblas_ddot Prof. S. Boyd, EE364b, Stanford University 7

BLAS naming convention: Level 2/3 cblas_ X XX XXX prefix data type structure operation matrix structure: triangular packed triangular banded triangular tr tp tb symmetric packed symmetric banded symmetric sy sp sb Hermitian packed Hermitian banded Hermitian hy hp hn general banded general ge gb operations: x ← A − 1 x (triangular only) y ← αAx + βy mv sv A ← A + xy T + yx T A ← A + xx T r r2 C ← αAB T + αBA T + βC C ← αAB + βC mm r2k examples: double precision real triangular matrix-vector product cblas_dtrmv double precision real symmetric rank- 2 k update cblas_dsyr2k Prof. S. Boyd, EE364b, Stanford University 8

Using BLAS efficiently always choose a higher-level BLAS routine over multiple calls to a lower-level BLAS routine k � A ∈ R m × n , x i ∈ R m , y i ∈ R n x i y T A ← A + i , i =1 two choices: k separate calls to the Level 2 routine cblas_dger A ← A + x 1 y T A ← A + x k y T 1 , . . . k or a single call to the Level 3 routine cblas_dgemm A ← A + XY T , X = [ x 1 · · · x k ] , Y = [ y 1 · · · y k ] the Level 3 choice will perform much better Prof. S. Boyd, EE364b, Stanford University 9

Is BLAS necessary? why use BLAS when writing your own routines is so easy? A ∈ R m × n , X ∈ R m × p , Y ∈ R n × p A ← A + XY T , p � A ij ← A ij + X ik Y jk k =1 void matmultadd( int m, int n, int p, double* A, const double* X, const double* Y ) { int i, j, k; for ( i = 0 ; i < m ; ++i ) for ( j = 0 ; j < n ; ++j ) for ( k = 0 ; k < p ; ++k ) A[ i + j * n ] += X[ i + k * p ] * Y[ j + k * p ]; } Prof. S. Boyd, EE364b, Stanford University 10

Is BLAS necessary? • tuned/optimized BLAS will run faster than your home-brew version — often 10 × or more • BLAS is tuned by selecting block sizes that fit well with your processor, cache sizes • ATLAS (automatically tuned linear algebra software) http://math-atlas.sourceforge.net uses automated code generation and testing methods to generate an optimized BLAS library for a specific computer Prof. S. Boyd, EE364b, Stanford University 11

Improving performance through blocking blocking is used to improve the performance of matrix/vector and matrix/matrix multiplications, Cholesky factorizations, etc. � � � � A 11 A 12 X 11 A + XY T ← � Y T Y T � + + 11 21 A 21 A 22 X 21 A 11 ← A 11 + X 11 Y T A 12 ← A 12 + X 11 Y T 11 , 21 , A 21 ← A 21 + X 21 Y T A 22 ← A 22 + X 21 Y T 11 , 21 optimal block size, and order of computations, depends on details of processor architecture, cache, memory Prof. S. Boyd, EE364b, Stanford University 12

Linear Algebra PACKage (LAPACK) LAPACK contains subroutines for solving linear systems and performing common matrix decompositions and factorizations • first release: February 1992; latest version (3.0): May 2000 • supercedes predecessors EISPACK and LINPACK • supports same data types (single/double precision, real/complex) and matrix structure types (symmetric, banded, . . . ) as BLAS • uses BLAS for internal computations • routines divided into three categories: auxiliary routines, computational routines, and driver routines Prof. S. Boyd, EE364b, Stanford University 13

LAPACK computational routines computational routines perform single, specific tasks • factorizations: LU , LL T / LL H , LDL T / LDL H , QR , LQ , QRZ , generalized QR and RQ • symmetric/Hermitian and nonsymmetric eigenvalue decompositions • singular value decompositions • generalized eigenvalue and singular value decompositions Prof. S. Boyd, EE364b, Stanford University 14

LAPACK driver routines driver routines call a sequence of computational routines to solve standard linear algebra problems, such as • linear equations: AX = B • linear least squares: minimize x � b − Ax � 2 • linear least-norm: minimize y � y � 2 subject to d = By • generalized linear least squares problems: minimize x � c − Ax � 2 minimize y � y � 2 subject to Bx = d subject to d = Ax + By Prof. S. Boyd, EE364b, Stanford University 15

LAPACK example solve KKT system � A T � � � � � H x a = A 0 y b x ∈ R n , v ∈ R m , H = H T ≻ 0 , m < n option 1 : driver routine dsysv uses computational routine dsytrf to compute permuted LDL T factorization � H � A → PLDL T P T A 0 and performs remaining computations to compute solution � � � � x a = P T L − 1 D − 1 L − T P y b Prof. S. Boyd, EE364b, Stanford University 16

option 2 : block elimination y = ( AH − 1 A T ) − 1 ( AH − 1 a − b ) , x = H − 1 a − H − 1 A T y • first we solve the system H [ Z w ] = [ A T a ] using driver routine dspsv • then we construct and solve ( AZ ) y = Aw − b using dspsv again • x = w − Zy using this approach we could exploit structure in H , e.g. , banded Prof. S. Boyd, EE364b, Stanford University 17

What about other languages? BLAS and LAPACK routines can be called from C, C++, Java, Python, . . . an alternative is to use a “native” library, such as • C++: Boost uBlas, Matrix Template Library • Python: NumPy/SciPy, CVXOPT • Java: JAMA Prof. S. Boyd, EE364b, Stanford University 18

Sparse matrices 0 5 10 15 20 25 30 35 40 0 5 10 15 20 25 30 35 40 nz = 505 • A ∈ R m × n is sparse if it has “enough zeros that it pays to take advantage of them” (J. Wilkinson) • usually this means n nz , number of elements known to be nonzero, is small: n nz ≪ mn Prof. S. Boyd, EE364b, Stanford University 19

Sparse matrices sparse matrices can save memory and time • storing A ∈ R m × n using double precision numbers – dense: 8 mn bytes – sparse: ≈ 16 n nz bytes or less, depending on storage format • operation y ← y + Ax : – dense: mn flops – sparse: n nz flops • operation x ← T − 1 x , T ∈ R n × n triangular, nonsingular: – dense: n 2 / 2 flops – sparse: n nz flops Prof. S. Boyd, EE364b, Stanford University 20

Numerical Linear Algebra Software (based on slides written by - PowerPoint PPT Presentation

Numerical Linear Algebra Software (based on slides written by Michael Grant) BLAS, ATLAS LAPACK sparse matrices Prof. S. Boyd, EE364b, Stanford University Numerical linear algebra in optimization most memory usage and computation time

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

Numerical Semigroup Algebra Joint with Kee, Mee-Kyoung International meeting on numerical

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

PV Math Department MCL Vision Credit Options Credit General General/Post- College Honors

Linear algebra and analysis recalls Lectures for PHD course on Numerical optimization Enrico

Linear algebra explained in four pages Excerpt from the N O BULLSHIT GUIDE TO LINEAR ALGEBRA by

Matrices Basic Linear Algebra Overview Lecture will cover why matrices and linear algebra

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

Expressive Linear Algebra in Haskell Henning Thielemann 2019-08-21 Expressive Linear Algebra in

Basic Concepts in Linear Algebra Department of Mathematics Boise State University September 14,

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

A Numerical Linear Algebra Framework for Solving Problems with Multivariate Polynomials Kim

A Numerical Linear Algebra Framework for Solving Problems with Multivariate Polynomials Kim

Chapter 4: Vectors, Matrices, and Linear Algebra Scott Owen & Greg Corrado Linear Algebra is

CMOS Inverter Two Inverters N Well VDD Share power and ground V DD PMOS is wider Abut cells

Oboe: Auto-tuning Video ABR Algorithms to Network Conditions Zahaib Akhtar , Yun Seong Nam

Basic Algorithms for Periodic-Linear Inequalities and Integer Polyhedra Alain Keterlin / Camus

Partnering with countries, cities and industries William Lin EVP, regions, cities and solutions

Upper bounds for query complexity inspired by the Elitzur-Vaidman bomb tester Cedric Yen-Yu Lin,

Entanglement and Spacetime Jennifer Lin + a conjecture about what the RT area is counting in the

PicSOM Experiments in TRECVID 2014 Semantic Indexing Task Jorma Laaksonen Aalto University

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 23: Introduction to

Numerical Linear Algebra Software (based on slides written by - PowerPoint PPT Presentation

Numerical Linear Algebra Software (based on slides written by Michael Grant) BLAS, ATLAS LAPACK sparse matrices Prof. S. Boyd, EE364b, Stanford University Numerical linear algebra in optimization most memory usage and computation time

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

Numerical Semigroup Algebra Joint with Kee, Mee-Kyoung International meeting on numerical

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

PV Math Department MCL Vision Credit Options Credit General General/Post- College Honors

Linear algebra and analysis recalls Lectures for PHD course on Numerical optimization Enrico

Linear algebra explained in four pages Excerpt from the N O BULLSHIT GUIDE TO LINEAR ALGEBRA by

Matrices Basic Linear Algebra Overview Lecture will cover why matrices and linear algebra

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

Expressive Linear Algebra in Haskell Henning Thielemann 2019-08-21 Expressive Linear Algebra in

Basic Concepts in Linear Algebra Department of Mathematics Boise State University September 14,

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

A Numerical Linear Algebra Framework for Solving Problems with Multivariate Polynomials Kim

A Numerical Linear Algebra Framework for Solving Problems with Multivariate Polynomials Kim

Chapter 4: Vectors, Matrices, and Linear Algebra Scott Owen &amp; Greg Corrado Linear Algebra is

CMOS Inverter Two Inverters N Well VDD Share power and ground V DD PMOS is wider Abut cells

Oboe: Auto-tuning Video ABR Algorithms to Network Conditions Zahaib Akhtar , Yun Seong Nam

Basic Algorithms for Periodic-Linear Inequalities and Integer Polyhedra Alain Keterlin / Camus

Partnering with countries, cities and industries William Lin EVP, regions, cities and solutions

Upper bounds for query complexity inspired by the Elitzur-Vaidman bomb tester Cedric Yen-Yu Lin,

Entanglement and Spacetime Jennifer Lin + a conjecture about what the RT area is counting in the

PicSOM Experiments in TRECVID 2014 Semantic Indexing Task Jorma Laaksonen Aalto University

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 23: Introduction to

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Chapter 4: Vectors, Matrices, and Linear Algebra Scott Owen & Greg Corrado Linear Algebra is