A Case for Malleable Thread-Level Linear Algebra Libraries: The LU - PowerPoint PPT Presentation

Feb 25, 2024 •93 likes •361 views

A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization with Partial Pivoting Sandra Cataln, Jose R. Herrero, Enrique S. Quintana-Ort, Rafael Rodrguez-Snchez, Robert van de Geijn BLIS Retreat, 19-20th September

A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization with Partial Pivoting Sandra Catalán, Jose R. Herrero, Enrique S. Quintana-Ortí, Rafael Rodríguez-Sánchez, Robert van de Geijn BLIS Retreat, 19-20th September 2016, Austin (Texas)
Motivation Increase number of threads BLAS → TLP Nested TLP + TP LAPACK → TP (runtime) 2
Why malleability Ta 3th Tb 5th . . . . Ta . . . Tb 3
Why malleability Ta 3th Tb 5th Ta 3th Tb 5th . . . . . . . . Ta . Ta . . . . Tb 8th Tb DLA library modification to allow number of threads expansion 3
LU as an example b size is important: - Too small → Low GEMM performance - Too large → Too many panel factorization flops 4
Optimal block size 5
Optimal block size 6
The panel factorization relevance Less than 2% of the flops 17.5% of the time 7
Dealing with the panel factorization Look-ahead: Overlap the factorization of the “next” panel with the update of the “current” trailing submatrix. 8
Look Ahead LU 9
Our setup ● Intel Xeon E5-2603 v3 ● 6 cores at 1.6 Ghz ● BLIS 0.1.8 ● BLIS Loop 4 ( jr ) parallelized ● Extrae 3.3.0 ● Panel factorization via blocked algorithm ● Two block sizes b o and b i ● Inner LU involve small-grained computations and little parallelism 10
Look Ahead LU Performance 11
Look Ahead LU Performance 11
Towards malleability ● P threads in the panel factorization ● R threads in the update ● Panel factorization less expensive than update – P threads will join R team eventually – BLAS does not allow to modify the number of working threads 12
Static re-partitioning ● Workaround: split the update into several GEMM ● Drawbacks: – Lower GEMM throughput (packing and suboptimal blocks) – Decision on which loop to parallelize and the granularity of the partitioning 13
Malleable thread-level BLAS ● Solving static partitioning issues: – Only one GEMM call → no extra data movements – BLIS takes care of the partitioning and granularity 14
How Malleability behaves 15
And the small case... 15
What if panel factorization is more expensive than the update ● If R finish before P → Stop panel factorization – RL LU. Keep a copy of the panel – Use LL LU. Sincronization among threads follows the same idea 16
Look ahead via runtimes ✔ TP execution ✔ Adaptative-depth look-ahead ✗ Re-packing and data movements (many GEMM calls) ✗ Block size fixes the granularity of the tasks ✗ Rarely exploit TP+TLP 17
Experimental results ● LU, LU_LA, LU_MB, LU_OS ● Square matrices from n=500 to n=12,000 ● b o was tested for values from 32 to 512 in steps of 32 ● b i was evaluated for 16 and 32 18
Performance comparison 19
Performance comparison 20
Conclusions ● Malleable implementation of DLA library ● Competitive results (small matrices) ● Pending strategies to be applied (Early termination) 21
THANK YOU

Recommend

13 IN THIS CHAPTER Benefits of Thread Pooling 308 Considerations and Costs of Thread

Thread Pooling CHAPTER 13 IN THIS CHAPTER Benefits of Thread Pooling 308 Considerations and Costs of Thread Pooling 308 A Generic Thread Pool: ThreadPool 309 A Specialized Worker Thread Pool: HttpServer 319 Techniques

556 views • 34 slides

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear functions. The word linear means straight or flat . y = 0 + 1 x Linear functions involve only addition and scalar multiplication. Chapter 1 Higher

1.2k views • 56 slides

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE core topics important Linear Combinations x 1 2x 2 + x 1 = x 2 =1 Linear Combinations Algebra Linear Combinations

1.58k views • 117 slides

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense linear algebra Next week: sparse linear algebra Numerical linear algebra in a nutshell Basic problems Linear systems: Ax = b Least

539 views • 36 slides

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

Mathematical Tools for Neural and Cognitive Science Fall semester, 2018 Section 1: Linear Algebra Linear Algebra Linear algebra has become as basic and as applicable as calculus, and fortunately it is easier - Gilbert Strang, Linear

360 views • 10 slides

To thread or not to thread? Why PETSc favors MPI-only Plenary Discussion PETSc User Meeting 2016

To thread or not to thread? Why PETSc favors MPI-only Plenary Discussion PETSc User Meeting 2016 Based on: MS35 - To Thread or Not To Thread April 13, 2016 SIAM PP , Paris The Big Picture The Big Picture

208 views • 9 slides

PV Math Department MCL Vision Credit Options Credit General General/Post- College Honors

PV Math Department MCL Vision Credit Options Credit General General/Post- College Honors Secondary Bound 1 Pre-Algebra Algebra 1 Algebra 1 Algebra 1 2 Algebra 1 Algebra 2/ Algebra 2 H. Algebra 2 3 Algebra Geometry H. Geometry

179 views • 3 slides

CASTER ASSEMBLY SCALE 1 : 1 DRAWN 1/26/2010 swaters A A CHECKED TITLE QA CASTER ASSEMBLY

4 3 2 1 PARTS LIST ITEM QTY PART NUMBER MATERIAL 1 1 TOP PLATE MALLEABLE IRON 2 2 AXLE SUPPORT MALLEABLE IRON 3 1 AXLE SAE 1020 4 2 BUSHING BRONZE 5 1 WHEEL MALLEABLE IRON B B CASTER ASSEMBLY SCALE 1 : 1 DRAWN

434 views • 7 slides

Malleable Proof Systems and Applications Melissa Chase (MSR Redmond) Markulf Kohlweiss (MSR

Malleable Proof Systems and Applications Melissa Chase (MSR Redmond) Markulf Kohlweiss (MSR Cambridge) Anna Lysyanskaya (Brown University) Sarah Meiklejohn (UC San Diego) 1 Non-malleable cryptography Twenty years ago, saw a strong emphasis on

1.6k views • 145 slides

Non-Malleable Codes for Partial Functions with Manipulation Detection Aggelos Kiayias Feng-Hao

Non-Malleable Codes for Partial Functions with Manipulation Detection Aggelos Kiayias Feng-Hao Liu Yiannis Tselekounis Edin. & FAU CRYPTO 2018 Outline Introduction to non-malleable codes Adversarial model, motivation Results,

772 views • 55 slides

Linear algebra explained in four pages Excerpt from the N O BULLSHIT GUIDE TO LINEAR ALGEBRA by

Linear algebra explained in four pages Excerpt from the N O BULLSHIT GUIDE TO LINEAR ALGEBRA by Ivan Savov Abstract This document will review the fundamental ideas of linear algebra. B. Matrix operations We will learn about matrices, matrix

333 views • 4 slides

Matrices Basic Linear Algebra Overview Lecture will cover why matrices and linear algebra

Matrices Basic Linear Algebra Overview Lecture will cover why matrices and linear algebra are so important basic terminology Gauss-Jordan elimination LU factorisation error estimation libraries Basic Linear Algebra 2

564 views • 27 slides

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of Matlab; hence the name MATrix LABoratory. In this section we will cover the basics of linear algebra, the ways of using Matlab in this context, and

1.08k views • 28 slides

Expressive Linear Algebra in Haskell Henning Thielemann 2019-08-21 Expressive Linear Algebra in

Expressive Linear Algebra in Haskell Expressive Linear Algebra in Haskell Henning Thielemann 2019-08-21 Expressive Linear Algebra in Haskell Motivation 1 Motivation 2 Solution 3 More realistic problem 4 More features 5 Closing Expressive

640 views • 48 slides

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Linear, Linear, Linear CS7616 Pattern Recognition A. Bobick CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive Computing Linear, Linear, Linear CS7616 Pattern Recognition A. Bobick Administrivia

685 views • 64 slides

Chapter 4: Vectors, Matrices, and Linear Algebra Scott Owen & Greg Corrado Linear Algebra is

Chapter 4: Vectors, Matrices, and Linear Algebra Scott Owen & Greg Corrado Linear Algebra is strikingly similar to the algebra you learned in high school, except that in the place of ordinary single numbers, it deals with vectors. Many of

650 views • 11 slides

The Iwahori-Hecke Algebra, the Ramanujan Conjecture, and Expander Graphs Cristina Ballantine

The Iwahori-Hecke Algebra, the Ramanujan Conjecture, and Expander Graphs Cristina Ballantine College of the Holy Cross ICERM April 16, 2013 Graph Theory Graph X = ( V , E ) Graph Theory Graph X = ( V , E ) V = { v 1 , v 2 , . . .

1.17k views • 89 slides

N = 2 superconformal field theory and operator algebras Yasu Kawahigashi University of Tokyo

N = 2 superconformal field theory and operator algebras Yasu Kawahigashi University of Tokyo Paris May 26, 2011 Yasu Kawahigashi (Univ. Tokyo) N = 2 SCFT and OA Paris May 26, 2011 1 / 17 Operator algebraic approach to conformal field theory

205 views • 19 slides

Positivity results for cluster algebras from surfaces Gregg Musiker (MSRI/MIT) (Joint work with

Positivity results for cluster algebras from surfaces Gregg Musiker (MSRI/MIT) (Joint work with Ralf Schiffler (University of Connecticut) and Lauren Williams (University of California, Berkeley)) AMS 2009 Eastern Sectional October 25, 2009

714 views • 59 slides

Constraints in Universal Algebra Ross Willard University of Waterloo, CAN SSAOS 2014 September

Constraints in Universal Algebra Ross Willard University of Waterloo, CAN SSAOS 2014 September 7, 2014 Lecture 1 R. Willard (Waterloo) Constraints in Universal Algebra SSAOS 2014 1 / 23 Outline Lecture 1 : Intersection problems and

677 views • 23 slides

On algebraic description of the Goldman-Turaev Lie bialgebra Yusuke Kuno Tsuda College 7 March

On algebraic description of the Goldman-Turaev Lie bialgebra Yusuke Kuno Tsuda College 7 March 2016 (joint work with Nariya Kawazumi (University of Tokyo)) Contents Introduction 1 Goldman bracket 2 Turaev cobracket 3 Yusuke Kuno (Tsuda

588 views • 27 slides

Algebra of waves A. A. Kutsenko Jacobs University, Bremen, Germany February 25, 2019 1 Some

Algebra of waves A. A. Kutsenko Jacobs University, Bremen, Germany February 25, 2019 1 Some aspects of mathematical theory of waves Algebras of Spectral Inverse integral theory problems operators Traces and Waves Representation

629 views • 49 slides

Formalizing o-minimality Reid Barton University of Pittsburgh January 6, 2020 FoMM / Lean

Formalizing o-minimality Reid Barton University of Pittsburgh January 6, 2020 FoMM / Lean Together The story Johan Commelin and I are interested in formalizing the theory of o-minimal structures . The story Johan Commelin and I are

747 views • 50 slides

$Patterns Occurring during GEMTEX Confrac Expansion P.L.Douillet of Quadratic Numbers$

Patterns Occurring during GEMTEX Confrac Expansion P.L.Douillet of Quadratic Numbers

E.N.S.A.I.T. (Roubaix) Patterns Occurring during GEMTEX Confrac Expansion P.L.Douillet of Quadratic Numbers 8/01/2004 Pierre L. Douillet douillet@ensait.fr key idea : x 0 start from a positive number subtract as much as you can 1

495 views • 18 slides