High Performance Matrix Inversion Based on LU Factorization for - PowerPoint PPT Presentation

Nov 17, 2023 •468 likes •702 views

Jack Dongarra, Mathieu Faverge, Hatem Ltaief, Piotr Luszczek High Performance Matrix Inversion Based on LU Factorization for Multicore Architectures presented by Piotr Luszczek Preliminaries Problem Statement n n A R PA = LU 1 U

Jack Dongarra, Mathieu Faverge, Hatem Ltaief, Piotr Luszczek High Performance Matrix Inversion Based on LU Factorization for Multicore Architectures presented by Piotr Luszczek
Preliminaries
Problem Statement n × n A ∈ R PA = LU − 1 U → U − 1 L → L − 1 ∈ R n × n A
To Keep in Mind... In the vast majority of practical computational problems, it is unnecessary and inadvisable to actually compute A -1 . Forsythe, Malcolm, and Moler
Data Layouts for Matrix Elements Column-major (LAPACK and derivatives) Tile (PLASMA)
Tasks and DAGs
Block LU Inversion Tile LU Inversion For each panel LU factorization For each diagonal tile ● ● DGETF2( ) -DGETRFR() parallel recursive LU DLASWP( ) for each tail tile panel DLASWP( ) -DLASWP( ) DTRSM( ) for each tail tile DGEMM( ) -DGEMM( ) for each left tile panel For each panel Invert U ● -DLASWP( ) DTRMM( ) DTRSM( ) For each diagonal tile Invert U ● DTRTI2( ) for each tile in panel -DTRSM( ) For each panel Invert L ● for each tail tile DLACPY( ) -DGEMM( ) DLASET( ) for each left panel tile DGEMM( ) -DTRSM( ) DTRSM( ) -DTRTRI( ) For each left tile Invert L ● DLASWP( ) column interchanges ● -DLACPY( ) -DLASET( ) ...
Queuing Functions with QUARK QUARK_Insert_Task( panel_LU_task, M, matrix_1 , INPUT, N, matrix_2 , INOUT, 1, result , OUTPUT, K, buffer , SCRATCH, 0);
DAGs of Tasks, Each State Separately 3 – Computation of U -1 1 – LU Factorization 4 – Column swapping 2 – Computation of L -1
DAGs of Tasks, All Stages Overlapped
Execution Traces No Overlap of Stages Overlap of Stages
The Case for Nested Parallelism
Panel Factorization as the Sequential Bottleneck xGETRF-REC Swap + xTRSM Swap + xTRSM xGEMM xGEMM xGETRF-REC xGEMM xGEMM
Panel Factorization is On Critical Path of DAG
Parallel Panel Factorization: Data Partitioning
Parallel Panel Factorization: Algorithm
Quick Performance Experiment
Results
Performance on AMD MagnyCours, 4x12=48 cores
LU Inversion's Power Profile: LAPACK
LU Inversion's Power Profile: MKL
LU Inversion's Power Profile: PLASMA
PLASMA LAPACK MKL This work was sponsored by NSF, DOE, and Microsoft

Recommend

Strengthening the inversion Tactic in Coq Dependent Types Inversion Lemmas Implications Anne

Strengthening the inversion Tactic in Coq Anne Mulhern Examples Universes Strengthening the inversion Tactic in Coq Dependent Types Inversion Lemmas Implications Anne Mulhern Implementation Conclusion Department of Computer Sciences

242 views • 23 slides

[3] The Matrix What is a matrix? Traditional answer Neo: What is the Matrix? Trinity: The answer

The Matrix [3] The Matrix What is a matrix? Traditional answer Neo: What is the Matrix? Trinity: The answer is out there, Neo, and its looking for you, and it will find you if you want it to. The Matrix , 1999 Traditional notion of a matrix:

1.43k views • 120 slides

Matrix Multiplication Matrix Multiplication via Matrix-Vector Mult Defn. If matrix A is m n

Matrix Multiplication Matrix Multiplication via Matrix-Vector Mult Defn. If matrix A is m n and matrix B is r s , then for the product AB to be valid it must be that n = r . If valid, the product AB has size m s . The columns of the

400 views • 12 slides

Pumping and population inversion - Laser amplification Gustav Lindgren 2015-02-12 Contents

Pumping and population inversion - Laser amplification Gustav Lindgren 2015-02-12 Contents Part I: Laser pumping and population inversion Steady state laser pumping and population inversion 4-level laser Solve rate-equations in steady-state

574 views • 41 slides

Short range geoacoustic inversion Short range geoacoustic inversion with a vertical line array

MPL Short range geoacoustic inversion Short range geoacoustic inversion with a vertical line array with a vertical line array Yong-Min Jiang 1 , N. Ross Chapman 1 and Peter Gerstoft 2 1 University of Victoria, Victoria, BC, Canada 2 Marine

533 views • 15 slides

Seismic Modeling, Migration and Velocity Inversion Full Waveform Inversion Bee Bednar Panorama

Seismic Modeling, Migration and Velocity Inversion Full Waveform Inversion Bee Bednar Panorama Technologies, Inc. 14811 St Marys Lane, Suite 150 Houston TX 77079 May 30, 2014 Bee Bednar (Panorama Technologies) Seismic Modeling, Migration

850 views • 53 slides

Asteroid orbital inversion using Asteroid orbital inversion using Markov-chain Monte Carlo

Asteroid orbital inversion using Asteroid orbital inversion using Markov-chain Monte Carlo methods Markov-chain Monte Carlo methods Karri Muinonen 1,2 1,2 , , Dagmara Dagmara Oszkiewicz Oszkiewicz 3,4,1 3,4,1 , , Karri Muinonen Tuomo

154 views • 14 slides

Waveform tomography and inversion - Full Waveform Inversion (FWI) Unit 12 Slide #1 Slide #2

Waveform tomography and inversion - Full Waveform Inversion (FWI) Unit 12 Slide #1 Slide #2 biondo@stanford.edu Slide #3 biondo@stanford.edu Slide #4 biondo@stanford.edu Slide #20 biondo@stanford.edu Slide #21 biondo@stanford.edu Slide #22

534 views • 40 slides

Boolean Algebra - Part 2 September 4, 2008 Typeset by Foil T EX Inversion Inversion or

Boolean Algebra - Part 2 September 4, 2008 Typeset by Foil T EX Inversion Inversion or Complement of a Function means all 0 outputs become 1 and all 1 outputs become 0. A B F F 0 0 0 1 0 1 1 0 1 0 1 0 1 1 0 1

867 views • 54 slides

Inversion Sequences and Generating Trees A. Bindi V. Guerrini S. Rinaldi University of Siena

Inversion Sequences and Generating Trees A. Bindi V. Guerrini S. Rinaldi University of Siena Permutation Patterns 2017 Inversion Sequences An inversion sequence is an integer sequence e 1 . . . e n satisfying 0 e i < i for all i = 1 , .

1.62k views • 158 slides

Seismic Modeling, Migration and Velocity Inversion Full Waveform Inversion Bee Bednar Panorama

Seismic Modeling, Migration and Velocity Inversion Full Waveform Inversion Bee Bednar Panorama Technologies, Inc. 14811 St Marys Lane, Suite 150 Houston TX 77079 May 18, 2014 Bee Bednar (Panorama Technologies) Seismic Modeling, Migration

432 views • 30 slides

Approximate Neumann Series or Exact Matrix Inversion for Massive MIMO? Oscar Gustafsson, Erik

Approximate Neumann Series or Exact Matrix Inversion for Massive MIMO? Oscar Gustafsson, Erik Bertilsson, Johannes Klasson, and Carl Ingemarsson Channel matrix, Gram matrix, to be inverted for zero forcing (or MMSE) : conjugate

645 views • 52 slides

Building an IoT Platform with Matrix matthew@matrix.org http://www.matrix.org What is Matrix?

Building an IoT Platform with Matrix matthew@matrix.org http://www.matrix.org What is Matrix? An open decentralised conversation store and message bus. Why? To create a global communication meta-network that bridges all the existing

471 views • 22 slides

Introductory Matrix Operations Matrix Entries Defn. For matrix A , notation a ij means the en-

Introductory Matrix Operations Matrix Entries Defn. For matrix A , notation a ij means the en- try in row i and column j of A . matOpsONE: 2 Matrix Addition and Scalar Multiplication Matrix addition requires the two ma- Defn. trices have the

230 views • 8 slides

Gov 2000: 10. Multiple Regression in Matrix Form Matthew Blackwell Fall 2016 1 / 64 1. Matrix

Gov 2000: 10. Multiple Regression in Matrix Form Matthew Blackwell Fall 2016 1 / 64 1. Matrix algebra review 2. Matrix Operations 3. Linear model in matrix form 4. OLS in matrix form 5. OLS inference in matrix form 2 / 64 Where are we?

1.64k views • 64 slides

Liberating Communication with Matrix matthew@matrix.org http://www.matrix.org What is Matrix?

Liberating Communication with Matrix matthew@matrix.org http://www.matrix.org What is Matrix? An open decentralised conversation store and message bus. Why? To create a global communication meta-network that bridges all the existing

813 views • 29 slides

A Model to Address Salary Compression for Faculty (an anti-compression model) Presented to

3/13/12 A Model to Address Salary Compression for Faculty (an anti-compression model) Presented to President Joe Shepard and the Faculty Salary and Benefits Committee 13 March 2012 The Problem: Salary Compression Characteristics include:

223 views • 11 slides

How to Prepare for U.S. Governm ent Action to Address U.S. Corporate Inversions Anthony (Toby) M

How to Prepare for U.S. Governm ent Action to Address U.S. Corporate Inversions Anthony (Toby) M offett David M cIntosh M arcia G. M adsen Consultant, Senior Advisor D.C. Partner D.C. Partner D.C. +1 202 263 3772 +1 202 263

448 views • 32 slides

Inversion Transactions: Structuring Deals to Capture Tax Benefits and Manage Post-Merger

Presenting a live 90-minute teleconference with interactive Q&A Inversion Transactions: Structuring Deals to Capture Tax Benefits and Manage Post-Merger Integration WEDNES DAY, OCTOBER 1, 2014 1pm East ern | 12pm Cent ral | 11am

1.03k views • 72 slides

From Engagement to Interconnectedness Preliminary findings from Nelson Mandela Metropolitan

HERANA From Engagement to Interconnectedness Preliminary findings from Nelson Mandela Metropolitan University South African Higher Education Community Engagement Forum (SAHECEF) 4 April 2014 Themes from Day 1 Connectedness

395 views • 26 slides

Computation and inversion of the dielectric matrix Derek Vigil-Fowler UC-Berkeley and LBNL BW

Computation and inversion of the dielectric matrix Derek Vigil-Fowler UC-Berkeley and LBNL BW Symposium 05/12/15 Email - vigil@berkeley.edu Materials Science for Energy, Technology Materials Science for Energy, Technology Dielectric response

366 views • 25 slides

HMS A Modern Software Design Principle Applied To SAS Macro Programming: The Inversion Of Control

HMS A Modern Software Design Principle Applied To SAS Macro Programming: The Inversion Of Control Concept HMS Analytical Software GmbH - Dr. P. Warnat PhUSE 2011 Company HMS Analytical Software is a specialist for Information Technology in

409 views • 15 slides

Presentation for UTD FLA March 2017 Askeladden Capital Intro / Bio Samir Patel UTD alum,

Making Good Decisions: Presentation for UTD FLA March 2017 Askeladden Capital Intro / Bio Samir Patel UTD alum, two years as an analyst for a hedge fund, launched my own (Askeladden Capital) in Jan 2016 Long-only small-cap equity

387 views • 14 slides

Update August 2019 Mike Schenk Chief Economist CUNA Economic Update is sponsored by Flight to

CUNA Economic Update August 2019 Mike Schenk Chief Economist CUNA Economic Update is sponsored by Flight to safety pushing yields down! Ten-Year U.S. Treasury Yields (Percent) // Source: Federal Reserve 2.02 1.90 1.86 1.75 1.74 1.73

512 views • 16 slides