Key concepts for parallel out-of-core LU factorization

J. J. Dongarra, S. Hammarling, D. W. Walker

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

This paper considers key ideas in the design of out-of-core dense LU factorization routines. A left-looking variant of the LU factorization algorithm is shown to require less I/O to disk than the right-looking variant, and is used to develop a parallel, out-of-core implementation. This implementation makes use of a small library of parallel I/O routines, together with ScaLAPACK and PBLAS routines. Results for runs on an Intel Paragon are presented and interpreted using a simple performance model.

Original languageEnglish
Pages (from-to)13-31
Number of pages19
JournalComputers and Mathematics with Applications
Volume35
Issue number7
DOIs
StatePublished - Apr 1998
Externally publishedYes

Funding

This paper presents a prototype for the design of a parallel software library for the out-of core solution of dense linear systems, and presents timing results for runs on an Intel Paragon. Other The authors acknowledge the use of the Intel Paragon XP/S 5 computer, located in the Oak Ridge National Laboratory Center for Computational Sciences (CCS), funded by the Department of Energy's Mathematical, Information, and Computational Sciences (MICS) Division of the Office of Computational and Technology Research (OCTR).

FundersFunder number
OCTR
Office of Computational and Technology Research
U.S. Department of Energy
Oak Ridge National Laboratory

    Keywords

    • Dense matrices
    • Out-of-core LU factorization
    • Parallel computing
    • Performance

    Fingerprint

    Dive into the research topics of 'Key concepts for parallel out-of-core LU factorization'. Together they form a unique fingerprint.

    Cite this