TY - JOUR
T1 - Design and implementation of the parallel out-of-core ScaLAPACK LU, QR, and Cholesky factorization routines
AU - D'Azevedo, Eduardo
AU - Dongarra, Jack
PY - 2000/12/25
Y1 - 2000/12/25
N2 - This paper describes the design and implementation of three core factorization routines - LU, QR, and Cholesky - included in the out-of-core extension of ScaLAPACK. These routines allow the factorization and solution of a dense system that is too large to fit entirely in physical memory. The full matrix is stored on disk and the factorization routines transfer sub-matrice panels into memory. The 'left-looking' column-oriented variant of the factorization algorithm is implemented to reduce the disk I/O traffic. The routines are implemented using a portable I/O interface and utilize high-performance ScaLAPACK factorization routines as in-core computational kernels. We present the details of the implementation for the out-of-core ScaLAPACK factorization routines, as well as performance and scalability results on a Beowulf Linux cluster.
AB - This paper describes the design and implementation of three core factorization routines - LU, QR, and Cholesky - included in the out-of-core extension of ScaLAPACK. These routines allow the factorization and solution of a dense system that is too large to fit entirely in physical memory. The full matrix is stored on disk and the factorization routines transfer sub-matrice panels into memory. The 'left-looking' column-oriented variant of the factorization algorithm is implemented to reduce the disk I/O traffic. The routines are implemented using a portable I/O interface and utilize high-performance ScaLAPACK factorization routines as in-core computational kernels. We present the details of the implementation for the out-of-core ScaLAPACK factorization routines, as well as performance and scalability results on a Beowulf Linux cluster.
UR - http://www.scopus.com/inward/record.url?scp=0034487070&partnerID=8YFLogxK
U2 - 10.1002/1096-9128(20001225)12:15<1481::AID-CPE540>3.0.CO;2-V
DO - 10.1002/1096-9128(20001225)12:15<1481::AID-CPE540>3.0.CO;2-V
M3 - Article
AN - SCOPUS:0034487070
SN - 1040-3108
VL - 12
SP - 1481
EP - 1493
JO - Concurrency Practice and Experience
JF - Concurrency Practice and Experience
IS - 15
ER -