TY - JOUR
T1 - Fast Cholesky factorization on GPUs for batch and native modes in MAGMA
AU - Abdelfattah, Ahmad
AU - Haidar, Azzam
AU - Tomov, Stanimire
AU - Dongarra, Jack
N1 - Publisher Copyright:
© 2016
PY - 2017/5
Y1 - 2017/5
N2 - This paper presents a GPU-accelerated Cholesky factorization for two different modes of operation. The first one is the batch mode, where many independent factorizations on small matrices can be performed concurrently. This mode supports fixed size and variable size problems, and is found in many scientific applications. The second mode is the native mode, where one factorization is performed on a large matrix without any CPU involvement, which allows the CPU do other useful work. We show that, despite the different workloads, both modes of operation share a common code-base that uses the GPU only. We also show that the developed routines achieve significant speedups against a multicore CPU using the MKL library, and against a GPU implementation by cuSOLVER. This work is part of the MAGMA library.
AB - This paper presents a GPU-accelerated Cholesky factorization for two different modes of operation. The first one is the batch mode, where many independent factorizations on small matrices can be performed concurrently. This mode supports fixed size and variable size problems, and is found in many scientific applications. The second mode is the native mode, where one factorization is performed on a large matrix without any CPU involvement, which allows the CPU do other useful work. We show that, despite the different workloads, both modes of operation share a common code-base that uses the GPU only. We also show that the developed routines achieve significant speedups against a multicore CPU using the MKL library, and against a GPU implementation by cuSOLVER. This work is part of the MAGMA library.
KW - Batched execution
KW - Cholesky factorization
KW - GPU computing
UR - http://www.scopus.com/inward/record.url?scp=85008712417&partnerID=8YFLogxK
U2 - 10.1016/j.jocs.2016.12.009
DO - 10.1016/j.jocs.2016.12.009
M3 - Article
AN - SCOPUS:85008712417
SN - 1877-7503
VL - 20
SP - 85
EP - 93
JO - Journal of Computational Science
JF - Journal of Computational Science
ER -