Newly Released Capabilities in the Distributed-Memory SuperLU Sparse Direct Solver

Xiaoye S. Li, Paul Lin, Yang Liu, Piyush Sao

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

We present the new features available in the recent release of SuperLU_DIST, Version 8.1.1. SuperLU_DIST is a distributed-memory parallel sparse direct solver. The new features include (1) a 3D communication-avoiding algorithm framework that trades off inter-process communication for selective memory duplication, (2) multi-GPU support for both NVIDIA GPUs and AMD GPUs, and (3) mixed-precision routines that perform single-precision LU factorization and double-precision iterative refinement. Apart from the algorithm improvements, we also modernized the software build system to use CMake and Spack package installation tools to simplify the installation procedure. Throughout the article, we describe in detail the pertinent performance-sensitive parameters associated with each new algorithmic feature, show how they are exposed to the users, and give general guidance of how to set these parameters. We illustrate that the solver's performance both in time and memory can be greatly improved after systematic tuning of the parameters, depending on the input sparse matrix and underlying hardware.

Original languageEnglish
Article number10
JournalACM Transactions on Mathematical Software
Volume49
Issue number1
DOIs
StatePublished - Mar 21 2023

Funding

This research was supported in part by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the U.S. Department of Energy Office of Science and the National Nuclear Security Administration, and in part by the Scientific Discovery through Advanced Computing (SciDAC) Program under the Office of Science at the U.S. Department of Energy.

Keywords

  • Additional Key Words and PhrasesSparse direct solver
  • GPU
  • communication-avoiding
  • mixed precision

Fingerprint

Dive into the research topics of 'Newly Released Capabilities in the Distributed-Memory SuperLU Sparse Direct Solver'. Together they form a unique fingerprint.

Cite this