Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning

  • Hartwig Anzt
  • , Jack Dongarra
  • , Goran Flegar
  • , Enrique S. Quintana-Ortí
  • , Andrés E. Tomás

Research output: Contribution to journalConference articlepeer-review

4 Scopus citations

Abstract

In this work we present new kernels for the generation and application of block-Jacobi precon-ditioners that accelerate the iterative solution of sparse linear systems on graphics processing units (GPUs). Our approach departs from the conventional LU factorization and decomposes the diagonal blocks of the matrix using the Gauss-Huard method. When enhanced with column pivoting, this method is as stable as LU with partial/row pivoting. Due to extensive use of GPU registers and integration of implicit pivoting, our variable size batched Gauss-Huard implementation outperforms the batched version of LU factorization. In addition, the application kernel combines the conventional two-stage triangular solve procedure, consisting of a backward solve followed by a forward solve, into a single stage that performs both operations simultaneously.

Original languageEnglish
Pages (from-to)1783-1792
Number of pages10
JournalProcedia Computer Science
Volume108
DOIs
StatePublished - 2017
EventInternational Conference on Computational Science ICCS 2017 - Zurich, Switzerland
Duration: Jun 12 2017Jun 14 2017

Funding

This material is supported by the U.S. Department of Energy Office of Science, Office of Ad-vanced Scientific Computing Research, Applied Mathematics program under Award #DE-SC-0010042. The researchers from UJI were supported by project TIN2014-53495-R of MINECO and FEDER.

Keywords

  • Gauss-Huard factorization
  • Gauss-Jordan elimination
  • Sparse linear systems
  • block-Jacobi preconditioner
  • graphics processing units (GPUs)
  • iterative methods
  • linear systems

Fingerprint

Dive into the research topics of 'Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning'. Together they form a unique fingerprint.

Cite this