Variable-Size Batched Condition Number Calculation on GPUs

Hartwig Anzt, Jack Dongarra, Goran Flegar, Thomas Grutzmacher

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

We present a kernel that is designed to quickly compute the condition number of a large collection of tiny matrices on a graphics processing unit (GPU). The matrices can differ in size and the process integrates the use of pivoting to ensure a numerically-stable matrix inversion. The performance assessment reveals that, in double precision arithmetic, the new GPU kernel achieves up to 550 GFLOPs (billions of floating-point operations per second) and 800 GFLOPs on NVIDIA's P100 and V100 GPUs, respectively. The results also demonstrate a considerable speed-up with respect to a workflow that computes the condition number via launching a set of four batched kernels. In addition, we present a variable-size batched kernel for the computation of the matrix infinity norm. We show that this memory-bound kernel achieves up to 90% of the sustainable peak bandwidth.

Original languageEnglish
Title of host publicationProceedings - 2018 30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages132-139
Number of pages8
ISBN (Electronic)9781538677698
DOIs
StatePublished - Jul 2 2018
Event30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018 - Lyon, France
Duration: Sep 24 2018Sep 27 2018

Publication series

NameProceedings - 2018 30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018

Conference

Conference30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018
Country/TerritoryFrance
CityLyon
Period09/24/1809/27/18

Funding

ACKNOWLEDGMENTS This work was partly funded by the U.S. Department of Energy Office of Science, Office of Advanced Scientific Computing Research, Applied Mathematics program under Award Number DE-SC-0010042. H. Anzt was supported by the “Impuls und Vernetzungsfond” of the Helmholtz Association under grant VH-NG-1241. G. Flegar was supported by projects TIN2014-53495-R of the Spanish Ministerio de Economía y Competitividad and the EU H2020 project 732631 OPRECOMP.

FundersFunder number
EU H2020732631
U.S. Department of Energy Office of Science
Advanced Scientific Computing ResearchDE-SC-0010042
Ministerio de Economía y Competitividad
Helmholtz AssociationTIN2014-53495-R, VH-NG-1241

    Fingerprint

    Dive into the research topics of 'Variable-Size Batched Condition Number Calculation on GPUs'. Together they form a unique fingerprint.

    Cite this