Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations

Hartwig Anzt, Yuhsiang M. Tsai, Ahmad Abdelfattah, Terry Cojean, Jack Dongarra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

GPU accelerators have become an Important backbone for scientific high performance-computing, and the performance advances obtained from adopting new GPU hardware are significant. In this paper we take a first look at NVIDIA's newest server-line GPU, the A100 architecture, part of the Ampere generation. Specifically, we assess its performance for sparse and batch computations, as these routines are relied upon in many scientific applications, and compare to the performance achieved on NVIDIA's previous server-line GPU.

Original languageEnglish
Title of host publicationProceedings of PMBS 2020
Subtitle of host publicationPerformance Modeling, Benchmarking and Simulation of High Performance Computer Systems, Held in conjunction with SC 2020: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages26-38
Number of pages13
ISBN (Electronic)9781665422659
DOIs
StatePublished - Nov 2020
Event2020 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, PMBS 2020 - Virtual, Atlanta, United States
Duration: Nov 12 2020 → …

Publication series

NameProceedings of PMBS 2020: Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, Held in conjunction with SC 2020: The International Conference for High Performance Computing, Networking, Storage and Analysis

Conference

Conference2020 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, PMBS 2020
Country/TerritoryUnited States
CityVirtual, Atlanta
Period11/12/20 → …

Funding

ACKNOWLEDGMENTS This work was supported by the “Impuls und Vernetzungs-fond” of the Helmholtz Association under grant VH-NG-1241 and by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the U.S. Department of Energy Office of Science and the National Nuclear Security Administration. The authors would like to thank the Steinbuch Centre for Computing (SCC) of the Karlsruhe Institute of Technology for providing access to an NVIDIA A100 GPU.

FundersFunder number
U.S. Department of Energy
National Nuclear Security Administration
Helmholtz Association17-SC-20-SC, VH-NG-1241

    Keywords

    • Batched Linear Algebra
    • NVIDIA A100 GPU
    • Sparse Linear Algebra
    • Sparse Matrix Vector Product

    Fingerprint

    Dive into the research topics of 'Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations'. Together they form a unique fingerprint.

    Cite this