Performance implications of nonuniform device topologies in scalable heterogeneous architectures

Jeremy S. Meredith, Philip C. Roth, Kyle L. Spafford, Jeffrey S. Vetter

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

This article considers trends in heterogeneous system design, particularly for GPUs. Using the Keeneland Initial Delivery System, the authors examine the performance implications of increased parallelism and specialized hardware on parallel scientific applications. They examine how nonuniform data-transfer performance across the node-level topology can impact performance. Finally, they help users of GPU-based systems avoid performance problems related to this nonuniformity.

Original languageEnglish
Article number5989784
Pages (from-to)66-75
Number of pages10
JournalIEEE Micro
Volume31
Issue number5
DOIs
StatePublished - Sep 2011

Funding

The submitted manuscript has been auth ored by Oak Ridge National Laboratory, which is managed by UT-Battelle under contract DE-AC05-00OR22725 to the US government. Accordingly, the US government retains a nonexclusive, royalty-free license to publish or reproduce the published form of this contribution, or allow others to do so, for US government purposes. This research was sponsored in part by the Office of Advanced Scientific Computing Research in the US Department of Energy, the NSF award OCI-0910735, and DARPA contract HR0011-10-9-0008. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the US government.

FundersFunder number
US Department of Energy
National Science FoundationOCI-0910735
Defense Advanced Research Projects AgencyHR0011-10-9-0008
Advanced Scientific Computing Research

    Keywords

    • GPU
    • data-transfer performance
    • heterogeneous GPUs
    • nonuniformity

    Fingerprint

    Dive into the research topics of 'Performance implications of nonuniform device topologies in scalable heterogeneous architectures'. Together they form a unique fingerprint.

    Cite this