Performance implications of nonuniform device topologies in scalable heterogeneous architectures

Jeremy S. Meredith, Philip C. Roth, Kyle L. Spafford, Jeffrey S. Vetter

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

This article considers trends in heterogeneous system design, particularly for GPUs. Using the Keeneland Initial Delivery System, the authors examine the performance implications of increased parallelism and specialized hardware on parallel scientific applications. They examine how nonuniform data-transfer performance across the node-level topology can impact performance. Finally, they help users of GPU-based systems avoid performance problems related to this nonuniformity.

Original languageEnglish
Article number5989784
Pages (from-to)66-75
Number of pages10
JournalIEEE Micro
Volume31
Issue number5
DOIs
StatePublished - Sep 2011

Funding

The submitted manuscript has been auth ored by Oak Ridge National Laboratory, which is managed by UT-Battelle under contract DE-AC05-00OR22725 to the US government. Accordingly, the US government retains a nonexclusive, royalty-free license to publish or reproduce the published form of this contribution, or allow others to do so, for US government purposes. This research was sponsored in part by the Office of Advanced Scientific Computing Research in the US Department of Energy, the NSF award OCI-0910735, and DARPA contract HR0011-10-9-0008. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the US government.

Keywords

  • GPU
  • data-transfer performance
  • heterogeneous GPUs
  • nonuniformity

Fingerprint

Dive into the research topics of 'Performance implications of nonuniform device topologies in scalable heterogeneous architectures'. Together they form a unique fingerprint.

Cite this