Using hybrid model OpenSHMEM+CUDA to implement the SHOC benchmark suite

Megan Grodowitz, Eduardo D’Azevedo, Sarah Powers, Neena Imam

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This work describes the process of porting the Scalable HeterOgeneous Computing (SHOC) benchmark suite from the hybrid MPI+CUDA implementation to OpenSHMEM+CUDA. SHOC includes a wide variety of benchmark kernels used to measure accelerator performance in both single node and cluster configurations. The hybrid model implementation attempts to place all major computation on accelerator devices, and uses MPI to synchronize and aggregate results. In some cases, MPI Groups are used to gradually reduce the number of accelerators used for computation as the problem size drops. Porting this behavior to OpenSHMEM required implementing several synchronizing collective operations, and using SHMEM teams to replace MPI Group functionality. Benchmark results on a Cray XK7 system with one GPU per compute node show that SHMEM performance is equal to MPI performance in these hybrid tasks. These results and porting experience show that using OpenSHMEM for accelerator devices benefits from adding functionality for synchronization and teams, and would further benefit from adding support for communication within accelerator kernels. (Notice: This manuscript has been authored by UT-Battelle, LLC under Contract No. DE- AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). This research used resources of the Center for Computational Sciences at Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. De-AC05- 00OR22725.)

Original languageEnglish
Title of host publicationOpenSHMEM and Related Technologies
Subtitle of host publicationEnhancing OpenSHMEM for Hybrid Environments - 3rd Workshop, OpenSHMEM 2016, Revised Selected Papers
EditorsManjunath Gorentla Venkata, Neena Imam, Swaroop Pophale, Tiffany M. Mintz
PublisherSpringer Verlag
Pages204-216
Number of pages13
ISBN (Print)9783319509945
DOIs
StatePublished - 2016
Event3rd workshop on OpenSHMEM and Related Technologies, OpenSHMEM 2016 - Baltimore, United States
Duration: Aug 2 2016Aug 4 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10007 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd workshop on OpenSHMEM and Related Technologies, OpenSHMEM 2016
Country/TerritoryUnited States
CityBaltimore
Period08/2/1608/4/16

Keywords

  • CUDA
  • Parallel computing
  • Programming models
  • SHMEM

Fingerprint

Dive into the research topics of 'Using hybrid model OpenSHMEM+CUDA to implement the SHOC benchmark suite'. Together they form a unique fingerprint.

Cite this