Balancing performance and portability with containers in HPC: An OpenSHMEM example

Thomas Naughton, Lawrence Sorrillo, Adam Simpson, Neena Imam

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

There is a growing interest in using Linux containers to streamline software development and application deployment. A container enables the user to bundle the salient elements of the software stack from an application’s perspective. In this paper, we discuss initial experiences in using the Open MPI implementation of OpenSHMEM with containers on HPC resources. We provide a brief overview of two container runtimes, Docker & Singularity, highlighting elements that are of interest for HPC users. The Docker platform offers a rich set of services that are widely used in enterprise environments, whereas Singularity is an emerging container runtime that is specifically written for use on HPC systems. We describe our procedure for container assembly and deployment that strives to maintain the portability of the container-based application. We show performance results for the Graph500 benchmark running along the typical continuum of development testbed up to full production supercomputer (ORNL’s Titan). The results show consistent performance between the native and Singularity (container) tests. The results also showed an unexplained drop in performance when using the Cray Gemini network with Open MPI’s OpenSHMEM, which was unrelated to the container usage.

Original languageEnglish
Title of host publicationOpenSHMEM and Related Technologies
Subtitle of host publicationBig Compute and Big Data Convergence - 4th Workshop, OpenSHMEM 2017, Revised Selected Papers
EditorsManjunath Gorentla Venkata, Neena Imam, Swaroop Pophale
PublisherSpringer Verlag
Pages130-142
Number of pages13
ISBN (Print)9783319738130
DOIs
StatePublished - 2018
Event4th Workshop on OpenSHMEM and Related Technologies: Big Compute and Big Data Convergence, OpenSHMEM 2017 - Annapolis, United States
Duration: Aug 7 2017Aug 9 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10679 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference4th Workshop on OpenSHMEM and Related Technologies: Big Compute and Big Data Convergence, OpenSHMEM 2017
Country/TerritoryUnited States
CityAnnapolis
Period08/7/1708/9/17

Funding

This work was sponsored by the U.S. Department of Energy’s Office of Advanced Scientific Computing Research. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a nonexclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http:// energy.gov/downloads/doe-public-access-plan). This research used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC05-00OR22725.

FundersFunder number
DOE Office of Science
U.S. Department of EnergyDE-AC05-00OR22725
Advanced Scientific Computing Research

    Fingerprint

    Dive into the research topics of 'Balancing performance and portability with containers in HPC: An OpenSHMEM example'. Together they form a unique fingerprint.

    Cite this