A Layered Approach for Modular Container Construction and Orchestration in HPC Environments

Quincy Wofford, Patrick G. Bridges, Patrick Widener

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Large-scale, high-throughput computational science faces an accelerating convergence of software and hardware. Software container-based solutions have become common in cloud-based datacenter environments, and are considered promising tools for addressing heterogeneity and portability concerns. However, container solutions reflect a set of assumptions which complicate their adoption by developers and users of scientific workflow applications. Nor are containers a universal solution for deployment in high-performance computing (HPC) environments which have specialized and vertically integrated scheduling and runtime software stacks. In this paper, we present a container design and deployment approach which uses modular layering to ease the deployment of containers into existing HPC environments. This layered approach allows operating system integrations, support for different communication and performance monitoring libraries, and application code to be defined and interchanged in isolation. We describe in this paper the details of our approach, including specifics about container deployment and orchestration for different HPC scheduling systems. We also describe how this layering method can be used to build containers for two separate applications, each deployed on clusters with different batch schedulers, MPI networking support, and performance monitoring requirements. Our experience indicates that the layered approach is a viable strategy for building applications intended to provide similar behavior across widely varying deployment targets.

Original languageEnglish
Title of host publicationScienceCloud 2021 - Proceedings of the 11th Workshop on Scientific Cloud Computing
PublisherAssociation for Computing Machinery, Inc
Pages1-8
Number of pages8
ISBN (Electronic)9781450383851
DOIs
StatePublished - Jun 21 2021
Externally publishedYes
Event11th Workshop on Scientific Cloud Computing, ScienceCloud 2021 - Virtual, Online, Sweden
Duration: Jun 21 2021 → …

Publication series

NameScienceCloud 2021 - Proceedings of the 11th Workshop on Scientific Cloud Computing

Conference

Conference11th Workshop on Scientific Cloud Computing, ScienceCloud 2021
Country/TerritorySweden
CityVirtual, Online
Period06/21/21 → …

Funding

This paper was supported in part by the National Science Foundation under Grant No. OAC-1807563. This work was funded in part by Los Alamos National Laboratory, supported by the US Department of Energy contract DE-FC02-06ER25750 (Los Alamos Publication Number LA-UR-20-27116). Sandia National Laboratories is a multi-mission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-NA0003525 (SAND2020-9527C).

Keywords

  • containers
  • distributed applications
  • high performance computing
  • scientific computing
  • user defined software stack

Fingerprint

Dive into the research topics of 'A Layered Approach for Modular Container Construction and Orchestration in HPC Environments'. Together they form a unique fingerprint.

Cite this