A Heterogeneous MPI+PPL Task Scheduling Approach for Asynchronous Many-Task Runtime Systems

John Holmen, Damodar Sahasrabudhe, Martin Berzins

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Asynchronous many-task runtime systems and MPI+X hybrid parallelism approaches have shown promise for helping manage the increasing complexity of nodes in current and emerging high performance computing (HPC) systems, including those for exascale the increasing architectural diversity of these systems, however, poses challenges for runtimes supporting more homogeneous HPC systems. Performance portability layers (PPL) have shown promise for helping manage this diversity. This paper describes a heterogeneous MPI+PPL task scheduling approach for combining these promising solutions with additional consideration for parallel third party libraries facing similar challenges to help prepare such a runtime for the diverse heterogeneous systems accompanying exascale computing. This approach is demonstrated using a heterogeneous MPI+Kokkos task scheduler and the accompanying portable abstractions [16] implemented in the Uintah Computational Framework, an asynchronous many-task runtime system, with additional consideration for hypre, a parallel third party library. Results are shown for two challenging problems executing workloads representative of typical Uintah applications these results show performance improvements up to 4.4x when using this scheduler and the accompanying portable abstractions [16] to port a previously MPI-Only problem to Kokkos::OpenMP and Kokkos::CUDA to improve complex heterogeneous node use. Good strong-scaling to 1,024 NVIDIA V100 GPUs and 512 IBM POWER9 processor are also shown using MPI+Kokkos::OpenMP+Kokkos::CUDA at scale.

Original languageEnglish
Title of host publicationPEARC 2021 - Practice and Experience in Advanced Research Computing 2021
Subtitle of host publicationEvolution Across All Dimensions
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450382922
DOIs
StatePublished - Jul 17 2021
Event5th Practice and Experience in Advanced Research Computing Conference: Evolution Across All Dimensions, PEARC 2021 - Virtual, Online, United States
Duration: Jul 19 2021Jul 22 2021

Publication series

NameACM International Conference Proceeding Series

Conference

Conference5th Practice and Experience in Advanced Research Computing Conference: Evolution Across All Dimensions, PEARC 2021
Country/TerritoryUnited States
CityVirtual, Online
Period07/19/2107/22/21

Funding

This material is based upon work supported by the Department of Energy, National Nuclear Security Administration, under Award Number(s) DE-NA0002375. This research used resources of the Lawrence Livermore National Laboratory. Support for J. K. Holmen and D. Sahasrabudhe comes from the University of Texas at Austin under Award Number(s) UTA19-001215 and a gift from the Intel Parallel Computing Centers Program. We would like to thank LLNL for extended access to Lassen with special thanks to Erik Draeger for post-PSAAP II support. We would like to thank ANL for the opportunity to participate in the Aurora Early Science Program. Additionally, we would like to thank all involved with the CCMSC and Uintah, past and present, with special thanks to Brad Peter-son, Jeremy Thornock, Derek Harris, Oscar Díaz-Ibarra, and Todd Harman for Kokkos-related ARCHES efforts.

Keywords

  • Asynchronous Many-Task Runtime System
  • Parallelism and Concurrency
  • Performance Portability
  • Portability
  • Software Engineering

Fingerprint

Dive into the research topics of 'A Heterogeneous MPI+PPL Task Scheduling Approach for Asynchronous Many-Task Runtime Systems'. Together they form a unique fingerprint.

Cite this