Performance assessment of ensembles of in situ workflows under resource constraints

Tu Mai Anh Do, Loïc Pottier, Rafael Ferreira da Silva, Silvina Caíno-Lores, Michela Taufer, Ewa Deelman

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Scientific breakthroughs in biomolecular methods and improvements in hardware technology have shifted from a long-running simulation to a large set of shorter simulations running simultaneously, called an ensemble. In an ensemble, simulations are usually coupled with analyses of data produced by the simulations. In situ methods can be used to analyze large volumes of data generated by scientific simulations at runtime (i.e., simulations and analyses are performed concurrently). In this work, we study the execution of ensemble-based simulations paired with in situ analyses using in-memory staging methods. Using an ensemble of molecular dynamics in situ workflows with multiple simulations and analyses, we first show that collecting traditional metrics such as makespan, instructions per cycle, memory usage, or cache miss ratio is not sufficient to characterize complex behaviors of ensembles. We propose a method to evaluate the performance of ensembles of workflows that captures multiple resource usage aspects: resource efficiency, resource allocation, and resource provisioning. Experimental results demonstrate that the proposed method can effectively distinguish the performance of different component placements in an ensemble with up to 32 ensemble members. By evaluating different co-location scenarios, our proposed performance indicators demonstrate benefits of co-locating simulation and coupled analyses within a compute node.

Original languageEnglish
Article numbere7111
JournalConcurrency and Computation: Practice and Experience
Volume35
Issue number20
DOIs
StatePublished - Sep 10 2023

Funding

This work is funded by NSF contracts #1741040, #1741057, and #1841758; and DOE contract #DE‐SC0012636. This research used resources of NERSC, a U.S. DOE Office of Science Facility operated under contract #DE‐AC02‐05CH11231 and OLCF at the Oak Ridge National Laboratory supported by the Office of Science in the U.S. DOE under contract #DE‐AC05‐00OR22725. National Science Foundation, Grant/Award Numbers: 1741040, 1741057, 1841758; U.S. Department of Energy, Grant/Award Numbers: DE‐AC02‐05CH11231, DE‐AC05‐00OR22725, DE‐SC0012636 Funding information

Keywords

  • ensemble workflow
  • high-performance computing
  • in situ model
  • molecular dynamics
  • scientific workflow

Fingerprint

Dive into the research topics of 'Performance assessment of ensembles of in situ workflows under resource constraints'. Together they form a unique fingerprint.

Cite this