Abstract
Scientific breakthroughs in biomolecular methods and improvements in hardware technology have shifted from a long-running simulation to a large set of shorter simulations running simultaneously, called an ensemble. In an ensemble, simulations are usually coupled with analyses of data produced by the simulations. In situ methods can be used to analyze large volumes of data generated by scientific simulations at runtime (i.e., simulations and analyses are performed concurrently). In this work, we study the execution of ensemble-based simulations paired with in situ analyses using in-memory staging methods. Using an ensemble of molecular dynamics in situ workflows with multiple simulations and analyses, we first show that collecting traditional metrics such as makespan, instructions per cycle, memory usage, or cache miss ratio is not sufficient to characterize complex behaviors of ensembles. We propose a method to evaluate the performance of ensembles of workflows that captures multiple resource usage aspects: resource efficiency, resource allocation, and resource provisioning. Experimental results demonstrate that the proposed method can effectively distinguish the performance of different component placements in an ensemble with up to 32 ensemble members. By evaluating different co-location scenarios, our proposed performance indicators demonstrate benefits of co-locating simulation and coupled analyses within a compute node.
Original language | English |
---|---|
Article number | e7111 |
Journal | Concurrency and Computation: Practice and Experience |
Volume | 35 |
Issue number | 20 |
DOIs | |
State | Published - Sep 10 2023 |
Funding
This work is funded by NSF contracts #1741040, #1741057, and #1841758; and DOE contract #DE‐SC0012636. This research used resources of NERSC, a U.S. DOE Office of Science Facility operated under contract #DE‐AC02‐05CH11231 and OLCF at the Oak Ridge National Laboratory supported by the Office of Science in the U.S. DOE under contract #DE‐AC05‐00OR22725. National Science Foundation, Grant/Award Numbers: 1741040, 1741057, 1841758; U.S. Department of Energy, Grant/Award Numbers: DE‐AC02‐05CH11231, DE‐AC05‐00OR22725, DE‐SC0012636 Funding information
Keywords
- ensemble workflow
- high-performance computing
- in situ model
- molecular dynamics
- scientific workflow