Assessing Resource Provisioning and Allocation of Ensembles of in Situ Workflows

Tu Mai Anh Do, Loïc Pottier, Rafael Ferreira Da Silva, Silvina Caíno-Lores, Michela Taufer, Ewa Deelman

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Scientific breakthroughs in biomolecular methods and improvements in hardware technology have shifted from a single long-running simulation to a large set of shorter simulations running simultaneously, called an ensemble. In an ensemble, each independent simulation is usually coupled with several analyses that apply identical or distinct algorithms on data produced by the corresponding simulation. Today, In situ methods are used to analyze large volumes of data generated by scientific simulations at runtime. This work studies the execution of ensemble-based simulations paired with In situ analyses using in-memory staging methods. Because simulations and analyses forming an ensemble typically run concurrently, deploying an ensemble requires efficient co-location-aware strategies, making sure the data flow between simulations and analyses that form an In situ workflow is efficient. Using an ensemble of molecular dynamics In situ workflows with multiple simulations and analyses, we first show that collecting traditional metrics such as makespan, instructions per cycle, memory usage, or cache miss ratio is not sufficient to characterize the complex behaviors of ensembles. Thus, we propose a method to evaluate the performance of ensembles of workflows that captures resource usage (efficiency), resource allocation, and component placement. Experimental results demonstrate that our proposed method can effectively capture the performance of different component placements in an ensemble. By evaluating different co-location scenarios, our performance indicator demonstrates improvements of up to four orders of magnitude when co-locating simulation and coupled analyses within a single computational host.

Original languageEnglish
Title of host publication50th International Conference on Parallel Processing Workshop, ICPP 2021 - Proceedings
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450384414
DOIs
StatePublished - Aug 9 2021
Externally publishedYes
Event50th International Conference on Parallel Processing Workshop, ICPP 2021 - Virtual, Online, United States
Duration: Aug 9 2021Aug 12 2021

Publication series

NameACM International Conference Proceeding Series

Conference

Conference50th International Conference on Parallel Processing Workshop, ICPP 2021
Country/TerritoryUnited States
CityVirtual, Online
Period08/9/2108/12/21

Funding

This work is funded by NSF contracts #1741040 and #1741057; and DOE contract #DE-SC0012636. This research used resources of NERSC, a U.S. Department of Energy Office of Science User Facility operated under Contract No. DE-AC02-05CH11231.

Keywords

  • Ensemble workflow
  • High-performance computing
  • In situ model
  • Molecular dynamics
  • Scientific workflow

Fingerprint

Dive into the research topics of 'Assessing Resource Provisioning and Allocation of Ensembles of in Situ Workflows'. Together they form a unique fingerprint.

Cite this