Co-scheduling Ensembles of In Situ Workflows

Tu Mai Anh Do, Loic Pottier, Rafael Ferreira Da Silva, Frederic Suter, Silvina Caino-Lores, Michela Taufer, Ewa Deelman

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Molecular dynamics (MD) simulations are widely used to study large-scale molecular systems. HPC systems are ideal platforms to run these studies, however, reaching the necessary simulation timescale to detect rare processes is challenging, even with modern supercomputers. To overcome the timescale limitation, the simulation of a long MD trajectory is replaced by multiple short-range simulations that are executed simultaneously in an ensemble of simulations. Analyses are usually co-scheduled with these simulations to efficiently process large volumes of data generated by the simulations at runtime, thanks to in situ techniques. Executing a workflow ensemble of simulations and their in situ analyses requires efficient co-scheduling strategies and sophisticated management of computational resources so that they are not slowing down each other. In this paper, we propose an efficient method to co-schedule simulations and in situ analyses such that the makespan of the workflow ensemble is minimized. We present a novel approach to allocate resources for a workflow ensemble under resource constraints by using a theoretical framework modeling the workflow ensemble's execution. We evaluate the proposed approach using an accurate simulator based on the WRENCH simulation framework on various workflow ensemble configurations. Results demonstrate the significance of co-scheduling simulations and in situ analyses that couple data together to benefit from data locality, in which inefficient scheduling decisions can lead to slowdown in makespan up to a factor of 30.

Original languageEnglish
Title of host publicationProceedings of WORKS 2022
Subtitle of host publication17th Workshop on Workflows in Support of Large-Scale Science, Held in conjunction with SC 2022: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages43-51
Number of pages9
ISBN (Electronic)9781665451918
DOIs
StatePublished - 2022
Event17th IEEE/ACM Workshop on Workflows in Support of Large-Scale Science, WORKS 2022 - Dallas, United States
Duration: Nov 13 2022Nov 18 2022

Publication series

NameProceedings of WORKS 2022: 17th Workshop on Workflows in Support of Large-Scale Science, Held in conjunction with SC 2022: The International Conference for High Performance Computing, Networking, Storage and Analysis

Conference

Conference17th IEEE/ACM Workshop on Workflows in Support of Large-Scale Science, WORKS 2022
Country/TerritoryUnited States
CityDallas
Period11/13/2211/18/22

Funding

This work is funded by NSF contracts #1741040, #1741057, and #1841758. This research used resources of the OLCF at ORNL, which is supported by the Office of Science of the U.S. DOE under Contract No. DE-AC05-00OR22725.

FundersFunder number
OLCF
National Science Foundation1741057, 1841758, 1741040
U.S. Department of EnergyDE-AC05-00OR22725
Office of Science

    Keywords

    • co-scheduling
    • high-performance computing
    • in situ
    • molecular dynamics
    • workflow ensemble

    Fingerprint

    Dive into the research topics of 'Co-scheduling Ensembles of In Situ Workflows'. Together they form a unique fingerprint.

    Cite this