Abstract
Scientific breakthroughs in biomolecular methods and improvements in hardware technology have shifted from a single long-running simulation to a large set of shorter simulations running simultaneously, called an ensemble. In an ensemble, each independent simulation is usually coupled with several analyses that apply identical or distinct algorithms on data produced by the corresponding simulation. Today, In situ methods are used to analyze large volumes of data generated by scientific simulations at runtime. This work studies the execution of ensemble-based simulations paired with In situ analyses using in-memory staging methods. Because simulations and analyses forming an ensemble typically run concurrently, deploying an ensemble requires efficient co-location-aware strategies, making sure the data flow between simulations and analyses that form an In situ workflow is efficient. Using an ensemble of molecular dynamics In situ workflows with multiple simulations and analyses, we first show that collecting traditional metrics such as makespan, instructions per cycle, memory usage, or cache miss ratio is not sufficient to characterize the complex behaviors of ensembles. Thus, we propose a method to evaluate the performance of ensembles of workflows that captures resource usage (efficiency), resource allocation, and component placement. Experimental results demonstrate that our proposed method can effectively capture the performance of different component placements in an ensemble. By evaluating different co-location scenarios, our performance indicator demonstrates improvements of up to four orders of magnitude when co-locating simulation and coupled analyses within a single computational host.
Original language | English |
---|---|
Title of host publication | 50th International Conference on Parallel Processing Workshop, ICPP 2021 - Proceedings |
Publisher | Association for Computing Machinery |
ISBN (Electronic) | 9781450384414 |
DOIs | |
State | Published - Aug 9 2021 |
Externally published | Yes |
Event | 50th International Conference on Parallel Processing Workshop, ICPP 2021 - Virtual, Online, United States Duration: Aug 9 2021 → Aug 12 2021 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | 50th International Conference on Parallel Processing Workshop, ICPP 2021 |
---|---|
Country/Territory | United States |
City | Virtual, Online |
Period | 08/9/21 → 08/12/21 |
Funding
This work is funded by NSF contracts #1741040 and #1741057; and DOE contract #DE-SC0012636. This research used resources of NERSC, a U.S. Department of Energy Office of Science User Facility operated under Contract No. DE-AC02-05CH11231.
Keywords
- Ensemble workflow
- High-performance computing
- In situ model
- Molecular dynamics
- Scientific workflow