Determining Levels of Detail for Simulators of Parallel and Distributed Computing Systems via Automated Calibration

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

There are two sources of inaccuracy when simulating parallel and distributed computing systems: (i) a simulator implemented at an insufficient level of detail; and (ii) incorrectly calibrated simulation parameter values. Increasing the simulator’s level of detail can improve accuracy, but at the cost of higher space, time, and/or software complexity. Furthermore, evaluating the intrinsic accuracy of a simulator requires that its parameters be well-calibrated. Making decisions regarding the level of detail is thus challenging. We propose a methodology for instantiating the simulation calibration process and a framework for automating this process, which makes it possible to pick appropriate levels of detail for any simulator. We demonstrate the usefulness of our approach via two case studies for two different domains.

Original languageEnglish
Title of host publicationProceedings of 2025 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC 2025 Workshops
PublisherAssociation for Computing Machinery, Inc
Pages1452-1463
Number of pages12
ISBN (Electronic)9798400718717
DOIs
StatePublished - Nov 15 2025
Event2025 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC 2025 Workshops - St. Louis, United States
Duration: Nov 16 2025Nov 21 2025

Publication series

NameProceedings of 2025 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC 2025 Workshops

Conference

Conference2025 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC 2025 Workshops
Country/TerritoryUnited States
CitySt. Louis
Period11/16/2511/21/25

Funding

This research was partially supported by National Science Foundation awards #2106059, #2411154, and #2106147. The technical support and advanced computing resources from University of Hawaii Information Technology Services Research Cyberinfrastructure, funded in part by the National Science Foundation CC* awards #2201428 and #2232862 are gratefully acknowledged. Finally, this research used resources of the OLCF at ORNL, which is supported by DOE’s Office of Science under Contract No. DEAC05-00OR22725. This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344. This material is based on work partially supported by LLNL LDRD 22-SI-004 (LLNL-CONF-2002494).

Keywords

  • Simulation of parallel and distributed computing systems
  • simulation accuracy
  • simulation calibration

Fingerprint

Dive into the research topics of 'Determining Levels of Detail for Simulators of Parallel and Distributed Computing Systems via Automated Calibration'. Together they form a unique fingerprint.

Cite this