Abstract
We characterize the GPU energy usage of two widely adopted exascale-ready applications representing two classes of particle and mesh solvers: (i) QMCPACK, a quantum Monte Carlo package, and (ii) AMReX-Castro, an adaptive mesh astrophysical code. We analyze power, temperature, utilization, and energy traces from double-/single (mixed)-precision benchmarks on NVIDIA’s A100 and H100 and AMD’s MI250X GPUs using queries in NVML and rocm_smi_lib, respectively. We explore application-specific metrics to provide insights on energy vs. performance trade-offs. Our results suggest that mixed-precision energy savings range between 6–25% on QMCPACK and 45% on AMReX-Castro. Also, we found gaps in the AMD tooling used on Frontier GPUs that need to be understood, while query resolutions on NVML have little variability between 1 ms-1 s. Overall, application level knowledge is crucial to define energy-cost/science-benefit opportunities for the codesign of future supercomputer architectures in the post-Moore era.
| Original language | English |
|---|---|
| Title of host publication | High Performance Computing - ISC High Performance 2025 International Workshops, Revised Selected Papers |
| Editors | Sarah Neuwirth, Arnab Kumar Paul, Tobias Weinzierl, Erin Claire Carson |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 177-190 |
| Number of pages | 14 |
| ISBN (Print) | 9783032076113 |
| DOIs | |
| State | Published - 2026 |
| Event | 40th International Conference on High Performance Computing, ISC High Performance 2025 - Hamburg, Germany Duration: Jun 10 2025 → Jun 13 2025 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 16091 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 40th International Conference on High Performance Computing, ISC High Performance 2025 |
|---|---|
| Country/Territory | Germany |
| City | Hamburg |
| Period | 06/10/25 → 06/13/25 |
Funding
This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (https://www.energy.gov/doe-public-access-plan). This material is based on work supported by the DOE’s Office of Science, Office of Advanced Scientific Computing Research through EXPRESS: 2023 Exploratory Research for Extreme Scale Science. PRCK was supported by the DOE’s Office of Science, Basic Energy Sciences, Materials Sciences and Engineering Division as part of the Computational Materials Sciences Program and the Center for Predictive Simulation of Functional Materials. This research used resources of the Oak Ridge Leadership Computing Facility and the Experimental Computing Laboratory at the Oak Ridge National Laboratory, which is supported by the DOE’s Office of Science under Contract No. DE-AC05-00OR22725. WG would like to acknowledge Brandon Tran from the University of Wisconsin for the valuable discussion on NVML.
Keywords
- Energy efficiency
- GPU Power
- HPC Applications
Fingerprint
Dive into the research topics of 'Characterizing GPU Energy Usage in Exascale-Ready Portable Science Applications'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver