Skip to main navigation Skip to search Skip to main content

Characterizing GPU Energy Usage in Exascale-Ready Portable Science Applications

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We characterize the GPU energy usage of two widely adopted exascale-ready applications representing two classes of particle and mesh solvers: (i) QMCPACK, a quantum Monte Carlo package, and (ii) AMReX-Castro, an adaptive mesh astrophysical code. We analyze power, temperature, utilization, and energy traces from double-/single (mixed)-precision benchmarks on NVIDIA’s A100 and H100 and AMD’s MI250X GPUs using queries in NVML and rocm_smi_lib, respectively. We explore application-specific metrics to provide insights on energy vs. performance trade-offs. Our results suggest that mixed-precision energy savings range between 6–25% on QMCPACK and 45% on AMReX-Castro. Also, we found gaps in the AMD tooling used on Frontier GPUs that need to be understood, while query resolutions on NVML have little variability between 1 ms-1 s. Overall, application level knowledge is crucial to define energy-cost/science-benefit opportunities for the codesign of future supercomputer architectures in the post-Moore era.

Original languageEnglish
Title of host publicationHigh Performance Computing - ISC High Performance 2025 International Workshops, Revised Selected Papers
EditorsSarah Neuwirth, Arnab Kumar Paul, Tobias Weinzierl, Erin Claire Carson
PublisherSpringer Science and Business Media Deutschland GmbH
Pages177-190
Number of pages14
ISBN (Print)9783032076113
DOIs
StatePublished - 2026
Event40th International Conference on High Performance Computing, ISC High Performance 2025 - Hamburg, Germany
Duration: Jun 10 2025Jun 13 2025

Publication series

NameLecture Notes in Computer Science
Volume16091 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference40th International Conference on High Performance Computing, ISC High Performance 2025
Country/TerritoryGermany
CityHamburg
Period06/10/2506/13/25

Funding

This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (https://www.energy.gov/doe-public-access-plan). This material is based on work supported by the DOE’s Office of Science, Office of Advanced Scientific Computing Research through EXPRESS: 2023 Exploratory Research for Extreme Scale Science. PRCK was supported by the DOE’s Office of Science, Basic Energy Sciences, Materials Sciences and Engineering Division as part of the Computational Materials Sciences Program and the Center for Predictive Simulation of Functional Materials. This research used resources of the Oak Ridge Leadership Computing Facility and the Experimental Computing Laboratory at the Oak Ridge National Laboratory, which is supported by the DOE’s Office of Science under Contract No. DE-AC05-00OR22725. WG would like to acknowledge Brandon Tran from the University of Wisconsin for the valuable discussion on NVML.

Keywords

  • Energy efficiency
  • GPU Power
  • HPC Applications

Fingerprint

Dive into the research topics of 'Characterizing GPU Energy Usage in Exascale-Ready Portable Science Applications'. Together they form a unique fingerprint.

Cite this