GreenMD: Energy-efficient Matrix Decomposition on Heterogeneous Multi-GPU Systems

Hadi Zamani, Laxmi Bhuyan, Jieyang Chen, Zizhong Chen

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

The current trend of performance growth in HPC systems is accompanied by a massive increase in energy consumption. In this article, we introduce GreenMD, an energy-efficient framework for heterogeneous systems for LU factorization utilizing multi-GPUs. LU factorization is a crucial kernel from the MAGMA library, which is highly optimized. Our aim is to apply DVFS to this application by leveraging slacks intelligently on both CPUs and multiple GPUs. To predict the slack times, accurate performance models are developed separately for both CPUs and GPUs based on the algorithmic knowledge and manufacturer's specifications. Since DVFS does not reduce static energy consumption, we also develop undervolting techniques for both CPUs and GPUs. Reducing voltage below threshold values may give rise to errors; hence, we extract the minimum safe voltages (VsafeMin) for the CPUs and GPUs utilizing a low overhead profiling phase and apply them before execution. It is shown that GreenMD improves the CPU, GPU, and total energy about 59%, 21%, and 31%, respectively, while delivering similar performance to the state-of-the-art linear algebra MAGMA library.

Original languageEnglish
Article number3583590
JournalACM Transactions on Parallel Computing
Volume10
Issue number2
DOIs
StatePublished - Jun 20 2023

Funding

This work is partly supported by NSF Grant 1907401.

Keywords

  • Additional Key Words and PhrasesParallel computation
  • energy efficiency
  • heterogeneous multi GPUs systems
  • high performance applications

Fingerprint

Dive into the research topics of 'GreenMD: Energy-efficient Matrix Decomposition on Heterogeneous Multi-GPU Systems'. Together they form a unique fingerprint.

Cite this