Abstract
Here, the IRIS programming model is evaluated as a method to improve performance portability for heterogeneous systems that use LU matrix factorization. LU (lower-upper) factorization is considered one of the most important numerical linear algebra operations used in multiple high-performance computing and scientific applications. IRIS enables the separation of the algorithm’s definition from the tuning by using tasks + dependencies. This considerably reduces the effort required to achieve performance portability on heterogeneous systems. One IRIS code can use different settings depending on the underlying hardware features. Different configurations are evaluated on two different heterogeneous systems to achieve important speedups for the reference code with minimal changes to the source code.
Original language | English |
---|---|
Title of host publication | Euro-Par 2022 |
Subtitle of host publication | Parallel Processing Workshops - Euro-Par 2022 International Workshops, Revised Selected Papers |
Editors | Jeremy Singer, Yehia Elkhatib, Dora Blanco Heras, Patrick Diehl, Nick Brown, Aleksandar Ilic |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 17-31 |
Number of pages | 15 |
ISBN (Print) | 9783031312083 |
DOIs | |
State | Published - 2023 |
Event | 28th International European Conference on Parallel and Distributed Computing , Euro-Par 2022 - Glasgow, United Kingdom Duration: Aug 22 2022 → Aug 26 2022 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 13835 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 28th International European Conference on Parallel and Distributed Computing , Euro-Par 2022 |
---|---|
Country/Territory | United Kingdom |
City | Glasgow |
Period | 08/22/22 → 08/26/22 |
Funding
J. Kim—Now at NVIDIA. Notice: This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid up, irrevocable, world-wide license to publish or reproduce the published form of the manuscript, or allow others to do so, for U.S. Government purposes. The DOE will provide public access to these results in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).
Keywords
- CPU
- GPU
- Heterogeneity
- IRIS
- LU factorization
- Performance portability
- Tasking