Abstract
SYCL is a portable programming model. Toward the goal of a better understanding of performance portability of SYCL kernels on GPUs, we select a bioinformatics kernel for computing the all-pairs distance as a case study. After migrating the kernel from CUDA to HIP and SYCL, we evaluate the performance of the CUDA, HIP, and SYCL kernels on NVIDIA V100 and AMD MI210 GPUs. We analyze the GPU instructions from the kernels to explain performance gaps between SYCL and CUDA/HIP. We hope that the findings are valuable for improving performance portability of SYCL.
Original language | English |
---|---|
Title of host publication | 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 366-372 |
Number of pages | 7 |
ISBN (Electronic) | 9798350311990 |
DOIs | |
State | Published - 2023 |
Event | 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023 - St. Petersburg, United States Duration: May 15 2023 → May 19 2023 |
Publication series
Name | 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023 |
---|
Conference
Conference | 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023 |
---|---|
Country/Territory | United States |
City | St. Petersburg |
Period | 05/15/23 → 05/19/23 |
Funding
We appreciate the reviewers for their comments and suggestions. The research used resources at the Experimental Computing Lab and the Compute and Data Environment for Science at Oak Ridge National Laboratory. This research was supported by the US Department of Energy Advanced Scientific Computing Research program under Contract No. DE-AC05-00OR22725. We appreciate the reviewers for their comments and suggestions. The research used resources at the Experimental Computing Lab and the Compute and Data Environment for Science at Oak Ridge National Laboratory. This research was supported by the US Department of Energy Advanced Scientific Computing Research program under Contract No. DE-AC05- 00OR22725.
Keywords
- Heterogeneous computing
- Performance portabuity
- programming model