Abstract
Sum reduction is a primitive operation in parallel computing while SYCL is a promising heterogeneous programming language. In this paper, we describe the SYCL implementations of integer sum reduction using atomic functions, shared local memory, vectorized memory accesses, and parameterized workload sizes. Evaluating the reduction kernels shows that we can achieve 1.4X speedup over the open-source implementations of sum reduction for a sufficiently large number of integers on an Intel integrated GPU.
Original language | English |
---|---|
Title of host publication | 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 652-655 |
Number of pages | 4 |
ISBN (Electronic) | 9781665435772 |
DOIs | |
State | Published - Jun 2021 |
Event | 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - Virtual, Portland, United States Duration: May 17 2021 → … |
Publication series
Name | 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021 |
---|
Conference
Conference | 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 |
---|---|
Country/Territory | United States |
City | Virtual, Portland |
Period | 05/17/21 → … |
Funding
ACKNOWLEDGMENT We sincerely appreciate the reviewers’ constructive criticism. This research was supported by the US Department of Energy Advanced Scientific Computing Research program under Contract No. DE-AC05-00OR22725. The results presented were obtained using the Intel DevCloud. This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).
Keywords
- GPU
- Reduction
- SYCL