TY - GEN
T1 - A case study of integer sum reduction using atomics
AU - Jin, Zheming
AU - Finkel, Hal
N1 - Publisher Copyright:
© 2018 Copyright is held by the owner/author(s). Publication rights
PY - 2018/6/20
Y1 - 2018/6/20
N2 - This paper presents the implementations of integer sum reduction using atomic functions on FPGA, CPU, and GPU platforms. We explain the implementations and optimizations of the kernel using an OpenCL-based high-level synthesis flow for an FPGA. In addition, we describe the optimizations of the reduction using directives for a multi-core CPU and a GPU. The experimental results show that the reduction on an Nvidia K80 GPU is 3.4X and 6.7X faster than an Intel Xeon 16-core CPU and an Arria 10 GX1150 FPGA, respectively. However, the FPGA consumes 4.4X and 2.3X less power than the CPU and GPU, respectively. The performance per watt on the FPGA is 2.2X higher than that on the CPU and 2.9X lower than that on the GPU.
AB - This paper presents the implementations of integer sum reduction using atomic functions on FPGA, CPU, and GPU platforms. We explain the implementations and optimizations of the kernel using an OpenCL-based high-level synthesis flow for an FPGA. In addition, we describe the optimizations of the reduction using directives for a multi-core CPU and a GPU. The experimental results show that the reduction on an Nvidia K80 GPU is 3.4X and 6.7X faster than an Intel Xeon 16-core CPU and an Arria 10 GX1150 FPGA, respectively. However, the FPGA consumes 4.4X and 2.3X less power than the CPU and GPU, respectively. The performance per watt on the FPGA is 2.2X higher than that on the CPU and 2.9X lower than that on the GPU.
UR - http://www.scopus.com/inward/record.url?scp=85056625889&partnerID=8YFLogxK
U2 - 10.1145/3241793.3241809
DO - 10.1145/3241793.3241809
M3 - Conference contribution
AN - SCOPUS:85056625889
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2018
PB - Association for Computing Machinery
T2 - 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2018
Y2 - 20 June 2018 through 22 June 2018
ER -