Abstract
OpenCL promotes code portability, and natively supports vectorized data types, which allows developers to potentially take advantage of the single-instruction-multiple-data instructions on CPUs, GPUs, and FPGAs. FPGAs are becoming a promising heterogeneous computing component. In our study, we choose a kernel used in frequent pattern compression as a case study of OpenCL kernel vectorizations on the three computing platforms. We describe different pattern matching approaches for the kernel, and manually vectorize the OpenCL kernel by a factor ranging from 2 to 16. We evaluate the kernel on an Intel Xeon 16-core CPU, an NVIDIA P100 GPU, and a Nallatech 385A FPGA card featuring an Intel Arria 10 GX1150 FPGA. Compared to the optimized kernel that is not vectorized, our vectorization can improve the kernel performance by a factor of 16 on the FPGA. The performance improvement ranges from 1 to 11.4 on the CPU, and from 1.02 to 9.3 on the GPU. The effectiveness of kernel vectorization depends on the work-group size.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 27th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2019 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 330 |
| Number of pages | 1 |
| ISBN (Electronic) | 9781728111315 |
| DOIs | |
| State | Published - Apr 2019 |
| Externally published | Yes |
| Event | 27th Annual IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2019 - San Diego, United States Duration: Apr 28 2019 → May 1 2019 |
Publication series
| Name | Proceedings - 27th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2019 |
|---|
Conference
| Conference | 27th Annual IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2019 |
|---|---|
| Country/Territory | United States |
| City | San Diego |
| Period | 04/28/19 → 05/1/19 |
Funding
The research was supported by the U.S. Department of Energy, Office of Science, under contract DE-AC02-06CH11357 and made use of the Argonne Leadership Computing Facility, a DOE Office of Science User Facility XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX IEEE
Keywords
- FPGA
- OpenCL
- Vectorization