Abstract
FPGAs are becoming promising heterogeneous computing components for high-performance computing. In this paper, we evaluate the resource utilizations, performance, and performance per watt of our implementations of the LULESH kernels in OpenCL on an Arria10-based FPGA platform. LULESH is a complex proxy application in the CORAL benchmark suite. We choose two representative kernels “CalcFBHourglassForceForElems” and “EvalEOSForElems” from the application in our study. Compared with the baseline implementations, our optimizations improve the performance by a factor of 1.65X and 2.96X for the two kernels on the FPGA, respectively. Using directives for accelerator programming, we also evaluate the performance of the kernels on an Intel Xeon 16-core CPU and an Nvidia K80 GPU. We find that the FPGA, constrained by the memory bandwidth, can perform 1.05X to 3.4X better than the CPU and GPU for small problem sizes. For the first kernel, the performance per watt on the FPGA is 1.59X and 7.1X higher than that on an Intel Xeon 16-core CPU and an Nvidia K80 GPU, respectively. For the second kernel, the performance per watt on the GPU is 1.82X higher than that on the FPGA. However, the performance per watt on the FPGA is 1.77X higher than that on the CPU.
| Original language | English |
|---|---|
| Title of host publication | Applied Reconfigurable Computing - 15th International Symposium, ARC 2019, Proceedings |
| Editors | Christian Hochberger, Andreas Koch, Pedro Diniz, Brent Nelson, Roger Woods |
| Publisher | Springer Verlag |
| Pages | 199-213 |
| Number of pages | 15 |
| ISBN (Print) | 9783030172268 |
| DOIs | |
| State | Published - 2019 |
| Externally published | Yes |
| Event | 15th International Symposium on Applied Reconfigurable Computing, ARC 2019 - Darmstadt, Germany Duration: Apr 9 2019 → Apr 11 2019 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 11444 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 15th International Symposium on Applied Reconfigurable Computing, ARC 2019 |
|---|---|
| Country/Territory | Germany |
| City | Darmstadt |
| Period | 04/9/19 → 04/11/19 |
Funding
Acknowledgments. The research was supported by the U.S. Department of Energy, Office of Science, under contract DE-AC02-06CH11357 and made use of the Argonne Leadership Computing Facility, a DOE Office of Science User Facility.
Keywords
- FPGA
- Kernel optimizations
- LULESH
- OpenCL