Abstract
A processor, with a GPU and a CPU integrated on the same chip, is a promising low-power system for floating-point intensive applications. While an integrated GPU is not designed to outperform a discrete GPU due to its power, area, and thermal constraints, there is a need to better understand the performance of a floating-point intensive kernel using an integrated GPU. Toward this end, we choose a representative floating-point intensive kernel as a case study. We port the kernel with a vendor-neutral framework, analyze the compiler optimizations of the kernel at the assembly code, evaluate the relationship between floating-point operations per second and arithmetic intensity, and compare the performance and power of the kernel implementations on the CPU and GPU. Our key findings are: 1) Compared to an un-optimized kernel, the floating-point optimizations improve the performance of the single-and double-precision floating-point kernels executing on an Intel® GEN8 Iris Pro GPU by 15.4X and 5.4X, respectively; the optimizations also improve the performance of the two kernels by 5.6X and 3.4X on an Intel® Xeon® E3 CPU, respectively. 2) Achieving peak floating-point operations per second on the GPU requires much higher arithmetic intensity than that on the CPU. 3) Running the floating-point intensive kernel on the processor consumes 48 Watts, which is very close to the thermal power draw of the processor. The floating-point optimization can reduce the average GPU power from 35.7 W to 22.7 W for the double-precision kernel, and from 33.1 W to 8.8 W for the single-precision kernel.
Original language | English |
---|---|
Title of host publication | 2019 10th International Green and Sustainable Computing Conference, IGSC 2019 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781728154169 |
DOIs | |
State | Published - Oct 2019 |
Externally published | Yes |
Event | 10th International Green and Sustainable Computing Conference, IGSC 2019 - Alexandria, United States Duration: Oct 21 2019 → Oct 24 2019 |
Publication series
Name | 2019 10th International Green and Sustainable Computing Conference, IGSC 2019 |
---|
Conference
Conference | 10th International Green and Sustainable Computing Conference, IGSC 2019 |
---|---|
Country/Territory | United States |
City | Alexandria |
Period | 10/21/19 → 10/24/19 |
Funding
ACKNOWLEDGMENT Results presented were obtained using the Chameleon testbed supported by the National Science Foundation. This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357.
Keywords
- GFLOPS
- Integrated GPU
- OpenCL
- floating-point intensive