Performance of Floating-point Intensive Kernels on Low-power Processor-A Case Study with Geodesic Distance Kernel

Zheming Jin, Paulius Velesko, Hal Finkel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A processor, with a GPU and a CPU integrated on the same chip, is a promising low-power system for floating-point intensive applications. While an integrated GPU is not designed to outperform a discrete GPU due to its power, area, and thermal constraints, there is a need to better understand the performance of a floating-point intensive kernel using an integrated GPU. Toward this end, we choose a representative floating-point intensive kernel as a case study. We port the kernel with a vendor-neutral framework, analyze the compiler optimizations of the kernel at the assembly code, evaluate the relationship between floating-point operations per second and arithmetic intensity, and compare the performance and power of the kernel implementations on the CPU and GPU. Our key findings are: 1) Compared to an un-optimized kernel, the floating-point optimizations improve the performance of the single-and double-precision floating-point kernels executing on an Intel® GEN8 Iris Pro GPU by 15.4X and 5.4X, respectively; the optimizations also improve the performance of the two kernels by 5.6X and 3.4X on an Intel® Xeon® E3 CPU, respectively. 2) Achieving peak floating-point operations per second on the GPU requires much higher arithmetic intensity than that on the CPU. 3) Running the floating-point intensive kernel on the processor consumes 48 Watts, which is very close to the thermal power draw of the processor. The floating-point optimization can reduce the average GPU power from 35.7 W to 22.7 W for the double-precision kernel, and from 33.1 W to 8.8 W for the single-precision kernel.

Original languageEnglish
Title of host publication2019 10th International Green and Sustainable Computing Conference, IGSC 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728154169
DOIs
StatePublished - Oct 2019
Externally publishedYes
Event10th International Green and Sustainable Computing Conference, IGSC 2019 - Alexandria, United States
Duration: Oct 21 2019Oct 24 2019

Publication series

Name2019 10th International Green and Sustainable Computing Conference, IGSC 2019

Conference

Conference10th International Green and Sustainable Computing Conference, IGSC 2019
Country/TerritoryUnited States
CityAlexandria
Period10/21/1910/24/19

Funding

ACKNOWLEDGMENT Results presented were obtained using the Chameleon testbed supported by the National Science Foundation. This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357.

FundersFunder number
National Science Foundation
Office of ScienceDE-AC02-06CH11357

    Keywords

    • GFLOPS
    • Integrated GPU
    • OpenCL
    • floating-point intensive

    Fingerprint

    Dive into the research topics of 'Performance of Floating-point Intensive Kernels on Low-power Processor-A Case Study with Geodesic Distance Kernel'. Together they form a unique fingerprint.

    Cite this