Empirical performance modeling of GPU kernels using active learning

Prasanna Balaprakash, Karl Rupp, Azamat Mametjanov, Robert B. Gramacy, Paul D. Hovland, Stefan M. Wild

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

We focus on a design-of-experiments methodology for developing empirical performance models of GPU kernels. Recently, we developed an iterative active learning algorithm that adaptively selects parameter configurations in batches for concurrent evaluation on CPU architectures in order to build performance models over the parameter space. In this paper, we illustrate the adoption of the algorithm when concurrent evaluations are not possible, which is particularly useful in the absence of GPU clusters. We present an empirical study of the algorithm on a diverse set of GPU kernels and hardware. We show that even when concurrent evaluations are not possible, the default batch mode of the algorithm yields better models and the iterative active learning algorithm reduces the overall time required to obtain high-quality empirical performance models for GPU kernels.

Original languageEnglish
Title of host publicationParallel Computing
Subtitle of host publicationAccelerating Computational Science and Engineering (CSE)
PublisherIOS Press BV
Pages646-655
Number of pages10
ISBN (Print)9781614993803
DOIs
StatePublished - 2014
Externally publishedYes

Publication series

NameAdvances in Parallel Computing
Volume25
ISSN (Print)0927-5452

Keywords

  • performance modeling
  • sequential design of experiments

Fingerprint

Dive into the research topics of 'Empirical performance modeling of GPU kernels using active learning'. Together they form a unique fingerprint.

Cite this