Generating efficient tensor contractions for GPUs

Thomas Nelson, Axel Rivera, Prasanna Balaprakash, Mary Hall, Paul D. Hovland, Elizabeth Jessup, Boyana Norris

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

28 Scopus citations

Abstract

Many scientific and numerical applications, including quantum chemistry modeling and fluid dynamics simulation, require tensor product and tensor contraction evaluation. Tensor computations are characterized by arrays with numerous dimensions, inherent parallelism, moderate data reuse and many degrees of freedom in the order in which to perform the computation. The best-performing implementation is heavily dependent on the tensor dimensionality and the target architecture. In this paper, we map tensor computations to GPUs, starting with a high-level tensor input language and producing efficient CUDA code as output. Our approach is to combine tensor-specific mathematical transformations with a GPU decision algorithm, machine learning and auto tuning of a large parameter space. Generated code shows significant performance gains over sequential and Open MP parallel code, and a comparison with Open ACC shows the importance of auto tuning and other optimizations in our framework for achieving efficient results.

Original languageEnglish
Title of host publicationProceedings - 2015 44th International Annual Conference on Parallel Processing, ICPP 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages969-978
Number of pages10
ISBN (Electronic)9781467375870
DOIs
StatePublished - Dec 8 2015
Externally publishedYes
Event44th International Conference on Parallel Processing, ICPP 2015 - Beijing, China
Duration: Sep 1 2015Sep 4 2015

Publication series

NameProceedings of the International Conference on Parallel Processing
Volume2015-December
ISSN (Print)0190-3918

Conference

Conference44th International Conference on Parallel Processing, ICPP 2015
Country/TerritoryChina
CityBeijing
Period09/1/1509/4/15

Keywords

  • Autotuning
  • GPUs
  • Tensor contraction

Fingerprint

Dive into the research topics of 'Generating efficient tensor contractions for GPUs'. Together they form a unique fingerprint.

Cite this