cLMAGMA: High performance dense linear algebra with OpenCL

Chongxiao Cao, Mark Gates, Jack Dongarra, Piotr Luszczek, Peng Du, Stanimire Tomov

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents the design and implementation of several fundamental dense linear algebra (DLA) algorithms in OpenCL. In particular, these are linear system solvers and eigenvalue problem solvers. Further, we give an overview of the clMAGMA library, an open source, high performance OpenCL library that incorporates various optimizations, and in general provides the DLA functionality of the popular LAPACK library on heterogeneous architectures. The LAPACK compliance and use of OpenCL simplify the use of clMAGMA in applications, while providing them with portable performance. High performance is obtained through the use of the high-performance OpenCL BLAS, hardware- and OpenCL-specific tuning, and a hybridization methodology, where we split the algorithm into computational tasks of various granularities. Execution of those tasks is efficiently scheduled over the heterogeneous hardware components by minimizing data movements and mapping algorithmic requirements to the architectural strengths of the various heterogeneous hardware components.

Original languageEnglish
Title of host publicationProceedings of the International Workshop on OpenCL 2013 and 2014, IWOCL 2014
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450330077
DOIs
StatePublished - May 12 2014
EventInternational Workshop on OpenCL 2013 and 2014, IWOCL 2014 - Bristol, United Kingdom
Duration: May 11 2014May 12 2014

Publication series

NameACM International Conference Proceeding Series
Volume11-12-May-2014

Conference

ConferenceInternational Workshop on OpenCL 2013 and 2014, IWOCL 2014
Country/TerritoryUnited Kingdom
CityBristol
Period05/11/1405/12/14

Fingerprint

Dive into the research topics of 'cLMAGMA: High performance dense linear algebra with OpenCL'. Together they form a unique fingerprint.

Cite this