Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency

Hatem Ltaief, Piotr Luszczek, Jack Dongarra

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

This paper presents the power profile of two high performance dense linear algebra libraries i.e., LAPACK and PLASMA. The former is based on block algorithms that use the fork-join paradigm to achieve parallel performance. The latter uses fine-grained task parallelism that recasts the computation to operate on submatrices called tiles. In this way tile algorithms are formed. We show results from the power profiling of the most common routines, which permits us to clearly identify the different phases of the computations. This allows us to isolate the bottlenecks in terms of energy efficiency. Our results show that PLASMA surpasses LAPACK not only in terms of performance but also in terms of energy efficiency.

Original languageEnglish
Pages (from-to)277-287
Number of pages11
JournalComputer Science - Research and Development
Volume27
Issue number4
DOIs
StatePublished - Nov 2012
Externally publishedYes

Keywords

  • Dense linear algebra
  • Energy efficiency
  • Multicore architectures
  • Power profile
  • Tile algorithms

Fingerprint

Dive into the research topics of 'Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency'. Together they form a unique fingerprint.

Cite this