Performance analysis and optimisation of two-sided factorization algorithms for heterogeneous platform

Khairul Kabir, Azzam Haidar, Stanimire Tomov, Jack Dongarra

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

Many applications, ranging from big data analytics to nanostructure designs, require the solution of large dense singular value decomposition (SVD) or eigenvalue problems. A first step in the solution methodology for these problems is the reduction of the matrix at hand to condensed form by two-sided orthogonal transformations. This step is standardly used to significantly accelerate the solution process. We present a performance analysis of the main two-sided factorizations used in these reductions: the bidiagonalization, tridiagonalization, and the upper Hessenberg factorizations on heterogeneous systems of multicore CPUs and Xeon Phi coprocessors. We derive a performance model and use it to guide the analysis and to evaluate performance. We develop optimized implementations for these methods that get up to 80% of the optimal performance bounds.

Original languageEnglish
Pages (from-to)180-190
Number of pages11
JournalProcedia Computer Science
Volume51
Issue number1
DOIs
StatePublished - 2015
EventInternational Conference on Computational Science, ICCS 2002 - Amsterdam, Netherlands
Duration: Apr 21 2002Apr 24 2002

Funding

This material is based upon work supported by the National Science Foundation under Grant No. ACI-1339822, the Department of Energy, Intel and the Russian Scientific Fund, Agreement N14-11-00190.

Keywords

  • Eigensolver
  • Multicore
  • Task-based programming
  • Xeon Phi

Fingerprint

Dive into the research topics of 'Performance analysis and optimisation of two-sided factorization algorithms for heterogeneous platform'. Together they form a unique fingerprint.

Cite this