Abstract
Many applications, ranging from big data analytics to nanostructure designs, require the solution of large dense singular value decomposition (SVD) or eigenvalue problems. A first step in the solution methodology for these problems is the reduction of the matrix at hand to condensed form by two-sided orthogonal transformations. This step is standardly used to significantly accelerate the solution process. We present a performance analysis of the main two-sided factorizations used in these reductions: the bidiagonalization, tridiagonalization, and the upper Hessenberg factorizations on heterogeneous systems of multicore CPUs and Xeon Phi coprocessors. We derive a performance model and use it to guide the analysis and to evaluate performance. We develop optimized implementations for these methods that get up to 80% of the optimal performance bounds.
Original language | English |
---|---|
Pages (from-to) | 180-190 |
Number of pages | 11 |
Journal | Procedia Computer Science |
Volume | 51 |
Issue number | 1 |
DOIs | |
State | Published - 2015 |
Event | International Conference on Computational Science, ICCS 2002 - Amsterdam, Netherlands Duration: Apr 21 2002 → Apr 24 2002 |
Funding
This material is based upon work supported by the National Science Foundation under Grant No. ACI-1339822, the Department of Energy, Intel and the Russian Scientific Fund, Agreement N14-11-00190.
Keywords
- Eigensolver
- Multicore
- Task-based programming
- Xeon Phi