Abstract
One of the main obstacles to the efficient solution of scientific problems is the problem of tuning software, both to the available architecture and to the user problem at hand. We describe approaches for obtaining tuned high-performance kernels and for automatically choosing suitable algorithms. Specifically, we describe the generation of dense and sparse Basic Linear Algebra Subprograms (BLAS) kernels, and the selection of linear solver algorithms. However, the ideas presented here extend beyond these areas, which can be considered proof of concept.
Original language | English |
---|---|
Pages (from-to) | 293-311 |
Number of pages | 19 |
Journal | Proceedings of the IEEE |
Volume | 93 |
Issue number | 2 |
DOIs | |
State | Published - Feb 2005 |
Keywords
- Adaptive methods
- Basic Linear Algebra Subprograms (BLAS)
- Dense kernels
- Iterative methods
- Linear systems
- Matrix-matrix product
- Matrix-vector product
- Performance optimization
- Preconditioners
- Sparse kernels