Abstract
Based on the premise that preconditioners needed for scientific computing are not only required to be robust in the numerical sense, but also scalable for up to thousands of light-weight cores, we argue that this two-fold goal is achieved for the recently developed self-adaptive multi-elimination preconditioner. For this purpose, we revise the underlying idea and analyze the performance of implementations realized in the PARALUTION and MAGMA open-source software libraries on GPU architectures (using either CUDA or OpenCL), Intel’s Many Integrated Core Architecture, and Intel’s Sandy Bridge processor. The comparison with other well-established preconditioners like multi-coloured Gauss- Seidel, ILU(0) and multi-colored ILU(0), shows that the twofold goal of a numerically stable cross-platform performant algorithm is achieved.
| Original language | English |
|---|---|
| Title of host publication | High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Revised Selected Papers |
| Editors | Osni Marques, Michel Dayde, Kengo Nakajima |
| Publisher | Springer Verlag |
| Pages | 115-123 |
| Number of pages | 9 |
| ISBN (Print) | 9783319173528 |
| DOIs | |
| State | Published - 2015 |
| Event | 11th International Conference on High Performance Computing for Computational Science, VECPAR 2014 - Eugene, United States Duration: Jun 30 2014 → Jul 3 2014 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 8969 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 11th International Conference on High Performance Computing for Computational Science, VECPAR 2014 |
|---|---|
| Country/Territory | United States |
| City | Eugene |
| Period | 06/30/14 → 07/3/14 |
Funding
This work has been supported by the Linnaeus centre of excellence UPMARC, Uppsala Programming for Multicore Architectures Research Center, the Russian Scientific Fund (Agreement N14-11-00190), DOE grant #DE-SC0010042, NVIDIA, and the NSF grant # ACI-1339822.