Abstract
Climate and weather can be predicted statistically via geospatial Maximum Likelihood Estimates (MLE), as an alternative to running large ensembles of forward models. The MLE-based iterative optimization procedure requires the solving of large-scale linear systems that performs a Cholesky factorization on a symmetric positive-definite covariance matrix-a demanding dense factorization in terms of memory footprint and computation. We propose a novel solution to this problem: at the mathematical level, we reduce the computational requirement by exploiting the data sparsity structure of the matrix off-diagonal tiles by means of low-rank approximations; and, at the programming-paradigm level, we integrate PaRSEC, a dynamic, task-based runtime to reach unparalleled levels of efficiency for solving extreme-scale linear algebra matrix operations. The resulting solution leverages fine-grained computations to facilitate asynchronous execution while providing a flexible data distribution to mitigate load imbalance. Performance results are reported using 3D synthetic datasets up to 42M geospatial locations on 130, 000 cores, which represent a cornerstone toward fast and accurate predictions of environmental applications.
Original language | English |
---|---|
Title of host publication | Proceedings of the Platform for Advanced Scientific Computing Conference, PASC 2020 |
Publisher | Association for Computing Machinery |
ISBN (Electronic) | 9781450379939 |
DOIs | |
State | Published - Jun 29 2020 |
Event | 7th Annual Platform for Advanced Scientific Computing Conference, PASC 2020 - Geneva, Switzerland Duration: Jun 29 2020 → Jul 1 2020 |
Publication series
Name | Proceedings of the Platform for Advanced Scientific Computing Conference, PASC 2020 |
---|
Conference
Conference | 7th Annual Platform for Advanced Scientific Computing Conference, PASC 2020 |
---|---|
Country/Territory | Switzerland |
City | Geneva |
Period | 06/29/20 → 07/1/20 |
Funding
This research was supported in part by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the U.S. Department of Energy Office of Science and the National Nuclear Security Administration. The authors would like also to thank Cray Inc. and Intel in the context of the Cray Center of Excellence and Intel Parallel Computing Center awarded to the Extreme Computing Research Center at KAUST. For computer time, this research used the Shaheen-2 supercomputer hosted at the Supercomputing Laboratory at KAUST.
Keywords
- Asynchronous execution
- Dynamic runtime system
- High performance computing
- Load balancing
- Low-rank matrix computations