Abstract
Traditional algorithms for first-principles molecular dynamics (FPMD) simulations only gain a modest capability increase from current petascale computers, due to their O(N3 ) complexity and their heavy use of global communications. To address this issue, we are developing a truly scalable O(N) complexity FPMD algorithm, based on density functional theory (DFT), which avoids global communications. The computational model uses a general nonorthogonal orbital formulation for the DFT energy functional, which requires knowledge of selected elements of the inverse of the associated overlap matrix. We present a scalable algorithm for approximately computing selected entries of the inverse of the overlap matrix, based on an approximate inverse technique, by inverting local blocks corresponding to principal submatrices of the global overlap matrix. The new FPMD algorithm exploits sparsity and uses nearest neighbor communication to provide a computational scheme capable of extreme scalability. Accuracy is controlled by the mesh spacing of the finite difference discretization, the size of the localization regions in which the electronic orbitals are confined, and a cutoff beyond which the entries of the overlap matrix can be omitted when computing selected entries of its inverse. We demonstrate the algorithm's excellent parallel scaling for up to O(100K) atoms on O(100K) processors, with a wall-clock time of O(1) minute per molecular dynamics time step.
Original language | English |
---|---|
Pages (from-to) | C353-C375 |
Journal | SIAM Journal on Scientific Computing |
Volume | 36 |
Issue number | 4 |
DOIs | |
State | Published - 2014 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2014 Society for Industrial and Applied Mathematics.
Keywords
- Density functional theory
- Gram matrix inverse
- Large scale molecular dynamics
- Linear scaling algorithms
- Parallel approximate inverse