TY - JOUR
T1 - Massively parallel and linear-scaling algorithm for second-order Møller–Plesset perturbation theory applied to the study of supramolecular wires
AU - Kjærgaard, Thomas
AU - Baudin, Pablo
AU - Bykov, Dmytro
AU - Eriksen, Janus Juul
AU - Ettenhuber, Patrick
AU - Kristensen, Kasper
AU - Larkin, Jeff
AU - Liakh, Dmitry
AU - Pawłowski, Filip
AU - Vose, Aaron
AU - Wang, Yang Min
AU - Jørgensen, Poul
N1 - Publisher Copyright:
© 2016 Elsevier B.V.
PY - 2017/3/1
Y1 - 2017/3/1
N2 - We present a scalable cross-platform hybrid MPI/OpenMP/OpenACC implementation of the Divide–Expand–Consolidate (DEC) formalism with portable performance on heterogeneous HPC architectures. The Divide–Expand–Consolidate formalism is designed to reduce the steep computational scaling of conventional many-body methods employed in electronic structure theory to linear scaling, while providing a simple mechanism for controlling the error introduced by this approximation. Our massively parallel implementation of this general scheme has three levels of parallelism, being a hybrid of the loosely coupled task-based parallelization approach and the conventional MPI +X programming model, where X is either OpenMP or OpenACC. We demonstrate strong and weak scalability of this implementation on heterogeneous HPC systems, namely on the GPU-based Cray XK7 Titan supercomputer at the Oak Ridge National Laboratory. Using the “resolution of the identity second-order Møller–Plesset perturbation theory” (RI-MP2) as the physical model for simulating correlated electron motion, the linear-scaling DEC implementation is applied to 1-aza-adamantane-trione (AAT) supramolecular wires containing up to 40 monomers (2440 atoms, 6800 correlated electrons, 24 440 basis functions and 91 280 auxiliary functions). This represents the largest molecular system treated at the MP2 level of theory, demonstrating an efficient removal of the scaling wall pertinent to conventional quantum many-body methods.
AB - We present a scalable cross-platform hybrid MPI/OpenMP/OpenACC implementation of the Divide–Expand–Consolidate (DEC) formalism with portable performance on heterogeneous HPC architectures. The Divide–Expand–Consolidate formalism is designed to reduce the steep computational scaling of conventional many-body methods employed in electronic structure theory to linear scaling, while providing a simple mechanism for controlling the error introduced by this approximation. Our massively parallel implementation of this general scheme has three levels of parallelism, being a hybrid of the loosely coupled task-based parallelization approach and the conventional MPI +X programming model, where X is either OpenMP or OpenACC. We demonstrate strong and weak scalability of this implementation on heterogeneous HPC systems, namely on the GPU-based Cray XK7 Titan supercomputer at the Oak Ridge National Laboratory. Using the “resolution of the identity second-order Møller–Plesset perturbation theory” (RI-MP2) as the physical model for simulating correlated electron motion, the linear-scaling DEC implementation is applied to 1-aza-adamantane-trione (AAT) supramolecular wires containing up to 40 monomers (2440 atoms, 6800 correlated electrons, 24 440 basis functions and 91 280 auxiliary functions). This represents the largest molecular system treated at the MP2 level of theory, demonstrating an efficient removal of the scaling wall pertinent to conventional quantum many-body methods.
KW - Linear Scaling Quantum Chemistry
KW - Massively Parallel Quantum Chemistry Implementation
KW - Method development
KW - Supramolecular wires
UR - http://www.scopus.com/inward/record.url?scp=85006984587&partnerID=8YFLogxK
U2 - 10.1016/j.cpc.2016.11.002
DO - 10.1016/j.cpc.2016.11.002
M3 - Article
AN - SCOPUS:85006984587
SN - 0010-4655
VL - 212
SP - 152
EP - 160
JO - Computer Physics Communications
JF - Computer Physics Communications
ER -